Displaying 20 results from an estimated 130 matches for "nema".
2015 Mar 24
3
[LLVMdev] RFC: Loop versioning for LICM
> On Mar 20, 2015, at 8:02 PM, Nema, Ashutosh <Ashutosh.Nema at amd.com> wrote:
>
> > Yes, this is what I was proposing above and here ;):
> Thanks Adam it’s for confirming J
NP :).
>
> > No, not hasLoopInvariantStore but hasAccessToLoopInvariantAddress.
> Its only for invariant stores[not loads], U...
2015 Mar 20
2
[LLVMdev] RFC: Loop versioning for LICM
> On Mar 19, 2015, at 9:46 PM, Nema, Ashutosh <Ashutosh.Nema at amd.com> wrote:
>
> Thanks Adam for your reply.
>
> From: Adam Nemet [mailto:anemet at apple.com <mailto:anemet at apple.com>]
> Sent: Friday, March 20, 2015 3:23 AM
> To: Nema, Ashutosh
> Cc: Hal Finkel; Philip Reames; llvmdev at cs....
2015 Mar 19
3
[LLVMdev] Cast to SCEVAddRecExpr
...,2}<%for.body4> to i64)".
So whenever SCEV cast to "SCEVAddRecExpr" fails, we have drill down for such patterns ?
Is that the right way ?
Regards,
Ashutosh
-----Original Message-----
From: Nick Lewycky [mailto:nicholas at mxc.ca]
Sent: Thursday, March 19, 2015 1:02 PM
To: Nema, Ashutosh
Cc: llvmdev at cs.uiuc.edu
Subject: Re: [LLVMdev] Cast to SCEVAddRecExpr
Nema, Ashutosh wrote:
> Hi Nick,
>
> Thanks for looking into it.
>
> I have tried that as well but it didn't worked.
>
> "AddExpr->getOperand(0))" node is:
> " (4 * (sex...
2016 Feb 25
2
how to force llvm generate gather intrinsic
...>> - * Elena*
>>
>>
>>
>> *From:* zhi chen [mailto:zchenhn at gmail.com]
>> *Sent:* Thursday, February 25, 2016 01:20
>> *To:* Demikhovsky, Elena <elena.demikhovsky at intel.com>
>> *Cc:* Sanjay Patel <spatel at rotateright.com>; Nema, Ashutosh <
>> Ashutosh.Nema at amd.com>; llvm-dev <llvm-dev at lists.llvm.org>
>>
>> *Subject:* Re: [llvm-dev] how to force llvm generate gather intrinsic
>>
>>
>>
>> Hi Elena,
>>
>>
>>
>> Are the masked_load and gather...
2016 Feb 26
2
how to force llvm generate gather intrinsic
...AVX2 processors.
>
>
>
> - * Elena*
>
>
>
> *From:* zhi chen [mailto:zchenhn at gmail.com]
> *Sent:* Thursday, February 25, 2016 20:48
> *To:* Sanjay Patel <spatel at rotateright.com>
> *Cc:* Demikhovsky, Elena <elena.demikhovsky at intel.com>; Nema, Ashutosh <
> Ashutosh.Nema at amd.com>; llvm-dev <llvm-dev at lists.llvm.org>
>
> *Subject:* Re: [llvm-dev] how to force llvm generate gather intrinsic
>
>
>
> It seems that http://reviews.llvm.org/D15690 only implemented
> gather/scatter for AVX-512, but not fo...
2016 Jan 20
2
Why getFunction() of CallGraphNode return NULL function?
So, I won't know the called function statically, Right?
--------------------------------------------
Qiuping Yi
Institute Of Software
Chinese Academy of Sciences
On Wed, Jan 20, 2016 at 2:24 PM, Nema, Ashutosh <Ashutosh.Nema at amd.com>
wrote:
> Typically for C++ virtual function you will see an indirect callSite
> (unless not de-virtualized).
>
>
>
> Regards,
>
> Ashutosh
>
>
>
> *From:* Qiuping Yi [mailto:yiqiuping at gmail.com]
> *Sent:* Wednesda...
2016 Feb 25
2
how to force llvm generate gather intrinsic
Yes, masked load/store/gather/scatter are completed.
- Elena
From: zhi chen [mailto:zchenhn at gmail.com]
Sent: Thursday, February 25, 2016 01:20
To: Demikhovsky, Elena <elena.demikhovsky at intel.com>
Cc: Sanjay Patel <spatel at rotateright.com>; Nema, Ashutosh <Ashutosh.Nema at amd.com>; llvm-dev <llvm-dev at lists.llvm.org>
Subject: Re: [llvm-dev] how to force llvm generate gather intrinsic
Hi Elena,
Are the masked_load and gather working now?
Best,
Zhi
On Sat, Jan 23, 2016 at 12:06 PM, Demikhovsky, Elena <elena.demikhovsky...
2016 Feb 26
0
how to force llvm generate gather intrinsic
No. Gather operation is slow on AVX2 processors.
- Elena
From: zhi chen [mailto:zchenhn at gmail.com]
Sent: Thursday, February 25, 2016 20:48
To: Sanjay Patel <spatel at rotateright.com>
Cc: Demikhovsky, Elena <elena.demikhovsky at intel.com>; Nema, Ashutosh <Ashutosh.Nema at amd.com>; llvm-dev <llvm-dev at lists.llvm.org>
Subject: Re: [llvm-dev] how to force llvm generate gather intrinsic
It seems that http://reviews.llvm.org/D15690 only implemented gather/scatter for AVX-512, but not for AVX/AVX2. Is there any plan to enable ga...
2016 Feb 26
0
how to force llvm generate gather intrinsic
...>> - * Elena*
>>
>>
>>
>> *From:* zhi chen [mailto:zchenhn at gmail.com]
>> *Sent:* Thursday, February 25, 2016 20:48
>> *To:* Sanjay Patel <spatel at rotateright.com>
>> *Cc:* Demikhovsky, Elena <elena.demikhovsky at intel.com>; Nema, Ashutosh <
>> Ashutosh.Nema at amd.com>; llvm-dev <llvm-dev at lists.llvm.org>
>>
>> *Subject:* Re: [llvm-dev] how to force llvm generate gather intrinsic
>>
>>
>>
>> It seems that http://reviews.llvm.org/D15690 only implemented
>> gather...
2016 Apr 12
2
X86 TRUNCATE cost for AVX & AVX2 mode
...for same operation in SSE2 mode.
http://reviews.llvm.org/rL256194
Looks like as the part of same patch we should reduce cost for TRUNCATE v16i32 to v16i8 in SSE4.1 as well.
Regards,
Ashutosh
From: Demikhovsky, Elena [mailto:elena.demikhovsky at intel.com]
Sent: Monday, April 11, 2016 9:05 PM
To: Nema, Ashutosh <Ashutosh.Nema at amd.com>
Cc: llvm-dev <llvm-dev at lists.llvm.org>; Zuckerman, Michael <michael.zuckerman at intel.com>
Subject: RE: X86 TRUNCATE cost for AVX & AVX2 mode
Hi,
One day I worked hard and refactored the cost calculation for all X86 targets.
http://re...
2018 May 16
1
ScalarEvolution questions
On Wed, May 16, 2018 at 1:24 AM, Nema, Ashutosh <Ashutosh.Nema at amd.com> wrote:
> Hi Sanjoy,
>
> Your inputs really helped.
>
> Using “isImpliedCond”, able to relate and find the min for cases like:
>
> SCEV1: (-1 + (sext i32 %n.addr.058 to i64))<nsw>
> SCEV2: 0
> Extra-Condition: (n.addr.058 &g...
2016 Feb 24
0
how to force llvm generate gather intrinsic
...wrote:
> Ø Can we legalize the same set of masked load/store operations for AVX1
> as AVX2?
>
> Yes, of course.
>
>
>
> - * Elena*
>
>
>
> *From:* Sanjay Patel [mailto:spatel at rotateright.com]
> *Sent:* Saturday, January 23, 2016 18:42
> *To:* Nema, Ashutosh <Ashutosh.Nema at amd.com>
> *Cc:* Demikhovsky, Elena <elena.demikhovsky at intel.com>; zhi chen <
> zchenhn at gmail.com>; llvm-dev <llvm-dev at lists.llvm.org>
> *Subject:* Re: [llvm-dev] how to force llvm generate gather intrinsic
>
>
>
>
&g...
2017 Mar 14
10
[Proposal][RFC] Epilog loop vectorization
...: vectorize’ again.
3) Block layout description with epilog loop vectorization is available at
https://reviews.llvm.org/file/data/fxg5vx3capyj257rrn5j/PHID-FILE-x6thnbf6ub55ep5yhalu/LayoutDescription.png
Approach-1 looks feasible, please comment if any objections.
Regards,
Ashutosh
From: Nema, Ashutosh
Sent: Wednesday, March 1, 2017 10:42 AM
To: 'Daniel Berlin' <dberlin at dberlin.org>
Cc: anemet at apple.com; Hal Finkel <hfinkel at anl.gov>; Zaks, Ayal <ayal.zaks at intel.com>; Renato Golin <renato.golin at linaro.org>; mkuper at google.com; Mehdi Amini...
2015 Mar 19
2
[LLVMdev] RFC: Loop versioning for LICM
Hi Ashutosh,
> On Mar 16, 2015, at 9:06 PM, Nema, Ashutosh <Ashutosh.Nema at amd.com> wrote:
>
> Hi Adam,
>
> From: Adam Nemet [mailto:anemet at apple.com <mailto:anemet at apple.com>]
> Sent: Wednesday, March 11, 2015 10:48 AM
> To: Nema, Ashutosh
> Cc: llvmdev at cs.uiuc.edu <mailto:llvmdev at cs.uiuc.edu...
2016 Feb 25
0
how to force llvm generate gather intrinsic
...er are completed.
>
>
>
> - * Elena*
>
>
>
> *From:* zhi chen [mailto:zchenhn at gmail.com]
> *Sent:* Thursday, February 25, 2016 01:20
> *To:* Demikhovsky, Elena <elena.demikhovsky at intel.com>
> *Cc:* Sanjay Patel <spatel at rotateright.com>; Nema, Ashutosh <
> Ashutosh.Nema at amd.com>; llvm-dev <llvm-dev at lists.llvm.org>
>
> *Subject:* Re: [llvm-dev] how to force llvm generate gather intrinsic
>
>
>
> Hi Elena,
>
>
>
> Are the masked_load and gather working now?
>
>
>
> Best,
>...
2018 May 10
2
ScalarEvolution questions
Thanks Sanjoy, I'll look into this.
-----Original Message-----
From: Sanjoy Das [mailto:sanjoy at playingwithpointers.com]
Sent: Thursday, May 10, 2018 8:07 AM
To: Nema, Ashutosh <Ashutosh.Nema at amd.com>
Cc: llvm-dev at lists.llvm.org
Subject: Re: [llvm-dev] ScalarEvolution questions
Hi Ashutosh,
On Wed, May 9, 2018 at 3:28 AM, Nema, Ashutosh via llvm-dev <llvm-dev at lists.llvm.org> wrote:
> I’m new to ScalarEvolution and wanted to explore its...
2017 Mar 14
2
[Proposal][RFC] Epilog loop vectorization
On 03/14/2017 11:21 AM, Adam Nemet wrote:
>
>> On Mar 14, 2017, at 6:00 AM, Nema, Ashutosh <Ashutosh.Nema at amd.com
>> <mailto:Ashutosh.Nema at amd.com>> wrote:
>>
>> Summarizing the discussion on the implementation approaches.
>> Discussed about two approaches, first running ‘InnerLoopVectorizer’
>> again on the epilog loop immediat...
2017 Feb 28
3
[Proposal][RFC] Epilog loop vectorization
...the remainder loop with metadata to limit the vectorization factor for epilog vector loop.
Regards,
Ashutosh
From: anemet at apple.com [mailto:anemet at apple.com]
Sent: Tuesday, February 28, 2017 1:33 AM
To: Hal Finkel <hfinkel at anl.gov>
Cc: Daniel Berlin <dberlin at dberlin.org>; Nema, Ashutosh <Ashutosh.Nema at amd.com>; Zaks, Ayal <ayal.zaks at intel.com>; Renato Golin <renato.golin at linaro.org>; mkuper at google.com; Mehdi Amini <mehdi.amini at apple.com>; llvm-dev <llvm-dev at lists.llvm.org>
Subject: Re: [llvm-dev] [Proposal][RFC] Epilog loop...
2017 Mar 14
2
[Proposal][RFC] Epilog loop vectorization
...be easy to fix, but I'm
> not sure whether the way this is going to handle (2) is sufficient.
> If I'm the only one that this bothers, I won't stand in the way, but
> I'd like to at least make sure we've fully considered this.
>
> On Mar 14, 2017 06:00, "Nema, Ashutosh" <Ashutosh.Nema at amd.com
> <mailto:Ashutosh.Nema at amd.com>> wrote:
>
> Summarizing the discussion on the implementation approaches.
>
> Discussed about two approaches, first running
> ‘InnerLoopVectorizer’ again on the epilog loop immedi...
2016 Jan 23
2
how to force llvm generate gather intrinsic
Ø Can we legalize the same set of masked load/store operations for AVX1 as AVX2?
Yes, of course.
- Elena
From: Sanjay Patel [mailto:spatel at rotateright.com]
Sent: Saturday, January 23, 2016 18:42
To: Nema, Ashutosh <Ashutosh.Nema at amd.com>
Cc: Demikhovsky, Elena <elena.demikhovsky at intel.com>; zhi chen <zchenhn at gmail.com>; llvm-dev <llvm-dev at lists.llvm.org>
Subject: Re: [llvm-dev] how to force llvm generate gather intrinsic
On Sat, Jan 23, 2016 at 6:45 AM, Nema, A...