search for: spatel

Displaying 20 results from an estimated 294 matches for "spatel".

Did you mean: patel
2016 Feb 26
0
how to force llvm generate gather intrinsic
...It would be great if we have profitability mode to see the necessity to use gathers. Or it also would be good if there is a compiler option for the users to enable LLVM to generate the gather instructions no matter it is faster or slow. Best, Zhi On Fri, Feb 26, 2016 at 12:49 PM, Sanjay Patel <spatel at rotateright.com> wrote: > If I'm understanding correctly, you're saying that vgather* is slow on all > of Excavator, Haswell, Broadwell, and Skylake (client). Therefore, we will > not generate it for any of those machines. > > Even if that's true, we should not def...
2016 Feb 26
2
how to force llvm generate gather intrinsic
...ikhovsky, Elena < elena.demikhovsky at intel.com> wrote: > No. Gather operation is slow on AVX2 processors. > > > > - * Elena* > > > > *From:* zhi chen [mailto:zchenhn at gmail.com] > *Sent:* Thursday, February 25, 2016 20:48 > *To:* Sanjay Patel <spatel at rotateright.com> > *Cc:* Demikhovsky, Elena <elena.demikhovsky at intel.com>; Nema, Ashutosh < > Ashutosh.Nema at amd.com>; llvm-dev <llvm-dev at lists.llvm.org> > > *Subject:* Re: [llvm-dev] how to force llvm generate gather intrinsic > > > > It seem...
2016 Feb 25
2
how to force llvm generate gather intrinsic
It seems that http://reviews.llvm.org/D15690 only implemented gather/scatter for AVX-512, but not for AVX/AVX2. Is there any plan to enable gather for AVX/2? Thanks. Best, Zhi On Thu, Feb 25, 2016 at 8:28 AM, Sanjay Patel <spatel at rotateright.com> wrote: > I don't think gather has been enabled for AVX2 as of r261875. > Masked load/store were enabled for AVX with: > http://reviews.llvm.org/D16528 / http://reviews.llvm.org/rL258675 > > On Wed, Feb 24, 2016 at 11:39 PM, Demikhovsky, Elena < > ele...
2016 Feb 26
0
how to force llvm generate gather intrinsic
No. Gather operation is slow on AVX2 processors. - Elena From: zhi chen [mailto:zchenhn at gmail.com] Sent: Thursday, February 25, 2016 20:48 To: Sanjay Patel <spatel at rotateright.com> Cc: Demikhovsky, Elena <elena.demikhovsky at intel.com>; Nema, Ashutosh <Ashutosh.Nema at amd.com>; llvm-dev <llvm-dev at lists.llvm.org> Subject: Re: [llvm-dev] how to force llvm generate gather intrinsic It seems that http://reviews.llvm.org/D15690 only i...
2016 Feb 01
2
Redundant promotion of integer values in x86 target
...vin, I wonder if you are still working on it and have a plan to submit your changes for the review. Thanks, Taewook From: "Smith, Kevin B" <kevin.b.smith at intel.com<mailto:kevin.b.smith at intel.com>> Date: Monday, February 1, 2016 at 3:30 PM To: 'Sanjay Patel' <spatel at rotateright.com<mailto:spatel at rotateright.com>>, Taewook Oh <twoh at fb.com<mailto:twoh at fb.com>> Cc: "llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>" <llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>> Subject: RE:...
2015 Oct 01
2
Fwd: buildbot failure in LLVM on llvm-mips-linux
....org>, Dehao Chen <dehao at google.com>, Evgeniy Stepanov <eugeni.stepanov at gmail.com>, Hal Finkel <hfinkel at anl.gov>, Kostya Serebryany <kcc at google.com>, Michael Zolotukhin <mzolotukhin at apple.com>, Reid Kleckner <rnk at google.com>, Sanjay Patel <spatel at rotateright.com> Cc: gkistanova at gmail.com The Buildbot has detected a new failure on builder llvm-mips-linux while building llvm. Full details are available at: http://lab.llvm.org:8011/builders/llvm-mips-linux/builds/14754 Buildbot URL: http://lab.llvm.org:8011/ Buildslave for this B...
2018 Jan 15
2
GEP transformation by InstCombiner
...t. It contains pointer size, but how can I conclude that the GEP index can't be widened? - Elena From: Hal Finkel [mailto:hfinkel at anl.gov] Sent: Monday, January 15, 2018 20:34 To: Demikhovsky, Elena <elena.demikhovsky at intel.com>; llvm-dev at lists.llvm.org; Sanjay Patel (spatel at rotateright.com) <spatel at rotateright.com>; Chandler Carruth (chandlerc at gmail.com) <chandlerc at gmail.com>; Quentin Colombet (qcolombet at apple.com) <qcolombet at apple.com>; Craig Topper (craig.topper at gmail.com) <craig.topper at gmail.com> Cc: Breger, Igor <...
2018 Jan 15
0
GEP transformation by InstCombiner
...already in DataLayout.  -Hal >   > > -          */ Elena/* > >   > > *From:*Hal Finkel [mailto:hfinkel at anl.gov] > *Sent:* Monday, January 15, 2018 20:34 > *To:* Demikhovsky, Elena <elena.demikhovsky at intel.com>; > llvm-dev at lists.llvm.org; Sanjay Patel (spatel at rotateright.com) > <spatel at rotateright.com>; Chandler Carruth (chandlerc at gmail.com) > <chandlerc at gmail.com>; Quentin Colombet (qcolombet at apple.com) > <qcolombet at apple.com>; Craig Topper (craig.topper at gmail.com) > <craig.topper at gmail.com> &...
2016 Nov 20
5
FMA canonicalization in IR
...like they do today to fuse them back together again? On Sat, Nov 19, 2016 at 8:29 PM Hal Finkel <hfinkel at anl.gov> wrote: > ----- Original Message ----- > > From: "Hal J. via llvm-dev Finkel" <llvm-dev at lists.llvm.org> > > To: "Sanjay Patel" <spatel at rotateright.com> > > Cc: "llvm-dev" <llvm-dev at lists.llvm.org> > > Sent: Saturday, November 19, 2016 10:58:27 AM > > Subject: Re: [llvm-dev] FMA canonicalization in IR > > > > > > Sent from my Verizon Wireless 4G LTE DROID > > On Nov...
2016 Jan 23
3
how to force llvm generate gather intrinsic
Thanks for your response, Sanjay. I know there are intrinsics available in C/C++. But the problem is that I want to instrument my code at the IR level and generate those instructions. I don't want to touch the source code. Best, Zhi On Fri, Jan 22, 2016 at 4:54 PM, Sanjay Patel <spatel at rotateright.com> wrote: > I was just looking at the related masked load/store operations, and I > think there are at least 2 bugs: > > 1. X86TTIImpl::isLegalMaskedLoad/Store() should be legal for FP types with > AVX1 (not just AVX2). > 2. X86TTIImpl::isLegalMaskedGather/Sca...
2017 Jan 24
3
[InstCombine] rL292492 affected LoopVectorizer and caused 17.30%/11.37% perf regressions on Cortex-A53/Cortex-A15 LNT machines
> On Jan 24, 2017, at 7:18 AM, Sanjay Patel <spatel at rotateright.com> wrote: > > > > On Mon, Jan 23, 2017 at 10:53 PM, Mehdi Amini <mehdi.amini at apple.com <mailto:mehdi.amini at apple.com>> wrote: > >> On Jan 23, 2017, at 3:48 PM, Sanjay Patel via llvm-dev <llvm-dev at lists.llvm.org <mailto:llvm-dev...
2016 Feb 25
2
how to force llvm generate gather intrinsic
Yes, masked load/store/gather/scatter are completed. - Elena From: zhi chen [mailto:zchenhn at gmail.com] Sent: Thursday, February 25, 2016 01:20 To: Demikhovsky, Elena <elena.demikhovsky at intel.com> Cc: Sanjay Patel <spatel at rotateright.com>; Nema, Ashutosh <Ashutosh.Nema at amd.com>; llvm-dev <llvm-dev at lists.llvm.org> Subject: Re: [llvm-dev] how to force llvm generate gather intrinsic Hi Elena, Are the masked_load and gather working now? Best, Zhi On Sat, Jan 23, 2016 at 12:06 PM, Demikhovsky,...
2015 Oct 01
2
Fwd: buildbot failure in LLVM on llvm-mips-linux
...ov<mailto:hfinkel at anl.gov>>, Kostya Serebryany <kcc at google.com<mailto:kcc at google.com>>, Michael Zolotukhin <mzolotukhin at apple.com<mailto:mzolotukhin at apple.com>>, Reid Kleckner <rnk at google.com<mailto:rnk at google.com>>, Sanjay Patel <spatel at rotateright.com<mailto:spatel at rotateright.com>> Cc: gkistanova at gmail.com<mailto:gkistanova at gmail.com> The Buildbot has detected a new failure on builder llvm-mips-linux while building llvm. Full details are available at: http://lab.llvm.org:8011/builders/llvm-mips-linu...
2017 Jul 13
2
failing to optimize boolean ops on cmps
This can't be an instsimplify though? The values we want in these cases do not exist already: %res = or i8 %b, %a %res = or i1 %cmp, %c On Thu, Jul 13, 2017 at 5:10 PM, Daniel Berlin <dberlin at dberlin.org> wrote: > > > On Thu, Jul 13, 2017 at 2:12 PM, Sanjay Patel <spatel at rotateright.com> > wrote: > >> We have several optimizations in InstCombine for bitwise logic ops >> (and/or/xor) that fail to handle compare patterns with the equivalent >> bitwise logic. Example: >> >> define i8 @or_and_not(i8 %a, i8 %b) { >> %not...
2018 Oct 01
6
[FPEnv] FNEG instruction
...nd, you don't want to over-constrain allowable optimizations. Fneg folds shouldn't be disabled just because we changed the FP exception state? On Mon, Oct 1, 2018 at 12:20 PM Cameron McInally <cameron.mcinally at nyu.edu> wrote: > On Thu, Sep 27, 2018 at 10:14 AM Sanjay Patel <spatel at rotateright.com> > wrote: > >> Regarding non-IEEE targets: yes, we definitely support those, so we do >> have to be careful about not breaking them. I know because I have broken >> them. :) >> See the discussion and related links here: >> https://reviews.ll...
2018 Jul 03
2
Question about canonicalizing cmp+select
I linked the wrong patch review. Here's the patch that was actually committed: https://reviews.llvm.org/D48508 https://reviews.llvm.org/rL335433 On Tue, Jul 3, 2018 at 4:39 PM, Sanjay Patel <spatel at rotateright.com> wrote: > [adding back llvm-dev and cc'ing Craig] > > I think you are asking if we are missing a fold (or your target is missing > enabling another hook) to transform the sext+add into shift+or? I think the > answer is 'yes'. We probably should add...
2016 Nov 19
2
FMA canonicalization in IR
Sent from my Verizon Wireless 4G LTE DROID On Nov 19, 2016 10:26 AM, Sanjay Patel <spatel at rotateright.com<mailto:spatel at rotateright.com>> wrote: > > If I have my FMA intrinsics story straight now (thanks for the explanation, Hal!), I think it raises another question about IR canonicalization (and may affect the proposed revision to IR FMF): No, I think that we spec...
2020 Sep 14
2
Invalid transformation in LibCallSimplifier::replacePowWithSqrt?
...- I misread your example and the problem. I see now where LibCallSimplifier creates the select...but we are immediately erasing that select with the code from the godbolt example. Does the real motivating case have no uses of the pow() result value? On Mon, Sep 14, 2020 at 1:03 PM Sanjay Patel <spatel at rotateright.com> wrote: > Yes, I mean just bail out on the transform in > LibCallSimplifier::replacePowWithSqrt() -> getSqrtCall(). If we can't prove > the call behaves the same with errno, then give up. > I'm not sure where the select / branching happens, but I don...
2016 Sep 01
2
enabling interleaved access loop vectorization
..., August 18, 2016 03:57 To: Zaks, Ayal <ayal.zaks at intel.com> Cc: Demikhovsky, Elena <elena.demikhovsky at intel.com>; Renato Golin <renato.golin at linaro.org>; Matthew Simpson <mssimpso at codeaurora.org>; Nema, Ashutosh <Ashutosh.Nema at amd.com>; Sanjay Patel <spatel at rotateright.com>; llvm-dev <llvm-dev at lists.llvm.org> Subject: Re: [llvm-dev] enabling interleaved access loop vectorization So, at least for this example, it looks like we actually want to vectorize with -enable-interleaved-mem-accesses, we just need the backend to generate good cod...
2016 Jan 12
2
greendragon build noisy due to mmap_stress.cc
...# 9557 - Failure! To: Roman Gareev <gareevroman at gmail.com <mailto:gareevroman at gmail.com>>, Simon Atanasyan <simon at atanasyan.com <mailto:simon at atanasyan.com>>, Teresa Johnson <tejohnson at google.com <mailto:tejohnson at google.com>>, Sanjay Patel <spatel at rotateright.com <mailto:spatel at rotateright.com>> Cc: Mikhail Zolotukhin <mzolotukhin at apple.com <mailto:mzolotukhin at apple.com>> __ [FAILURE] clang-stage1-cmake-RA_check [#9557] Build URL: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA_check/9557/ Pr...