Displaying 20 results from an estimated 1300 matches similar to: "Data structure improvement for the SLP vectorizer"
2017 Mar 15
2
Data structure improvement for the SLP vectorizer
Maybe it would illustrative to give an IR example of the case I'm
interested in. Consider
define void @"julia_transform_bvn_derivs_hessian!"(double* %data,
double* %data2, double *%data3, double *%out) {
%element11 = getelementptr inbounds double, double* %data, i32 1
%load10 = load double, double* %data
%load11 = load double, double* %element11
%element21 =
2014 Nov 11
3
[LLVMdev] supporting SAD in loop vectorizer
----- Original Message -----
> From: "Dibyendu Das" <Dibyendu.Das at amd.com>
> To: "Hal Finkel" <hfinkel at anl.gov>, "Renato Golin" <renato.golin at linaro.org>
> Cc: llvmdev at cs.uiuc.edu
> Sent: Tuesday, November 4, 2014 12:15:12 PM
> Subject: RE: [LLVMdev] supporting SAD in loop vectorizer
>
> Here's the simple SAD
2014 Nov 04
3
[LLVMdev] supporting SAD in loop vectorizer
----- Original Message -----
> From: "Renato Golin" <renato.golin at linaro.org>
> To: "Dibyendu Das" <Dibyendu.Das at amd.com>
> Cc: llvmdev at cs.uiuc.edu
> Sent: Tuesday, November 4, 2014 5:23:30 AM
> Subject: Re: [LLVMdev] supporting SAD in loop vectorizer
>
> On 4 November 2014 11:06, Das, Dibyendu <Dibyendu.Das at amd.com> wrote:
2014 Nov 11
4
[LLVMdev] supporting SAD in loop vectorizer
----- Original Message -----
> From: "James Molloy" <james at jamesmolloy.co.uk>
> To: "Hal Finkel" <hfinkel at anl.gov>
> Cc: "Dibyendu Das" <Dibyendu.Das at amd.com>, llvmdev at cs.uiuc.edu
> Sent: Tuesday, November 11, 2014 8:21:37 AM
> Subject: Re: [LLVMdev] supporting SAD in loop vectorizer
>
>
> If you'd like to
2018 Jan 24
2
Panic: data stack: Out of memory when allocating bytes
On Wed, Jan 24, 2018 at 18:55:47 +0100, Thomas Robers wrote:
> Am 23.01.2018 um 20:07 schrieb Josef 'Jeff' Sipek:
> > On Tue, Jan 23, 2018 at 14:03:27 -0500, Josef 'Jeff' Sipek wrote:
> > > On Tue, Jan 23, 2018 at 18:21:38 +0100, Thomas Robers wrote:
...
> > > 1. Do you have any idea what the imap process was doing at the time of the
> > >
2015 Nov 09
2
[RFC][SLP] Let's turn -slp-vectorize-hor on by default
I've done compile-time experiments for AArch64 over SPEC{2000,2006}
and of course the test-suite. I measure no significant compile-time
impact of enabling this feature by default.
I also ran the test-suite on an X86-64 machine. I can't imagine any
other targets being uniquely effected in terms of compile-time by
turning this on after testing both AArch64 and X86-64. I also timed
running
2015 Nov 09
3
[RFC][SLP] Let's turn -slp-vectorize-hor on by default
I have not. I could feasibly do this, but I'm not set up to perform
good experiments on X86-64 hardware. Furthermore, if I do it for
X86-64, it only seems fair I should do it for the other backends as
well, which is much less feasible for me. I'm reaching out the
community to see if there's any objection based on their own
measurements of this feature about defaulting it to on.
Please
2009 Apr 22
1
[PATCH] allow to disable SLP with runtime option
Hi everyone,
I'd like to propose a patch for review. It enhances rsync when
patched and compiled with slp support.
It adds a new global boolean option, 'disable slp', which can be used to disable
SLP advertisements at runtime. The idea behind this patch is to allow
distributors to build rsync with SLP support compiled in, but to allow
the users to turn it off without recompiling
2013 Oct 25
1
[PATCH] rsync-patches/slp.diff: use lp_num_modules instead of the removed lp_numserv
Hello,
rsync-patches/slp.diff is still using lp_numserv which was removed by
commit b583594ac7d2f8a38aca85c1bfa4b1487122377a
Signed-off-by: Vitezslav Cizek <vcizek at suse.cz>
---
slp.diff | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/slp.diff b/slp.diff
index a9703f1..953b400 100644
--- a/slp.diff
+++ b/slp.diff
@@ -479,7 +479,7 @@ new file mode 100644
+
2020 Sep 28
2
PSLP: Padded SLP Automatic Vectorization
Hey, I noticed this talk from the EuroLLVM 2015
(https://llvm.org/devmtg/2015-04/slides/pslp_slides_EUROLLVM2015.pdf) on
the PSLP vectorization algorithm (CGO 2015 paper:
http://vporpo.me/papers/pslp_cgo2015.pdf).
Is anyone working on implementing it?
If so, are there Phab reviews I can subscribe to?
Best,
Matt
2015 Nov 10
4
[RFC][SLP] Let's turn -slp-vectorize-hor on by default
I will try to get some spec cpu 2006 rate runs done under -O3 -flto with and without -slp-vectorize-hor and let you know.
-Thx
-----Original Message-----
From: nrotem at apple.com [mailto:nrotem at apple.com]
Sent: Tuesday, November 10, 2015 3:33 AM
To: Charlie Turner
Cc: Das, Dibyendu; llvm-dev at lists.llvm.org
Subject: Re: [llvm-dev] [RFC][SLP] Let's turn -slp-vectorize-hor on by default
2015 Nov 11
2
[RFC][SLP] Let's turn -slp-vectorize-hor on by default
We have started this. Since there are some holidays expect a small delay. Will let you know by Friday.
Thx
Sent from my Windows Phone
________________________________
From: Charlie Turner<mailto:charlesturner7c5 at gmail.com>
Sent: 11/11/2015 6:34 PM
To: Das, Dibyendu<mailto:Dibyendu.Das at amd.com>
Cc: nrotem at apple.com<mailto:nrotem at apple.com>; llvm-dev at
2013 Nov 07
1
[LLVMdev] SLP vectorizer turned on in commit r190916 which says nothing about it - how to turn it off?
Revision 190916
Commit message:
"Lift alignment restrictions for load/store folding on
VINSERTF128/VEXTRACTF128. Fixes PR17268."
Actual contents of the commit includes
Index: tools/opt/opt.cpp
===================================================================
--- tools/opt/opt.cpp (revision 190915)
+++ tools/opt/opt.cpp (revision 190916)
@@ -462,6 +462,7 @@
2013 Jul 29
0
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
Cool. Thanks!
-Jim
On Jul 29, 2013, at 1:07 PM, Renato Golin <renato.golin at linaro.org> wrote:
> On 29 July 2013 20:39, Jim Grosbach <grosbach at apple.com> wrote:
> These results are really excellent. They’re on Intel, I assume, right? What do the ARM numbers look like? Before enabling by default, we should make sure that the results are comparable there as well.
>
>
2015 Jul 01
3
[LLVMdev] SLP vectorizer on AVX feature
On 1 July 2015 at 21:22, Frank Winter <fwinter at jlab.org> wrote:
> there were two follow-up emails.
I only got one... weird...
> The issue is solved. The SLP vectorizer has
> a magic number built into the code which determines the max. vector length
> to search for. That was set to 128 bits. Increasing it to 256 bits solved
> the issue.
That looks like a simple fix. Is
2014 Aug 07
3
[LLVMdev] How to broaden the SLP vectorizer's search
The BB vectorizer has an option 'bb-vectorizer-search-limit'. Is there a
similar option for the SLP vectorizer? Maybe an analysis pass' scope
that can be widen?
I have large basic blocks with instructions that should be merged into
packed versions. However, the blocks are optimized independently from
each other. Now, if the instructions to be merged aren't too far apart
the
2013 Jul 29
2
[LLVMdev] Enabling the SLP-vectorizer by default for -O3
On 29 July 2013 20:39, Jim Grosbach <grosbach at apple.com> wrote:
> These results are really excellent. They’re on Intel, I assume, right?
> What do the ARM numbers look like? Before enabling by default, we should
> make sure that the results are comparable there as well.
>
Hi Jim,
I'll have a look.
--renato
-------------- next part --------------
An HTML attachment was
2013 Jul 14
0
[LLVMdev] Enabling the SLP vectorizer by default for -O3
Cool!
What changes have you seen to generated code size?
I'll take it for a spin on our benchmarks.
On Sat, Jul 13, 2013 at 11:30 PM, Nadav Rotem <nrotem at apple.com> wrote:
> Hi,
>
> LLVM’s SLP-vectorizer is a new pass that combines similar independent
> instructions in a straight-line code. It is currently not enabled by
> default, and people who want to
2018 Feb 08
2
[RFC] Make LoopVectorize Aware of SLP Operations
Hi,
On 08/02/2018 04:22, Caballero, Diego wrote:
> Hi Florian!
>
> This proposal sounds pretty exciting! Integrating SLP-aware loop vectorization (or the other way around) and SLP into the VPlan framework is definitely aligned with the long term vision and we would prefer this approach to the LoopReroll and InstCombine alternatives that you mentioned. We prefer a generic implementation
2017 Mar 24
2
SLP regression on SystemZ
Hi,
I have come across a major regression resulting after SLP vectorization
(+18% on SystemZ, just for enabling SLP). This all relates to one
particular very hot loop.
Scalar code:
%conv252 = zext i16 %110 to i64
%conv254 = zext i16 %111 to i64
%sub255 = sub nsw i64 %conv252, %conv254
... repeated
SLP output:
%101 = zext <16 x i16> %100 to <16 x i64>
%104 = zext