Displaying 20 results from an estimated 238 matches for "autovectorized".
Did you mean:
autovectorizer
2011 Jun 24
2
[LLVMdev] LLVM autovectorization support
I would like to know the status of the autovectorization support in LLVM.
does LLVM have a loop dependence analysis, does LLVM have a infrastructure
for autovectorization ? etc.
Kind Regards
Xin Tong
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20110624/3dc35318/attachment.html>
2011 Jun 24
0
[LLVMdev] LLVM autovectorization support
On 24 June 2011 21:13, Xin Tong Utoronto <x.tong at utoronto.ca> wrote:
> I would like to know the status of the autovectorization support in LLVM.
> does LLVM have a loop dependence analysis, does LLVM have a infrastructure
> for autovectorization ? etc.
Not yet, but it's getting there...
http://polly.grosser.es/
cheers,
--renato
2015 Jul 06
4
[LLVMdev] SPMD Autovectorizer
Hi,
Are there any plans to integrate an autovectorizer for SPMD programs into
LLVM? For example, there were previous discussions about integrating the
whole function vectorizer (WFV) from Ralf Karrenberg into LLVM.
Thanks,
Zack
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
2015 Jul 07
2
[LLVMdev] SPMD Autovectorizer
...its barrier synchronization semantics
etc.
That is, the output of this pass could be also converted to
OpenMP SIMD constructs, if wanted. In pocl's case the output
is simply a new kernel function (we call "work group function")
that executes all WIs using parallel loops (which can be
autovectorized more easily, or even multithreaded if seeing fit,
or both).
--
--Pekka
2013 Jan 24
3
[LLVMdev] LoopVectorizer in OpenCL C work group autovectorization
Hi,
I started to play with the LoopVectorizer of LLVM trunk
on the work-item loops produced by pocl's OpenCL C
kernel compiler, in hopes of implementing multi-work-item
work group autovectorization in a modular manner.
The vectorizer seems to refuse to vectorize the loop if it sees
multiple writes to the same memory object within the
same iteration. In case of parallel loops such as
the
2013 Feb 01
0
[LLVMdev] LoopVectorizer in OpenCL C work group autovectorization
----- Original Message -----
> From: "Ralf Karrenberg" <Chareos at gmx.de>
> To: "Hal Finkel" <hfinkel at anl.gov>
> Cc: "Pekka Jääskeläinen" <pekka.jaaskelainen at tut.fi>, "LLVM Developers Mailing List" <llvmdev at cs.uiuc.edu>, "Nadav
> Rotem" <nrotem at apple.com>
> Sent: Friday, February 1, 2013
2013 Feb 01
1
[LLVMdev] LoopVectorizer in OpenCL C work group autovectorization
Hi Hal,
On 1/31/13 6:47 PM, Hal Finkel wrote:
>>> In any case, since our own OpenCL driver is more of a
>>> proof-of-concept
>>> implementation and not very robust, I'd be willing to give it a try
>>> to
>>> integrate the current libWFV into pocl. This should boost
>>> performance
>>> quite a bit for many kernels without too much
2013 Jan 25
0
[LLVMdev] LoopVectorizer in OpenCL C work group autovectorization
Hi Pekka,
> Hi,
>
> I started to play with the LoopVectorizer of LLVM trunk
> on the work-item loops produced by pocl's OpenCL C
> kernel compiler, in hopes of implementing multi-work-item
> work group autovectorization in a modular manner.
>
Thanks for checking the Loop Vectorizer, I am interested in hearing your feedback. The Loop Vectorizer does not fit here.
2015 Jul 07
2
[LLVMdev] SPMD Autovectorizer
On 07/07/2015 09:30 PM, C Bergström wrote:
> If you're going to "autopar" (turn a loop into a threads which run on
> many cores or something) then please don't add a dependency on OMP.
I wouldn't, but simply utilize the parallel loop metadata that
was originally designed for this purpose. What is done with
that MD is up to other passes.
--
--Pekka
2013 Jan 25
0
[LLVMdev] LoopVectorizer in OpenCL C work group autovectorization
----- Original Message -----
> From: "Pekka Jääskeläinen" <pekka.jaaskelainen at tut.fi>
> To: "Hal Finkel" <hfinkel at anl.gov>
> Cc: "LLVM Developers Mailing List" <llvmdev at cs.uiuc.edu>, "Nadav Rotem" <nrotem at apple.com>
> Sent: Friday, January 25, 2013 8:14:57 AM
> Subject: Re: [LLVMdev] LoopVectorizer in OpenCL
2009 Apr 01
2
[LLVMdev] GSoC 2009: Auto-vectorization
Nick Lewycky wrote:
> Andreas Bolka wrote:
>> Hi all,
>>
>> I'd like to have a first stab at a loop-based auto-vectorization pass as
>> part of 2009's Google Summer of Code program. As far as I can tell from
>> searching the mailing list archives, no work on such an auto-vectorizer
>> seems to be currently in progress.
>
> Hi Andreas,
>
2013 Jan 31
0
[LLVMdev] LoopVectorizer in OpenCL C work group autovectorization
----- Original Message -----
> From: "Pekka Jääskeläinen" <pekka.jaaskelainen at tut.fi>
> To: "Ralf Karrenberg" <Chareos at gmx.de>
> Cc: "LLVM Developers Mailing List" <llvmdev at cs.uiuc.edu>
> Sent: Thursday, January 31, 2013 11:15:43 AM
> Subject: Re: [LLVMdev] LoopVectorizer in OpenCL C work group autovectorization
>
> Hi
2011 Nov 28
1
[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass
This is very interesting work. Thanks. Quick questions / comments.
1. What's the algorithm complexity?
2. Please rename -vectorize to -bb-vectorize since this is a specific kind of vectorization.
3. Have you tried non-x86 targets? Does it work? If not, what would it take to make it work?
I am concerned that LLVM currently does not have a cost model for vectorization. That's necessary for
2011 Oct 29
0
[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass
On Sat, Oct 29, 2011 at 12:30:12PM -0500, Hal Finkel wrote:
> Also, when using clang, I had to pass -Dinline= on the command line:
> when using -emit-llvm, clang appears not to emit code for functions
> declared inline. This is a bug, but I've not yet tracked it down.
http://clang.llvm.org/compatibility.html#inline
Thanks,
--
Peter
2011 Nov 17
2
[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass
Hello Hal,
> MultiSource/Applications/ClamAV - fails to compile shared_sha256.c with
> an error: error in backend: Cannot select: 0x4fbcb40: v2i64 =
> X86ISD::MOVLPD 0x4149e00, 0x418d930 [ID=596]
Please report this as a PR regardless of the pass. Bugs in the backend
should be fixed.
--
With best regards, Anton Korobeynikov
Faculty of Mathematics and Mechanics, Saint Petersburg State
2011 Dec 20
2
[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass
Hi,
I see that there are two functions in your code that are O(n^2) in
number of instructions of the program: getCandidatePairs and
buildDepMap. I think that you could make these two functions faster
if you work on some form of factored def-use chains for memory, like
the VUSE/VDEFs of GCC.
I was trying to find a similar representation in LLVM: isn't there already
a virtual SSA
2011 Dec 20
0
[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass
On Tue, 2011-12-20 at 13:57 -0600, Sebastian Pop wrote:
> Hi,
>
> I see that there are two functions in your code that are O(n^2) in
> number of instructions of the program: getCandidatePairs and
> buildDepMap. I think that you could make these two functions faster
> if you work on some form of factored def-use chains for memory, like
> the VUSE/VDEFs of GCC.
Thanks for the
2012 Jan 26
0
[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass
On Thu, Jan 26, 2012 at 2:49 PM, Hal Finkel <hfinkel at anl.gov> wrote:
> Thanks! Did you compile with any non-default flags other than -mllvm
> -vectorize?
I used -O3 and -vectorize, no other non-default flags.
Sebastian
--
Qualcomm Innovation Center, Inc is a member of Code Aurora Forum
2015 Jul 07
2
[LLVMdev] SPMD Autovectorizer
Hi Renato,
On 07/07/2015 10:57 PM, Renato Golin wrote:
> Now, IIRC, OpenCL had a lot of trouble from getting odd-sized vector
> types in IR that the middle end would not understand, especially the
> vectorizers. The solution, at least as of 2 years ago, was to
> serialise everything and let the CL back-end to vectorize it.
Perhaps you are referring to the problem of autovectorizing
2012 Feb 03
0
[LLVMdev] Vectorization: Next Steps
Hi Hal,
> As some of you may know, I committed my basic-block autovectorization
> pass a few days ago. I encourage anyone interested to try it out (pass
> -vectorize to opt or -mllvm -vectorize to clang) and provide feedback.
> Especially in combination with -unroll-allow-partial, I have observed
> some significant benchmark speedups, but, I have also observed some
> significant