thr3ads.net - similar to: "LAA behavior on Incorrect #pragma omp simd."

Displaying 20 results from an estimated 400 matches similar to: "LAA behavior on Incorrect #pragma omp simd."

LLVM's loop unroller & llvm.loop.parallel_accesses

2020 May 14

LLVM's loop unroller & llvm.loop.parallel_accesses

This is interesting! So are you saying that loop.parallel_accesses strictly loop parallel, and says nothing about aliasing? I see, I guess we may have been "abusing" the hint and re-purposed it. But isn't llvm's vectorizer using loop.parallel_accesses to vectorize loops including vectorize memory accesses that if you ignore loop-carried dependencies, usually means effectively

LLVM's loop unroller & llvm.loop.parallel_accesses

2020 May 18

LLVM's loop unroller & llvm.loop.parallel_accesses

Would you guys be open to supporting a new hint with the right semantics, like e.g. llvm.loop.noalias_accesses?! I would need to find support in clang however and the main point of support would be the loop unroller behaving as stated in the OP. On Thu, May 14, 2020 at 3:04 PM Michael Kruse <llvmdev at meinersbur.de> wrote: > Trivial example: > > #pragma clang loop

[RFC] Compiled regression tests.

2020 Jul 01

[RFC] Compiled regression tests.

On 7/1/20 12:40 AM, Michael Kruse via llvm-dev wrote: > To illustrate some difficulties with FileCheck, lets make a > non-semantic change in LLVM: > > --- a/llvm/lib/Analysis/VectorUtils.cpp > +++ b/llvm/lib/Analysis/VectorUtils.cpp > @@ -642,8 +642,8 @@ MDNode *llvm::uniteAccessGroups(MDNode > *AccGroups1, MDNode *AccGroups2) { > return AccGroups1;

LLVM's loop unroller & llvm.loop.parallel_accesses

2020 May 19

LLVM's loop unroller & llvm.loop.parallel_accesses

Skipping the clang question for now, this had to be a loop pragma of some kind. One step back: what we really need is a way to express that memory accesses between iterations can be re-ordered. The code that's being compiled _is_ noalias, but we don't _have_ to use noalias semantics, e.g. loop parallel semantics are sufficient. What's missing is a way to express that past the llvm

LLVM's loop unroller & llvm.loop.parallel_accesses

2020 May 14

LLVM's loop unroller & llvm.loop.parallel_accesses

Hi, in our backend, which is unfortunately not upstreamed, we are relying on llvm.loop.parallel_accesses metadata for certain passes like software pipelining so we can re-order instructions. Ideally, we would want the loop unroller to support the notion of the loop's parallelism in its pre-unrolled version. This probably should happen by propagating !alias.scope and !alias metadata. Is there

RFC: LoopIDs are not identifiers (and better loop-parallel metadata)

2018 Dec 05

RFC: LoopIDs are not identifiers (and better loop-parallel metadata)

Dear LLVM community, LLVM IR has a concept of 'LoopID' [1] which is a misnomer: (a) LoopIDs are not unique: Any pass that duplicates IR will do it including its metadata (e.g. LoopVersioning) such that thereafter multiple loops are linked with the same LoopID. There is even a test case (Transforms/LoopUnroll/unroll-pragmas-disabled.ll) for multiple loops with the same LoopID. (b)

Glusterfs could not open spec file

2008 Jun 27

Glusterfs could not open spec file

Dear Team, I have installed and configured gluster in one server and client. one time it was worked fine, again later it is not working. my configuration files. server [root at rhel2 ~]# cat /etc/glusterfs/glusterfs-server.vol volume rhel2 type storage/posix # POSIX FS translator option directory /opt # Export this directory end-volume volume rhel2 type

[RFC] Compiled regression tests.

2020 Jun 24

[RFC] Compiled regression tests.

Am Mi., 24. Juni 2020 um 10:12 Uhr schrieb David Blaikie <dblaikie at gmail.com>: > > As mentioned in the Differential, generating the tests automatically > > will lose information about what actually is intended to be tested, > > Agreed - and I didn't mean to suggest tests should be automatically > generated. I work pretty hard in code reviews to encourage tests to

[RFC] Compiled regression tests.

2020 Jul 01

[RFC] Compiled regression tests.

The test as written is fragile because it requires a certain ordering. If the output order is not important, use CHECK-DAG rather than CHECK. This would be a failure to understand the testing tool. My experience, over a 40-year career, is that good software developers are generally not very good test-writers. These are different skills and good testing is frequently not taught. It’s easy to

RFC: Interface user provided vector functions with the vectorizer.

2019 Jun 17

RFC: Interface user provided vector functions with the vectorizer.

I agree with Simon. This looks good conceptually. I have minor implementation comments but that can wait till the code reviews. Sorry for the delay and thanks for working on this. Get Outlook for Android<https://aka.ms/ghei36> ________________________________ From: Simon Moll <moll at cs.uni-saarland.de> Sent: Monday, June 17, 2019 10:02:58 AM To: Francesco Petrogalli; LLVM

RFC: Interface user provided vector functions with the vectorizer.

2019 Jun 11

RFC: Interface user provided vector functions with the vectorizer.

Dear all, I have re-written the proposal for interfacing user provided vector functions, originally posted in both llvm-dev and cfe-dev mailing list: "[RFC] Expose user provided vector function for auto-vectorization." The proposal looks quite different from the original submission, therefore I took the liberty to start a new thread. The original thread generated some good

RFC: Interface user provided vector functions with the vectorizer.

2019 Jun 21

RFC: Interface user provided vector functions with the vectorizer.

>In all cases, the IR type of the parameters in `foo` is i64, therefore is not possible to distinguish what C type generated the signature of `foo`. Ouch. >I don’t know if this is going to be a problem for other architectures I haven't checked what IA-32/Intel64 should do for type 2, but I fully agree that this needs to be done properly according to the ABI. >Therefore, I would

RFC: Interface user provided vector functions with the vectorizer.

2019 Jun 24

RFC: Interface user provided vector functions with the vectorizer.

I have an RFC for first-class complex types in LLVM IR pending for some internal review. I hope to post it soon. That should help address this problem. Then the vector function signature generation could stay in LLVM, if I'm understanding the issue correctly. -David Francesco Petrogalli via llvm-dev <llvm-dev at lists.llvm.org> writes: > Hi all - I am

RFC: Interface user provided vector functions with the vectorizer.

2019 Jun 24

RFC: Interface user provided vector functions with the vectorizer.

@Xinmin, Saito: If Clang/the frontend generates the version there is no problem, or is there? The frontend knows about the original source type and it's ABI specific lowering already. @Francesco, we should even consider putting the generating capabilities outside of the OpenMP code generation (in the future). That could allow easier reuse by other frontends. Get Outlook for

[cfe-dev] [RFC] Expose user provided vector function for auto-vectorization.

2019 Jun 04

[cfe-dev] [RFC] Expose user provided vector function for auto-vectorization.

Hi Francesco, On 06/03, Francesco Petrogalli wrote: > > On Jun 3, 2019, at 1:43 PM, Andrea Bocci <andrea.bocci at cern.ch> wrote: > > as a candidate future user of the proposed extension, I think I like the simplified proposal better than the original RFC. > > > > The only part of the syntax that I would find not very much user-friendly is having to mangle the

RFC: Interface user provided vector functions with the vectorizer.

2019 Jun 24

RFC: Interface user provided vector functions with the vectorizer.

>Thank you everybody for their input, and for your patience. This is proving harder than expected! :) Thank you for doing the hard part of the work. Hideki -----Original Message----- From: Francesco Petrogalli [mailto:Francesco.Petrogalli at arm.com] Sent: Monday, June 24, 2019 11:26 AM To: Saito, Hideki <hideki.saito at intel.com> Cc: Doerfert, Johannes <jdoerfert at anl.gov>;

[RFC] Enable "#pragma omp declare simd" in the LoopVectorizer

2016 Dec 12

[RFC] Enable "#pragma omp declare simd" in the LoopVectorizer

)On 12 December 2016 at 13:44, Francesco Petrogalli <Francesco.Petrogalli at arm.com> wrote: > I am using “D” for 64-bit NEON and “Q” for 128-bit NEON, which makes NEON > vector symbols look as follows: > > _ZVGQN2v__Z1fd > _ZVGDN2v__Z1ff > _ZVGQN4v__Z1ff Hi Francesco, The ARM AAPCS (A.2.1) says: "For C++ the mangled name for parameters is as though the equivalent

[RFC] Enable "#pragma omp declare simd" in the LoopVectorizer

2016 Dec 12

[RFC] Enable "#pragma omp declare simd" in the LoopVectorizer

On 12/12/2016 14:32, "Renato Golin" <renato.golin at linaro.org> wrote: >Clang is already able to mangle NEON vectors of any length >(CXXNameMangler::mangleNeonVectorType), you should use that, as this >is very likely to be compatible with other compilers as well. Hi Renato, The “u”, “v”, “l” mangling of parameters has already been agreed internally at ARM with the gcc

[RFC] Enable "#pragma omp declare simd" in the LoopVectorizer

2016 Dec 12

[RFC] Enable "#pragma omp declare simd" in the LoopVectorizer

On 12 December 2016 at 16:49, Francesco Petrogalli <Francesco.Petrogalli at arm.com> wrote: > I am not sure I understand here. In my patch, all I am doing is “vector > symbol awareness generation”. There are no globals that are generated in > the final object file, it is just the TargetLibraryInfoImpl that is being > populated with the info needed by the vectorizer. The

[RFC] Enable "#pragma omp declare simd" in the LoopVectorizer

2016 Dec 12

[RFC] Enable "#pragma omp declare simd" in the LoopVectorizer

Xinmin, Allow me to share a couple of comments about what Renato is saying. On 08/12/2016 22:08, "Renato Golin" <renato.golin at linaro.org> wrote: >I'm still unsure how the simplistic mangling we have today will work >around the multiple versions we could have with NEON (and in the >future, SVE) without polluting the mangling quite a lot (have you seen

similar to: LAA behavior on Incorrect #pragma omp simd.