search for: 027540

Displaying 2 results from an estimated 2 matches for "027540".

Did you mean: 027440
2013 Mar 03
0
[LLVMdev] parallel loop metadata simplification
On 03/02/2013 08:44 PM, Tobias Grosser wrote: > If the use of ivdep is correct, it seems necessary to _not_ annotate the loads > and stores from and to 't'. Only after 't' is moved into a register, the loop is > actually parallel on the IR level. I didn't realize this is a problem in general because in pocl we explicitly "privatize" the OpenCL C kernel
2013 Mar 02
4
[LLVMdev] parallel loop metadata simplification
On 03/01/2013 10:05 PM, Redmond, Paul wrote: [...] > I have discovered that you can provide a custom inserter to IRBuilder (who knew!). This has basically solved all my problems and allowed me to generate the proper metadata with minimal changes to clang codegen. Currently it adds the metadata to all loads and stores but I don't think this is a problem and can be refined later if necessary.