Displaying 3 results from an estimated 3 matches for "sequecnce".
Did you mean:
asequence
2017 Oct 23
3
Jacobi 5 Point Stencil Code not Vectorizing
...n't work in this case).<br /><br />Minor note: You're using 'int' for data, than multiply by 0.25 (divide by 4) and than write it back to 'int'. This will cost you 2 conversion to/from double while you may just place (...) / 4 which should be optimized to simple sequecnce with shifts (not to single shift due to signedness, but still better than conversions with changes of element size 4->8->4 and data size INT->FP->INT).<br /><br />And by the way why do you divide by 4, not by 5 as number of points suggest?<br /><br />Serge Preis&...
2017 Oct 24
3
Jacobi 5 Point Stencil Code not Vectorizing
...ase).
>>
>> Minor note: You're using 'int' for data, than multiply by 0.25 (divide by
>> 4) and than write it back to 'int'. This will cost you 2 conversion to/from
>> double while you may just place (...) / 4 which should be optimized to
>> simple sequecnce with shifts (not to single shift due to signedness, but
>> still better than conversions with changes of element size 4->8->4 and data
>> size INT->FP->INT).
>>
>> And by the way why do you divide by 4, not by 5 as number of points
>> suggest?
>>
>...
2017 Jul 01
3
Jacobi 5 Point Stencil Code not Vectorizing
Does it happen due to loop carried dependence? if yes what is the solution
to vectorize such codes?
please reply. i m waiting.
On Jul 1, 2017 12:30 PM, "hameeza ahmed" <hahmed2305 at gmail.com> wrote:
> I even tried polly but still my llvm IR does not contain vector
> instructions. i used the following command;
>
> clang -S -emit-llvm stencil.c -march=knl -O3