search for: vector_intrinsics

Displaying 1 result from an estimated 1 matches for "vector_intrinsics".

2014 Feb 18
4
[LLVMdev] Optimizing math code
...ang doesn’t vectorize it. Is there a simple way to get it to vectorize portably? I’d like to port to ARM NEON as well as SSE and AVX, and I’d like to be compatible with GCC, so I’d rather not use ext_vector_length if I can avoid it. If I can’t avoid it, I can cobble something together with a big vector_intrinsics.h file or something. You might think that vectorizing a 4-long loop wouldn’t matter, but of course field addition is inlined and called all the time. GCC 4.7.2 vectorizes the above code, but bails to scalar code if a==b or a==c, which isn’t necessary. It doesn’t check this if you declare the arr...