search for: _z4fct7pf

Displaying 1 result from an estimated 1 matches for "_z4fct7pf".

Did you mean: _z4fct3pf
2020 Sep 01
2
Vector evolution?
...z, I have AVX2. So should the compiler understand the loop and upgrade the vector width? On the other hand if I do the following loop: void fct7(float *x) { #pragma clang loop vectorize(enable) for (int i = 0; i < 4 * 256; ++i) x[i] = 7 * x[i]; } It compiles it to: 00000000000001e0 <_Z4fct7Pf>: 1e0: 31 c0 xor %eax,%eax 1e2: c4 e2 7d 18 05 00 00 vbroadcastss 0x0(%rip),%ymm0 # 1eb <_Z4fct7Pf+0xb> 1e9: 00 00 1eb: 0f 1f 44 00 00 nopl 0x0(%rax,%rax,1) 1f0: c5 fc 59 0c 87 vmulps (%rdi,%rax,4),%ymm0,%ymm1 1f5: c5 fc 59 54 87 20 vmulps...