Displaying 1 result from an estimated 1 matches for "_z4fct7pf".
Did you mean:
_z4fct3pf
2020 Sep 01
2
Vector evolution?
...z, I have AVX2. So
should the compiler understand the loop and upgrade the vector width?
On the other hand if I do the following loop:
void fct7(float *x)
{
#pragma clang loop vectorize(enable)
for (int i = 0; i < 4 * 256; ++i)
x[i] = 7 * x[i];
}
It compiles it to:
00000000000001e0 <_Z4fct7Pf>:
1e0: 31 c0 xor %eax,%eax
1e2: c4 e2 7d 18 05 00 00 vbroadcastss 0x0(%rip),%ymm0 # 1eb
<_Z4fct7Pf+0xb>
1e9: 00 00
1eb: 0f 1f 44 00 00 nopl 0x0(%rax,%rax,1)
1f0: c5 fc 59 0c 87 vmulps (%rdi,%rax,4),%ymm0,%ymm1
1f5: c5 fc 59 54 87 20 vmulps...