Displaying 1 result from an estimated 1 matches for "round_up_to_multiple_of_vl".
2017 Jun 21
2
AVX 512 Assembly Code Generation issues
when i generate code with 72 loop iterations.
the compiler generates code with using avx512 zmm operations 4 times
(16x4=64) and remaining 8 iterations are handled by routine mov operations
with EAX register. wouldn't it be better if it uses ymm for remaining 8
iterations as it does when iteration count is between 8 and 15. same for
xmm and so on.
please correct me if i am wrong.
Thank