Displaying 1 result from an estimated 1 matches for "_z7loopdacllpjl".
2020 Sep 05
2
Possible AVX512 codegen bug in LLVM 10.0.1?
...epeating pattern of 4 correct results
followed by 4 incorrect results. (There are also some commented-out lines
in the LLVM file, from my own testing of alternative implementations to
confirm that the recurisve-function code is otherwise correct.)
The crux seems to be that the recursive function, _Z7loopdacllPjl, takes a
vector of 8 64-bit integers as one of its arguments. There's no issue with
such an argument in LLVM IR, but the generated assembly seems to be
incorrect. Examining the assembly file, it seems that _Z7loopdacllPjl
loads this vector argument off the stack with a 64-byte reload (notably...