Displaying 2 results from an estimated 2 matches for "nvq86w".
Did you mean:
nv86
2020 Oct 01
3
A 4x slower initialization loop in LLVM vs GCC and MSVC
...ually interchanging the loops, which gives a ~3x speedup).
Alternatively, if we we would create vector stores instead of the small memcpy calls, we probably would get a better result overall. Using Clang's Matrix Types extensions effectively does so, and with that version https://godbolt.org/z/nvq86W I get the same speed as if disabling SROA (although the code is not as nice as it code be right now, as there's no syntax for constant initializers for matrix types yet)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/...
2020 Sep 27
2
A 4x slower initialization loop in LLVM vs GCC and MSVC
Hi everyone,
I was watching this video [1]. There's an example of an initialization loop
for which
Clang unfortunately generates really bad code [2]. In my machine, the Clang
version
is 4x slower than the GCC version. I have not tested the MSVC version, but
it should
be around the same.
In case anyone's interested, in the video [1] Casey explains why this code
is bad (around 59:39).
So,