thr3ads.net - search: "nvq86w"

Displaying 2 results from an estimated 2 matches for "nvq86w".

Did you mean: nv86

A 4x slower initialization loop in LLVM vs GCC and MSVC

2020 Oct 01

A 4x slower initialization loop in LLVM vs GCC and MSVC

...ually interchanging the loops, which gives a ~3x speedup). Alternatively, if we we would create vector stores instead of the small memcpy calls, we probably would get a better result overall. Using Clang's Matrix Types extensions effectively does so, and with that version https://godbolt.org/z/nvq86W I get the same speed as if disabling SROA (although the code is not as nice as it code be right now, as there's no syntax for constant initializers for matrix types yet) -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/...

A 4x slower initialization loop in LLVM vs GCC and MSVC

2020 Sep 27

A 4x slower initialization loop in LLVM vs GCC and MSVC

Hi everyone, I was watching this video [1]. There's an example of an initialization loop for which Clang unfortunately generates really bad code [2]. In my machine, the Clang version is 4x slower than the GCC version. I have not tested the MSVC version, but it should be around the same. In case anyone's interested, in the video [1] Casey explains why this code is bad (around 59:39). So,

search for: nvq86w