thr3ads.net - search: "call

Displaying 2 results from an estimated 2 matches for "call_llvm".

LLVM 11 and trunk selecting 4 wide instead of 8 wide loop vectorization for AVX-enabled target

2020 Jul 17

LLVM 11 and trunk selecting 4 wide instead of 8 wide loop vectorization for AVX-enabled target

...e great if you could share the IR with the attributes, as they depend on > the downstream TLI. > > I am also CC’ing Francesco, who might be able to help you pinning down > where exactly things go wrong with the mapping. > > Cheers, > Florian > > —— > > define float @call_llvm.log10.f32(float %in) { > %call = tail call float @llvm.log10.f32(float %in) > ret float %call > } > > declare float @llvm.log10.f32(float) > -- Neil Henning Senior Software Engineer Compiler unity.com -------------- next part -------------- An HTML attachment was scrubbed.....

LLVM 11 and trunk selecting 4 wide instead of 8 wide loop vectorization for AVX-enabled target

2020 Jul 16

LLVM 11 and trunk selecting 4 wide instead of 8 wide loop vectorization for AVX-enabled target

So for us we use SLEEF to actually implement the libcalls (LLVM intrinsics) that LLVM by default would generate - and since SLEEF has highly optimal 8-wide pow, optimized for AVX and AVX2, we really want to use that. So we would not see 4/8 libcalls and instead see 1 call to something that lights up the ymm registers. I guess the problem then is that the default expectation is that pow would be

search for: call_llvm