thr3ads.net - llvm dev - [LLVMdev] Aggressive FMA fusion for NVPTX [Jan 2015]

If this information is useful, please help other people find it:
Share via:

Olivier H Sallenave

2015-Jan-13 22:14 UTC

[LLVMdev] Aggressive FMA fusion for NVPTX

Hi,

I propose to override the TLI callback enableAggressiveFMAFusion for the
NVPTX backend and return true instead of false. The reason is the same as
for PPC: fmul, fmadd and fadd nodes cost the same number of cycles (see
http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#arithmetic-instructions
), so we can enable more combining heuristics to produce more FMAs. For
instance, this pattern would be considered:

// fold (fadd (fma x, y, (fmul u, v)), z) -> (fma x, y (fma u, v, z))

cf. commits:
http://llvm.org/viewvc/llvm-project?view=revision&revision=218120
http://llvm.org/viewvc/llvm-project?view=revision&revision=225380

Please tell me what you think.

Olivier
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20150113/f6350539/attachment.html>

Justin Holewinski

2015-Jan-14 01:57 UTC

head link

[LLVMdev] Aggressive FMA fusion for NVPTX

Looks good to me!  Thanks!

On Tue, Jan 13, 2015 at 5:14 PM, Olivier H Sallenave <ohsallen at
us.ibm.com>
wrote:
> Hi,
>
> I propose to override the TLI callback enableAggressiveFMAFusion for the
> NVPTX backend and return true instead of false. The reason is the same as
> for PPC: fmul, fmadd and fadd nodes cost the same number of cycles (see
>
http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#arithmetic-instructions
> ), so we can enable more combining heuristics to produce more FMAs. For
> instance, this pattern would be considered:
>
> // fold (fadd (fma x, y, (fmul u, v)), z) -> (fma x, y (fma u, v, z))
>
> cf. commits:
> http://llvm.org/viewvc/llvm-project?view=revision&revision=218120
> http://llvm.org/viewvc/llvm-project?view=revision&revision=225380
>
> Please tell me what you think.
>
> Olivier
>
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>
>

-- 

Thanks,

Justin Holewinski
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20150113/c73b60bd/attachment.html>

llvm dev - Jan 2015 - [LLVMdev] Aggressive FMA fusion for NVPTX

[LLVMdev] Aggressive FMA fusion for NVPTX

[LLVMdev] Aggressive FMA fusion for NVPTX