Displaying 2 results from an estimated 2 matches for "__builtin_astype".
2016 Feb 11
2
Vectorization with fast-math on irregular ISA sub-sets
...oke our implementation of 'nextafterf'. The desired code fragment (FP32) is:
float xAbs = fabsf(x);
since we know our instruction for this does not handle denormals and the algorithm is sensitive to correct denormals, the code was written to avoid this issue as follows:
float xAbs = __builtin_astype(__builtin_astype(x, unsigned) & 0x7FFFFFFF, float);
But the v3.8 FP optimiser now recognises this pattern and replaces it with an ISD::FABS node and broke our workaround :-) It's a great optimisation and I have no problem with its correctness, but I was thinking that perhaps I might see w...
2016 Feb 11
4
Vectorization with fast-math on irregular ISA sub-sets
----- Original Message -----
> From: "Renato Golin" <renato.golin at linaro.org>
> To: "Hal Finkel" <hfinkel at anl.gov>
> Cc: "James Molloy" <James.Molloy at arm.com>, "Nadav Rotem" <nrotem at apple.com>, "Arnold Schwaighofer"
> <aschwaighofer at apple.com>, "LLVM Dev" <llvm-dev at