thr3ads.net - llvm dev - [llvm-dev] [cfe-dev] Intrinsic llvm::isnan [Aug 2021]

If this information is useful, please help other people find it:
Share via:

Sanjay Patel via llvm-dev

2021-Aug-23 19:42 UTC

[llvm-dev] [cfe-dev] Intrinsic llvm::isnan

Ok, does this edit to the LangRef make sense for the definition of
"ignore":
"optimization passes may assume that the exception status flags will not be
read and that floating-point exceptions **will** be masked" -->
"optimization passes may assume that the exception status flags will not be
read and that floating-point exceptions **may** be masked"

It's still not clear to me if there's a benefit from having an intrinsic
vs. one more exception mode ("none" or "off").
That answer might depend on how many more of these intrinsics we'll need?
<cmath> has these:
isfinite()
isinf()
isnormal()
signbit()

others?

On Mon, Aug 23, 2021 at 2:25 PM Kaylor, Andrew <andrew.kaylor at
intel.com>
wrote:
> > Why would a target be allowed to lower the constrained fcmp intrinsic
> with "ignore" to an operation that might raise an exception?
>
>
>
> Because the “fpexcept.ignore” argument means that exceptions are being
> ignored. That is, it doesn’t matter whether exceptions are raised or not.
> It’s a promise that exceptions are not unmasked or being explicitly checked
> in this part of the code.
>
>
>
> The “fpexcept.ignore” argument isn’t used for the fully strict mode. It is
> used for two cases:
>
>
>
> 1) To get back to the “default” state (in combination with
> “fpround.tonearest”) when a non-constrained operation is inlined into a
> function that has constrained operations.
>
> 2) To enable dynamic rounding (-frounding-math) without requiring strict
> exception semantics.
>
>
>
>
>
> Regarding NaN comparison support in the presence of fast-math flags, I
> think this is an excellent reason to have the intrinsic. I needed this for
> a (non-C/C++) downstream compiler before the llvm.isnan intrinsic was
> introduced and found that the only ways to achieve it were either to
> introduce an intrinsic for the comparison or to remove the ‘nnan’ flag
> everywhere.
>
>
>
> -Andy
>
>
>
>
>
> *From:* llvm-dev <llvm-dev-bounces at lists.llvm.org> *On Behalf Of
*Sanjay
> Patel via llvm-dev
> *Sent:* Monday, August 23, 2021 10:56 AM
> *To:* Wang, Pengfei <Pengfei.Wang at intel.com>
> *Cc:* llvm-dev at lists.llvm.org; Clang Dev <cfe-dev at
lists.llvm.org>
> *Subject:* Re: [llvm-dev] [cfe-dev] Intrinsic llvm::isnan
>
>
>
> Why would a target be allowed to lower the constrained fcmp intrinsic with
> "ignore" to an operation that might raise an exception?
>
> If that scenario does not exist, then make the definition of fcmp
"ignore"
> stronger by saying that it *requires* a lowering that does not raise an
> exception.
>
> If refining the definition of "ignore" in this case is too hard
to
> reconcile with other FP ops, how about adding another exception mode
> ("none") to specify that exceptions are guaranteed not to be
raised?
>
>
>
> On Mon, Aug 23, 2021 at 11:04 AM Wang, Pengfei <pengfei.wang at
intel.com>
> wrote:
>
> AFAIU, the fpexcept metadata is just a hint for optimizations, which have
> to respect the sequence of FP instructions. “ignore” is not a command to
> ask backend to mask the exception. It just tells optimizations the
> exceptions are not concerned by user, so that the FP instructions can be
> scheduled.
>
> I think an non exception intrinsic is needed, because it is a commend to
> request backend not to emit instructions that may raise exception.
>
>
>
> Thanks
>
> Pengfei
>
>
>
> *From:* llvm-dev <llvm-dev-bounces at lists.llvm.org> *On Behalf Of
*Sanjay
> Patel via llvm-dev
> *Sent:* Monday, August 23, 2021 10:13 PM
> *To:* Serge Pavlov <sepavloff at gmail.com>
> *Cc:* LLVM Developers <llvm-dev at lists.llvm.org>; Clang Dev <
> cfe-dev at lists.llvm.org>
> *Subject:* Re: [llvm-dev] [cfe-dev] Intrinsic llvm::isnan
>
>
>
> If the FP state is not made invisible/irrelevant by
"fpexcept.ignore",
> then why is that argument on this intrinsic call at all?
>
> Ie, why is that argument not used by the backend to decide to lower to a
> CMP instruction or special isnan instruction or library call?
>
>
>
> An example would be helpful - in what case would these two lower
> differently given that SNAN always raises a visible exception?
>
> call i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x,
> metadata !"uno", metadata !"fpexcept.ignore") ;
"floating-point exceptions
> will be masked"
>
> call i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x,
> metadata !"uno", metadata !"fpexcept.strict")  ;
"this mode can also be
> used with code that unmasks FP exceptions"
>
>
>
>
>
> On Mon, Aug 23, 2021 at 9:50 AM Serge Pavlov <sepavloff at gmail.com>
wrote:
>
> When codegen lowers this call, it does not know if this is a regular
> compare and it may create CMP instruction, as for default environment, or
> this is 'isnan' for which it should generate different code, which
does not
> influence FP state.
>
>
>
> On most architectures compare instructions change FP state, in default
> environment it is just ignored, but actually hardware registers can be
> modified, For 'isnan' instructions must actually leave FP state
intact.
>
>
> Thanks,
> --Serge
>
>
>
>
>
> On Mon, Aug 23, 2021 at 8:43 PM Sanjay Patel <spatel at
rotateright.com>
> wrote:
>
> You're saying that the function definition text overrides the argument
> definition text. Why are we choosing that interpretation rather than the
> inverse (and documenting it one way or the other)?
>
>
>
> On Mon, Aug 23, 2021 at 9:38 AM Serge Pavlov <sepavloff at gmail.com>
wrote:
>
> > How is this call in LLVM different than the semantics of
"isnan(x)" that
> is required by IEEE-754 or the C standard?
>
>
>
> If either of the arguments of `llvm.experimental.constrained.fcmp` is
> signaling NaN, this function should raise an 'Invalid' exception.
'isnan'
> never raises exceptions.
>
>
> Thanks,
> --Serge
>
>
>
>
>
> On Mon, Aug 23, 2021 at 8:10 PM Sanjay Patel <spatel at
rotateright.com>
> wrote:
>
> I'm confused about the definition of:
>
>
>
https://llvm.org/docs/LangRef.html#llvm-experimental-constrained-fcmp-and-llvm-experimental-constrained-fcmps-intrinsics
>
>
>
> These intrinsics require an "exception behavior" argument. That
argument
> can take the value “fpexcept.ignore” which is defined as:
>
> "optimization passes may assume that the exception status flags will
not
> be read and that floating-point exceptions will be masked"
>
>
>
> i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x, metadata
> !"uno", metadata !"fpexcept.ignore")
>
>
>
> How is this call in LLVM different than the semantics of
"isnan(x)" that
> is required by IEEE-754 or the C standard?
>
>
>
> On Mon, Aug 23, 2021 at 9:00 AM Serge Pavlov via cfe-dev <
> cfe-dev at lists.llvm.org> wrote:
>
>
> On Mon, Aug 23, 2021 at 6:12 PM Roman Lebedev <lebedev.ri at
gmail.com>
> wrote:
>
> Thank you for posting the RFC!
>
> I do not believe we should conflate StrictFP support, and
> `-ffast-math` handling, these are two separate/separatable concerns.
>
>
>
> You are right, they are separate, but they originate from the
> implementation of the same function and can be solved with the same
> solution.
>
>
>
>
> As for the latter, right now i'm not convinced that we should
> second-guess/override explicit user request.
> This is inconsistent, and does not match how at least the GCC deals with
> it.
> I think changing the status-quo (before said patch) should be a separate
> RFC,
> and that change should be undone until after that RFC is accepted.
>
>
>
> Actually we have two explicit user requests, a call of 'isnan' and
an
> option '-ffast-math'. IMHO they do not contradict each other as
'isnan' is
> not an arithmetic operation. There is a discussion in
> https://reviews.llvm.org/D18513#387418, which also expresses the opinion
> that limitations imposed by '-ffast-math' should be applied only to
'math'
> functions but not to 'tests'. As for GCC behavior, they agree that
this
> behavior is a bag: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84949.
> Intel and Microsoft compilers do not replace 'isnan' with assumed
value.
>
>
>
> As for the latter, the main point of confusion is,
> why is `@llvm.isnan` still used in non-StrictFP code?
>
>
>
> We have to introduce an intrinsic to represent `isnan` in strictfp
> environment. It is natural to use it for the default environment as
> well. Besides, a target may have a more efficient way to represent `isnan`
> than unordered comparison.
>
>
>
> The argument that we need `@llvm.isnan` because we *might* transition
> in and out of StrictFP section does not seem to hold for me, because
> https://llvm.org/docs/LangRef.html#constrainedfp says:
>
> > If any FP operation in a function is constrained then they all must be
> constrained. This is required for correct LLVM IR.
>
>
>
> There was no such intention. The primary motivation was strict fp
> exceptions.
>
>
>
>
> So presumably when codegen'ing a function, we already know that we
> will use StrictFP ops, and that should be the knob to use `@llvm.isnan`,
> i think.
>
>
> Roman
>
>
>
>
> On Mon, Aug 23, 2021 at 1:57 PM Serge Pavlov via cfe-dev
> <cfe-dev at lists.llvm.org> wrote:
> >
> > Hi all,
> >
> > Some time ago a new intrinsic `llvm.isnan` was introduced, which is
> intended to represent IEEE-754 operation `isNaN` as well as a family of C
> library functions `isnan*`. Recently during post-commit review concern was
> raised (see  https://reviews.llvm.org/D104854) that this functionality
> must have had RFC to make sure there is consensus on semantics.
> >
> > Previously the frontend intrinsic `__builtin_isnan` was converted into
> `cmp uno` during IR generation in clang codegen. There are two main reasons
> why this solution is not satisfactory.
> >
> > 1.  Strict floating-point semantics.
> >
> > If FP exceptions are not ignored, `cmp uno` must be replaced with its
> constrained counterpart, namely `llvm.experimental.constrained.fcmp` or
> `llvm.experimental.constrained.fcmps`. None of them is compatible with the
> semantics of `isnan`. Both IEEE-754 (5.7.2) an C standard  (
> http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2596.pdf, F.3p6) demand
> that this function does not raise floating point exceptions. Both the
> constrained compare intrinsics raise an exception if either operand is a
> SNAN (https://llvm.org/docs/LangRef.html#id1131). So there was no
> target-independent IR construct that could express `isnan`.
> >
> > This drawback was significant enough and some attempts to alleviate it
> were undertaken. In https://reviews.llvm.org/D95948 `isnan` was
> implemented using integer operations in strictfp functions. It however is
> not suitable for targets where a more efficient way exists, like dedicated
> instruction. Another solution was implemented in
> https://reviews.llvm.org/D96568, where a hook
> 'clang::TargetCodeGenInfo::testFPKind' was introduced, which
injects target
> specific code into IR. Such a solution makes IR more target-dependent and
> prevents some IR-level optimizations.
> >
> > 2. Compilation with -ffast-math
> >
> > The option '-ffast-math' is often used for performance
critical code, as
> it can produce faster code. In this case the user must ensure that NaNs are
> not used as operand values. `isnan` is just proposed for such checks, but
> it was unusable when `isnan` was represented by compare instruction,
> because the latter may be optimized out. One of use cases is data in
> memory, which is processed by a function compiled with `-ffast-math`. Some
> items in the data are NaNs to denote absence of values.
> >
> > This point requires some remarks about using NaNs when a function is
> compiled with `-ffast-math`. GCC manual does not specify how this option
> works, it only states about `-ffinite-math-only` (
>
https://gcc.gnu.org/onlinedocs/gcc-11.2.0/gcc/Optimize-Options.html#Optimize-Options
> ):
> >
> > `Allow optimizations for floating-point arithmetic that assume that
> arguments and results are not NaNs or +-Infs.`
> >
> > `isnan` does not do any arithmetic, only check, so this statement
> apparently does not apply to it. There is a GCC bug report
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84949, where investigation
> conforms that std::isnan() and std::fpclassify() should works with NaNs as
> specified even in -ffast-math mode.
> >
> > Extending NaN restrictions in -ffast-math mode to functions like
`isnan`
> does not make code faster, but is a source of broken user expectations. If
> a user writes `isnan` they usually expect an actual check. Silently
> removing the check is a stronger action than assuming that float value
> contains only real numbers.
> >
> > Intrinsic `llvm.isnan` solves these problems. It
> > - represents the check throughout the IR pipeline and saves it from
> undesired optimizations,
> > - is lowered in selector, which can choose the most suitable
> implementation for particular target,
> > - helps keeping IR target-independent,
> > - facilitates program analysis as the operation is presented
explicitly
> and is not hidden behind general nodes.
> >
> > Note that `llvm.isnan` is optimized out if its argument is an
operation
> with `nnan` flag, this behavior agrees with the definition of this flag in
> LLVM documentation.
> >
> > Any feedback is welcome.
> >
> > Thanks,
> > --Serge
> > _______________________________________________
> > cfe-dev mailing list
> > cfe-dev at lists.llvm.org
> > https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>
> _______________________________________________
> cfe-dev mailing list
> cfe-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20210823/1b963cee/attachment.html>

James Y Knight via llvm-dev

2021-Aug-23 20:46 UTC

head link

[llvm-dev] [cfe-dev] Intrinsic llvm::isnan

According to IEEE 754, the following operations are
"non-computational",
and should not signal if provided with an sNAN argument.
I've annotated each with -> C function/macro equivalent, taken from the
table in the C standard draft.

enum class(source) -> fpclassify, signbit, issignaling
boolean isSignMinus(source) -> signbit
boolean isNormal(source) -> isnormal
boolean isFinite(source) -> isfinite
boolean isZero(source) -> iszero
boolean isSubnormal(source) -> issubnormal
boolean isInfinite(source) -> isinf
boolean isNaN(source) -> isnan
boolean isSignaling(source) -> issignaling
boolean isCanonical(source) -> iscanonical
enum radix(source) -> FLT_RADIX constant
boolean totalOrder(source, source) -> totalorder
boolean totalOrderMag(source, source) -> totalordermag

(Decimal numbers:)
boolean sameQuantum(source, source) -> samequantum

The following list are "quiet-computational" operations, which also
should
not signal when provided with an sNAN:

sourceFormat copy(source) -> memcpy, memmove,+(x) -- (Somewhat
surprisingly, "x = y" is implementation-defined to be either a
"copy"
(non-signaling) or a "convertFormat" (signaling), even when x and y
are the
same type.)
sourceFormat negate(source) -> -(x)
sourceFormat abs(source) -> fabs

(Decimal numbers):

decimalEncoding encodeDecimal(decimal) -> encodedec
decimal decodeDecimal(decimalEncoding) -> decodedec
binaryEncoding encodeBinary(decimal) -> encodebin
decimal decodeBinary(binaryEncoding) -> decodebin




On Mon, Aug 23, 2021 at 3:42 PM Sanjay Patel via cfe-dev <
cfe-dev at lists.llvm.org> wrote:
> Ok, does this edit to the LangRef make sense for the definition of
> "ignore":
> "optimization passes may assume that the exception status flags will
not
> be read and that floating-point exceptions **will** be masked" -->
> "optimization passes may assume that the exception status flags will
not
> be read and that floating-point exceptions **may** be masked"
>
> It's still not clear to me if there's a benefit from having an
intrinsic
> vs. one more exception mode ("none" or "off").
> That answer might depend on how many more of these intrinsics we'll
need?
> <cmath> has these:
> isfinite()
> isinf()
> isnormal()
> signbit()
>
> others?
>
> On Mon, Aug 23, 2021 at 2:25 PM Kaylor, Andrew <andrew.kaylor at
intel.com>
> wrote:
>
>> > Why would a target be allowed to lower the constrained fcmp
intrinsic
>> with "ignore" to an operation that might raise an exception?
>>
>>
>>
>> Because the “fpexcept.ignore” argument means that exceptions are being
>> ignored. That is, it doesn’t matter whether exceptions are raised or
not.
>> It’s a promise that exceptions are not unmasked or being explicitly
checked
>> in this part of the code.
>>
>>
>>
>> The “fpexcept.ignore” argument isn’t used for the fully strict mode. It
>> is used for two cases:
>>
>>
>>
>> 1) To get back to the “default” state (in combination with
>> “fpround.tonearest”) when a non-constrained operation is inlined into a
>> function that has constrained operations.
>>
>> 2) To enable dynamic rounding (-frounding-math) without requiring
strict
>> exception semantics.
>>
>>
>>
>>
>>
>> Regarding NaN comparison support in the presence of fast-math flags, I
>> think this is an excellent reason to have the intrinsic. I needed this
for
>> a (non-C/C++) downstream compiler before the llvm.isnan intrinsic was
>> introduced and found that the only ways to achieve it were either to
>> introduce an intrinsic for the comparison or to remove the ‘nnan’ flag
>> everywhere.
>>
>>
>>
>> -Andy
>>
>>
>>
>>
>>
>> *From:* llvm-dev <llvm-dev-bounces at lists.llvm.org> *On Behalf
Of *Sanjay
>> Patel via llvm-dev
>> *Sent:* Monday, August 23, 2021 10:56 AM
>> *To:* Wang, Pengfei <Pengfei.Wang at intel.com>
>> *Cc:* llvm-dev at lists.llvm.org; Clang Dev <cfe-dev at
lists.llvm.org>
>> *Subject:* Re: [llvm-dev] [cfe-dev] Intrinsic llvm::isnan
>>
>>
>>
>> Why would a target be allowed to lower the constrained fcmp intrinsic
>> with "ignore" to an operation that might raise an exception?
>>
>> If that scenario does not exist, then make the definition of fcmp
>> "ignore" stronger by saying that it *requires* a lowering
that does not
>> raise an exception.
>>
>> If refining the definition of "ignore" in this case is too
hard to
>> reconcile with other FP ops, how about adding another exception mode
>> ("none") to specify that exceptions are guaranteed not to be
raised?
>>
>>
>>
>> On Mon, Aug 23, 2021 at 11:04 AM Wang, Pengfei <pengfei.wang at
intel.com>
>> wrote:
>>
>> AFAIU, the fpexcept metadata is just a hint for optimizations, which
have
>> to respect the sequence of FP instructions. “ignore” is not a command
to
>> ask backend to mask the exception. It just tells optimizations the
>> exceptions are not concerned by user, so that the FP instructions can
be
>> scheduled.
>>
>> I think an non exception intrinsic is needed, because it is a commend
to
>> request backend not to emit instructions that may raise exception.
>>
>>
>>
>> Thanks
>>
>> Pengfei
>>
>>
>>
>> *From:* llvm-dev <llvm-dev-bounces at lists.llvm.org> *On Behalf
Of *Sanjay
>> Patel via llvm-dev
>> *Sent:* Monday, August 23, 2021 10:13 PM
>> *To:* Serge Pavlov <sepavloff at gmail.com>
>> *Cc:* LLVM Developers <llvm-dev at lists.llvm.org>; Clang Dev
<
>> cfe-dev at lists.llvm.org>
>> *Subject:* Re: [llvm-dev] [cfe-dev] Intrinsic llvm::isnan
>>
>>
>>
>> If the FP state is not made invisible/irrelevant by
"fpexcept.ignore",
>> then why is that argument on this intrinsic call at all?
>>
>> Ie, why is that argument not used by the backend to decide to lower to
a
>> CMP instruction or special isnan instruction or library call?
>>
>>
>>
>> An example would be helpful - in what case would these two lower
>> differently given that SNAN always raises a visible exception?
>>
>> call i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x,
>> metadata !"uno", metadata !"fpexcept.ignore") ;
"floating-point exceptions
>> will be masked"
>>
>> call i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x,
>> metadata !"uno", metadata !"fpexcept.strict")  ;
"this mode can also be
>> used with code that unmasks FP exceptions"
>>
>>
>>
>>
>>
>> On Mon, Aug 23, 2021 at 9:50 AM Serge Pavlov <sepavloff at
gmail.com> wrote:
>>
>> When codegen lowers this call, it does not know if this is a regular
>> compare and it may create CMP instruction, as for default environment,
or
>> this is 'isnan' for which it should generate different code,
which does not
>> influence FP state.
>>
>>
>>
>> On most architectures compare instructions change FP state, in default
>> environment it is just ignored, but actually hardware registers can be
>> modified, For 'isnan' instructions must actually leave FP state
intact.
>>
>>
>> Thanks,
>> --Serge
>>
>>
>>
>>
>>
>> On Mon, Aug 23, 2021 at 8:43 PM Sanjay Patel <spatel at
rotateright.com>
>> wrote:
>>
>> You're saying that the function definition text overrides the
argument
>> definition text. Why are we choosing that interpretation rather than
the
>> inverse (and documenting it one way or the other)?
>>
>>
>>
>> On Mon, Aug 23, 2021 at 9:38 AM Serge Pavlov <sepavloff at
gmail.com> wrote:
>>
>> > How is this call in LLVM different than the semantics of
"isnan(x)"
>> that is required by IEEE-754 or the C standard?
>>
>>
>>
>> If either of the arguments of `llvm.experimental.constrained.fcmp` is
>> signaling NaN, this function should raise an 'Invalid'
exception. 'isnan'
>> never raises exceptions.
>>
>>
>> Thanks,
>> --Serge
>>
>>
>>
>>
>>
>> On Mon, Aug 23, 2021 at 8:10 PM Sanjay Patel <spatel at
rotateright.com>
>> wrote:
>>
>> I'm confused about the definition of:
>>
>>
>>
https://llvm.org/docs/LangRef.html#llvm-experimental-constrained-fcmp-and-llvm-experimental-constrained-fcmps-intrinsics
>>
>>
>>
>> These intrinsics require an "exception behavior" argument.
That argument
>> can take the value “fpexcept.ignore” which is defined as:
>>
>> "optimization passes may assume that the exception status flags
will not
>> be read and that floating-point exceptions will be masked"
>>
>>
>>
>> i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x,
metadata
>> !"uno", metadata !"fpexcept.ignore")
>>
>>
>>
>> How is this call in LLVM different than the semantics of
"isnan(x)" that
>> is required by IEEE-754 or the C standard?
>>
>>
>>
>> On Mon, Aug 23, 2021 at 9:00 AM Serge Pavlov via cfe-dev <
>> cfe-dev at lists.llvm.org> wrote:
>>
>>
>> On Mon, Aug 23, 2021 at 6:12 PM Roman Lebedev <lebedev.ri at
gmail.com>
>> wrote:
>>
>> Thank you for posting the RFC!
>>
>> I do not believe we should conflate StrictFP support, and
>> `-ffast-math` handling, these are two separate/separatable concerns.
>>
>>
>>
>> You are right, they are separate, but they originate from the
>> implementation of the same function and can be solved with the same
>> solution.
>>
>>
>>
>>
>> As for the latter, right now i'm not convinced that we should
>> second-guess/override explicit user request.
>> This is inconsistent, and does not match how at least the GCC deals
with
>> it.
>> I think changing the status-quo (before said patch) should be a
separate
>> RFC,
>> and that change should be undone until after that RFC is accepted.
>>
>>
>>
>> Actually we have two explicit user requests, a call of 'isnan'
and an
>> option '-ffast-math'. IMHO they do not contradict each other as
'isnan' is
>> not an arithmetic operation. There is a discussion in
>> https://reviews.llvm.org/D18513#387418, which also expresses the
opinion
>> that limitations imposed by '-ffast-math' should be applied
only to 'math'
>> functions but not to 'tests'. As for GCC behavior, they agree
that this
>> behavior is a bag: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84949.
>> Intel and Microsoft compilers do not replace 'isnan' with
assumed value.
>>
>>
>>
>> As for the latter, the main point of confusion is,
>> why is `@llvm.isnan` still used in non-StrictFP code?
>>
>>
>>
>> We have to introduce an intrinsic to represent `isnan` in strictfp
>> environment. It is natural to use it for the default environment as
>> well. Besides, a target may have a more efficient way to represent
`isnan`
>> than unordered comparison.
>>
>>
>>
>> The argument that we need `@llvm.isnan` because we *might* transition
>> in and out of StrictFP section does not seem to hold for me, because
>> https://llvm.org/docs/LangRef.html#constrainedfp says:
>>
>> > If any FP operation in a function is constrained then they all
must be
>> constrained. This is required for correct LLVM IR.
>>
>>
>>
>> There was no such intention. The primary motivation was strict fp
>> exceptions.
>>
>>
>>
>>
>> So presumably when codegen'ing a function, we already know that we
>> will use StrictFP ops, and that should be the knob to use
`@llvm.isnan`,
>> i think.
>>
>>
>> Roman
>>
>>
>>
>>
>> On Mon, Aug 23, 2021 at 1:57 PM Serge Pavlov via cfe-dev
>> <cfe-dev at lists.llvm.org> wrote:
>> >
>> > Hi all,
>> >
>> > Some time ago a new intrinsic `llvm.isnan` was introduced, which
is
>> intended to represent IEEE-754 operation `isNaN` as well as a family of
C
>> library functions `isnan*`. Recently during post-commit review concern
was
>> raised (see  https://reviews.llvm.org/D104854) that this functionality
>> must have had RFC to make sure there is consensus on semantics.
>> >
>> > Previously the frontend intrinsic `__builtin_isnan` was converted
into
>> `cmp uno` during IR generation in clang codegen. There are two main
reasons
>> why this solution is not satisfactory.
>> >
>> > 1.  Strict floating-point semantics.
>> >
>> > If FP exceptions are not ignored, `cmp uno` must be replaced with
its
>> constrained counterpart, namely `llvm.experimental.constrained.fcmp` or
>> `llvm.experimental.constrained.fcmps`. None of them is compatible with
the
>> semantics of `isnan`. Both IEEE-754 (5.7.2) an C standard  (
>> http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2596.pdf, F.3p6)
demand
>> that this function does not raise floating point exceptions. Both the
>> constrained compare intrinsics raise an exception if either operand is
a
>> SNAN (https://llvm.org/docs/LangRef.html#id1131). So there was no
>> target-independent IR construct that could express `isnan`.
>> >
>> > This drawback was significant enough and some attempts to
alleviate it
>> were undertaken. In https://reviews.llvm.org/D95948 `isnan` was
>> implemented using integer operations in strictfp functions. It however
is
>> not suitable for targets where a more efficient way exists, like
dedicated
>> instruction. Another solution was implemented in
>> https://reviews.llvm.org/D96568, where a hook
>> 'clang::TargetCodeGenInfo::testFPKind' was introduced, which
injects target
>> specific code into IR. Such a solution makes IR more target-dependent
and
>> prevents some IR-level optimizations.
>> >
>> > 2. Compilation with -ffast-math
>> >
>> > The option '-ffast-math' is often used for performance
critical code,
>> as it can produce faster code. In this case the user must ensure that
NaNs
>> are not used as operand values. `isnan` is just proposed for such
checks,
>> but it was unusable when `isnan` was represented by compare
instruction,
>> because the latter may be optimized out. One of use cases is data in
>> memory, which is processed by a function compiled with `-ffast-math`.
Some
>> items in the data are NaNs to denote absence of values.
>> >
>> > This point requires some remarks about using NaNs when a function
is
>> compiled with `-ffast-math`. GCC manual does not specify how this
option
>> works, it only states about `-ffinite-math-only` (
>>
https://gcc.gnu.org/onlinedocs/gcc-11.2.0/gcc/Optimize-Options.html#Optimize-Options
>> ):
>> >
>> > `Allow optimizations for floating-point arithmetic that assume
that
>> arguments and results are not NaNs or +-Infs.`
>> >
>> > `isnan` does not do any arithmetic, only check, so this statement
>> apparently does not apply to it. There is a GCC bug report
>> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84949, where investigation
>> conforms that std::isnan() and std::fpclassify() should works with NaNs
as
>> specified even in -ffast-math mode.
>> >
>> > Extending NaN restrictions in -ffast-math mode to functions like
>> `isnan` does not make code faster, but is a source of broken user
>> expectations. If a user writes `isnan` they usually expect an actual
check.
>> Silently removing the check is a stronger action than assuming that
float
>> value contains only real numbers.
>> >
>> > Intrinsic `llvm.isnan` solves these problems. It
>> > - represents the check throughout the IR pipeline and saves it
from
>> undesired optimizations,
>> > - is lowered in selector, which can choose the most suitable
>> implementation for particular target,
>> > - helps keeping IR target-independent,
>> > - facilitates program analysis as the operation is presented
explicitly
>> and is not hidden behind general nodes.
>> >
>> > Note that `llvm.isnan` is optimized out if its argument is an
operation
>> with `nnan` flag, this behavior agrees with the definition of this flag
in
>> LLVM documentation.
>> >
>> > Any feedback is welcome.
>> >
>> > Thanks,
>> > --Serge
>> > _______________________________________________
>> > cfe-dev mailing list
>> > cfe-dev at lists.llvm.org
>> > https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>>
>> _______________________________________________
>> cfe-dev mailing list
>> cfe-dev at lists.llvm.org
>> https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>>
>> _______________________________________________
> cfe-dev mailing list
> cfe-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20210823/3be5f0e8/attachment.html>

Arthur O'Dwyer via llvm-dev

2021-Aug-23 21:36 UTC

head link

[llvm-dev] [cfe-dev] Intrinsic llvm::isnan

On Mon, Aug 23, 2021 at 3:42 PM Sanjay Patel via cfe-dev <
cfe-dev at lists.llvm.org> wrote:
> Ok, does this edit to the LangRef make sense for the definition of
> "ignore":
> "optimization passes may assume that the exception status flags will
not
> be read and that floating-point exceptions **will** be masked" -->
> "optimization passes may assume that the exception status flags will
not
> be read and that floating-point exceptions **may** be masked"
>
I haven't been following the technical details, but in terms of the English
documentation, it makes no sense to say that someone "may assume that [X]
*may* happen." Either [X] always happens, in which case optimization passes
may safely assume that it happens; or [X] never happens, in which case
optimization passes may safely assume that it does not happen; or else [X]
sometimes happens and sometimes doesn't, in which case optimizations
passes *must
not assume anything* about [X].

So you might say: "optimization passes may assume that the exception status
flags will not be read. Floating-point exceptions might or might not be
masked, depending on [____]" (and then mention the relevant variable, such
as "instruction set" or "optimization level" or whatever).

HTH,
Arthur
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20210823/2426dbf4/attachment.html>

llvm dev - Aug 2021 - [cfe-dev] Intrinsic llvm::isnan

[llvm-dev] [cfe-dev] Intrinsic llvm::isnan

[llvm-dev] [cfe-dev] Intrinsic llvm::isnan

[llvm-dev] [cfe-dev] Intrinsic llvm::isnan