Sanjay Patel via llvm-dev
2021-Aug-23 19:42 UTC
[llvm-dev] [cfe-dev] Intrinsic llvm::isnan
Ok, does this edit to the LangRef make sense for the definition of "ignore": "optimization passes may assume that the exception status flags will not be read and that floating-point exceptions **will** be masked" --> "optimization passes may assume that the exception status flags will not be read and that floating-point exceptions **may** be masked" It's still not clear to me if there's a benefit from having an intrinsic vs. one more exception mode ("none" or "off"). That answer might depend on how many more of these intrinsics we'll need? <cmath> has these: isfinite() isinf() isnormal() signbit() others? On Mon, Aug 23, 2021 at 2:25 PM Kaylor, Andrew <andrew.kaylor at intel.com> wrote:> > Why would a target be allowed to lower the constrained fcmp intrinsic > with "ignore" to an operation that might raise an exception? > > > > Because the “fpexcept.ignore” argument means that exceptions are being > ignored. That is, it doesn’t matter whether exceptions are raised or not. > It’s a promise that exceptions are not unmasked or being explicitly checked > in this part of the code. > > > > The “fpexcept.ignore” argument isn’t used for the fully strict mode. It is > used for two cases: > > > > 1) To get back to the “default” state (in combination with > “fpround.tonearest”) when a non-constrained operation is inlined into a > function that has constrained operations. > > 2) To enable dynamic rounding (-frounding-math) without requiring strict > exception semantics. > > > > > > Regarding NaN comparison support in the presence of fast-math flags, I > think this is an excellent reason to have the intrinsic. I needed this for > a (non-C/C++) downstream compiler before the llvm.isnan intrinsic was > introduced and found that the only ways to achieve it were either to > introduce an intrinsic for the comparison or to remove the ‘nnan’ flag > everywhere. > > > > -Andy > > > > > > *From:* llvm-dev <llvm-dev-bounces at lists.llvm.org> *On Behalf Of *Sanjay > Patel via llvm-dev > *Sent:* Monday, August 23, 2021 10:56 AM > *To:* Wang, Pengfei <Pengfei.Wang at intel.com> > *Cc:* llvm-dev at lists.llvm.org; Clang Dev <cfe-dev at lists.llvm.org> > *Subject:* Re: [llvm-dev] [cfe-dev] Intrinsic llvm::isnan > > > > Why would a target be allowed to lower the constrained fcmp intrinsic with > "ignore" to an operation that might raise an exception? > > If that scenario does not exist, then make the definition of fcmp "ignore" > stronger by saying that it *requires* a lowering that does not raise an > exception. > > If refining the definition of "ignore" in this case is too hard to > reconcile with other FP ops, how about adding another exception mode > ("none") to specify that exceptions are guaranteed not to be raised? > > > > On Mon, Aug 23, 2021 at 11:04 AM Wang, Pengfei <pengfei.wang at intel.com> > wrote: > > AFAIU, the fpexcept metadata is just a hint for optimizations, which have > to respect the sequence of FP instructions. “ignore” is not a command to > ask backend to mask the exception. It just tells optimizations the > exceptions are not concerned by user, so that the FP instructions can be > scheduled. > > I think an non exception intrinsic is needed, because it is a commend to > request backend not to emit instructions that may raise exception. > > > > Thanks > > Pengfei > > > > *From:* llvm-dev <llvm-dev-bounces at lists.llvm.org> *On Behalf Of *Sanjay > Patel via llvm-dev > *Sent:* Monday, August 23, 2021 10:13 PM > *To:* Serge Pavlov <sepavloff at gmail.com> > *Cc:* LLVM Developers <llvm-dev at lists.llvm.org>; Clang Dev < > cfe-dev at lists.llvm.org> > *Subject:* Re: [llvm-dev] [cfe-dev] Intrinsic llvm::isnan > > > > If the FP state is not made invisible/irrelevant by "fpexcept.ignore", > then why is that argument on this intrinsic call at all? > > Ie, why is that argument not used by the backend to decide to lower to a > CMP instruction or special isnan instruction or library call? > > > > An example would be helpful - in what case would these two lower > differently given that SNAN always raises a visible exception? > > call i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x, > metadata !"uno", metadata !"fpexcept.ignore") ; "floating-point exceptions > will be masked" > > call i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x, > metadata !"uno", metadata !"fpexcept.strict") ; "this mode can also be > used with code that unmasks FP exceptions" > > > > > > On Mon, Aug 23, 2021 at 9:50 AM Serge Pavlov <sepavloff at gmail.com> wrote: > > When codegen lowers this call, it does not know if this is a regular > compare and it may create CMP instruction, as for default environment, or > this is 'isnan' for which it should generate different code, which does not > influence FP state. > > > > On most architectures compare instructions change FP state, in default > environment it is just ignored, but actually hardware registers can be > modified, For 'isnan' instructions must actually leave FP state intact. > > > Thanks, > --Serge > > > > > > On Mon, Aug 23, 2021 at 8:43 PM Sanjay Patel <spatel at rotateright.com> > wrote: > > You're saying that the function definition text overrides the argument > definition text. Why are we choosing that interpretation rather than the > inverse (and documenting it one way or the other)? > > > > On Mon, Aug 23, 2021 at 9:38 AM Serge Pavlov <sepavloff at gmail.com> wrote: > > > How is this call in LLVM different than the semantics of "isnan(x)" that > is required by IEEE-754 or the C standard? > > > > If either of the arguments of `llvm.experimental.constrained.fcmp` is > signaling NaN, this function should raise an 'Invalid' exception. 'isnan' > never raises exceptions. > > > Thanks, > --Serge > > > > > > On Mon, Aug 23, 2021 at 8:10 PM Sanjay Patel <spatel at rotateright.com> > wrote: > > I'm confused about the definition of: > > > https://llvm.org/docs/LangRef.html#llvm-experimental-constrained-fcmp-and-llvm-experimental-constrained-fcmps-intrinsics > > > > These intrinsics require an "exception behavior" argument. That argument > can take the value “fpexcept.ignore” which is defined as: > > "optimization passes may assume that the exception status flags will not > be read and that floating-point exceptions will be masked" > > > > i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x, metadata > !"uno", metadata !"fpexcept.ignore") > > > > How is this call in LLVM different than the semantics of "isnan(x)" that > is required by IEEE-754 or the C standard? > > > > On Mon, Aug 23, 2021 at 9:00 AM Serge Pavlov via cfe-dev < > cfe-dev at lists.llvm.org> wrote: > > > On Mon, Aug 23, 2021 at 6:12 PM Roman Lebedev <lebedev.ri at gmail.com> > wrote: > > Thank you for posting the RFC! > > I do not believe we should conflate StrictFP support, and > `-ffast-math` handling, these are two separate/separatable concerns. > > > > You are right, they are separate, but they originate from the > implementation of the same function and can be solved with the same > solution. > > > > > As for the latter, right now i'm not convinced that we should > second-guess/override explicit user request. > This is inconsistent, and does not match how at least the GCC deals with > it. > I think changing the status-quo (before said patch) should be a separate > RFC, > and that change should be undone until after that RFC is accepted. > > > > Actually we have two explicit user requests, a call of 'isnan' and an > option '-ffast-math'. IMHO they do not contradict each other as 'isnan' is > not an arithmetic operation. There is a discussion in > https://reviews.llvm.org/D18513#387418, which also expresses the opinion > that limitations imposed by '-ffast-math' should be applied only to 'math' > functions but not to 'tests'. As for GCC behavior, they agree that this > behavior is a bag: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84949. > Intel and Microsoft compilers do not replace 'isnan' with assumed value. > > > > As for the latter, the main point of confusion is, > why is `@llvm.isnan` still used in non-StrictFP code? > > > > We have to introduce an intrinsic to represent `isnan` in strictfp > environment. It is natural to use it for the default environment as > well. Besides, a target may have a more efficient way to represent `isnan` > than unordered comparison. > > > > The argument that we need `@llvm.isnan` because we *might* transition > in and out of StrictFP section does not seem to hold for me, because > https://llvm.org/docs/LangRef.html#constrainedfp says: > > > If any FP operation in a function is constrained then they all must be > constrained. This is required for correct LLVM IR. > > > > There was no such intention. The primary motivation was strict fp > exceptions. > > > > > So presumably when codegen'ing a function, we already know that we > will use StrictFP ops, and that should be the knob to use `@llvm.isnan`, > i think. > > > Roman > > > > > On Mon, Aug 23, 2021 at 1:57 PM Serge Pavlov via cfe-dev > <cfe-dev at lists.llvm.org> wrote: > > > > Hi all, > > > > Some time ago a new intrinsic `llvm.isnan` was introduced, which is > intended to represent IEEE-754 operation `isNaN` as well as a family of C > library functions `isnan*`. Recently during post-commit review concern was > raised (see https://reviews.llvm.org/D104854) that this functionality > must have had RFC to make sure there is consensus on semantics. > > > > Previously the frontend intrinsic `__builtin_isnan` was converted into > `cmp uno` during IR generation in clang codegen. There are two main reasons > why this solution is not satisfactory. > > > > 1. Strict floating-point semantics. > > > > If FP exceptions are not ignored, `cmp uno` must be replaced with its > constrained counterpart, namely `llvm.experimental.constrained.fcmp` or > `llvm.experimental.constrained.fcmps`. None of them is compatible with the > semantics of `isnan`. Both IEEE-754 (5.7.2) an C standard ( > http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2596.pdf, F.3p6) demand > that this function does not raise floating point exceptions. Both the > constrained compare intrinsics raise an exception if either operand is a > SNAN (https://llvm.org/docs/LangRef.html#id1131). So there was no > target-independent IR construct that could express `isnan`. > > > > This drawback was significant enough and some attempts to alleviate it > were undertaken. In https://reviews.llvm.org/D95948 `isnan` was > implemented using integer operations in strictfp functions. It however is > not suitable for targets where a more efficient way exists, like dedicated > instruction. Another solution was implemented in > https://reviews.llvm.org/D96568, where a hook > 'clang::TargetCodeGenInfo::testFPKind' was introduced, which injects target > specific code into IR. Such a solution makes IR more target-dependent and > prevents some IR-level optimizations. > > > > 2. Compilation with -ffast-math > > > > The option '-ffast-math' is often used for performance critical code, as > it can produce faster code. In this case the user must ensure that NaNs are > not used as operand values. `isnan` is just proposed for such checks, but > it was unusable when `isnan` was represented by compare instruction, > because the latter may be optimized out. One of use cases is data in > memory, which is processed by a function compiled with `-ffast-math`. Some > items in the data are NaNs to denote absence of values. > > > > This point requires some remarks about using NaNs when a function is > compiled with `-ffast-math`. GCC manual does not specify how this option > works, it only states about `-ffinite-math-only` ( > https://gcc.gnu.org/onlinedocs/gcc-11.2.0/gcc/Optimize-Options.html#Optimize-Options > ): > > > > `Allow optimizations for floating-point arithmetic that assume that > arguments and results are not NaNs or +-Infs.` > > > > `isnan` does not do any arithmetic, only check, so this statement > apparently does not apply to it. There is a GCC bug report > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84949, where investigation > conforms that std::isnan() and std::fpclassify() should works with NaNs as > specified even in -ffast-math mode. > > > > Extending NaN restrictions in -ffast-math mode to functions like `isnan` > does not make code faster, but is a source of broken user expectations. If > a user writes `isnan` they usually expect an actual check. Silently > removing the check is a stronger action than assuming that float value > contains only real numbers. > > > > Intrinsic `llvm.isnan` solves these problems. It > > - represents the check throughout the IR pipeline and saves it from > undesired optimizations, > > - is lowered in selector, which can choose the most suitable > implementation for particular target, > > - helps keeping IR target-independent, > > - facilitates program analysis as the operation is presented explicitly > and is not hidden behind general nodes. > > > > Note that `llvm.isnan` is optimized out if its argument is an operation > with `nnan` flag, this behavior agrees with the definition of this flag in > LLVM documentation. > > > > Any feedback is welcome. > > > > Thanks, > > --Serge > > _______________________________________________ > > cfe-dev mailing list > > cfe-dev at lists.llvm.org > > https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev > > _______________________________________________ > cfe-dev mailing list > cfe-dev at lists.llvm.org > https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev > >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20210823/1b963cee/attachment.html>
James Y Knight via llvm-dev
2021-Aug-23 20:46 UTC
[llvm-dev] [cfe-dev] Intrinsic llvm::isnan
According to IEEE 754, the following operations are "non-computational", and should not signal if provided with an sNAN argument. I've annotated each with -> C function/macro equivalent, taken from the table in the C standard draft. enum class(source) -> fpclassify, signbit, issignaling boolean isSignMinus(source) -> signbit boolean isNormal(source) -> isnormal boolean isFinite(source) -> isfinite boolean isZero(source) -> iszero boolean isSubnormal(source) -> issubnormal boolean isInfinite(source) -> isinf boolean isNaN(source) -> isnan boolean isSignaling(source) -> issignaling boolean isCanonical(source) -> iscanonical enum radix(source) -> FLT_RADIX constant boolean totalOrder(source, source) -> totalorder boolean totalOrderMag(source, source) -> totalordermag (Decimal numbers:) boolean sameQuantum(source, source) -> samequantum The following list are "quiet-computational" operations, which also should not signal when provided with an sNAN: sourceFormat copy(source) -> memcpy, memmove,+(x) -- (Somewhat surprisingly, "x = y" is implementation-defined to be either a "copy" (non-signaling) or a "convertFormat" (signaling), even when x and y are the same type.) sourceFormat negate(source) -> -(x) sourceFormat abs(source) -> fabs (Decimal numbers): decimalEncoding encodeDecimal(decimal) -> encodedec decimal decodeDecimal(decimalEncoding) -> decodedec binaryEncoding encodeBinary(decimal) -> encodebin decimal decodeBinary(binaryEncoding) -> decodebin On Mon, Aug 23, 2021 at 3:42 PM Sanjay Patel via cfe-dev < cfe-dev at lists.llvm.org> wrote:> Ok, does this edit to the LangRef make sense for the definition of > "ignore": > "optimization passes may assume that the exception status flags will not > be read and that floating-point exceptions **will** be masked" --> > "optimization passes may assume that the exception status flags will not > be read and that floating-point exceptions **may** be masked" > > It's still not clear to me if there's a benefit from having an intrinsic > vs. one more exception mode ("none" or "off"). > That answer might depend on how many more of these intrinsics we'll need? > <cmath> has these: > isfinite() > isinf() > isnormal() > signbit() > > others? > > On Mon, Aug 23, 2021 at 2:25 PM Kaylor, Andrew <andrew.kaylor at intel.com> > wrote: > >> > Why would a target be allowed to lower the constrained fcmp intrinsic >> with "ignore" to an operation that might raise an exception? >> >> >> >> Because the “fpexcept.ignore” argument means that exceptions are being >> ignored. That is, it doesn’t matter whether exceptions are raised or not. >> It’s a promise that exceptions are not unmasked or being explicitly checked >> in this part of the code. >> >> >> >> The “fpexcept.ignore” argument isn’t used for the fully strict mode. It >> is used for two cases: >> >> >> >> 1) To get back to the “default” state (in combination with >> “fpround.tonearest”) when a non-constrained operation is inlined into a >> function that has constrained operations. >> >> 2) To enable dynamic rounding (-frounding-math) without requiring strict >> exception semantics. >> >> >> >> >> >> Regarding NaN comparison support in the presence of fast-math flags, I >> think this is an excellent reason to have the intrinsic. I needed this for >> a (non-C/C++) downstream compiler before the llvm.isnan intrinsic was >> introduced and found that the only ways to achieve it were either to >> introduce an intrinsic for the comparison or to remove the ‘nnan’ flag >> everywhere. >> >> >> >> -Andy >> >> >> >> >> >> *From:* llvm-dev <llvm-dev-bounces at lists.llvm.org> *On Behalf Of *Sanjay >> Patel via llvm-dev >> *Sent:* Monday, August 23, 2021 10:56 AM >> *To:* Wang, Pengfei <Pengfei.Wang at intel.com> >> *Cc:* llvm-dev at lists.llvm.org; Clang Dev <cfe-dev at lists.llvm.org> >> *Subject:* Re: [llvm-dev] [cfe-dev] Intrinsic llvm::isnan >> >> >> >> Why would a target be allowed to lower the constrained fcmp intrinsic >> with "ignore" to an operation that might raise an exception? >> >> If that scenario does not exist, then make the definition of fcmp >> "ignore" stronger by saying that it *requires* a lowering that does not >> raise an exception. >> >> If refining the definition of "ignore" in this case is too hard to >> reconcile with other FP ops, how about adding another exception mode >> ("none") to specify that exceptions are guaranteed not to be raised? >> >> >> >> On Mon, Aug 23, 2021 at 11:04 AM Wang, Pengfei <pengfei.wang at intel.com> >> wrote: >> >> AFAIU, the fpexcept metadata is just a hint for optimizations, which have >> to respect the sequence of FP instructions. “ignore” is not a command to >> ask backend to mask the exception. It just tells optimizations the >> exceptions are not concerned by user, so that the FP instructions can be >> scheduled. >> >> I think an non exception intrinsic is needed, because it is a commend to >> request backend not to emit instructions that may raise exception. >> >> >> >> Thanks >> >> Pengfei >> >> >> >> *From:* llvm-dev <llvm-dev-bounces at lists.llvm.org> *On Behalf Of *Sanjay >> Patel via llvm-dev >> *Sent:* Monday, August 23, 2021 10:13 PM >> *To:* Serge Pavlov <sepavloff at gmail.com> >> *Cc:* LLVM Developers <llvm-dev at lists.llvm.org>; Clang Dev < >> cfe-dev at lists.llvm.org> >> *Subject:* Re: [llvm-dev] [cfe-dev] Intrinsic llvm::isnan >> >> >> >> If the FP state is not made invisible/irrelevant by "fpexcept.ignore", >> then why is that argument on this intrinsic call at all? >> >> Ie, why is that argument not used by the backend to decide to lower to a >> CMP instruction or special isnan instruction or library call? >> >> >> >> An example would be helpful - in what case would these two lower >> differently given that SNAN always raises a visible exception? >> >> call i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x, >> metadata !"uno", metadata !"fpexcept.ignore") ; "floating-point exceptions >> will be masked" >> >> call i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x, >> metadata !"uno", metadata !"fpexcept.strict") ; "this mode can also be >> used with code that unmasks FP exceptions" >> >> >> >> >> >> On Mon, Aug 23, 2021 at 9:50 AM Serge Pavlov <sepavloff at gmail.com> wrote: >> >> When codegen lowers this call, it does not know if this is a regular >> compare and it may create CMP instruction, as for default environment, or >> this is 'isnan' for which it should generate different code, which does not >> influence FP state. >> >> >> >> On most architectures compare instructions change FP state, in default >> environment it is just ignored, but actually hardware registers can be >> modified, For 'isnan' instructions must actually leave FP state intact. >> >> >> Thanks, >> --Serge >> >> >> >> >> >> On Mon, Aug 23, 2021 at 8:43 PM Sanjay Patel <spatel at rotateright.com> >> wrote: >> >> You're saying that the function definition text overrides the argument >> definition text. Why are we choosing that interpretation rather than the >> inverse (and documenting it one way or the other)? >> >> >> >> On Mon, Aug 23, 2021 at 9:38 AM Serge Pavlov <sepavloff at gmail.com> wrote: >> >> > How is this call in LLVM different than the semantics of "isnan(x)" >> that is required by IEEE-754 or the C standard? >> >> >> >> If either of the arguments of `llvm.experimental.constrained.fcmp` is >> signaling NaN, this function should raise an 'Invalid' exception. 'isnan' >> never raises exceptions. >> >> >> Thanks, >> --Serge >> >> >> >> >> >> On Mon, Aug 23, 2021 at 8:10 PM Sanjay Patel <spatel at rotateright.com> >> wrote: >> >> I'm confused about the definition of: >> >> >> https://llvm.org/docs/LangRef.html#llvm-experimental-constrained-fcmp-and-llvm-experimental-constrained-fcmps-intrinsics >> >> >> >> These intrinsics require an "exception behavior" argument. That argument >> can take the value “fpexcept.ignore” which is defined as: >> >> "optimization passes may assume that the exception status flags will not >> be read and that floating-point exceptions will be masked" >> >> >> >> i1 @llvm.experimental.constrained.fcmp.f64(double %x, double %x, metadata >> !"uno", metadata !"fpexcept.ignore") >> >> >> >> How is this call in LLVM different than the semantics of "isnan(x)" that >> is required by IEEE-754 or the C standard? >> >> >> >> On Mon, Aug 23, 2021 at 9:00 AM Serge Pavlov via cfe-dev < >> cfe-dev at lists.llvm.org> wrote: >> >> >> On Mon, Aug 23, 2021 at 6:12 PM Roman Lebedev <lebedev.ri at gmail.com> >> wrote: >> >> Thank you for posting the RFC! >> >> I do not believe we should conflate StrictFP support, and >> `-ffast-math` handling, these are two separate/separatable concerns. >> >> >> >> You are right, they are separate, but they originate from the >> implementation of the same function and can be solved with the same >> solution. >> >> >> >> >> As for the latter, right now i'm not convinced that we should >> second-guess/override explicit user request. >> This is inconsistent, and does not match how at least the GCC deals with >> it. >> I think changing the status-quo (before said patch) should be a separate >> RFC, >> and that change should be undone until after that RFC is accepted. >> >> >> >> Actually we have two explicit user requests, a call of 'isnan' and an >> option '-ffast-math'. IMHO they do not contradict each other as 'isnan' is >> not an arithmetic operation. There is a discussion in >> https://reviews.llvm.org/D18513#387418, which also expresses the opinion >> that limitations imposed by '-ffast-math' should be applied only to 'math' >> functions but not to 'tests'. As for GCC behavior, they agree that this >> behavior is a bag: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84949. >> Intel and Microsoft compilers do not replace 'isnan' with assumed value. >> >> >> >> As for the latter, the main point of confusion is, >> why is `@llvm.isnan` still used in non-StrictFP code? >> >> >> >> We have to introduce an intrinsic to represent `isnan` in strictfp >> environment. It is natural to use it for the default environment as >> well. Besides, a target may have a more efficient way to represent `isnan` >> than unordered comparison. >> >> >> >> The argument that we need `@llvm.isnan` because we *might* transition >> in and out of StrictFP section does not seem to hold for me, because >> https://llvm.org/docs/LangRef.html#constrainedfp says: >> >> > If any FP operation in a function is constrained then they all must be >> constrained. This is required for correct LLVM IR. >> >> >> >> There was no such intention. The primary motivation was strict fp >> exceptions. >> >> >> >> >> So presumably when codegen'ing a function, we already know that we >> will use StrictFP ops, and that should be the knob to use `@llvm.isnan`, >> i think. >> >> >> Roman >> >> >> >> >> On Mon, Aug 23, 2021 at 1:57 PM Serge Pavlov via cfe-dev >> <cfe-dev at lists.llvm.org> wrote: >> > >> > Hi all, >> > >> > Some time ago a new intrinsic `llvm.isnan` was introduced, which is >> intended to represent IEEE-754 operation `isNaN` as well as a family of C >> library functions `isnan*`. Recently during post-commit review concern was >> raised (see https://reviews.llvm.org/D104854) that this functionality >> must have had RFC to make sure there is consensus on semantics. >> > >> > Previously the frontend intrinsic `__builtin_isnan` was converted into >> `cmp uno` during IR generation in clang codegen. There are two main reasons >> why this solution is not satisfactory. >> > >> > 1. Strict floating-point semantics. >> > >> > If FP exceptions are not ignored, `cmp uno` must be replaced with its >> constrained counterpart, namely `llvm.experimental.constrained.fcmp` or >> `llvm.experimental.constrained.fcmps`. None of them is compatible with the >> semantics of `isnan`. Both IEEE-754 (5.7.2) an C standard ( >> http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2596.pdf, F.3p6) demand >> that this function does not raise floating point exceptions. Both the >> constrained compare intrinsics raise an exception if either operand is a >> SNAN (https://llvm.org/docs/LangRef.html#id1131). So there was no >> target-independent IR construct that could express `isnan`. >> > >> > This drawback was significant enough and some attempts to alleviate it >> were undertaken. In https://reviews.llvm.org/D95948 `isnan` was >> implemented using integer operations in strictfp functions. It however is >> not suitable for targets where a more efficient way exists, like dedicated >> instruction. Another solution was implemented in >> https://reviews.llvm.org/D96568, where a hook >> 'clang::TargetCodeGenInfo::testFPKind' was introduced, which injects target >> specific code into IR. Such a solution makes IR more target-dependent and >> prevents some IR-level optimizations. >> > >> > 2. Compilation with -ffast-math >> > >> > The option '-ffast-math' is often used for performance critical code, >> as it can produce faster code. In this case the user must ensure that NaNs >> are not used as operand values. `isnan` is just proposed for such checks, >> but it was unusable when `isnan` was represented by compare instruction, >> because the latter may be optimized out. One of use cases is data in >> memory, which is processed by a function compiled with `-ffast-math`. Some >> items in the data are NaNs to denote absence of values. >> > >> > This point requires some remarks about using NaNs when a function is >> compiled with `-ffast-math`. GCC manual does not specify how this option >> works, it only states about `-ffinite-math-only` ( >> https://gcc.gnu.org/onlinedocs/gcc-11.2.0/gcc/Optimize-Options.html#Optimize-Options >> ): >> > >> > `Allow optimizations for floating-point arithmetic that assume that >> arguments and results are not NaNs or +-Infs.` >> > >> > `isnan` does not do any arithmetic, only check, so this statement >> apparently does not apply to it. There is a GCC bug report >> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84949, where investigation >> conforms that std::isnan() and std::fpclassify() should works with NaNs as >> specified even in -ffast-math mode. >> > >> > Extending NaN restrictions in -ffast-math mode to functions like >> `isnan` does not make code faster, but is a source of broken user >> expectations. If a user writes `isnan` they usually expect an actual check. >> Silently removing the check is a stronger action than assuming that float >> value contains only real numbers. >> > >> > Intrinsic `llvm.isnan` solves these problems. It >> > - represents the check throughout the IR pipeline and saves it from >> undesired optimizations, >> > - is lowered in selector, which can choose the most suitable >> implementation for particular target, >> > - helps keeping IR target-independent, >> > - facilitates program analysis as the operation is presented explicitly >> and is not hidden behind general nodes. >> > >> > Note that `llvm.isnan` is optimized out if its argument is an operation >> with `nnan` flag, this behavior agrees with the definition of this flag in >> LLVM documentation. >> > >> > Any feedback is welcome. >> > >> > Thanks, >> > --Serge >> > _______________________________________________ >> > cfe-dev mailing list >> > cfe-dev at lists.llvm.org >> > https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev >> >> _______________________________________________ >> cfe-dev mailing list >> cfe-dev at lists.llvm.org >> https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev >> >> _______________________________________________ > cfe-dev mailing list > cfe-dev at lists.llvm.org > https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20210823/3be5f0e8/attachment.html>
Arthur O'Dwyer via llvm-dev
2021-Aug-23 21:36 UTC
[llvm-dev] [cfe-dev] Intrinsic llvm::isnan
On Mon, Aug 23, 2021 at 3:42 PM Sanjay Patel via cfe-dev < cfe-dev at lists.llvm.org> wrote:> Ok, does this edit to the LangRef make sense for the definition of > "ignore": > "optimization passes may assume that the exception status flags will not > be read and that floating-point exceptions **will** be masked" --> > "optimization passes may assume that the exception status flags will not > be read and that floating-point exceptions **may** be masked" >I haven't been following the technical details, but in terms of the English documentation, it makes no sense to say that someone "may assume that [X] *may* happen." Either [X] always happens, in which case optimization passes may safely assume that it happens; or [X] never happens, in which case optimization passes may safely assume that it does not happen; or else [X] sometimes happens and sometimes doesn't, in which case optimizations passes *must not assume anything* about [X]. So you might say: "optimization passes may assume that the exception status flags will not be read. Floating-point exceptions might or might not be masked, depending on [____]" (and then mention the relevant variable, such as "instruction set" or "optimization level" or whatever). HTH, Arthur -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20210823/2426dbf4/attachment.html>