thr3ads.net - search: "bb509624"

Displaying 2 results from an estimated 2 matches for "bb509624".

[LLVMdev] [PATCH][RFC]: Add fmin/fmax intrinsics

2014 Sep 13

[LLVMdev] [PATCH][RFC]: Add fmin/fmax intrinsics

...not generally true. HLSL (DirectX), CUDA, OpenCL, and Metal all > have defined semantics for NaNs which include not propagating them through > min/max. GLSL (OpenGL) is the odd one out in this area. > > HLSL leaves it undefined: http://msdn.microsoft.com/en-us/library/windows/desktop/bb509624%28v=vs.85%29.aspx I guess Metal and others only have a "fast-math" flag which (among other things) makes behavior on NaN undefined, but it's my impression that it's a popular flag. > Also, as a practical issues, many GPUs have ISA-level support for the > IEEE-conforming v...

[LLVMdev] [PATCH][RFC]: Add fmin/fmax intrinsics

2014 Sep 12

[LLVMdev] [PATCH][RFC]: Add fmin/fmax intrinsics

> On Sep 12, 2014, at 10:27 AM, Dan Gohman <dan433584 at gmail.com> wrote: > > > More generally, I don’t see a compelling reason for LLVM to add intrinsic support for the version you’re proposing. Your choice can easily be expanded into IR, and does not have the wide hardware support (particularly in GPUs) that the IEEE version does. > > The IEEE version can also be

search for: bb509624