Codestin Search App

adelejjeh · 2026-06-01T18:53:42Z

Guard the (int)fn & 0x3 quadrant index computation in trig reduction
functions against NaN input. fptosi NaN is UB in C and produces
poison in LLVM IR, which the compiler exploits during constant-folding
to return garbage from cos(inf), sin(inf), etc.

Fix: ret.i = BUILTIN_ISNAN(fn) ? 0 : ((int)fn & 0x3);

Applied to 7 locations across 6 files (trigredsmall F/D, trigred H,
trigpired F/D/H). Upstream PR llvm#201435 pattern matches the isnan
guard replacing it with the saturating intrinsic which removes the UB.

Verified: identical instruction count, all reproducer variants pass.

Fixes: LCOMPILER-2150

Guard the `(int)fn & 0x3` quadrant index computation in trig reduction functions against NaN input. `fptosi NaN` is UB in C and produces `poison` in LLVM IR, which the compiler exploits during constant-folding to return garbage from `cos(inf)`, `sin(inf)`, etc. Fix by adding an isnan check: `isnan(fn) ? 0 : ((int)fn & 0x3)`. The AMDGPU backend folds away the guard at codegen since v_cvt_i32_f32 already returns 0 for NaN (see llvm#200960). Fixes: LCOMPILER-2150 Co-Authored-By: Claude Opus 4.6 <[email protected]>

arsenm · 2026-06-02T17:14:50Z

    struct redret ret;
    ret.hi = MATH_MAD(t, -0.5, x);
-    ret.i = (int)t & 0x3;
+    ret.i = BUILTIN_ISNAN_F64(t) ? 0 : ((int)t & 0x3);


Can you rewrite this as is-inf-or-nan(x)? It's harder to prove that t isn't a nan based on the input, but only inf or nan inputs should result in nan results

I don't want this statement to result in any instructions besides the cvt_i32_f64 and similarly for the other types.

@b-sumner The upstream PR handles pattern matching the generated LLVM IR and replacing it with a single llvm.fptosi.sat

@arsenm if we change the check to check x instead of t it would make it harder to pattern match and replace with the saturating intrinsic.

ultimately, the pattern matches in instcombine will fold the resulting checks and we will result in the same, just an fptosi.sat instead of fptosi.

adelejjeh force-pushed the amd/dev/aejjeh/device-libs/fix-trig-nan-ub branch from 69edd27 to cf5e0cb Compare June 1, 2026 22:12

adelejjeh changed the title ~~device-libs: Use saturating float-to-int casts to avoid NaN UB~~ device-libs: Guard trig reduction quadrant index against NaN UB Jun 1, 2026

adelejjeh marked this pull request as ready for review June 1, 2026 22:22

adelejjeh requested review from b-sumner and lamb-j as code owners June 1, 2026 22:22

arsenm added the device-libs Related to Device Libraries label Jun 2, 2026

arsenm reviewed Jun 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

device-libs: Guard trig reduction quadrant index against NaN UB#2752

device-libs: Guard trig reduction quadrant index against NaN UB#2752
adelejjeh wants to merge 1 commit into
amd-stagingfrom
amd/dev/aejjeh/device-libs/fix-trig-nan-ub

adelejjeh commented Jun 1, 2026 •

edited

Loading

Uh oh!

arsenm Jun 2, 2026

Uh oh!

b-sumner Jun 2, 2026

Uh oh!

adelejjeh Jun 2, 2026 •

edited

Loading

Uh oh!

adelejjeh Jun 2, 2026

Uh oh!

adelejjeh Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

adelejjeh commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arsenm Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

b-sumner Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

adelejjeh Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adelejjeh Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

adelejjeh Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

adelejjeh commented Jun 1, 2026 •

edited

Loading

adelejjeh Jun 2, 2026 •

edited

Loading