[clang][BoundsSafety][UBSan] Fix false-positive UBSan pointer overflow by usama54321 · Pull Request #12846 · swiftlang/llvm-project

usama54321 · 2026-04-27T21:06:23Z

check for pointer subtraction

When -fbounds-safety and -fsanitize=pointer-overflow are both enabled, pointer subtraction unconditionally triggers a UBSan pointer overflow false positive. EmitCheckedBoundPointerArithmetic was passing the raw positive index to EmitCheckedInBoundsGEP without negating it first, causing the check to verify (base + offset) ule base instead of (base - offset) ule base, which is always false for any positive offset.

All other callers of EmitCheckedInBoundsGEP pre-negate the index for subtraction. Fix this by negating the index with CreateNeg before passing it to EmitCheckedInBoundsGEP when IsSub is true.

rdar://173944149

usama54321 · 2026-04-27T21:06:41Z

@swift-ci please test

check for pointer subtraction When -fbounds-safety and -fsanitize=pointer-overflow are both enabled, pointer subtraction unconditionally triggers a UBSan pointer overflow false positive. EmitCheckedBoundPointerArithmetic was passing the raw positive index to EmitCheckedInBoundsGEP without negating it first, causing the check to verify (base + offset) ule base instead of (base - offset) ule base, which is always false for any positive offset. All other callers of EmitCheckedInBoundsGEP pre-negate the index for subtraction. Fix this by negating the index with CreateNeg before passing it to EmitCheckedInBoundsGEP when IsSub is true. rdar://173944149

usama54321 · 2026-04-27T21:08:05Z

@swift-ci please test

rapidsna

Thanks! LGTM

rapidsna · 2026-04-27T22:36:48Z

@swift-ci test llvm

rapidsna · 2026-04-27T22:38:54Z

FYI. For llvm changes in next, we just run @swift-ci test llvm, which I just invoked. swift-ci test doesn't make sense for next anyway because there's no matching swift tag for it.

hnrklssn · 2026-04-28T19:04:32Z

+// CHECK: %[[OFFSET:[a-z0-9]+]] = load i32
+// CHECK: %[[NEG:[a-z0-9.]+]] = sub i32 0, %[[OFFSET]]
+// CHECK: getelementptr{{.*}} %[[NEG]]
+// CHECK: %[[EXT:[a-z0-9.]+]] = sext i32 %[[NEG]] to i64
+// CHECK: call { i64, i1 } @llvm.smul.with.overflow.i64(i64 1, i64 %[[EXT]])


Do we want to use update_cc_test_checks?

I think this would be a good idea so that this test is easier to update when upstream inevitably changes the IR.

Note that this is an -O0 test, so the fragility is lower than the -O2 tests

I would rather keep this as is. I have not used that script in the past, but it is creating around 30 CHECK lines which I think is much more brittle than this.

delcypher

All other callers of EmitCheckedInBoundsGEP pre-negate the index for subtraction. Fix this by negating the index with CreateNeg before passing it to EmitCheckedInBoundsGEP when IsSub is true.

Do you have concrete examples? When I look over the code it's not obvious to me that this is true.

E.g:

    // For everything else, we can just do a simple increment.
    } else {
      llvm::Value *amt = Builder.getInt32(amount);
      llvm::Type *elemTy = CGF.ConvertTypeForMem(type);
      if (CGF.getLangOpts().PointerOverflowDefined)
        value = Builder.CreateGEP(elemTy, value, amt, "incdec.ptr");
      else
        value = CGF.EmitCheckedInBoundsGEP(
            elemTy, value, amt, /*SignedIndices=*/false, isSubtraction,
            E->getExprLoc(), "incdec.ptr");
    }

delcypher · 2026-04-29T18:42:58Z

  EmitBoundPointerArithmetic(DestLV, BaseLV, Idx, IsSigned, IsSub);

+  if (IsSub) {
+    Idx = Builder.CreateNeg(Idx, "idx.neg");


So I'm a little confused by this. IsSub is already being passed to the IsSubtraction parameter of EmitCheckedInBoundsGEP so the method knows we are doing a subtraction so it seems surprising that we would need to negate Idx.

Are we passing the wrong value to the SignedIndices parameter of EmitCheckedInBoundsGEP?

I was pretty confused by this as well and looked at a number of callsites. Looking at the complete code you pasted above, it negates the index in the first case, does not in the second case (which might be a bug?). The last case is just a plain increment at least according to the comment.

I see the negation in other places as well.

if (const VariableArrayType *vla = CGF.getContext().getAsVariableArrayType(type)) { llvm::Value *numElts = CGF.getVLASize(vla).NumElts; if (!isInc) numElts = Builder.CreateNSWNeg(numElts, "vla.negsize"); llvm::Type *elemTy = CGF.ConvertTypeForMem(vla->getElementType()); if (CGF.getLangOpts().PointerOverflowDefined) value = Builder.CreateGEP(elemTy, value, numElts, "vla.inc"); else value = CGF.EmitCheckedInBoundsGEP( elemTy, value, numElts, /*SignedIndices=*/false, isSubtraction, E->getExprLoc(), "vla.inc"); // Arithmetic on function pointers (!) is just +-1. } else if (type->isFunctionType()) { llvm::Value *amt = Builder.getInt32(amount); if (CGF.getLangOpts().PointerOverflowDefined) value = Builder.CreateGEP(CGF.Int8Ty, value, amt, "incdec.funcptr"); else value = CGF.EmitCheckedInBoundsGEP(CGF.Int8Ty, value, amt, /*SignedIndices=*/false, isSubtraction, E->getExprLoc(), "incdec.funcptr"); // For everything else, we can just do a simple increment. } else { llvm::Value *amt = Builder.getInt32(amount); llvm::Type *elemTy = CGF.ConvertTypeForMem(type); if (CGF.getLangOpts().PointerOverflowDefined) value = Builder.CreateGEP(elemTy, value, amt, "incdec.ptr"); else value = CGF.EmitCheckedInBoundsGEP( elemTy, value, amt, /*SignedIndices=*/false, isSubtraction, E->getExprLoc(), "incdec.ptr"); }

SignedIndices is independent of this. For example in the test I wrote, SignedIndices = false and IsSub = true, while if i replace ptr - offset; with ptr - 3; SignedIndices = true and IsSub = false (which makes sense).

I will double check this again and add some more tests

The last case is just a plain increment at least according to the comment.

I think that comment is misleading because
isSubtraction is bool isSubtraction = !isInc; on that path I think so this can still be a decrement.

SignedIndices = false and IsSub = true, while if i replace ptr - offset; with ptr - 3; SignedIndices = true and IsSub = false (which makes sense).

That interesting. Sounds like when we have a constant we effectively are treating this as ptr + (-3).

I'm still very suspicious because given that we are telling EmitCheckedInBoundsGEP:

We are doing subtraction

The index is an unsigned value

would suggest there isn't a need to negate the index at all. However a quick glance at the implementation shows:

Value * CodeGenFunction::EmitCheckedInBoundsGEP(llvm::Type *ElemTy, Value *Ptr, ArrayRef<Value *> IdxList, bool SignedIndices, bool IsSubtraction, SourceLocation Loc, const Twine &Name) { llvm::Type *PtrTy = Ptr->getType(); llvm::GEPNoWrapFlags NWFlags = llvm::GEPNoWrapFlags::inBounds(); if (!SignedIndices && !IsSubtraction) NWFlags |= llvm::GEPNoWrapFlags::noUnsignedWrap(); Value *GEPVal = Builder.CreateGEP(ElemTy, Ptr, IdxList, Name, NWFlags); // stuff that does use `IsSubtraction` for computing overflow checking. return GEPVal;

It seems at least for the computed value we return, the IsSubtraction isn't really used other than setting the wrapping flags. This makes it seem like IsSubtraction is a bit of a footgun.

Yes this seems to be bad API design, and fixing it would mean updating all the callsites. If you want I can create a radar and fix that upstream but we would also need a separate patch in this repo for bounds-safety usages. What would you recommend?

The origin of this seems to be from https://reviews.llvm.org/D34121.

So perhaps this is expected behavior because the doxygen comments say

/// Same as IRBuilder::CreateInBoundsGEP, but additionally emits a check to /// detect undefined behavior when the pointer overflow sanitizer is enabled. /// \p SignedIndices indicates whether any of the GEP indices are signed. /// \p IsSubtraction indicates whether the expression used to form the GEP /// is a subtraction. llvm::Value *EmitCheckedInBoundsGEP(llvm::Type *ElemTy, llvm::Value *Ptr, ArrayRef<llvm::Value *> IdxList, bool SignedIndices, bool IsSubtraction, SourceLocation Loc, const Twine &Name = "");

and IRBuilder::CreateInBoundsGEP doesn't have any notion of subtraction. Everything is an addition and I think you have to manually negate any value in the IdxList first if you want to do negative indexing. Also because IdxList is a list and not a single llvm::Value* which of those should be subtracted?

That being said this is pretty confusing. I think we should at the least file an issue upstream and try to clean this up in a separate PR.

I don't want to block fixing the -fbounds-safety/ubsan bug on that though.

Oh something to think about here too is how this interacts with SanitizerKind::ArrayBounds and if that expects Idx to be negated or not.

Yep that also expects the index to be negated. I took a look at the code/few callsites.

I will update with more tests and resolve this conversation afterwards

usama54321 requested review from danliew-apple and rapidsna April 27, 2026 21:06

usama54321 force-pushed the eng/173944149 branch from 6ccae21 to 74becf4 Compare April 27, 2026 21:07

rapidsna requested review from hnrklssn and ojhunt April 27, 2026 21:10

rapidsna approved these changes Apr 27, 2026

View reviewed changes

hnrklssn reviewed Apr 28, 2026

View reviewed changes

delcypher self-requested a review April 29, 2026 18:14

delcypher requested changes Apr 29, 2026

View reviewed changes

delcypher added the clang:bounds-safety Issue relating to the experimental -fbounds-safety feature in Clang label Apr 29, 2026

Conversation

usama54321 commented Apr 27, 2026

Uh oh!

usama54321 commented Apr 27, 2026

Uh oh!

usama54321 commented Apr 27, 2026

Uh oh!

rapidsna left a comment

Choose a reason for hiding this comment

Uh oh!

rapidsna commented Apr 27, 2026

Uh oh!

rapidsna commented Apr 27, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

delcypher left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

delcypher Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

delcypher left a comment •

edited

Loading

delcypher Apr 29, 2026 •

edited

Loading