fix(security): neutralize delimiter injection bypasses by gustavomenani · Pull Request #2395 · gsd-build/get-shit-done

gustavomenani · 2026-04-18T04:12:28Z

Fix PR

Linked Issue

Issue #2394 was created through the public bug flow and currently has the repository-default needs-triage label. A maintainer still needs to apply confirmed-bug to satisfy the repo's fix-template gate.

What was broken

scanForInjection() already classified delimiter-style prompt injection markers as malicious, but sanitizeForPrompt() did not neutralize the same family of markers. Raw <user> tags, whitespace-padded delimiter tags, and closing [/SYSTEM] / <</SYS>> markers could survive sanitization.

What this fix does

This makes sanitizeForPrompt() neutralize the full delimiter family that detection already recognizes:

adds user tags to the neutralizer
accepts whitespace before > in delimiter tags
rewrites closing [SYSTEM] / [INST] markers too
rewrites closing <</SYS>> markers too
adds regression coverage for each bypass case

Root cause

Detection and sanitization had drifted apart. The scanner accepted a broader delimiter family than the sanitizer rewrote, so callers that relied on sanitization alone could still persist prompt-boundary markers in user-controlled text.

Testing

How I verified the fix

Reproduced on main with a direct Node snippet: scanForInjection('<user>override</user>') returned clean: false while sanitizeForPrompt('<user>override</user>') returned the raw string unchanged.
Re-ran the focused suites after the fix:
- node --test tests/security.test.cjs tests/prompt-injection-scan.test.cjs
Re-ran full npm test on Windows to confirm no regressions from this patch. The full suite still has pre-existing Windows failures unrelated to this change:
- tests/few-shot-calibration.test.cjs
- tests/prune-orphaned-worktrees.test.cjs
- tests/skill-manifest.test.cjs

Regression test added?

Yes — added tests covering <user> tags, closing [SYSTEM] markers, and closing <</SYS>> markers

Platforms tested

Windows (including backslash path handling)
macOS
Linux
N/A (not platform-specific)

Runtimes tested

Checklist

Issue linked above with Fixes #2394
Linked issue has the confirmed-bug label
Fix is scoped to the reported bug — no unrelated changes included
Regression test added
All existing tests pass (npm test) — unrelated pre-existing Windows failures remain on main
CHANGELOG.md updated if this is a user-facing fix
No unnecessary dependencies added

Breaking changes

None

Summary by CodeRabbit

Release Notes

Bug Fixes
- Enhanced security protection against prompt-injection attacks by strengthening delimiter detection to catch a broader range of obfuscation techniques, including whitespace-padded tags and closing marker variants.
Tests
- Added and updated test coverage to verify the expanded prompt-injection safeguards.

coderabbitai · 2026-04-18T04:12:42Z

📝 Walkthrough

Walkthrough

This PR enhances security in the sanitizeForPrompt() function by expanding detection and neutralization of prompt-injection delimiter markers. Updates include handling <user> tags, whitespace-padded delimiters, and closing forms of bracket and angle-bracket markers. The changelog is updated and tests are added to verify the enhanced sanitization.

Changes

Cohort / File(s)	Summary
Documentation `CHANGELOG.md`	Added "Fixed" entry documenting the expanded `sanitizeForPrompt()` behavior to neutralize a broader set of delimiter markers including `<user>` tags, whitespace-padded variants, and closing markers.
Security Logic `get-shit-done/bin/lib/security.cjs`	Broadened sanitization regex patterns in `sanitizeForPrompt()`: extended XML/HTML tag neutralization to include `user` role, updated bracket marker handling to support optional slashes, and expanded angle-bracket marker matching for optional forward slashes. Updated obfuscation pattern messaging.
Test Coverage `tests/security.test.cjs`	Added test for whitespace-padded `<user>` tag neutralization; strengthened `[SYSTEM]` marker assertions to verify both opening and closing forms; expanded `<<SYS>>` test with closing marker variant and dual occurrence validation.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Possibly related issues

sanitizeForPrompt leaves delimiter injection markers intact for <user>, spaced tags, and closing markers #2394: This PR directly implements the requested security enhancements—expanding sanitizeForPrompt() regexes to neutralize <user> tags, whitespace-padded delimiters, and closing [\/SYSTEM]/[\/INST]/<</SYS>> markers with comprehensive test coverage.

Poem

🐰 Prompts were slipping through the gate,
User tags and slashes bold,
But now our regex stands so straight,
Injections neutralized, controlled!
The bunny hops with glee—secure at last! ✨

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title concisely and accurately describes the main change: broadening delimiter sanitization to prevent injection bypasses in sanitizeForPrompt().
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Description check	✅ Passed	The PR description uses the correct fix template with all required sections: linked issue, problem statement, solution details, root cause, testing verification with platform coverage, regression tests, and completed checklist.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Copilot

Pull request overview

This PR closes a security gap where sanitizeForPrompt() did not neutralize several delimiter-style prompt injection markers that scanForInjection() already flags, allowing prompt-boundary tokens to persist in user-controlled text (Fixes #2394).

Changes:

Expand sanitizeForPrompt() neutralization to cover <user> tags, whitespace-padded delimiter tags, closing [/SYSTEM]/[/INST], and closing <</SYS>>.
Update injection finding messaging and add targeted regression tests for the bypass cases.
Document the security fix in CHANGELOG.md.

Reviewed changes

Copilot reviewed 2 out of 3 changed files in this pull request and generated 1 comment.

File	Description
`tests/security.test.cjs`	Adds regression tests covering `<user>` tags, closing bracket markers, and closing `<</SYS>>`.
`get-shit-done/bin/lib/security.cjs`	Extends delimiter neutralization patterns in `sanitizeForPrompt()` and adjusts an injection finding message.
`CHANGELOG.md`	Adds an Unreleased “Fixed” entry describing the prompt sanitization hardening.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-18T04:15:50Z


 ### Fixed
 - **Installer now installs `@gsd-build/sdk` automatically** so `gsd-sdk` lands on PATH. Resolves `command not found: gsd-sdk` errors that affected every `/gsd-*` command after a fresh install or `/gsd-update` to 1.36+. Adds `--no-sdk` to opt out and `--sdk` to force reinstall. Implements the `--sdk` flag that was previously documented in README but never wired up (#2385)
+- **Prompt sanitization now neutralizes the full delimiter family it already classifies as injection** — `<user>` tags, whitespace-padded delimiter tags, closing `[\/SYSTEM]` / `[\/INST]` markers, and closing `<</SYS>>` markers no longer pass through `sanitizeForPrompt()`. Fixes a security gap where malicious commit messages or other user-controlled text could retain prompt-boundary markers despite being "sanitized" (#2394)


The changelog text includes bracket markers as [\/SYSTEM] / [\/INST]. In Markdown this likely renders with the backslashes visible (since / doesn’t need escaping), which makes the example tokens inaccurate. Consider changing these to [/SYSTEM] / [/INST] (and similarly for any other escaped slashes) so the entry matches the actual markers being neutralized.

Suggested change

- **Prompt sanitization now neutralizes the full delimiter family it already classifies as injection** — `<user>` tags, whitespace-padded delimiter tags, closing `[\/SYSTEM]` / `[\/INST]` markers, and closing `<</SYS>>` markers no longer pass through `sanitizeForPrompt()`. Fixes a security gap where malicious commit messages or other user-controlled text could retain prompt-boundary markers despite being "sanitized" (#2394)

- **Prompt sanitization now neutralizes the full delimiter family it already classifies as injection** — `<user>` tags, whitespace-padded delimiter tags, closing `[/SYSTEM]` / `[/INST]` markers, and closing `<</SYS>>` markers no longer pass through `sanitizeForPrompt()`. Fixes a security gap where malicious commit messages or other user-controlled text could retain prompt-boundary markers despite being "sanitized" (#2394)

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (2)

tests/security.test.cjs (1)

229-236: Add an explicit [/INST] regression assertion to match the new sanitizer scope.

sanitizeForPrompt() now rewrites both SYSTEM and INST closing forms, but this block only asserts SYSTEM closings.

✅ Suggested test extension

   test('neutralizes [SYSTEM] markers', () => {
     const input = 'Text [SYSTEM] override [/SYSTEM]';
     const result = sanitizeForPrompt(input);
     assert.ok(!result.includes('[SYSTEM]'));
     assert.ok(result.includes('[SYSTEM-TEXT]'));
     assert.ok(!result.includes('[/SYSTEM]'));
     assert.ok(result.includes('[/SYSTEM-TEXT]'));
   });
+
+  test('neutralizes [INST] markers including closing form', () => {
+    const input = 'Text [INST] override [/INST]';
+    const result = sanitizeForPrompt(input);
+    assert.ok(!result.includes('[INST]'));
+    assert.ok(result.includes('[INST-TEXT]'));
+    assert.ok(!result.includes('[/INST]'));
+    assert.ok(result.includes('[/INST-TEXT]'));
+  });

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/security.test.cjs` around lines 229 - 236, The test "neutralizes
[SYSTEM] markers" needs an additional assertion for the rewritten
installer-style closing token: update the test that calls
sanitizeForPrompt(input) (the neutralizes [SYSTEM] markers case) to also assert
that the sanitized result no longer contains '[/INST]' and does contain the
rewritten '[/INST-TEXT]' form; this uses the same sanitizeForPrompt call and
mirrors the existing checks for '[/SYSTEM]'/'[/SYSTEM-TEXT]'.

get-shit-done/bin/lib/security.cjs (1)

248-258: Consider centralizing delimiter definitions to avoid detection/sanitization drift.

The delimiter family is still spread across multiple regex sets (e.g., Line 142, Line 145, Line 165, and these sanitizer replacements). A shared source of truth would reduce future bypass regressions.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@get-shit-done/bin/lib/security.cjs` around lines 248 - 258, Centralize the
delimiter/marker patterns used by the sanitizer by extracting the various
literals and regexes (e.g., the patterns used in the sanitized.replace calls for
roles /<(\/?)(system|assistant|human|user)\s*>/, bracket markers
/\[(\/?)(SYSTEM|INST)\]/, and SYS markers /<<\s*\/?\s*SYS\s*>>/) into a single
shared set of constants or a factory (e.g., DELIMITER_PATTERNS or
buildDelimiterRegexes) and have the replacement logic in sanitize() reference
those constants instead of inline regexes; update all usages (including the
existing sanitized.replace calls and the other places noted in the review) to
pull from that shared source so tests and future changes only need to update one
canonical definition to avoid drift.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@get-shit-done/bin/lib/security.cjs`:
- Line 166: The detection message for the delimiter-injection rule is missing
the word "human" even though the regex still matches it; update the message
string (the object property named message in get-shit-done/bin/lib/security.cjs
that currently reads 'Delimiter injection pattern: system/assistant/user-style
tag detected') to include "human" (e.g., 'Delimiter injection pattern:
system/assistant/user/human-style tag detected' or similar) so the message
accurately reflects all matched tags.

---

Nitpick comments:
In `@get-shit-done/bin/lib/security.cjs`:
- Around line 248-258: Centralize the delimiter/marker patterns used by the
sanitizer by extracting the various literals and regexes (e.g., the patterns
used in the sanitized.replace calls for roles
/<(\/?)(system|assistant|human|user)\s*>/, bracket markers
/\[(\/?)(SYSTEM|INST)\]/, and SYS markers /<<\s*\/?\s*SYS\s*>>/) into a single
shared set of constants or a factory (e.g., DELIMITER_PATTERNS or
buildDelimiterRegexes) and have the replacement logic in sanitize() reference
those constants instead of inline regexes; update all usages (including the
existing sanitized.replace calls and the other places noted in the review) to
pull from that shared source so tests and future changes only need to update one
canonical definition to avoid drift.

In `@tests/security.test.cjs`:
- Around line 229-236: The test "neutralizes [SYSTEM] markers" needs an
additional assertion for the rewritten installer-style closing token: update the
test that calls sanitizeForPrompt(input) (the neutralizes [SYSTEM] markers case)
to also assert that the sanitized result no longer contains '[/INST]' and does
contain the rewritten '[/INST-TEXT]' form; this uses the same sanitizeForPrompt
call and mirrors the existing checks for '[/SYSTEM]'/'[/SYSTEM-TEXT]'.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro Plus

Run ID: 003cae10-1d16-439e-8639-d8bb1ef300c5

📥 Commits

Reviewing files that changed from the base of the PR and between 28d6649 and ec753f7.

📒 Files selected for processing (3)

CHANGELOG.md
get-shit-done/bin/lib/security.cjs
tests/security.test.cjs

coderabbitai · 2026-04-18T04:16:32Z

  {
    pattern: /<\/?(system|human|assistant|user)\s*>/i,
-    message: 'Delimiter injection pattern: <system>/<assistant>/<user> tag detected',
+    message: 'Delimiter injection pattern: system/assistant/user-style tag detected',


⚠️ Potential issue | 🟡 Minor

Detection message is now missing human although the regex still matches it.

This can mislead findings triage and test expectations.

💡 Suggested text fix

- message: 'Delimiter injection pattern: system/assistant/user-style tag detected', + message: 'Delimiter injection pattern: system/assistant/human/user-style tag detected',

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

message: 'Delimiter injection pattern: system/assistant/user-style tag detected',

message: 'Delimiter injection pattern: system/assistant/human/user-style tag detected',

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@get-shit-done/bin/lib/security.cjs` at line 166, The detection message for the delimiter-injection rule is missing the word "human" even though the regex still matches it; update the message string (the object property named message in get-shit-done/bin/lib/security.cjs that currently reads 'Delimiter injection pattern: system/assistant/user-style tag detected') to include "human" (e.g., 'Delimiter injection pattern: system/assistant/user/human-style tag detected' or similar) so the message accurately reflects all matched tags.

trek-e

The fix is correct and complete. Verified by running the sanitizer against all reported payloads — all four bypass cases are neutralized.

The detection/sanitization gap in #2394 is real and the approach taken (extending the regex patterns to cover the full delimiter family the scanner already recognizes) is the right fix. Regression tests cover each of the four bypass variants.

One minor note on the test coverage: the new [/SYSTEM] test checks SYSTEM-TEXT but the existing test for [SYSTEM] (line 221-226, before the regression block) was not updated to assert [/SYSTEM] is also neutralized. The regression block at line 249 covers this separately. No action required — coverage is there, just not co-located with the original test.

The detection message update ("system/assistant/user-style tag detected" — omitting "human") is a minor inaccuracy flagged by CodeRabbit. The scanner regex still catches human, only the message text is incomplete. Not a blocker.

Approve.

trek-e · 2026-04-20T03:43:34Z

CodeRabbit finding audit — findings evaluated.

Three CodeRabbit findings on this PR:

Finding 1 — Detection message omits "human" tag (VALID, minor)

The regex at line 165 matches system|human|assistant|user but the message string reads "system/assistant/user-style tag detected" — omitting "human". The scanner correctly catches <human> tags but the message misleads when that specific tag fires. This is a documentation accuracy issue, not a detection gap. The fix is one word: change "user-style" to "human/user-style" in the message string.

Finding 2 — Missing [/INST] test assertion (VALID, minor)

The test at line 229-236 asserts [SYSTEM] and [/SYSTEM] neutralization. sanitizeForPrompt() now also handles [/INST] (closing form), but there is no test asserting that [/INST] is rewritten to [/INST-TEXT]. The existing detection test at line 149 covers [INST] opening form for scanning, but not sanitization of the closing form. A test that inputs 'Text [INST] override [/INST]' and asserts both forms are neutralized is missing.

Finding 3 — Centralize delimiter patterns (NITPICK, optional)

The suggestion to extract shared delimiter constants is a valid refactoring direction but not a correctness issue. The current approach (inline regexes) is functional. This would be a follow-up refactor, not a blocker.

trek-e adversarial review approved this PR. Findings 1 and 2 are small fixes that can be applied in a follow-up commit before merge.

gustavomenani · 2026-04-20T04:16:02Z

Thanks for the detailed audit.

Agreed. Findings 1 and 2 are valid minor fixes and should be addressed before merge. Finding 3 makes sense as a follow-up refactor, but should not block the security fix.

fix(security): neutralize delimiter injection bypasses

ec753f7

Copilot AI review requested due to automatic review settings April 18, 2026 04:12

gustavomenani requested a review from glittercowboy as a code owner April 18, 2026 04:12

gustavomenani mentioned this pull request Apr 18, 2026

sanitizeForPrompt leaves delimiter injection markers intact for <user>, spaced tags, and closing markers #2394

Closed

Copilot started reviewing on behalf of gustavomenani April 18, 2026 04:12 View session

Copilot AI reviewed Apr 18, 2026

View reviewed changes

coderabbitai bot reviewed Apr 18, 2026

View reviewed changes

trek-e approved these changes Apr 20, 2026

View reviewed changes

trek-e added review: approved (merge conflict) PR approved but has merge conflicts — author must rebase area: core PROJECT.md, REQUIREMENTS.md, templates bug Something isn't working labels Apr 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(security): neutralize delimiter injection bypasses#2395

fix(security): neutralize delimiter injection bypasses#2395
gustavomenani wants to merge 1 commit intogsd-build:mainfrom
gustavomenani:fix/sanitize-delimiter-bypass

gustavomenani commented Apr 18, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Apr 18, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related issues

Poem

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 18, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Apr 18, 2026

Uh oh!

trek-e left a comment

Uh oh!

trek-e commented Apr 20, 2026

Uh oh!

gustavomenani commented Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	message: 'Delimiter injection pattern: system/assistant/user-style tag detected',
	message: 'Delimiter injection pattern: system/assistant/human/user-style tag detected',

Uh oh!

Conversation

gustavomenani commented Apr 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fix PR

Linked Issue

What was broken

What this fix does

Root cause

Testing

How I verified the fix

Regression test added?

Platforms tested

Runtimes tested

Checklist

Breaking changes

Summary by CodeRabbit

Release Notes

Uh oh!

coderabbitai bot commented Apr 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related issues

Poem

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

trek-e left a comment

Choose a reason for hiding this comment

Uh oh!

trek-e commented Apr 20, 2026

Uh oh!

gustavomenani commented Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gustavomenani commented Apr 18, 2026 •

edited

Loading

coderabbitai bot commented Apr 18, 2026 •

edited

Loading