feat(tools): #1246 #1247 #1248 — coding-agent feedback surface (research, tool guidance, ruff diagnostics) by ohdearquant · Pull Request #1261 · ohdearquant/lionagi

ohdearquant · 2026-06-03T18:32:37Z

Coding-harness feedback-surface slice from the lionagi-sweep show — the SWE-bench engagement thesis (harness > model). Critic APPROVE (CRIT:0 MAJ:0 MIN:3), claims verified against source; 63 tests pass; pre-commit ruff green.

Shipped

research: study open-source coding harnesses (OpenCode et al.) for harness-engineering wins #1246 — docs/research/coding-harnesses-2026-06.md: concrete findings from OpenCode + mini-swe-agent (loop, tool set, permission model, persistence) + a prioritized "fold into lionagi" list.
feat: tune reader/editor/bash tools — add the guidance our primitive tools lack #1248 — reader/editor/bash tool guidance: descriptions + error messages upgraded with recovery hints (failed edit → why + how to fix; bash error → next step), modeled on CC's Read/Edit prompts. (tools/file/reader.py, tools/file/editor.py, tools/code/bash.py)
feat: AST/static-analysis coding tools — give agents IDE-grade feedback #1247 (first slice) — code_check: a ruff-as-feedback diagnostic tool returning structured file:line:diagnostic an agent can act on, composable with the editor. (tools/code/check.py)

Deferred (drafted in the research doc)

feat: AST/static-analysis coding tools — give agents IDE-grade feedback #1247 remainder — ast-grep structural search, outline/navigation, parse-validation are designed in the research doc (:118-162) but not yet implemented. Leaving feat: AST/static-analysis coding tools — give agents IDE-grade feedback #1247 open for that follow-up.
3 MINOR parity/test-hardening items (non-blocking).

Closes #1246
Closes #1248
Refs #1247 (first slice shipped; AST remainder drafted)

🤖 Generated with Claude Code

…nostics Give the coding agent an IDE-grade feedback loop across reader/editor/bash and a new static-analysis check tool. - #1246: research OSS harnesses (OpenCode, mini-swe-agent) — agent loop, tool set, permission model, persistence. docs/research/coding-harnesses-2026-06.md with pinned commit SHAs, file:line citations, and a prioritized fold-in list. Drafts the remaining AST surface for the #1247 follow-up. - #1248: tune reader/editor/bash guidance — error messages now explain why a failure happened and how to recover (failed edit → line-prefix/whitespace diagnostics + re-read; bash → cwd= over `cd &&`, PATH, timeout, truncation). Tests assert the recovery text and that recovery paths work. - #1247: AST/static-analysis feedback tool — code_check shells out to ruff (optional dep, shutil.which guard, never raises) returning structured file:line:col diagnostics. Composes with the editor (edit → check). Test exercises a known-bad snippet end to end. Tests: uv run pytest tests/tools/test_check.py tests/tools/test_guidance.py tests/tools/test_reader.py — 63 passed. Critic verdict: APPROVE (CRIT:0 MAJ:0 MIN:3). Refs #1246 #1247 #1248 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

Sweep results: research tool, tool guidance, ruff diagnostics modules plus codebase-wide UP038/formatting fixes applied by the sweep. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

ohdearquant and others added 2 commits June 3, 2026 14:30

feat(tools): coding-agent feedback surface + codebase-wide lint fixes

4672027

Sweep results: research tool, tool guidance, ruff diagnostics modules plus codebase-wide UP038/formatting fixes applied by the sweep. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(tools): #1246 #1247 #1248 — coding-agent feedback surface (research, tool guidance, ruff diagnostics)#1261

feat(tools): #1246 #1247 #1248 — coding-agent feedback surface (research, tool guidance, ruff diagnostics)#1261
ohdearquant wants to merge 2 commits into
mainfrom
show/lionagi-sweep/coding-tools

ohdearquant commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ohdearquant commented Jun 3, 2026

Shipped

Deferred (drafted in the research doc)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant