Skip to content

Commit ef8c8ad

Browse files
Merge pull request #5 from 8thlight/eval/improve-rpi-skills
Evaluate and improve RPI pipeline skills
2 parents 9beb3e7 + 51e5ddf commit ef8c8ad

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

59 files changed

+519
-3795
lines changed

.github/workflows/bump-versions.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ jobs:
2525
run: |
2626
git config user.name "github-actions[bot]"
2727
git config user.email "github-actions[bot]@users.noreply.github.com"
28-
git add .claude-plugin/marketplace.json
28+
git add .claude-plugin/marketplace.json plugins/*/.claude-plugin/plugin.json
2929
git diff --staged --quiet && echo "No version changes" && exit 0
3030
git commit -m "Bump marketplace and plugin versions"
3131
git push

AGENTS.md

Lines changed: 9 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ This file provides guidance to Claude Code when working with this repository.
44

55
This project is a Claude Code plugin marketplace containing **skills** — structured markdown workflow documents. There is no application code or build system. Skills are validated by a three-layer test pipeline (see Testing Skills below).
66

7-
For project architecture, key concepts (RPI methodology, testing philosophy, hexagonal architecture), artifact lifecycle, and installation commands, see `docs/architecture.md`.
7+
The core workflow follows the **RPI methodology** (Research → Plan → Implement) for non-trivial features. Skills are validated by a three-layer test pipeline (see Testing Skills below).
88

99
## Skill Authoring Guidelines
1010

@@ -32,16 +32,18 @@ allowed-tools: Read Glob Write
3232
- `triggers` — List of phrases that activate the skill. Use specific multi-word phrases; avoid bare single-word triggers that risk false activation.
3333
- `allowed-tools` — Space-separated list of Claude Code tools the skill may use.
3434

35-
## Core Workflow (Plan Mode Native)
35+
## Core Workflow (RPI)
3636

37-
The primary workflow aligns with Claude Code's native plan mode:
37+
The primary workflow flows seamlessly through three phases, with artifacts at `.light/sessions/` as handoff points:
3838

39-
1. **Research** (outside plan mode) — Assess complexity, dispatch parallel subagents if warranted, write temporary artifact to `.light/sessions/`
40-
2. **Plan** (inside plan mode) — Draft behavior activates, produces plan summarizing research with Agent Context blocks
41-
3. **Execute** (`/implement`) — Creates task graph from approved plan, runs three-agent TDD orchestration, writes session artifact to `.light/sessions/`
39+
1. **Research** (`/research`) — Assess complexity, dispatch parallel subagents if warranted, write research artifact to `.light/sessions/`
40+
2. **Plan** (`/plan-tasks`) — Consumes research artifact, produces plan with Agent Context blocks + task graph
41+
3. **Execute** (`/implement`) — Executes task graph with three-agent TDD orchestration, writes session artifact to `.light/sessions/`
4242
4. **Post-execution** — code-review, simplify, reflect recommendations
4343
5. **Reflect** (`/reflect`) — Optional post-session learning loop
4444

45+
Each phase flows directly into the next. Context clearing is only suggested when the conversation is extensive — the artifacts carry all needed context forward.
46+
4547
## Creating New Skills
4648

4749
Use the Anthropic `/skill-creator` skill to guide you through creating new skills. It provides an interactive workflow for designing effective skill documents.
@@ -51,8 +53,7 @@ Before creating a skill, review the Skill Authoring Guidelines and Frontmatter F
5153
### Where Skills Live
5254

5355
```
54-
plugins/praxis/skills/ — research, plan-tasks, implement, tdd, refactor, reflect
55-
plugins/harness/skills/ — adr, diagram, distill, hexagonal-architecture, scaffold
56+
plugins/praxis/skills/ — research, plan-tasks, implement, tdd, adr, reflect
5657
```
5758

5859
Each skill directory contains:

README.md

Lines changed: 17 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -9,10 +9,9 @@ A Claude Code plugin providing skills for agentic engineering patterns and pract
99
```
1010
/plugin marketplace add 8thlight/lightfactory
1111
/plugin install lightfactory@praxis
12-
/plugin install lightfactory@harness
1312
```
1413

15-
After installation, skills are available as `praxis:skill-name`, `harness:skill-name` and activate automatically when relevant to your task.
14+
After installation, skills are available as `praxis:skill-name` and activate automatically when relevant to your task.
1615

1716
### Install from Local Clone
1817

@@ -23,48 +22,47 @@ git clone https://github.com/8thlight/lightfactory
2322
```
2423
/plugin marketplace add /path/to/lightfactory
2524
/plugin install lightfactory@praxis
26-
/plugin install lightfactory@harness
2725
```
2826

2927
### Update
3028

3129
```
3230
/plugin marketplace update lightfactory@praxis
33-
/plugin marketplace update lightfactory@harness
3431
```
3532

3633
## Available Skills
3734

38-
### Praxis Plugin: used for agentic engineering flow
35+
### Praxis Plugin
3936

4037
| Skill | Command | Description |
4138
|-------|---------|-------------|
42-
| **tdd** | `/tdd` | Boundary-focused TDD workflow enforcing L3/L4 altitude testing and property-based tests |
4339
| **research** | `/research` | Spawns parallel subagents to explore a codebase and produce a compact research artifact |
4440
| **plan-tasks** | `/plan-tasks` | Consumes research artifact and produces a compact implementation plan with L3/L4 test specs |
4541
| **implement** | `/implement` | Executes an implementation plan phase by phase with strict test-first discipline |
46-
| **refactor** | `/refactor` | Refactoring process with test safety and incremental commits |
42+
| **tdd** | `/tdd` | Boundary-focused TDD workflow enforcing L3/L4 altitude testing and property-based tests |
43+
| **adr** | `/adr` | Guides writing minimal Architecture Decision Records |
4744
| **reflect** | `/reflect` | Post-session reflection that mines git history and artifacts to produce improvement proposals |
4845

49-
#### RPI Methodology (Research → Plan → Implement)
46+
### RPI Methodology (Research → Plan → Implement)
5047

51-
The light factory praxis plugin's core workflow for non-trivial features:
48+
The praxis plugin's core workflow for non-trivial features follows three phases:
5249

53-
1. **Research** (`/research`) — Explore the codebase with parallel subagents, output a compact research artifact
54-
2. **Plan** (`/plan-tasks`) — Consume the research artifact, produce a compact implementation plan with test specs
50+
1. **Research** (`/research`) — Explore the codebase with parallel subagents, output a compact research artifact to `.light/sessions/`
51+
2. **Plan** (`/plan-tasks`) — Consume the research artifact, produce a compact implementation plan with test specs and Agent Context blocks
5552
3. **Implement** (`/implement`) — Execute the plan phase by phase with strict RED → GREEN → VALIDATE discipline
5653

54+
#### Key Concepts
5755

58-
### Harness Plugin: used for harness engineering flow
56+
- **Plan Mode Native** — The workflow aligns with Claude Code's native plan mode. Research happens outside plan mode; planning activates draft behavior inside plan mode; implementation executes the approved plan.
57+
- **Three-Agent TDD Isolation** — During implementation, each TDD phase dispatches isolated agents: `agent-test` (writes failing tests), `agent-impl` (writes minimal code to pass), and `agent-validate` (runs full test suite). Agents never modify each other's artifacts.
58+
- **Context Compaction** — Research produces a ~200-line artifact that carries forward into planning, replacing unbounded codebase exploration with a focused summary. Plan mode entry is the compaction boundary.
59+
- **Tracker Detection Chain** — The implement and plan-tasks skills auto-detect available task trackers: yaks (preferred) → beads (fallback) → native tasks (last resort).
5960

61+
#### Standalone Skills
6062

61-
| Skill | Command | Description |
62-
|-------|---------|-------------|
63-
| **diagram** | `/diagram` | Creates architecture diagrams (C4 structural, dynamic flows, data flow) with subtype dispatch |
64-
| **scaffold** | `/scaffold` | Scaffolds DDD projects from Gherkin feature files with language subtype dispatch |
65-
| **hexagonal-architecture** | `/hexagonal-architecture` | Applies hexagonal (ports & adapters) architecture with domain-first design |
66-
| **adr** | `/adr` | Guides writing minimal Architecture Decision Records |
67-
| **distill** | `/distill` | Distills an agentic engineering harness from a project with DDD structures and fitness tests |
63+
- **TDD** (`/tdd`) — For interactive, human-in-the-loop test-driven development outside the full RPI flow. Enforces boundary-focused testing at the L3/L4 altitude with ZOMBIES progression.
64+
- **ADR** (`/adr`) — Guides writing Architecture Decision Records following the Harmel-Law signal check pattern. Ensures decisions are actually made before documenting them.
65+
- **Reflect** (`/reflect`) — Post-session learning loop that mines git history and session artifacts to produce improvement proposals for skills, CLAUDE.md, and hooks.
6866

6967
## Testing
7068

@@ -87,5 +85,3 @@ cd tests/evals && promptfoo eval
8785
## Contributing
8886

8987
See `AGENTS.md` for skill authoring guidelines, testing workflows, and the pre-ship checklist.
90-
91-
See `docs/architecture.md` for project structure and key concepts.

docs/TODO.md

Lines changed: 0 additions & 11 deletions
This file was deleted.

docs/architecture.md

Lines changed: 0 additions & 57 deletions
This file was deleted.

plugins/harness/.claude-plugin/plugin.json

Lines changed: 0 additions & 9 deletions
This file was deleted.

plugins/harness/hooks/hooks.json

Lines changed: 0 additions & 6 deletions
This file was deleted.

plugins/harness/skills/README.md

Lines changed: 0 additions & 13 deletions
This file was deleted.

0 commit comments

Comments
 (0)