Add orch skill forward test evidence
This commit is contained in:
@@ -122,6 +122,26 @@ Use these defaults unless a case file explicitly overrides them:
|
||||
- keep the temporary DB, repo fixture, and working directory on failure for debugging
|
||||
- cleanup the temporary working directory on success only if the caller does not need replay artifacts
|
||||
|
||||
## Direct CLI Replay
|
||||
|
||||
The repository also includes a reusable direct replay runner at `scripts/run_orch_skill_forward_tests.sh`.
|
||||
|
||||
This runner executes the bundled `skills/orch/assets/orch` and `skills/inbox/assets/inbox` binaries against temporary SQLite DBs and Git fixtures without spawning Codex role agents.
|
||||
|
||||
Use it to validate packaged CLI behavior and record concrete evidence quickly, but do not treat it as a full replacement for the real subagent-forward model described above.
|
||||
|
||||
The case files in this directory now include recorded example runs captured through that direct replay path on `2026-03-19`.
|
||||
|
||||
## Real Subagent Forward Runs
|
||||
|
||||
The five cases in this directory were also executed with real spawned role agents on `2026-03-19`.
|
||||
|
||||
That run used injected project-local `skills/orch/` and `skills/inbox/` bundles with a narrow-context fallback (`fork_context: false`) after an earlier wider-context attempt proved unreliable for this repo.
|
||||
|
||||
The successful evidence root for those runs was `/tmp/orch-skill-subagents.J1XWgs`.
|
||||
|
||||
Some longer cases used staged leader progression while keeping the same leader agent active across phases so the run still exercised real agent-driven `orch` control flow instead of a main-thread direct replay.
|
||||
|
||||
## Per-Case Template
|
||||
|
||||
Each case file should use this structure:
|
||||
|
||||
Reference in New Issue
Block a user