Add orch skill forward test evidence

2026-03-19 18:36:31 +08:00
parent d17b5ebfbd
commit e9cbb15c2d
10 changed files with 1036 additions and 0 deletions
@@ -122,6 +122,26 @@ Use these defaults unless a case file explicitly overrides them:
 - keep the temporary DB, repo fixture, and working directory on failure for debugging
 - cleanup the temporary working directory on success only if the caller does not need replay artifacts

+## Direct CLI Replay
+
+The repository also includes a reusable direct replay runner at `scripts/run_orch_skill_forward_tests.sh`.
+
+This runner executes the bundled `skills/orch/assets/orch` and `skills/inbox/assets/inbox` binaries against temporary SQLite DBs and Git fixtures without spawning Codex role agents.
+
+Use it to validate packaged CLI behavior and record concrete evidence quickly, but do not treat it as a full replacement for the real subagent-forward model described above.
+
+The case files in this directory now include recorded example runs captured through that direct replay path on `2026-03-19`.
+
+## Real Subagent Forward Runs
+
+The five cases in this directory were also executed with real spawned role agents on `2026-03-19`.
+
+That run used injected project-local `skills/orch/` and `skills/inbox/` bundles with a narrow-context fallback (`fork_context: false`) after an earlier wider-context attempt proved unreliable for this repo.
+
+The successful evidence root for those runs was `/tmp/orch-skill-subagents.J1XWgs`.
+
+Some longer cases used staged leader progression while keeping the same leader agent active across phases so the run still exercised real agent-driven `orch` control flow instead of a main-thread direct replay.
+
 ## Per-Case Template

 Each case file should use this structure: