gstack/test
Garry Tan 219599076f
feat: Codex E2E test harness — verify skills work on Codex CLI
New test infrastructure:
- CodexSessionRunner: spawns codex exec, parses JSONL stream, returns
  structured results (output, reasoning, toolCalls, tokens)
- JSONL parser ported from Python (codex/SKILL.md.tmpl) to TypeScript
- Temp HOME skill installation for Codex discovery testing

E2E tests (gated behind EVALS=1 + codex + OPENAI_API_KEY):
- codex-discover-skill: installs skill, verifies Codex finds it
- codex-review-findings: runs gstack-review via Codex, validates output

Integrates with existing eval infrastructure:
- Diff-based test selection via touchfiles
- Eval persistence via EvalCollector
- bun run test:codex / test:codex:all convenience scripts

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-19 01:10:51 -07:00
..
fixtures feat: design review lite in /review and /ship + gstack-diff-scope (v0.6.3) (#142) 2026-03-17 20:12:55 -05:00
helpers feat: Codex E2E test harness — verify skills work on Codex CLI 2026-03-19 01:10:51 -07:00
analytics.test.ts feat: safety hook skills + skill usage telemetry (v0.7.1) (#189) 2026-03-18 23:57:59 -05:00
codex-e2e.test.ts feat: Codex E2E test harness — verify skills work on Codex CLI 2026-03-19 01:10:51 -07:00
gen-skill-docs.test.ts test: Codex generation tests + CI + docs for multi-agent support 2026-03-19 01:04:21 -07:00
hook-scripts.test.ts feat: safety hook skills + skill usage telemetry (v0.7.1) (#189) 2026-03-18 23:57:59 -05:00
skill-e2e.test.ts feat: /codex skill — multi-AI second opinion + proactive suggestions (#197) 2026-03-19 00:22:52 -05:00
skill-llm-eval.test.ts fix: /qa never refuses browser testing on backend-only changes (#202) 2026-03-19 00:31:26 -05:00
skill-parser.test.ts feat: SKILL.md template system, 3-tier testing, DX tools (v0.3.3) (#41) 2026-03-13 21:08:12 -07:00
skill-routing-e2e.test.ts feat: natural language skill routing + proactive suggestions (v0.7.1) (#195) 2026-03-18 23:08:04 -05:00
skill-validation.test.ts test: Codex generation tests + CI + docs for multi-agent support 2026-03-19 01:04:21 -07:00
touchfiles.test.ts fix: /qa never refuses browser testing on backend-only changes (#202) 2026-03-19 00:31:26 -05:00