gstack/test
Garry Tan c9cead34e2
test: codex skill validation (12 stub tests) + E2E eval test
Stub tests (free tier): verify template content — three modes, gate verdict,
session continuity, cost tracking, cross-model comparison, binary discovery,
error handling, mktemp usage, and integrations into /review, /ship, /plan-eng-review.

E2E test (paid tier): runs /codex review on vulnerable fixture repo via
session-runner, verifies output contains findings and GATE verdict.
2026-03-18 21:21:02 -07:00
..
fixtures feat: design review lite in /review and /ship + gstack-diff-scope (v0.6.3) (#142) 2026-03-17 20:12:55 -05:00
helpers feat: /codex skill — multi-AI second opinion (review, challenge, consult) 2026-03-18 21:11:42 -07:00
gen-skill-docs.test.ts feat: interactive /plan-design-review + CEO invokes designer + 100% coverage (v0.6.4) (#149) 2026-03-17 22:48:48 -05:00
skill-e2e.test.ts test: codex skill validation (12 stub tests) + E2E eval test 2026-03-18 21:21:02 -07:00
skill-llm-eval.test.ts feat: interactive /plan-design-review + CEO invokes designer + 100% coverage (v0.6.4) (#149) 2026-03-17 22:48:48 -05:00
skill-parser.test.ts feat: SKILL.md template system, 3-tier testing, DX tools (v0.3.3) (#41) 2026-03-13 21:08:12 -07:00
skill-validation.test.ts test: codex skill validation (12 stub tests) + E2E eval test 2026-03-18 21:21:02 -07:00
touchfiles.test.ts feat: interactive /plan-design-review + CEO invokes designer + 100% coverage (v0.6.4) (#149) 2026-03-17 22:48:48 -05:00