gstack/test/helpers
Garry Tan 2769cd043d
Merge branch 'main' into garrytan/team-supabase-store
Resolved 4 conflicts:
- scripts/gen-skill-docs.ts: kept ARTIFACT_SETUP + added main's new
  resolvers (SPEC_REVIEW_LOOP, DESIGN_SKETCH, BENEFITS_FROM,
  CODEX_REVIEW_STEP). Updated codex review-log to use new paths.
- ship/SKILL.md.tmpl: adopted {{CODEX_REVIEW_STEP}} macro from main
- test/skill-e2e.test.ts: added main's new E2E tests (office-hours
  spec review, plan-ceo benefits-from) + kept our E2E isolation cleanup

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-21 09:29:05 -07:00
..
codex-session-runner.ts feat: multi-agent support — gstack works on Codex, Gemini CLI, and Cursor (v0.9.0) (#226) 2026-03-19 18:20:50 -07:00
eval-store.test.ts merge: integrate origin/main (v0.4.0, v0.4.1) into team-supabase-store 2026-03-16 07:49:27 -05:00
eval-store.ts merge: integrate origin/main (v0.4.0, v0.4.1) into team-supabase-store 2026-03-16 07:49:27 -05:00
gemini-session-runner.test.ts feat: Gemini CLI E2E tests (v0.9.2.0) (#252) 2026-03-20 08:30:09 -07:00
gemini-session-runner.ts feat: Gemini CLI E2E tests (v0.9.2.0) (#252) 2026-03-20 08:30:09 -07:00
llm-judge.test.ts feat: wire eval-cache + eval-tier into LLM judge, pin E2E model 2026-03-15 16:47:35 -05:00
llm-judge.ts feat: wire eval-cache + eval-tier into LLM judge, pin E2E model 2026-03-15 16:47:35 -05:00
observability.test.ts fix: never clean up observability artifacts — partial file persists after finalize 2026-03-14 12:37:38 -05:00
session-runner.test.ts feat: wire costs[] from modelUsage into eval results 2026-03-15 16:47:27 -05:00
session-runner.ts feat: wire costs[] from modelUsage into eval results 2026-03-15 16:47:27 -05:00
skill-parser.ts feat: 3-tier eval suite with planted-bug outcome testing (EVALS=1) 2026-03-14 01:17:36 -05:00
touchfiles.ts feat: Gemini CLI E2E tests (v0.9.2.0) (#252) 2026-03-20 08:30:09 -07:00