gstack/test/helpers
Garry Tan f9cfabeda8
feat: add E2E observability — heartbeat, progress.log, NDJSON persistence, savePartial()
session-runner: atomic heartbeat file (e2e-live.json), per-run log directory
(~/.gstack-dev/e2e-runs/{runId}/), progress.log + per-test NDJSON persistence,
failure transcripts to persistent run dir instead of tmpdir.

eval-store: 3 new diagnostic fields (exit_reason, timeout_at_turn, last_tool_call),
savePartial() writes _partial-e2e.json after each addTest() for crash resilience,
finalize() cleans up partial file.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-14 11:04:16 -05:00
..
eval-store.test.ts feat: eval persistence with auto-compare against previous run 2026-03-14 03:49:47 -05:00
eval-store.ts feat: add E2E observability — heartbeat, progress.log, NDJSON persistence, savePartial() 2026-03-14 11:04:16 -05:00
llm-judge.ts feat: 3-tier eval suite with planted-bug outcome testing (EVALS=1) 2026-03-14 01:17:36 -05:00
session-runner.test.ts feat: stream-json NDJSON parser for real-time E2E progress 2026-03-14 03:49:36 -05:00
session-runner.ts feat: add E2E observability — heartbeat, progress.log, NDJSON persistence, savePartial() 2026-03-14 11:04:16 -05:00
skill-parser.ts feat: 3-tier eval suite with planted-bug outcome testing (EVALS=1) 2026-03-14 01:17:36 -05:00