gstack/test/helpers
Garry Tan 029a7c2a37
feat: eval-watch dashboard + observability unit tests (15 tests, 11 codepaths)
eval-watch: live terminal dashboard reads heartbeat + partial file every 1s,
shows completed/running tests, stale detection (>10min), --tail flag for
progress.log tail. Pure renderDashboard() function for testability.

observability.test.ts: unit tests for sanitizeTestName, heartbeat schema,
progress.log format, NDJSON file naming, savePartial() with _partial flag,
finalize() cleanup, diagnostic fields, watcher rendering, stale detection,
and non-fatal I/O guarantees.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-14 11:04:40 -05:00
..
eval-store.test.ts feat: eval persistence with auto-compare against previous run 2026-03-14 03:49:47 -05:00
eval-store.ts feat: add E2E observability — heartbeat, progress.log, NDJSON persistence, savePartial() 2026-03-14 11:04:16 -05:00
llm-judge.ts feat: 3-tier eval suite with planted-bug outcome testing (EVALS=1) 2026-03-14 01:17:36 -05:00
observability.test.ts feat: eval-watch dashboard + observability unit tests (15 tests, 11 codepaths) 2026-03-14 11:04:40 -05:00
session-runner.test.ts feat: stream-json NDJSON parser for real-time E2E progress 2026-03-14 03:49:36 -05:00
session-runner.ts feat: add E2E observability — heartbeat, progress.log, NDJSON persistence, savePartial() 2026-03-14 11:04:16 -05:00
skill-parser.ts feat: 3-tier eval suite with planted-bug outcome testing (EVALS=1) 2026-03-14 01:17:36 -05:00