gstack/test
Garry Tan e7347c2f8f
feat: stream-json NDJSON parser for real-time E2E progress
Switch session-runner from buffered `--output-format json` to streaming
`--output-format stream-json --verbose`. Parses NDJSON line-by-line for
real-time tool-by-tool progress on stderr during 3-5 min E2E runs.

- Extract testable `parseNDJSON()` function (pure, no I/O)
- Count turns per assistant event (not per text block)
- Add `transcript: any[]` to SkillTestResult, remove dead `messages` field
- Reconstruct allText from transcript for browse error scanning
- 8 unit tests for parser (malformed lines, empty input, turn counting)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-14 03:49:36 -05:00
..
fixtures fix: rewrite session-runner to claude -p subprocess, lower flaky baselines 2026-03-14 02:34:10 -05:00
helpers feat: stream-json NDJSON parser for real-time E2E progress 2026-03-14 03:49:36 -05:00
gen-skill-docs.test.ts fix: enrich SKILL.md docs to pass LLM evals, upgrade judge to Sonnet 4.6 (#43) 2026-03-13 22:14:14 -07:00
skill-e2e.test.ts Merge remote-tracking branch 'origin/main' into v0.3.6-qa-upgrades 2026-03-14 02:35:48 -05:00
skill-llm-eval.test.ts Merge remote-tracking branch 'origin/main' into v0.3.6-qa-upgrades 2026-03-14 02:35:48 -05:00
skill-parser.test.ts feat: SKILL.md template system, 3-tier testing, DX tools (v0.3.3) (#41) 2026-03-13 21:08:12 -07:00
skill-validation.test.ts feat: 3-tier eval suite with planted-bug outcome testing (EVALS=1) 2026-03-14 01:17:36 -05:00