gstack

History

Garry Tan 29b948bd90 test(diagram): paid E2E pair — gate triplet contract + periodic authoring judge diagram-triplet (gate, deterministic functional): a fresh claude -p agent following the skill extract must emit a parseable triplet — graph LR/TD in .mmd, excalidraw scene with >3 elements, SVG markup, PNG magic bytes. Verified live: pass, $0.17, 58s. diagram-authoring-quality (periodic, LLM-judged): faithfulness/labels/size rubric with a diagnostic-path cap, floor 6/10. Verified live: pass at exactly 6 with substantive critique. Touchfiles select both on diagram/ and lib/diagram-render/ changes; tier split per E2E_TIERS rules (eng-review D5). Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>		2026-06-12 00:32:37 -07:00
..
providers	v1.57.2.0 feat: AskUserQuestion prose fallback when the tool fails at runtime (#1908 )	2026-06-07 21:38:21 -07:00
agent-sdk-runner.ts	v1.57.2.0 feat: AskUserQuestion prose fallback when the tool fails at runtime (#1908 )	2026-06-07 21:38:21 -07:00
auq-sdk-capture.ts	v1.56.0.0 Token-reduction Phase B + AUQ paranoid safety net (#1849 )	2026-06-04 11:14:43 -07:00
benchmark-judge.ts	feat(v1.3.0.0): open agents learnings + cross-model benchmark skill (#1040 )	2026-04-19 17:50:31 +08:00
benchmark-runner.ts	feat(v1.3.0.0): open agents learnings + cross-model benchmark skill (#1040 )	2026-04-19 17:50:31 +08:00
budget-override.test.ts	v1.46.0.0 feat: gstack v2 foundation — catalog tokens drop 56%, eval-first floor covers all 51 skills (#1712 )	2026-05-26 16:50:03 -07:00
budget-override.ts	v1.46.0.0 feat: gstack v2 foundation — catalog tokens drop 56%, eval-first floor covers all 51 skills (#1712 )	2026-05-26 16:50:03 -07:00
capture-parity-baseline.test.ts	v1.46.0.0 feat: gstack v2 foundation — catalog tokens drop 56%, eval-first floor covers all 51 skills (#1712 )	2026-05-26 16:50:03 -07:00
capture-parity-baseline.ts	v1.46.0.0 feat: gstack v2 foundation — catalog tokens drop 56%, eval-first floor covers all 51 skills (#1712 )	2026-05-26 16:50:03 -07:00
carve-guard-checks.ts	v1.57.0.0 feat: carve-guard system + carve cso/document-release/design-consultation (#1907 )	2026-06-07 19:13:24 -07:00
carve-guards.ts	v1.57.10.0 feat: Codex review default-on across review/ship/plan/docs (#1966 )	2026-06-10 21:14:58 -07:00
claude-pty-runner.ts	v1.31.0.0 fix: delete AskUserQuestion fallback (root cause of forever war) + harness primitives (#1390 )	2026-05-09 17:01:13 -07:00
claude-pty-runner.unit.test.ts	v1.31.0.0 fix: delete AskUserQuestion fallback (root cause of forever war) + harness primitives (#1390 )	2026-05-09 17:01:13 -07:00
codex-session-runner.ts	fix: enforce Codex 1024-char description limit + auto-heal stale installs (v0.11.9.0) (#391 )	2026-03-23 08:44:08 -07:00
e2e-helpers.ts	v1.39.2.0 feat: GSTACK_* env-shim for Conductor + gbrain/gstack setup docs (#1534 )	2026-05-16 12:32:33 -07:00
eval-store.test.ts	feat: QA restructure, browser ref staleness, eval efficiency metrics (v0.4.0) (#83 )	2026-03-15 23:55:39 -05:00
eval-store.ts	v1.32.0.0 fix wave: 7 community PRs + 5 gate-eval hardenings (#1431 )	2026-05-11 12:16:26 -07:00
gemini-session-runner.test.ts	feat: Gemini CLI E2E tests (v0.9.2.0) (#252 )	2026-03-20 08:30:09 -07:00
gemini-session-runner.ts	feat: Gemini CLI E2E tests (v0.9.2.0) (#252 )	2026-03-20 08:30:09 -07:00
llm-judge.ts	v1.25.1.0 fix: office-hours Phase 4 STOP gate + AskUserQuestion recommendation judge (#1296 )	2026-05-01 19:51:51 -07:00
observability.test.ts	fix: never clean up observability artifacts — partial file persists after finalize	2026-03-14 12:37:38 -05:00
parity-harness.ts	v1.57.10.0 feat: Codex review default-on across review/ship/plan/docs (#1966 )	2026-06-10 21:14:58 -07:00
pricing.ts	feat(v1.3.0.0): open agents learnings + cross-model benchmark skill (#1040 )	2026-04-19 17:50:31 +08:00
required-reads.ts	v1.54.0.0 feat: carve /ship into skeleton + on-demand sections (-59% always-loaded) (#1806 )	2026-05-30 12:09:10 -07:00
secret-sink-harness.ts	v1.12.0.0 feat: /setup-gbrain — coding-agent onboarding for gbrain (#1183 )	2026-04-24 01:38:21 -07:00
session-runner.test.ts	feat: stream-json NDJSON parser for real-time E2E progress	2026-03-14 03:49:36 -05:00
session-runner.ts	v1.57.2.0 feat: AskUserQuestion prose fallback when the tool fails at runtime (#1908 )	2026-06-07 21:38:21 -07:00
skill-parser.ts	feat: content security — 4-layer prompt injection defense for pair-agent (#815 )	2026-04-06 14:41:06 -07:00
tool-map.ts	feat(v1.3.0.0): open agents learnings + cross-model benchmark skill (#1040 )	2026-04-19 17:50:31 +08:00
touchfiles.ts	test(diagram): paid E2E pair — gate triplet contract + periodic authoring judge	2026-06-12 00:32:37 -07:00
transcript-section-logger.ts	v1.54.0.0 feat: carve /ship into skeleton + on-demand sections (-59% always-loaded) (#1806 )	2026-05-30 12:09:10 -07:00