gstack/lib
Garry Tan daea165333
feat: add eval:trend CLI for per-test pass rate tracking
computeTrends() classifies tests as stable-pass/stable-fail/flaky/
improving/degrading based on pass rate, flip count, and recent streak.
gstack eval trend shows sparkline table with --limit, --tier, --test
filters. Guard CLI main block with import.meta.main to prevent
execution on import.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-15 16:47:41 -05:00
..
auth.ts feat: add team sync infrastructure (config, auth, push/pull, CLI) 2026-03-15 02:02:40 -05:00
cli-eval.ts feat: add eval:trend CLI for per-test pass rate tracking 2026-03-15 16:47:41 -05:00
cli-sync.ts feat: add team sync infrastructure (config, auth, push/pull, CLI) 2026-03-15 02:02:40 -05:00
eval-cache.ts feat: add SHA-based eval caching with EVAL_CACHE=0 bypass 2026-03-15 09:39:26 -05:00
eval-cost.ts feat: wire costs[] from modelUsage into eval results 2026-03-15 16:47:27 -05:00
eval-format.ts feat: wire costs[] from modelUsage into eval results 2026-03-15 16:47:27 -05:00
eval-tier.ts feat: add eval format validation, tier selection, cost tracking 2026-03-15 09:39:18 -05:00
sync-config.ts feat: add team sync infrastructure (config, auth, push/pull, CLI) 2026-03-15 02:02:40 -05:00
sync.ts feat: add team sync infrastructure (config, auth, push/pull, CLI) 2026-03-15 02:02:40 -05:00
util.ts feat: add listEvalFiles, loadEvalResults, formatTimestamp to lib/util.ts 2026-03-15 09:39:09 -05:00