gstack

History

Garry Tan 1626d4857b v1.57.7.0 feat: GSTACK REVIEW REPORT always declares unresolved decisions (#1916 ) * fix(plan-devex-review): add missing gstack-review-log step plan-devex-review carried the EXIT PLAN MODE GATE but never wrote a review-log entry, so the gate's 'review log was called' check was structurally unsatisfiable and the Review Readiness Dashboard / GSTACK REVIEW REPORT had no plan-devex-review data to read. Add a Review Log section before the dashboard read, logging the devex fields the report parser already expects (status, scores, product_type, tthw, persona, competitive_tier, unresolved, commit). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * feat(review): make unresolved-decisions status mandatory in GSTACK REVIEW REPORT The report's UNRESOLVED line was optional ('omit if empty') and the EXIT PLAN MODE GATE only checked it 'if applicable', so a plan could ship with no statement about open decisions at all — a missed ambiguity read identically to a clean plan. Now every report ends with a mandatory unresolved-decisions status as its final line: either the exact unbolded sentinel 'NO UNRESOLVED DECISIONS', or a 'UNRESOLVED DECISIONS:' block of bullets. The gate blocks ExitPlanMode unless that final line is present. generatePlanFileReviewReport: current-review items are listed from context; prior reviews contribute an aggregate count computed as latest-fresh-row- per-skill minus the current run (no double-count, dashboard 7-day window). generateExitPlanModeGate: check #3 is now blocking with no 'if applicable' escape; bolded sentinel does not satisfy it. Tests: static guard in gen-skill-docs.test.ts asserts the mandatory status across all six report consumers and the gate across gate-bearing skills; skill-e2e-plan.test.ts asserts the written report's final line is the status (and fixes a stale 'four review rows' -> five-row prompt). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * refactor(review): compress unresolved-status prose to fit parity budget After merging origin/main (v1.57.3.0), plan-devex-review exceeded the 1.05x parity ratio vs the v1.53.0.0 baseline. Rather than rebase the baseline, compressed the new prose to stay under the cap honestly: the report's unresolved-status block (~32 -> ~9 lines) and the EXIT PLAN MODE GATE's final-line check (~7 -> ~5 lines), plus the plan-devex-review review-log step. All load-bearing rules and the exact gate-checkable tokens are preserved; the static guards in gen-skill-docs.test.ts still pass. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test: regenerate stale ship golden fixtures (#1909 follow-up) #1909 (v1.57.3.0) added the always-loaded PR-title-version rule to ship's template and committed the regenerated ship/SKILL.md, but did not refresh the three ship golden fixtures, leaving the golden-file regression test red on main. Regenerate them from current output. The diff is purely #1909 content: the PR-title invariant line plus a previously-unresolved ${ctx.paths.binDir} placeholder that current generation correctly resolves. No feature content from this branch leaks into ship (ship does not consume the review report resolvers). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(plan-devex-review): restore TIMESTAMP fill instruction in review-log Adversarial review caught that compressing the devex review-log block dropped the TIMESTAMP substitution guidance the three sibling plan-review skills carry. A literal "timestamp":"TIMESTAMP" parses as JSON but is an unparseable date, so the Review Readiness Dashboard's 7-day freshness window silently drops the plan-devex-review row (and the report's prior-review aggregation loses it). Restore the one-line instruction. Also: the plan-review-report E2E now derives its last-line check from the report slice, not the whole file, so a mis-placed report surfaces the real trailing content in the failure message. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(parity): rebase parity baseline v1.53.0.0 -> v1.57.7.0 The v1.53 anchor is four minor versions stale. v1.54-v1.57 (ship/plan carving, carve-guards, AUQ prose fallback, the cross-session decision-log preamble) plus this branch's mandatory unresolved-decisions status line pushed the three plan-review skills past the 5% ratchet even after exhaustive compression. The new baseline captures current UNION sizes (skeleton + sections/.md, matching what parity-harness measures) so the per-skill 1.05 ratio keeps catching future bloat. The frozen v1.44.1 integrity anchor and the v1.47 size-budget baseline are untouched. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> chore: bump version and changelog (v1.57.7.0) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>		2026-06-08 21:17:18 -07:00
..
preamble	v1.57.5.0 feat: cross-session decision memory + gbrain dream-stage call graph (#1910 )	2026-06-08 06:20:58 -07:00
browse.ts	fix: avoid tilde-in-assignment to silence Claude Code permission prompts (#993 )	2026-04-16 14:49:56 -07:00
codex-helpers.ts	feat: Factory Droid compatibility — works across Claude Code, Codex, and Factory (v0.13.5.0) (#621 )	2026-03-29 08:57:34 -07:00
composition.ts	v1.57.4.0 refactor(ethos): rename Boil the Lake principle to Boil the Ocean (#1912 )	2026-06-08 05:41:07 -07:00
confidence.ts	v1.43.2.0 fix wave: post-Daegu paper-cut — 18 fixes, 28 bisect commits (#1642 )	2026-05-21 21:21:07 -07:00
constants.ts	feat(v1.3.0.0): open agents learnings + cross-model benchmark skill (#1040 )	2026-04-19 17:50:31 +08:00
design.ts	v1.45.0.0 feat(design): persistent board daemon — 24h boards, one tab, board history (#1710 )	2026-05-25 20:45:12 -07:00
dx.ts	feat: /plan-devex-review + /devex-review — DX review skills (v0.15.3.0) (#784 )	2026-04-03 16:22:57 -07:00
gbrain.ts	v1.52.1.0 feat: brain-aware planning — 5 skills read structured gbrain context before asking (#1742 )	2026-05-29 08:35:00 -07:00
index.ts	v1.54.0.0 feat: carve /ship into skeleton + on-demand sections (-59% always-loaded) (#1806 )	2026-05-30 12:09:10 -07:00
learnings.ts	v1.33.1.0 fix(learnings): token-OR query + task-shaped retrieval in 3 long skills (#1442 )	2026-05-11 19:34:33 -07:00
make-pdf.ts	feat(v1.4.0.0): /make-pdf — markdown to publication-quality PDFs (#1086 )	2026-04-20 13:20:30 +08:00
model-overlay.ts	feat(v1.10.1.0): overlay efficacy harness + Opus 4.7 fanout nudge removal (#1166 )	2026-04-23 18:42:58 -07:00
preamble.ts	v1.46.0.0 feat: gstack v2 foundation — catalog tokens drop 56%, eval-first floor covers all 51 skills (#1712 )	2026-05-26 16:50:03 -07:00
question-tuning.ts	v1.52.0.0 feat(plan-tune): explicit consent + first-run setup wizard for contributors (#1741 )	2026-05-28 18:21:09 -07:00
redact-doc.ts	v1.53.0.0 feat: smarter redaction — PII/secrets/legal guard across /spec, /ship, /cso, /document-* (#1797 )	2026-05-30 08:54:46 -07:00
review-army.ts	v1.42.0.0 Daegu wave: 23 community-filed bugs + PTY classifier enforcement (24 bisect commits) (#1594 )	2026-05-20 07:35:01 -07:00
review.ts	v1.57.7.0 feat: GSTACK REVIEW REPORT always declares unresolved decisions (#1916 )	2026-06-08 21:17:18 -07:00
sections.ts	v1.54.0.0 feat: carve /ship into skeleton + on-demand sections (-59% always-loaded) (#1806 )	2026-05-30 12:09:10 -07:00
tasks-section.ts	v1.38.1.0 fix wave: surrogate-safe page captures (#1440 ), Implementation Tasks across review skills (#1454 ), root-level artifact patterns (#1452 ) (#1504 )	2026-05-14 21:46:50 -07:00
testing.ts	feat(v1.3.0.0): open agents learnings + cross-model benchmark skill (#1040 )	2026-04-19 17:50:31 +08:00
types.ts	v1.46.0.0 feat: gstack v2 foundation — catalog tokens drop 56%, eval-first floor covers all 51 skills (#1712 )	2026-05-26 16:50:03 -07:00
utility.ts	feat(v1.5.2.0): Opus 4.7 migration — model overlay, voice, routing (#1117 )	2026-04-22 01:06:22 -07:00