gstack

History

Garry Tan 9562ad4e70 v1.53.1.0 fix: non-interactive-safe plan-tune hook install (flags + smart defaults) (#1805 ) * feat(config): add plan_tune_hooks setting (prompt\|yes\|no) Registers a new gstack-config key controlling whether ./setup installs the plan-tune Claude Code hooks. Default "prompt". Documented in the config header and surfaced in `gstack-config defaults` / `list`. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(setup): make plan-tune hook install non-interactive-safe The plan-tune consent prompt used a blocking `read -r` with no timeout. Under a forwarded/automated TTY (conductor workspace setup, CI with a pty) it hung setup forever. Move the decision into flags + env + saved config with a smart default: --plan-tune-hooks / --no-plan-tune-hooks / --plan-tune-hooks=yes\|no\|prompt > GSTACK_PLAN_TUNE_HOOKS env > plan_tune_hooks config > prompt-on-real-TTY. Explicit yes/no act non-interactively. The remaining interactive branch is gated on a real (non-quiet) TTY and uses a time-bounded `read -t 10 </dev/tty` that defaults to skip, so it can never hang. A timeout no longer persists a decline marker, so a later hands-on run can still offer the install. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(dev-setup): run setup non-interactively in dev/workspace mode Conductor runs bin/dev-setup under a forwarded pty, so any setup prompt (skill-prefix, plan-tune consent) would hang the workspace. Detach stdin (`setup </dev/null`) so every prompt takes its smart non-interactive default: flat skill names, skip the global plan-tune hook install without writing a decline marker. Saved prefix/config preferences are still honored, and a dev workspace no longer silently mutates ~/.claude/settings.json. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(setup): guard plan-tune hooks stay non-interactive Static + binary-level regression test (free, <1s): asserts the flags are wired, the plan-tune read is time-bounded (no bare blocking read), explicit yes/no decisions short-circuit before the prompt, and gstack-config knows the plan_tune_hooks key. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(setup,config): harden plan-tune decision against bad input Review follow-ups to the non-interactive plan-tune work: - setup now lowercases + whitespace-strips the resolved decision before the case match, so an explicit opt-in via flag/env ("YES", "Yes", " yes") is honored instead of silently falling through to "prompt"/skip. Also accepts on/off and 1/0. - gstack-config rejects out-of-domain plan_tune_hooks values (anything but prompt\|yes\|no) with a warning + fallback to prompt, matching the existing value-whitelist pattern for explain_level / artifacts_sync_mode. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(dev-setup): never mutate global hooks during workspace setup Closing stdin alone only suppresses the prompt branch; a saved `plan_tune_hooks: yes` or exported GSTACK_PLAN_TUNE_HOOKS=yes would still resolve to "install" and rewrite the user's global ~/.claude/settings.json to point at THIS ephemeral worktree — which breaks once the workspace is deleted. Pass --plan-tune-hooks=prompt (highest precedence) so dev-setup pins resolution to prompt-mode; with stdin closed that is a guaranteed no-op skip (no install, no decline marker). To install the hooks, run ./setup --plan-tune-hooks directly. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(setup): isolate config tests from host + cover new guards - Point gstack-config tests at a temp GSTACK_HOME so `get plan_tune_hooks` reads the built-in default, not whatever the host machine has in ~/.gstack/config.yaml (the prior test was non-deterministic). - Add behavioral coverage: yes/no/prompt round-trip, out-of-domain rejection. - Add a normalization guard (decision input is lowercased/trimmed) and a dev-setup guard (runs setup with --plan-tune-hooks=prompt + stdin detached). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test: rebaseline parity-suite v1.44.1 -> v1.53.0.0 The frozen v1.44.1 anchor went stale: five planning skills (plan-ceo-review, plan-eng-review, plan-design-review, investigate, office-hours) crept past the 1.05x ceiling via legitimate v1.49-v1.53 growth (brain-aware planning + the v1.53 redaction guard), so `bun test` was red on a clean checkout of main. Capture a fresh baseline at HEAD (bun run scripts/capture-baseline.ts --tag v1.53.0.0) and re-point the test at it. The per-skill 1.05 ratio is kept, so future bloat is still caught; only the anchor moved. Mirrors the earlier skill-size-budget rebase (v1.44.1 -> v1.47.0.0). Historical v1.44.1 / v1.46.0.0 / v1.47.0.0 baselines are retained for the v1->v2 audit trail. The captured skill bytes equal origin/main exactly (this branch left every SKILL.md untouched). Clears the pre-existing failures noted in the v1.53.0.0 CHANGELOG. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(plan-tune): de-flake "derive pushes scope_appetite up" The test was ~25-50% flaky (worse on main). gstack-question-log fires a fire-and-forget background `--derive` after every write; the 5 rapid log writes spawned 5 racing background derives that collided with the test's explicit --derive — a late one that only saw 3 entries could clobber developer-profile.json after the explicit one wrote sample_size=5. Set GSTACK_QUESTION_LOG_NO_DERIVE=1 (the flag the binary documents for exactly this case) so the writes don't spawn background derives. The explicit --derive still runs, so real derive behavior is still asserted. 20/20 green after. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * chore: bump version and changelog (v1.53.1.0) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * docs: document non-interactive dev-setup + plan-tune hook flags (v1.53.1.0) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>		2026-05-30 11:42:13 -07:00
..
golden	v1.53.0.0 feat: smarter redaction — PII/secrets/legal guard across /spec, /ship, /cso, /document-* (#1797 )	2026-05-30 08:54:46 -07:00
ios-qa/FixtureApp	v1.43.0.0 feat: iOS device-farm (5 skills, Mac daemon, Tailscale) (#1574 )	2026-05-21 16:09:26 -07:00
mode-posture	feat: mode-posture energy fix for /plan-ceo-review and /office-hours (v1.1.2.0) (#1065 )	2026-04-19 05:44:39 +08:00
office-hours-brain-writeback	v1.52.1.0 feat: brain-aware planning — 5 skills read structured gbrain context before asking (#1742 )	2026-05-29 08:35:00 -07:00
plans	v1.15.0.0 feat: slim preamble + real-PTY plan-mode E2E harness (#1215 )	2026-04-26 13:55:13 -07:00
coverage-audit-fixture.ts	feat: test coverage catalog — shared audit across plan/ship/review (v0.10.1.0) (#259 )	2026-03-22 11:28:16 -07:00
eval-baselines.json	fix: rewrite session-runner to claude -p subprocess, lower flaky baselines	2026-03-14 02:34:10 -05:00
forcing-finding-seeds.ts	v1.48.0.0 feat: AskUserQuestion split rule + runtime AUTO_DECIDE carve-out (#1740 )	2026-05-26 23:43:07 -07:00
golden-ship-claude.md	fix: community security wave — 8 PRs, 4 contributors (v0.15.13.0) (#847 )	2026-04-06 00:47:04 -07:00
overlay-nudges.ts	feat(v1.10.1.0): overlay efficacy harness + Opus 4.7 fanout nudge removal (#1166 )	2026-04-23 18:42:58 -07:00
parity-baseline-v1.44.1.json	v1.46.0.0 feat: gstack v2 foundation — catalog tokens drop 56%, eval-first floor covers all 51 skills (#1712 )	2026-05-26 16:50:03 -07:00
parity-baseline-v1.46.0.0.json	v1.46.0.0 feat: gstack v2 foundation — catalog tokens drop 56%, eval-first floor covers all 51 skills (#1712 )	2026-05-26 16:50:03 -07:00
parity-baseline-v1.47.0.0.json	v1.52.0.0 feat(plan-tune): explicit consent + first-run setup wizard for contributors (#1741 )	2026-05-28 18:21:09 -07:00
parity-baseline-v1.53.0.0.json	v1.53.1.0 fix: non-interactive-safe plan-tune hook install (flags + smart defaults) (#1805 )	2026-05-30 11:42:13 -07:00
qa-eval-checkout-ground-truth.json	fix: 100% E2E pass — isolate test dirs, restart server, relax FP thresholds	2026-03-14 07:17:17 -05:00
qa-eval-ground-truth.json	fix: 100% E2E pass — isolate test dirs, restart server, relax FP thresholds	2026-03-14 07:17:17 -05:00
qa-eval-spa-ground-truth.json	fix: 100% E2E pass — isolate test dirs, restart server, relax FP thresholds	2026-03-14 07:17:17 -05:00
review-army-migration.sql	feat: Review Army — parallel specialist reviewers for /review (v0.14.3.0) (#692 )	2026-03-30 22:07:50 -06:00
review-army-n-plus-one.rb	feat: Review Army — parallel specialist reviewers for /review (v0.14.3.0) (#692 )	2026-03-30 22:07:50 -06:00
review-eval-design-slop.css	feat: design review lite in /review and /ship + gstack-diff-scope (v0.6.3) (#142 )	2026-03-17 20:12:55 -05:00
review-eval-design-slop.html	feat: design review lite in /review and /ship + gstack-diff-scope (v0.6.3) (#142 )	2026-03-17 20:12:55 -05:00
review-eval-enum-diff.rb	feat: contributor mode, session awareness, recommendation format (#90 )	2026-03-16 01:45:50 -05:00
review-eval-enum.rb	feat: contributor mode, session awareness, recommendation format (#90 )	2026-03-16 01:45:50 -05:00
review-eval-vuln.rb	feat: 3-tier eval suite with planted-bug outcome testing (EVALS=1)	2026-03-14 01:17:36 -05:00