gstack

History

Garry Tan 46c1fae7f1 v1.54.0.0 feat: carve /ship into skeleton + on-demand sections (-59% always-loaded) (#1806 ) * feat(test): transcript-section-logger + ship-action fingerprint (T10) Pure-analysis module over a SkillTestResult/NDJSON transcript: - extractSectionReads(): which sections/.md a run opened (post-carve check) - extractShipActions(): observable action fingerprint (merge/test/bump/ changelog/commit/push/pr) that works on the MONOLITH too, so a baseline captured before the carve can detect a sectioned-ship regression - baseline read/write + compareShipActions() for baseline-first dogf(T10) Baseline-first answers the Codex outside-voice critique that a logger in the same PR as the carve is post-failure telemetry without a pre-carve reference. 11 unit tests, all green. Paid monolith baseline capture runs separately. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> feat(pipeline): section discovery + generation machinery (T9) - discover-skills.ts: discoverSectionTemplates() scans <skill>/sections/.md.tmpl - gen-skill-docs.ts: extract resolvePlaceholders + applyHostRewrites + buildContext as shared helpers (processTemplate and the new processSectionTemplate both call them, so a sanitization/rewrite fix can't miss sections) [C1] - processSectionTemplate: body-fragment generation (no frontmatter/catalog/voice), parent-skill TemplateContext (skillName pinned to parent, not 'sections', so appliesTo gating + tier behave identically), per-host output routing - --host all now fails the build on ANY host failure, not just claude, so a stale external-host output can't slip the freshness gate [Codex outside-voice #9] Inert until a skill is carved (no sections/ dirs exist yet). Refactor is output-neutral: gen:skill-docs --dry-run --host all reports 0 STALE. 5 discovery unit tests + 389 gen-skill-docs tests green. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> feat(setup): install sections/ for cherry-pick targets (claude + kiro) (T9) Two install targets cherry-pick SKILL.md and would leave a carved skill's sections/ behind, 404ing a runtime 'Read sections/<name>.md': - link_claude_skill_dirs: link the sections/ subdir via _link_or_copy (windows gets a fresh copy on every ./setup) - kiro per-skill loop: sed-rewrite + copy each sections/* so paths resolve under ~/.kiro, not ~/.codex/~/.claude codex/factory/opencode link the whole generated dir, so sections ride free. Addresses Codex outside-voice #4/#6 (runtime pathing landmine). Inert until a skill is carved. Static-tripwire test + windows-fallback invariant green. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * feat(ship): gstack-version-bump CLI — tested idempotency classify + write (T9) Hybrid CLI extraction (CM1): the deterministic core of ship Step 12 becomes a tested CLI instead of bash prose the agent re-derives each run. - classify: FRESH/ALREADY_BUMPED/DRIFT_STALE_PKG/DRIFT_UNEXPECTED from VERSION vs origin/<base>:VERSION vs package.json.version (pure reader) - write: validated dual-write to VERSION + package.json (FRESH bump) - repair: DRIFT_STALE_PKG sync, no re-bump Bump-LEVEL choice + queue collision stay agent judgment; slot pick stays bin/gstack-next-version. This removes the re-bump-a-shipped-branch footgun from skippable prose into code that can't be skipped or misread. 15 tests (exhaustive state matrix + write/repair fs + real-git classify). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(parity): sectioned-skill parity capability — guards the carve (T9) Carved skills (skeleton + sections/.md) need parity checks that see relocated content, or moving a phrase into a section reads as 'lost': - readSkillForParity(): union skeleton + all sections/.md - checkSkillParity sectioned mode: content checks against the union; minBytes/ maxSizeRatio against union bytes (total behavior preserved); maxSkeletonBytes asserts the always-loaded skeleton actually shrank. Lowering minBytes to fit a small skeleton would otherwise make the size floor toothless [Codex #12]. Built + tested BEFORE the carve so ship's invariant can flip to sectioned in the same commit it lands. Monolith path byte-identical (verified: pre-existing investigate 1.053 ratio drift fails the same with this change stashed). 7 sectioned-parity tests + existing parity tests green. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * refactor(ship): carve into skeleton + on-demand sections (Claude) (T9) ship/SKILL.md drops 167KB → 68.7KB (~59% of the always-loaded skill) by moving 8 prose-heavy steps into ship/sections/.md, read on demand: tests, test-coverage, plan-completion, review-army, greptile, adversarial, changelog, pr-body. Step 12's version logic now calls the tested gstack-version-bump CLI instead of inline bash. Claude-first (S2): {{SECTION:id}} emits a STOP-Read pointer on Claude (skeleton + generated section files) and INLINES the content on every other host, so external hosts keep the full monolith — verified factory at 162KB with no sections dir. {{SECTION_INDEX:ship}} renders the situation→section table from the PASSIVE manifest (CM2 / v2_PLAN.md:663); required-reads live only in test fixtures. Multi-pass resolve expands inlined sections' own resolvers. Parity: ship invariant flipped to sectioned (union content checks + maxSkeletonBytes asserts the shrink). Carve-fallout fixed across gen-skill-docs/skill-validation/ golden/plan-completion/#1539/size-budget tests via skeleton+sections union reads. Free suite green except the pre-existing investigate parity drift. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> test(ship): manifest-consistency + context-parity + requiredReads helper (T9) Free deterministic guards for the carve: - required-reads.ts + unit test: assertRequiredReads(run, requiredFiles) — the mechanical layer-5 check that the agent Read the sections its situation needs (required set comes from the fixture, not the passive manifest) - section-manifest-consistency: 3-tier orphan classification (generated orphan + hand-edited generated file → FAIL; manifest orphan → WARN per v2_PLAN.md) and pins the PASSIVE-manifest contract (no applies_when/required_for) - template-context-parity: generated sections have zero unresolved placeholders and gated resolvers (ADVERSARIAL_STEP/CONFIDENCE_CALIBRATION/CHANGELOG_WORKFLOW) rendered — proving sections resolve with the parent skillName, not 'sections' 16 tests, all green. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(ship): section-loading E2E + idempotency CLI detection (T9) - skill-e2e-ship-section-loading.test.ts (new, periodic): runs real /ship in plan mode against a fresh version-changing fixture and asserts the agent Read the required sections (review-army + changelog). Runs against the INSTALLED skill (~/.claude/skills/gstack/ship), not repo paths, so install-layout 404s surface [Codex outside-voice #5]. Layer-5 mechanical guard against silent section-skip. - skill-e2e-ship-idempotency.test.ts: detection updated for the carve — Step 12 now runs gstack-version-bump classify (JSON "state":"ALREADY_BUMPED") instead of the inline bash echo (STATE: ALREADY_BUMPED). Accept both; add a gstack-version-bump-write re-bump regression signal. - touchfiles: register ship-section-loading (periodic) + extend idempotency deps with bin/gstack-version-bump + scripts/resolvers/sections.ts. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(ship): union-read redaction wiring test for the carve (T9) main's PR-body redaction-at-sink lives in sections/pr-body.md.tmpl after the carve, not the skeleton template. Read skeleton + section templates union so the redaction-wiring assertions follow the relocated content. 9/9 green. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * v1.54.0.0 feat: carve /ship into skeleton + on-demand sections (-59% always-loaded) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>		2026-05-30 12:09:10 -07:00
..
app	feat: GStack Browser — double-click AI browser with anti-bot stealth (#695 )	2026-04-04 10:17:05 -07:00
host-adapters	feat: declarative multi-host platform + OpenCode, Slate, Cursor, OpenClaw (v0.15.5.0) (#793 )	2026-04-04 15:32:20 -07:00
resolvers	v1.54.0.0 feat: carve /ship into skeleton + on-demand sections (-59% always-loaded) (#1806 )	2026-05-30 12:09:10 -07:00
analytics.ts	feat: safety hook skills + skill usage telemetry (v0.7.1) (#189 )	2026-03-18 23:57:59 -05:00
archetypes.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
brain-cache-spec.ts	v1.52.1.0 feat: brain-aware planning — 5 skills read structured gbrain context before asking (#1742 )	2026-05-29 08:35:00 -07:00
build-app.sh	v1.41.1.0 fix wave: 7 HIGH bugs from external audit + regression tests (PR #1169 follow-up) (#1592 )	2026-05-20 06:56:41 -07:00
build.sh	v1.42.0.0 Daegu wave: 23 community-filed bugs + PTY classifier enforcement (24 bisect commits) (#1594 )	2026-05-20 07:35:01 -07:00
capture-baseline.ts	v1.46.0.0 feat: gstack v2 foundation — catalog tokens drop 56%, eval-first floor covers all 51 skills (#1712 )	2026-05-26 16:50:03 -07:00
compare-pr-version.ts	v1.16.0.0 feat: tunnel allowlist 17→26 + canDispatchOverTunnel pure function (#1253 )	2026-04-28 00:57:28 -07:00
declared-annotation.ts	v1.52.0.0 feat(plan-tune): explicit consent + first-run setup wizard for contributors (#1741 )	2026-05-28 18:21:09 -07:00
detect-bump.ts	v1.11.0.0 feat(ship): workspace-aware version allocation (#1168 )	2026-04-23 23:03:27 -07:00
dev-skill.ts	feat: Wave 3 — community bug fixes & platform support (v0.11.6.0) (#359 )	2026-03-23 22:15:23 -07:00
discover-skills.ts	v1.54.0.0 feat: carve /ship into skeleton + on-demand sections (-59% always-loaded) (#1806 )	2026-05-30 12:09:10 -07:00
eval-compare.ts	feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )	2026-03-23 23:05:22 -07:00
eval-list.ts	feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )	2026-03-23 23:05:22 -07:00
eval-select.ts	feat: diff-based test selection for E2E and LLM-judge evals (v0.6.1.0) (#139 )	2026-03-17 18:45:41 -05:00
eval-summary.ts	feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )	2026-03-23 23:05:22 -07:00
eval-watch.ts	feat: /land-and-deploy, /canary, /benchmark + perf review (v0.7.0) (#183 )	2026-03-21 14:31:36 -07:00
garry-output-comparison.ts	fix: remove hardcoded author emails from throughput script	2026-04-18 15:36:50 +08:00
gen-llms-txt.ts	v1.28.0.0 feat: browse --headed/--proxy/--navigate + gstack/llms.txt + webdriver-only stealth (#1363 )	2026-05-07 20:14:59 -07:00
gen-skill-docs.ts	v1.54.0.0 feat: carve /ship into skeleton + on-demand sections (-59% always-loaded) (#1806 )	2026-05-30 12:09:10 -07:00
gstack-schema-pack.ts	v1.52.1.0 feat: brain-aware planning — 5 skills read structured gbrain context before asking (#1742 )	2026-05-29 08:35:00 -07:00
host-config-export.ts	feat: declarative multi-host platform + OpenCode, Slate, Cursor, OpenClaw (v0.15.5.0) (#793 )	2026-04-04 15:32:20 -07:00
host-config.ts	feat: OpenClaw integration v2 — prompt is the bridge (v0.15.9.0) (#816 )	2026-04-05 02:23:59 -07:00
jargon-list.json	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
models.ts	feat(v1.5.2.0): Opus 4.7 migration — model overlay, voice, routing (#1117 )	2026-04-22 01:06:22 -07:00
one-way-doors.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
preflight-agent-sdk.ts	v1.39.2.0 feat: GSTACK_* env-shim for Conductor + gbrain/gstack setup docs (#1534 )	2026-05-16 12:32:33 -07:00
proactive-suggestions.json	v1.47.0.0 feat: /spec — author backlog-ready spec in 5 phases + optional agent spawn (#1698 ) (#1733 )	2026-05-26 21:36:53 -07:00
psychographic-signals.ts	v1.52.0.0 feat(plan-tune): explicit consent + first-run setup wizard for contributors (#1741 )	2026-05-28 18:21:09 -07:00
question-registry.ts	v1.52.0.0 feat(plan-tune): explicit consent + first-run setup wizard for contributors (#1741 )	2026-05-28 18:21:09 -07:00
setup-scc.sh	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
skill-check.ts	v1.15.0.0 feat: slim preamble + real-PTY plan-mode E2E harness (#1215 )	2026-04-26 13:55:13 -07:00
slop-diff.ts	security: tunnel dual-listener + SSRF + envelope + path wave (v1.6.0.0) (#1137 )	2026-04-21 21:58:27 -07:00
task-emission-schema.ts	v1.38.1.0 fix wave: surrogate-safe page captures (#1440 ), Implementation Tasks across review skills (#1454 ), root-level artifact patterns (#1452 ) (#1504 )	2026-05-14 21:46:50 -07:00
test-free-shards.ts	v1.24.0.0 feat: cross-platform hardening — curated Windows lane + Bun.which resolver + path-portability helper (#1252 )	2026-05-01 07:21:28 -07:00
update-readme-throughput.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
write-version-files.sh	v1.42.0.0 Daegu wave: 23 community-filed bugs + PTY classifier enforcement (24 bisect commits) (#1594 )	2026-05-20 07:35:01 -07:00