gstack

History

Garry Tan e22ce8e28e test: collision sentinel covers every gstack skill across every host Universal insurance policy against upstream slash-command shadowing. The /checkpoint bug (Claude Code shipped /checkpoint as a /rewind alias, silently shadowing the gstack skill) cost us weeks of user confusion before we realized. This test is the "never again" check: enumerate every gstack skill name and cross-check against a per-host list of known built-in slash commands. Architecture: - KNOWN_BUILTINS per host. Currently Claude Code: 23 built-ins (checkpoint, rewind, compact, plan, cost, stats, context, usage, help, clear, quit, exit, agents, mcp, model, permissions, config, init, review, security-review, continue, bare, model). Sourced from docs + live skill-list dumps + claude --help output. - KNOWN_COLLISIONS_TOLERATED: skill names that DO collide but we've consciously decided to live with. Mandatory justification comment per entry. - GENERIC_VERB_WATCHLIST: advisory list of names at higher risk of future collision (save, load, run, deploy, start, stop, etc.). Prints a warning but doesn't fail. Tests (6 total, 26ms, free tier): 1. At least one skill discovered (enumerator sanity) 2. No duplicate skill names within gstack 3. No skill name collides with any claude-code built-in (with KNOWN_COLLISIONS_TOLERATED escape hatch) 4. KNOWN_COLLISIONS_TOLERATED entries are all still live collisions (prevents stale exceptions rotting after a rename) 5. The /checkpoint rename actually landed (checkpoint not in skills, context-save and context-restore are) 6. Advisory: generic-verb watchlist (informational only) Current real collisions: - /review — gstack pre-dates Claude Code's /review. Tolerated with written justification (track user confusion, rename to /diff-review if it bites). The rest of gstack is collision-free. Maintenance: when a host ships a new built-in, add the name to the host's KNOWN_BUILTINS list. If a gstack skill needs to coexist with a built-in, add an entry to KNOWN_COLLISIONS_TOLERATED with a written justification. Blind additions fail code review. TODO: add codex/kiro/opencode/slate/cursor/openclaw/hermes/factory/ gbrain built-in lists as we encounter collisions. Claude Code is the primary shadow risk (biggest audience, fastest release cadence). Note: bun's parser chokes on backticks inside block comments (spec- legal but regex-breaking in @oven/bun-parser). Workaround: avoid them.		2026-04-19 05:47:41 +08:00
..
fixtures	merge: origin/main v1.1.1.0 into garrytan/fix-checkpoints	2026-04-19 00:02:07 +08:00
helpers	test: tier-1 live-fire E2E for context-save + context-restore	2026-04-19 05:41:49 +08:00
analytics.test.ts	feat: safety hook skills + skill usage telemetry (v0.7.1) (#189 )	2026-03-18 23:57:59 -05:00
audit-compliance.test.ts	fix: security audit round 2 (v0.13.4.0) (#640 )	2026-03-29 22:46:33 -06:00
builder-profile.test.ts	feat: relationship closing — office-hours adapts to repeat users (v0.16.2.0) (#937 )	2026-04-08 22:21:28 -10:00
codex-e2e.test.ts	feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )	2026-03-23 23:05:22 -07:00
codex-hardening.test.ts	codex + Apple Silicon hardening wave (v0.18.4.0) (#1056 )	2026-04-18 12:30:54 +08:00
context-save-hardening.test.ts	test: tier-2 hardening tests for context-save + context-restore	2026-04-19 05:38:16 +08:00
diff-scope.test.ts	feat: Review Army — parallel specialist reviewers for /review (v0.14.3.0) (#692 )	2026-03-30 22:07:50 -06:00
explain-level-config.test.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
gemini-e2e.test.ts	feat: Confusion Protocol, Hermes + GBrain hosts, brain-first resolver (v0.18.0.0) (#1005 )	2026-04-16 10:41:38 -07:00
gen-skill-docs.test.ts	codex + Apple Silicon hardening wave (v0.18.4.0) (#1056 )	2026-04-18 12:30:54 +08:00
global-discover.test.ts	fix: close redundant PRs + friendly error on all design commands (v0.15.8.1) (#817 )	2026-04-05 02:02:06 -07:00
gstack-developer-profile.test.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
gstack-question-log.test.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
gstack-question-preference.test.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
hook-scripts.test.ts	feat: safety hook skills + skill usage telemetry (v0.7.1) (#189 )	2026-03-18 23:57:59 -05:00
host-config.test.ts	community wave: 6 PRs + hardening (v0.18.1.0) (#1028 )	2026-04-17 00:45:13 -07:00
jargon-list.test.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
learnings-injection.test.ts	fix: community security wave — 8 PRs, 4 contributors (v0.15.13.0) (#847 )	2026-04-06 00:47:04 -07:00
learnings.test.ts	feat: GStack Learns — per-project self-learning infrastructure (v0.13.4.0) (#622 )	2026-03-29 17:02:01 -06:00
migration-checkpoint-ownership.test.ts	merge: origin/main v1.1.1.0 into garrytan/fix-checkpoints	2026-04-19 00:02:07 +08:00
openclaw-native-skills.test.ts	community wave: 6 PRs + hardening (v0.18.1.0) (#1028 )	2026-04-17 00:45:13 -07:00
plan-tune.test.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
readme-throughput.test.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
relink.test.ts	fix: headed browser auto-shutdown + disconnect cleanup (v0.18.1.0) (#1025 )	2026-04-16 15:39:44 -07:00
review-log.test.ts	fix: community PRs + security hardening + E2E stability (v0.12.7.0) (#552 )	2026-03-26 23:21:27 -06:00
setup-codesign.test.ts	codex + Apple Silicon hardening wave (v0.18.4.0) (#1056 )	2026-04-18 12:30:54 +08:00
ship-version-sync.test.ts	fix(ship): detect + repair VERSION/package.json drift in Step 12 (v1.1.1.0) (#1063 )	2026-04-18 23:58:59 +08:00
skill-collision-sentinel.test.ts	test: collision sentinel covers every gstack skill across every host	2026-04-19 05:47:41 +08:00
skill-e2e-autoplan-dual-voice.test.ts	security: harden migration + context-save after adversarial review	2026-04-18 23:28:12 +08:00
skill-e2e-bws.test.ts	fix: cookie picker auth token leak (v0.15.17.0) (#904 )	2026-04-08 10:10:13 -07:00
skill-e2e-context-skills.test.ts	test: tier-1 live-fire E2E for context-save + context-restore	2026-04-19 05:41:49 +08:00
skill-e2e-cso.test.ts	feat: /cso v2 — infrastructure-first security audit (v0.11.6.0) (#384 )	2026-03-23 06:57:22 -07:00
skill-e2e-deploy.test.ts	feat: /land-and-deploy first-run dry run + staging-first + trust ladder (v0.12.2.0) (#518 )	2026-03-26 11:08:31 -07:00
skill-e2e-design.test.ts	feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )	2026-03-23 10:17:33 -07:00
skill-e2e-learnings.test.ts	feat: recursive self-improvement — operational learning + full skill wiring (v0.13.8.0) (#647 )	2026-03-31 23:08:22 -06:00
skill-e2e-plan-tune.test.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
skill-e2e-plan.test.ts	test: E2E tests for plan review report and Codex offering (v0.11.15.0) (#449 )	2026-03-24 07:30:24 -07:00
skill-e2e-qa-bugs.test.ts	feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )	2026-03-23 10:17:33 -07:00
skill-e2e-qa-workflow.test.ts	feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )	2026-03-23 10:17:33 -07:00
skill-e2e-review-army.test.ts	feat: Review Army — parallel specialist reviewers for /review (v0.14.3.0) (#692 )	2026-03-30 22:07:50 -06:00
skill-e2e-review.test.ts	feat: Confusion Protocol, Hermes + GBrain hosts, brain-first resolver (v0.18.0.0) (#1005 )	2026-04-16 10:41:38 -07:00
skill-e2e-session-intelligence.test.ts	tests: split checkpoint-save-resume into context-save + context-restore E2Es	2026-04-18 16:42:52 +08:00
skill-e2e-sidebar.test.ts	feat: declarative multi-host platform + OpenCode, Slate, Cursor, OpenClaw (v0.15.5.0) (#793 )	2026-04-04 15:32:20 -07:00
skill-e2e-workflow.test.ts	refactor: extract TabSession for per-tab state isolation (v0.15.16.0) (#873 )	2026-04-07 00:23:36 -07:00
skill-e2e.test.ts	feat: recursive self-improvement — operational learning + full skill wiring (v0.13.8.0) (#647 )	2026-03-31 23:08:22 -06:00
skill-llm-eval.test.ts	feat: voice directive for all skills (v0.12.3.0) (#520 )	2026-03-26 17:31:53 -06:00
skill-parser.test.ts	feat: SKILL.md template system, 3-tier testing, DX tools (v0.3.3) (#41 )	2026-03-13 21:08:12 -07:00
skill-routing-e2e.test.ts	feat: Confusion Protocol, Hermes + GBrain hosts, brain-first resolver (v0.18.0.0) (#1005 )	2026-04-16 10:41:38 -07:00
skill-validation.test.ts	feat: context rot defense for /ship — subagent isolation + clean step numbering (v0.18.1.0) (#1030 )	2026-04-16 23:14:03 -07:00
team-mode.test.ts	feat: Confusion Protocol, Hermes + GBrain hosts, brain-first resolver (v0.18.0.0) (#1005 )	2026-04-16 10:41:38 -07:00
telemetry.test.ts	feat: community wave — 7 fixes, relink, sidebar Write, discoverability (v0.13.5.0) (#641 )	2026-03-29 21:43:36 -06:00
timeline.test.ts	feat: Session Intelligence Layer — /checkpoint + /health + context recovery (v0.15.0.0) (#733 )	2026-04-01 00:50:42 -06:00
touchfiles.test.ts	feat: recursive self-improvement — operational learning + full skill wiring (v0.13.8.0) (#647 )	2026-03-31 23:08:22 -06:00
uninstall.test.ts	feat: community PRs — faster install, skill namespacing, uninstall, Codex fallback, Windows fix, Python patterns (v0.12.9.0) (#561 )	2026-03-27 00:44:37 -06:00
upgrade-migration-v1.test.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
v0-dormancy.test.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00
worktree.test.ts	feat: content security — 4-layer prompt injection defense for pair-agent (#815 )	2026-04-06 14:41:06 -07:00
writing-style-resolver.test.ts	feat: gstack v1 — simpler prompts + real LOC receipts (v1.0.0.0) (#1039 )	2026-04-18 15:05:42 +08:00