claw-code

Commit Graph

Author	SHA1	Message	Date
YeonGyu-Kim	aee5263aef	test(tools): prove recovery loop against .claw/worker-state.json directly recovery_loop_state_file_reflects_transitions reads the actual state file after each transition to verify the canonical observability surface reflects the full stall->resolve->ready progression: spawning (state file exists, seconds_since_update present) -> trust_required (is_ready=false, trust_gate_cleared=false in file) -> spawning (trust_gate_cleared=true after WorkerResolveTrust) -> ready_for_prompt (is_ready=true after ready screen observe) This is the end-to-end proof gaebal-gajae called for: clawhip polling .claw/worker-state.json will see truthful state at every step of the recovery loop, including the seconds_since_update staleness signal. 90 tool tests passing, 0 failing.	2026-04-08 04:38:38 +09:00
YeonGyu-Kim	9461522af5	feat(tools): expose WorkerObserveCompletion tool; add provider-degraded classification tests observe_completion() on WorkerRegistry classifies finish_reason into Finished vs Failed (finish='unknown' + 0 tokens = provider degraded). This logic existed in the runtime but had no tool wrapper — clawhip could not call it. Added WorkerObserveCompletion as a first-class tool. Tool schema: { worker_id, finish_reason: string, tokens_output: integer } Handler: run_worker_observe_completion -> global_worker_registry().observe_completion() Tests added: - worker_observe_completion_success_finish_sets_finished_status finish=end_turn + tokens=512 -> status=finished - worker_observe_completion_degraded_provider_sets_failed_status finish=unknown + tokens=0 -> status=failed, last_error populated 89 tool tests passing, 0 failing.	2026-04-08 04:35:05 +09:00
YeonGyu-Kim	c08f060ca1	test(tools): end-to-end stall-detect and recovery loop coverage Proves the clawhip restart/recover flow that gaebal-gajae flagged: 1. stall_detect_and_resolve_trust_end_to_end - Worker created without trusted_roots -> trust_auto_resolve=false - WorkerObserve with trust-prompt text -> status=trust_required, gate cleared=false - WorkerResolveTrust -> status=spawning, trust_gate_cleared=true - WorkerObserve with ready text -> status=ready_for_prompt Full resolve path verified end-to-end. 2. stall_detect_and_restart_recovery_end_to_end - Worker stalls at trust_required - WorkerRestart resets to spawning, trust_gate_cleared=false Documents the restart-then-re-acquire-trust flow. Note: seconds_since_update is in .claw/worker-state.json (state file), not in the Worker tool output struct. Staleness detection via state file is covered by emit_state_file_writes_worker_status_on_transition in worker_boot.rs tests. 87 tool tests passing, 0 failing.	2026-04-08 04:09:55 +09:00
YeonGyu-Kim	aa37dc6936	test(tools): add coverage for WorkerRestart and WorkerTerminate tools WorkerRestart and WorkerTerminate had zero test coverage despite being public tools in the tool spec. Also confirms one design decision worth noting: restart resets trust_gate_cleared=false, so an allowlisted worker that gets restarted must re-acquire trust via the normal observe flow (by design — trust is per-session, not per-CWD). Tests added: - worker_terminate_sets_finished_status - worker_restart_resets_to_spawning (verifies status=spawning, prompt_in_flight=false, trust_gate_cleared=false) - worker_terminate_on_unknown_id_returns_error - worker_restart_on_unknown_id_returns_error 85 tool tests passing, 0 failing.	2026-04-08 03:33:05 +09:00
YeonGyu-Kim	6ddfa78b7c	feat(tools): wire config.trusted_roots into WorkerCreate tool Previously WorkerCreate passed trusted_roots directly to spawn_worker with no config-level default. Any batch script omitting the field stalled all workers at TrustRequired with no recovery path. Now run_worker_create loads RuntimeConfig from the worker CWD before spawning and merges config.trusted_roots() with per-call overrides. Per-call overrides still take effect; config provides the default. Add test: worker_create_merges_config_trusted_roots_without_per_call_override - writes .claw/settings.json with trustedRoots=[<os-temp-dir>] in a temp worktree - calls WorkerCreate with no trusted_roots field - asserts trust_auto_resolve=true (config roots matched the CWD) 81 tool tests passing, 0 failing.	2026-04-08 03:08:13 +09:00
YeonGyu-Kim	0f2f02af2d	feat: b6-http-proxy-v2 follow-up work — batch 6	2026-04-07 16:11:51 +09:00
YeonGyu-Kim	18d3c1918b	feat: b6-http-proxy-v2 — batch 6	2026-04-07 15:52:30 +09:00
YeonGyu-Kim	d509f16b5a	feat: b5-skip-perms-flag — batch 5 upstream parity	2026-04-07 14:51:27 +09:00
Yeachan-Heo	549ad7c3af	Restore compatibility skill lookup fallback	2026-04-06 09:11:27 +00:00
Yeachan-Heo	421ead7dba	Remove orphaned skill lookup helpers	2026-04-06 07:56:50 +00:00
Yeachan-Heo	f9cb42fb44	Resolve claw-code main merge conflicts	2026-04-06 07:16:57 +00:00
Yeachan-Heo	01b263c838	Let /skills invocations reach the prompt skill path The CLI still treated every /skills payload other than list/install/help as local usage text, so skills that appeared in /skills could not actually be invoked. This restores prompt dispatch for /skills <skill> [args], keeps list/install on the local path, and shares skill resolution with the Skill tool so project-local and legacy /commands entries resolve consistently. Constraint: --resume local slash execution still only supports local commands without provider turns Rejected: Implement full resumed prompt-turn execution for /skills \| larger behavior change outside this bugfix Rejected: Keep separate skill lookups in tools and commands \| drift already caused listing/invocation mismatches Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep /skills discovery, CLI prompt dispatch, and Tool Skill resolution on the same registry semantics Tested: cargo fmt --all; cargo clippy -p commands -p tools -p rusty-claude-cli --all-targets -- -D warnings; cargo test --workspace -- --nocapture Not-tested: Live provider-backed /skills invocation against external skill packs in an interactive REPL	2026-04-06 06:43:31 +00:00
Yeachan-Heo	831d8a2d4b	Classify quiet agent states before they look stale Persist derived machine states for agent manifests so downstream monitors can distinguish working, blocked, degraded, and finished-cleanable lanes without inferring everything from prose. This also records commit provenance in terminal-state manifests and marks the new session-state classification roadmap item as done. Constraint: Keep the change scoped to manifest persistence and tests without introducing a new monitoring service layer Rejected: Leave state classification as downstream text scraping only \| repeated dogfood runs showed quiet/finished lanes being misreported as stale Confidence: medium Scope-risk: narrow Directive: Reuse derived_state + commit provenance from manifests before adding any new stale-session heuristics elsewhere Tested: python .github/scripts/check_doc_source_of_truth.py Tested: cd rust && cargo fmt --all --check Tested: cd rust && cargo test -q -p tools Tested: cd rust && cargo clippy -p tools --all-targets --no-deps -- -D warnings Not-tested: full cargo clippy --workspace --all-targets -- -D warnings still fails on unrelated pre-existing runtime lint debt	2026-04-05 18:47:23 +00:00
Yeachan-Heo	19c6b29524	Close the clawability backlog with deterministic CLI output and lane lineage Finish the remaining roadmap work by making direct CLI JSON output deterministic across the non-interactive surface, restoring the degraded-startup MCP test as a real workspace test, and adding branch-lock plus commit-lineage primitives so downstream lane consumers can distinguish superseded worktree commits from canonical lineage. Constraint: Keep the user-facing config namespace centered on .claw while preserving legacy fallback discovery for compatibility Constraint: Verification needed to stay clean-room and reproducible from the checked-in workspace alone Rejected: Leave the output-format contract implied by ad-hoc smoke runs only \| too easy for direct CLI regressions to slip back into prose-only output Rejected: Keep commit provenance as free-form detail text \| downstream consumers need structured branch/worktree/supersession metadata Confidence: medium Scope-risk: moderate Directive: Extend the JSON contract through the same direct CLI entrypoints instead of adding one-off serializers on parallel code paths Tested: python .github/scripts/check_doc_source_of_truth.py Tested: cd rust && cargo fmt --all --check Tested: cd rust && cargo test --workspace Tested: cd rust && cargo clippy -p commands -p tools -p rusty-claude-cli --all-targets --no-deps -- -D warnings Not-tested: full cargo clippy --workspace --all-targets -- -D warnings still reports unrelated pre-existing runtime lint debt outside this change set	2026-04-05 18:41:02 +00:00
Yeachan-Heo	2dd05bfcef	Make .claw the only user-facing config namespace Agents, skills, and init output were still surfacing .codex/.claude paths even though the runtime already treats .claw as the canonical config home. This updates help text, reports, skill install defaults, and repo bootstrap output to present a single .claw namespace while keeping legacy discovery fallbacks in place for existing setups. Constraint: Existing .codex/.claude agent and skill directories still need to load for compatibility Rejected: Remove legacy discovery entirely \| would break existing user setups instead of just cleaning up surfaced output Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep future user-facing config, agent, and skill path copy aligned to .claw and even when legacy fallbacks remain supported internally Tested: cargo fmt --all --check; cargo test --workspace --exclude compat-harness Not-tested: cargo clippy --workspace --all-targets -- -D warnings \| fails in pre-existing unrelated runtime files (for example mcp_lifecycle_hardened.rs, mcp_tool_bridge.rs, lsp_client.rs, permission_enforcer.rs, recovery_recipes.rs, stale_branch.rs, task_registry.rs, team_cron_registry.rs, worker_boot.rs)	2026-04-05 18:11:25 +00:00
Yeachan-Heo	31163be347	style: cargo fmt	2026-04-05 16:56:48 +00:00
Yeachan-Heo	cd1ee43f33	fix: suppress dead_code warnings for unused provider and lane completion items	2026-04-05 03:22:32 +00:00
Yeachan-Heo	dbfc9d521c	Track runtime tasks with structured task packets Replace the oversized packet model with the requested JSON-friendly packet shape and thread it through the in-memory task registry. Add the RunTaskPacket tool so callers can launch packet-backed tasks directly while preserving existing task creation flows. Constraint: The existing task system and tool surface had to keep TaskCreate behavior intact while adding packet-backed execution Rejected: Add a second parallel packet registry \| would duplicate task lifecycle state Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep TaskPacket aligned with the tool schema and task registry serialization when extending the packet contract Tested: cargo build --workspace; cargo test --workspace Not-tested: live end-to-end invocation of RunTaskPacket through an interactive CLI session	2026-04-04 15:11:26 +00:00
Yeachan-Heo	784f07abfa	Harden worker boot recovery before task dispatch The worker boot registry now exposes the requested lifecycle states, emits structured trust and prompt-delivery events, and recovers from shell or wrong-target prompt delivery by replaying the last prompt. Supporting fixes keep MCP remote config parsing backwards-compatible and make CLI argument parsing less dependent on ambient config and cwd state so the workspace stays green under full parallel test runs. Constraint: Worker prompts must not be dispatched before a confirmed ready_for_prompt handshake Constraint: Prompt misdelivery recovery must stay minimal and avoid new dependencies Rejected: Keep prompt_accepted and blocked as public lifecycle states \| user requested the narrower explicit state set Rejected: Treat url-only MCP server configs as invalid \| existing CLI/runtime tests still rely on that shorthand Confidence: high Scope-risk: moderate Reversibility: clean Directive: Preserve prompt_in_flight semantics when extending worker boot; misdelivery detection depends on it Tested: cargo build --workspace; cargo test --workspace Not-tested: Live tmux worker delivery against a real external coding agent pane	2026-04-04 14:50:43 +00:00
Yeachan-Heo	8a9ea1679f	feat(mcp+lifecycle): MCP degraded-startup reporting, lane event schema, lane completion hardening Add MCP structured degraded-startup classification (P2.10): - classify MCP failures as startup/handshake/config/partial - expose failed_servers + recovery_recommendations in tool output - add mcp_degraded output field with server_name, failure_mode, recoverable Canonical lane event schema (P2.7): - add LaneEventName variants for all lifecycle states - wire LaneEvent::new with full 3-arg signature (event, status, emitted_at) - emit typed events for Started, Blocked, Failed, Finished Fix let mut executor for search test binary Fix lane_completion unused import warnings Note: mcp_stdio::manager_discovery_report test has pre-existing failure on clean main, unrelated to this commit.	2026-04-04 14:31:56 +00:00
Yeachan-Heo	639a54275d	Stop stale branches from polluting workspace test signals Workspace-wide verification now preflights the current branch against main so stale or diverged branches surface missing commits before broad cargo tests run. The lane failure taxonomy is also collapsed to the blocker classes the roadmap lane needs so automation can branch on a smaller, stable set of categories. Constraint: Broad workspace tests should not run when main is ahead and would produce stale-branch noise Rejected: Run workspace tests unconditionally \| makes stale-branch failures indistinguishable from real regressions Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Keep workspace-test preflight scoped to broad test commands until command classification grows more precise Tested: cargo test -p runtime stale_branch -- --nocapture; cargo test -p tools lane_failure_taxonomy_normalizes_common_blockers -- --nocapture; cargo test -p tools bash_workspace_tests_are_blocked_when_branch_is_behind_main -- --nocapture; cargo test -p tools bash_targeted_tests_skip_branch_preflight -- --nocapture Not-tested: clean worktree cargo test --workspace still fails on pre-existing rusty-claude-cli tests default_permission_mode_uses_project_config_when_env_is_unset and single_word_slash_command_names_return_guidance_instead_of_hitting_prompt_mode	2026-04-04 14:01:31 +00:00
Jobdori	fc675445e6	feat(tools): add lane_completion module (P1.3) Implement automatic lane completion detection: - detect_lane_completion(): checks session-finished + tests-green + pushed - evaluate_completed_lane(): triggers CloseoutLane + CleanupSession actions - 6 tests covering all conditions Bridges the gap where LaneContext::completed was a passive bool that nothing automatically set. Now completion is auto-detected. ROADMAP P1.3 marked done.	2026-04-04 22:05:49 +09:00
Jobdori	2dfda31b26	feat(tools): wire SummaryCompressor into lane.finished event detail The SummaryCompressor (runtime::summary_compression) was exported but called nowhere. Lane events emitted a Finished variant with detail: None even when the agent produced a result string. Wire compress_summary_text() into the Finished event detail field so that: - result prose is compressed to ≤1200 chars / 24 lines before storage - duplicate lines and whitespace noise are removed - the event detail is machine-readable, not raw prose blob - None is still emitted when result is empty/None (no regression) This is the P1.4 wiring item from ROADMAP: 'Wire SummaryCompressor into the lane event pipeline — exported but called nowhere; LaneEvent stream never fed through compressor.' cargo test --workspace: 643 pass (1 pre-existing flaky), fmt clean.	2026-04-04 16:35:33 +09:00
Jobdori	d558a2d7ac	feat(policy): add lane reconciliation events and policy support Add terminal lane states for when a lane discovers its work is already landed in main, superseded by another lane, or has an empty diff: LaneEventName: - lane.reconciled — branch already merged, no action needed - lane.merged — work successfully merged - lane.superseded — work replaced by another lane/commit - lane.closed — lane manually closed PolicyAction::Reconcile with ReconcileReason enum: - AlreadyMerged — branch tip already in main - Superseded — another lane landed the same work - EmptyDiff — PR would be empty - ManualClose — operator closed the lane PolicyCondition::LaneReconciled — matches lanes that reached a no-action-required terminal state. LaneContext::reconciled() constructor for lanes that discovered they have nothing to do. This closes the gap where lanes like 9404-9410 could discover 'nothing to do' but had no typed terminal state to express it. The policy engine can now auto-closeout reconciled lanes instead of leaving them in limbo. Addresses ROADMAP P1.3 (lane-completion emitter) groundwork. Tests: 4 new tests covering reconcile rule firing, context defaults, non-reconciled lanes not triggering reconcile rules, and reason variant distinctness. Full workspace suite: 643 pass, 0 fail.	2026-04-04 16:12:06 +09:00
Jobdori	13015f6428	feat(runtime): hardened MCP lifecycle with phase tracking and degraded-mode reporting	2026-04-04 00:42:43 +09:00
Yeachan-Heo	f76311f9d6	Prevent worker prompts from outrunning boot readiness Add a foundational worker_boot control plane and tool surface for reliable startup. The new registry tracks trust gates, ready-for-prompt handshakes, prompt delivery attempts, and shell misdelivery recovery so callers can coordinate worker boot above raw terminal transport. Constraint: Current main has no tmux-backed worker control API to extend directly Constraint: First slice must stay deterministic and fully testable in-process Rejected: Wire the first implementation straight to tmux panes \| would couple transport details to unfinished state semantics Rejected: Ship parser helpers without control tools \| would not enforce the ready-before-prompt contract end to end Confidence: high Scope-risk: moderate Reversibility: clean Directive: Treat WorkerObserve heuristics as a temporary transport adapter and replace them with typed runtime events before widening automation policy Tested: cargo test -p runtime worker_boot Tested: cargo test -p tools worker_tools Tested: cargo check -p runtime -p tools Not-tested: Real tmux/TTY trust prompts and live worker boot on an actual coding session Not-tested: Full cargo clippy -p runtime -p tools --all-targets -- -D warnings (fails on pre-existing warnings outside this slice)	2026-04-03 15:20:22 +00:00
Yeachan-Heo	56ee33e057	Make agent lane state machine-readable The background Agent tool already persisted lane-adjacent state via a JSON manifest and a markdown transcript, making it the smallest viable vertical slice for the ROADMAP lane-event work. This change adds canonical typed lane events to the manifest and normalizes terminal blockers into the shared failure taxonomy so downstream clawhip-style consumers can branch on structured state instead of scraping prose alone. The slice is intentionally narrow: it covers agent start, finish, blocked, and failed transitions plus blocker classification, while leaving broader lane orchestration and external consumers for later phases. Tests lock the manifest schema and taxonomy mapping so future extensions can add events without regressing the typed baseline. Constraint: Land a fresh-main vertical slice without inventing a larger lane framework first Rejected: Add a brand-new lane subsystem across crates \| too broad for one verified slice Rejected: Only add markdown log annotations \| still log-shaped and not machine-first Confidence: high Scope-risk: narrow Reversibility: clean Directive: Extend the same event names and failure classes before adding any alternate manifest schema for lane reporting Tested: cargo test -p tools agent_persists_handoff_metadata -- --nocapture Tested: cargo test -p tools agent_fake_runner_can_persist_completion_and_failure -- --nocapture Tested: cargo test -p tools lane_failure_taxonomy_normalizes_common_blockers -- --nocapture Not-tested: Full clawhip consumer integration or multi-crate event plumbing	2026-04-03 15:20:22 +00:00
Yeachan-Heo	bf5eb8785e	Recover the MCP lane on top of current main This resolves the stale-branch merge against origin/main, keeps the MCP runtime wiring, and preserves prompt-approved CLI tool execution after the mock parity harness additions landed upstream. Constraint: Branch had to absorb origin/main changes through a contentful merge before more MCP work Constraint: Prompt-approved runtime tool execution must continue working with new CLI/mock parity coverage Rejected: Keep permission enforcer attached inside CliToolExecutor for conversation turns \| caused prompt-approved bash parity flow to fail as a tool error Rejected: Defer the merge and continue on stale history \| would leave the lane red against current main Confidence: high Scope-risk: moderate Reversibility: clean Directive: Runtime permission policy and executor-side permission enforcement are separate layers; do not reapply executor enforcement to conversation turns without revalidating mock parity harness approval flows Tested: cargo test -p rusty-claude-cli --test mock_parity_harness -- --nocapture; cargo test -p rusty-claude-cli -- --nocapture; cargo test --workspace -- --nocapture Not-tested: Additional live remote/provider scenarios beyond the existing workspace suite	2026-04-03 14:51:18 +00:00
Yeachan-Heo	b3fe057559	Close the MCP lifecycle gap from config to runtime tool execution This wires configured MCP servers into the CLI/runtime path so discovered MCP tools, resource wrappers, search visibility, shutdown handling, and best-effort discovery all work together instead of living as isolated runtime primitives. Constraint: Keep non-MCP startup flows working without new required config Constraint: Preserve partial availability when one configured MCP server fails discovery Rejected: Fail runtime startup on any MCP discovery error \| too brittle for mixed healthy/broken server configs Rejected: Keep MCP support runtime-only without registry wiring \| left discovery and invocation unreachable from the CLI tool lane Confidence: high Scope-risk: moderate Reversibility: clean Directive: Runtime MCP tools are registry-backed but executed through CliToolExecutor state; keep future tool-registry changes aligned with that split Tested: cargo test -p runtime mcp -- --nocapture; cargo test -p tools -- --nocapture; cargo test -p rusty-claude-cli -- --nocapture; cargo test --workspace -- --nocapture Not-tested: Live remote MCP transports (http/sse/ws/sdk) remain unsupported in the CLI execution path	2026-04-03 14:31:25 +00:00
Jobdori	80ad9f4195	feat(tools): replace AskUserQuestion + RemoteTrigger stubs with real implementations - AskUserQuestion: interactive stdin/stdout prompt with numbered options - RemoteTrigger: real HTTP client (GET/POST/PUT/DELETE/PATCH/HEAD) with custom headers, body, 30s timeout, response truncation - All 480+ tests green	2026-04-03 19:37:34 +09:00
Jobdori	8cc7d4c641	chore: additional AI slop cleanup and enforcer wiring from sessions 1/5 Session 1 (ses_2ad65873): with_enforcer builders + 2 regression tests Session 5 (ses_2ad67e8e): continued AI slop cleanup pass — redundant comments, unused_self suppressions, unreachable! tightening Session cleanup (ses_2ad6b26c): Python placeholder centralization Workspace tests: 363+ passed, 0 failed.	2026-04-03 18:35:27 +09:00
Jobdori	618a79a9f4	feat: ultraclaw session outputs — registry tests, MCP bridge, PARITY.md, cleanup Ultraclaw mode results from 10 parallel opencode sessions: - PARITY.md: Updated both copies with all 9 landed lanes, commit hashes, line counts, and test counts. All checklist items marked complete. - MCP bridge: McpToolRegistry.call_tool now wired to real McpServerManager via async JSON-RPC (discover_tools -> tools/call -> shutdown) - Registry tests: Added coverage for TaskRegistry, TeamRegistry, CronRegistry, PermissionEnforcer, LspRegistry (branch-focused tests) - Permissions refactor: Simplified authorize_with_context, extracted helpers, added characterization tests (185 runtime tests pass) - AI slop cleanup: Removed redundant comments, unused_self suppressions, tightened unreachable branches - CLI fixes: Minor adjustments in main.rs and hooks.rs All 363+ tests pass. Workspace compiles clean.	2026-04-03 18:23:03 +09:00
Jobdori	f25363e45d	fix(tools): wire PermissionEnforcer into execute_tool dispatch path The review correctly identified that enforce_permission_check() was defined but never called. This commit: - Adds enforcer: Option<PermissionEnforcer> field to GlobalToolRegistry and SubagentToolExecutor - Adds set_enforcer() method for runtime configuration - Gates both execute() paths through enforce_permission_check() when an enforcer is configured - Default: None (Allow-all, matching existing behavior) Resolves the dead-code finding from ultraclaw review sessions 3 and 8.	2026-04-03 18:18:19 +09:00
Jobdori	66283f4dc9	feat(runtime+tools): PermissionEnforcer — permission mode enforcement layer Add PermissionEnforcer in crates/runtime/src/permission_enforcer.rs and wire enforce_permission_check() into crates/tools/src/lib.rs. Runtime additions: - PermissionEnforcer: wraps PermissionPolicy with enforcement API - check(tool, input): validates tool against active mode via policy.authorize() - check_file_write(path, workspace_root): workspace boundary enforcement - ReadOnly: deny all writes - WorkspaceWrite: allow within workspace, deny outside - DangerFullAccess/Allow: permit all - Prompt: deny (no prompter available) - check_bash(command): read-only command heuristic (60+ safe commands) - Detects -i/--in-place/redirect operators as non-read-only - is_within_workspace(): string-prefix boundary check - is_read_only_command(): conservative allowlist of safe CLI commands Tool wiring: - enforce_permission_check() public API for gating execute_tool() calls - Maps EnforcementResult::Denied to Err(reason) for tool dispatch 9 new tests covering all permission modes + workspace boundary + bash heuristic.	2026-04-03 17:55:04 +09:00
Jobdori	2d665039f8	feat(runtime+tools): LspRegistry — LSP client dispatch for tool surface Add LspRegistry in crates/runtime/src/lsp_client.rs and wire it into run_lsp() tool handler in crates/tools/src/lib.rs. Runtime additions: - LspRegistry: register/get servers by language, find server by file extension, manage diagnostics, dispatch LSP actions - LspAction enum (Diagnostics/Hover/Definition/References/Completion/Symbols/Format) - LspServerStatus enum (Connected/Disconnected/Starting/Error) - Diagnostic/Location/Hover/CompletionItem/Symbol types for structured responses - Action dispatch validates server status and path requirements Tool wiring: - run_lsp() maps LspInput to LspRegistry.dispatch() - Supports dynamic server lookup by file extension (rust/ts/js/py/go/java/c/cpp/rb/lua) - Caches diagnostics across servers 8 new tests covering registration, lookup, diagnostics, and dispatch paths. Bridges to existing LSP process manager for actual JSON-RPC execution.	2026-04-03 17:46:13 +09:00
Jobdori	730667f433	feat(runtime+tools): McpToolRegistry — MCP lifecycle bridge for tool surface Add McpToolRegistry in crates/runtime/src/mcp_tool_bridge.rs and wire it into all 4 MCP tool handlers in crates/tools/src/lib.rs. Runtime additions: - McpToolRegistry: register/get/list servers, list/read resources, call tools, set auth status, disconnect - McpConnectionStatus enum (Disconnected/Connecting/Connected/AuthRequired/Error) - Connection-state validation (reject ops on disconnected servers) - Resource URI lookup, tool name validation before dispatch Tool wiring: - ListMcpResources: queries registry for server resources - ReadMcpResource: looks up specific resource by URI - McpAuth: returns server auth/connection status - MCP (tool proxy): validates + dispatches tool calls through registry 8 new tests covering all lifecycle paths + error cases. Bridges to existing McpServerManager for actual JSON-RPC execution.	2026-04-03 17:39:35 +09:00
Jobdori	c486ca6692	feat(runtime+tools): TeamRegistry and CronRegistry — replace team/cron stubs Add TeamRegistry and CronRegistry in crates/runtime/src/team_cron_registry.rs and wire them into the 5 team+cron tool handlers in crates/tools/src/lib.rs. Runtime additions: - TeamRegistry: create/get/list/delete(soft)/remove(hard), task_ids tracking, TeamStatus (Created/Running/Completed/Deleted) - CronRegistry: create/get/list(enabled_only)/delete/disable/record_run, CronEntry with run_count and last_run_at tracking Tool wiring: - TeamCreate: creates team in registry, assigns team_id to tasks via TaskRegistry - TeamDelete: soft-deletes team with status transition - CronCreate: creates cron entry with real cron_id - CronDelete: removes entry, returns deleted schedule info - CronList: returns full entry list with run history 8 new tests (team + cron) — all passing.	2026-04-03 17:32:57 +09:00
Jobdori	e8692e45c4	feat(tools): wire TaskRegistry into task tool dispatch Replace all 6 task tool stubs (TaskCreate/Get/List/Stop/Update/Output) with real TaskRegistry-backed implementations: - TaskCreate: creates task in global registry, returns real task_id - TaskGet: retrieves full task state (status, messages, timestamps) - TaskList: lists all tasks with metadata - TaskStop: transitions task to stopped state with validation - TaskUpdate: appends user messages to task message history - TaskOutput: returns accumulated task output Global registry uses OnceLock<TaskRegistry> singleton per process. All existing tests pass (37 tools, 149 runtime, 102 CLI).	2026-04-03 17:26:26 +09:00
Jobdori	b9d0d45bc4	feat: add MCPTool + TestingPermissionTool — tool surface 40/40 Close the final tool parity gap: - MCP: dynamic tool proxy for connected MCP servers - TestingPermission: test-only permission enforcement verification Tool surface now matches upstream: 40/40. All stubs, fmt/clippy/tests green.	2026-04-03 07:50:51 +09:00
Jobdori	9b2d187655	feat: add remaining tool specs — Team, Cron, LSP, MCP, RemoteTrigger Port 10 more missing tool definitions from upstream parity audit: - TeamCreate, TeamDelete: parallel sub-agent team management - CronCreate, CronDelete, CronList: scheduled recurring tasks - LSP: Language Server Protocol code intelligence queries - ListMcpResources, ReadMcpResource, McpAuth: MCP server resource access - RemoteTrigger: remote action/webhook triggers All tools have full ToolSpec schemas and stub execute functions. Tool surface now 38/40 (was 28/40). Remaining: MCPTool (dynamic tool proxy) and TestingPermissionTool (test-only). fmt/clippy/tests all green.	2026-04-03 07:42:16 +09:00
Jobdori	64f4ed0ad8	feat: add AskUserQuestion + Task tool specs and stubs Port 7 missing tool definitions from upstream parity audit: - AskUserQuestionTool: ask user a question with optional choices - TaskCreate: create background sub-agent task - TaskGet: get task status by ID - TaskList: list all background tasks - TaskStop: stop a running task - TaskUpdate: send message to a running task - TaskOutput: retrieve task output All tools have full ToolSpec schemas registered in mvp_tool_specs() and stub execute functions wired into execute_tool(). Stubs return structured JSON responses; real sub-agent runtime integration is the next step. Closes parity gap: 21 -> 28 tools (upstream has 40). fmt/clippy/tests all green.	2026-04-03 07:39:21 +09:00
Jobdori	fbafb9cffc	fix: post-merge clippy/fmt cleanup (9407-9410 integration)	2026-04-03 05:12:51 +09:00
Yeachan-Heo	5c845d582e	Close the plan-mode parity gap for worktree-local tool flows PARITY.md still flags missing plan/worktree entry-exit tools. This change adds EnterPlanMode and ExitPlanMode to the Rust tool registry, stores reversible worktree-local state under .claw/tool-state, and restores or clears the prior local permission override on exit. The round-trip tests cover both restoring an existing local override and cleaning up a tool-created override from an empty local state. Constraint: Must keep the override worktree-local and reversible without mutating higher-scope settings Rejected: Reuse Config alone with no state file \| exit could not safely restore absent-vs-local overrides Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep plan-mode state tracking aligned with settings.local.json precedence before adding worktree enter/exit tools Tested: cargo test -p tools Not-tested: interactive CLI prompt-mode invocation of the new tools	2026-04-02 10:01:33 +00:00
YeonGyu-Kim	73187de6ea	feat(tools): error propagation, REPL timeout, edge-case validation - Replace NotebookEdit expect() with Result-based error propagation - Add 5-minute guard to Sleep duration - Reject empty StructuredOutput payloads - Enforce timeout_ms in REPL via spawn+try_wait+kill - Add edge-case tests: excessive/zero sleep, empty output, REPL timeout - Verified: cargo test -p tools 35 passed, clippy clean	2026-04-02 18:24:39 +09:00
YeonGyu-Kim	f5fa3e26c8	refactor(tools): replace panic paths with proper error handling - Convert permission_mode_from_plugin panic to Result-based error - Add input validation for tool dispatch edge cases - Propagate signature changes to main.rs caller - 29 tools tests pass, clippy clean	2026-04-02 18:04:55 +09:00
Yeachan-Heo	79da7c0adf	Make claw's REPL feel self-explanatory from analysis through commit Claw already had the core slash-command and git primitives, but the UX still made users work to discover them, understand current workspace state, and trust what `/commit` was about to do. This change tightens that flow in the same places Codex-style CLIs do: command discovery, live status, typo recovery, and commit preflight/output. The REPL banner and `/help` now surface a clearer starter path, unknown slash commands suggest likely matches, `/status` includes actionable git state, and `/commit` explains what it is staging and committing before and after the model writes the Lore message. I also cleared the workspace's existing clippy blockers so the verification lane can stay fully green. Constraint: Improve UX inside the existing Rust CLI surfaces without adding new dependencies Rejected: Add more slash commands first \| discoverability and feedback were the bigger friction points Rejected: Split verification lint fixes into a second commit \| user requested one solid commit Confidence: high Scope-risk: moderate Directive: Keep slash discoverability, status reporting, and commit reporting aligned so `/help`, `/status`, and `/commit` tell the same workflow story Tested: cargo fmt --all; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: Manual interactive REPL session against live Anthropic/xAI endpoints	2026-04-02 07:20:35 +00:00
YeonGyu-Kim	de228ee5a6	fix: forward prompt cache events through clients Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-02 11:38:24 +09:00
YeonGyu-Kim	0bd0914347	fix: stabilize merge fallout test fixtures	2026-04-02 11:31:53 +09:00
YeonGyu-Kim	8476d713a8	Merge remote-tracking branch 'origin/rcc/cache-tracking' into integration/dori-cleanroom	2026-04-02 11:17:13 +09:00
YeonGyu-Kim	164bd518a1	Merge remote-tracking branch 'origin/rcc/telemetry' into integration/dori-cleanroom	2026-04-02 11:13:56 +09:00
YeonGyu-Kim	1d4c8a8f50	Merge remote-tracking branch 'origin/rcc/sandbox' into integration/dori-cleanroom # Conflicts: # rust/crates/commands/src/lib.rs # rust/crates/runtime/src/config.rs # rust/crates/runtime/src/lib.rs # rust/crates/rusty-claude-cli/src/main.rs	2026-04-02 10:42:15 +09:00
YeonGyu-Kim	2929759ded	Merge remote-tracking branch 'origin/rcc/plugins' into integration/dori-cleanroom # Conflicts: # rust/crates/commands/src/lib.rs # rust/crates/claw-cli/src/main.rs	2026-04-01 19:13:53 +09:00
YeonGyu-Kim	c849c0672f	fix: resolve all post-merge compile errors - Fix unresolved imports (auto_compaction, AutoCompactionEvent) - Add Thinking/RedactedThinking match arms - Fix workspace.dependencies serde_json - Fix enum exhaustiveness in OutputContentBlock matches - cargo check --workspace passes	2026-04-01 18:59:55 +09:00
YeonGyu-Kim	6f1ff24cea	fix: update prompt tests for post-plugins-merge format	2026-04-01 18:52:23 +09:00
YeonGyu-Kim	c2e41ba205	fix: post-plugins-merge cleanroom fixes and workspace deps Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-01 18:48:39 +09:00
Yeachan-Heo	be2bce7f8e	Ignore reasoning blocks in runtime adapters without affecting tool/text flows After the parser can accept thinking-style blocks, the CLI and tools adapters must explicitly ignore them so only user-visible text and tool calls drive runtime behavior. This keeps reasoning metadata from surfacing as text or interfering with tool accumulation. Constraint: Runtime behavior must remain unchanged for normal text/tool streaming Rejected: Treat thinking blocks as assistant text \| would leak hidden reasoning into visible output and session flow Confidence: high Scope-risk: narrow Directive: If future features need persisted reasoning blocks, add a dedicated runtime representation instead of overloading text handling Tested: cargo test -p claw-cli response_to_events_ignores_thinking_blocks -- --nocapture; cargo test -p tools response_to_events_ignores_thinking_blocks -- --nocapture Not-tested: End-to-end interactive run against a live thinking-enabled model	2026-04-01 08:06:10 +00:00
Yeachan-Heo	aea2adb9c8	Allow subagent tool flows to reach plugin-provided tools The subagent runtime still advertised and executed only built-in tools, which left plugin-provided tools outside the Agent execution path. This change loads the same plugin-aware registry used by the CLI for subagent tool definitions, permission policy, and execution lookup so delegated runs can resolve plugin tools consistently. Constraint: Plugin tools must respect the existing runtime plugin config and enabled-plugin state Rejected: Thread plugin-specific exceptions through execute_tool directly \| would bypass registry validation and duplicate lookup rules Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Keep CLI and subagent registry construction aligned when plugin tool loading rules change Tested: cargo test -p tools -p claw-cli Not-tested: Live Anthropic subagent runs invoking plugin tools end-to-end	2026-04-01 07:36:05 +00:00
Yeachan-Heo	7c115d1e07	feat: plugin subsystem progress	2026-04-01 07:30:20 +00:00
Yeachan-Heo	b757e96c13	Keep plugin-aware CLI validation aligned with the shared registry The shared /plugins command flow already routes through the plugin registry, but allowed-tool normalization still fell back to builtin tools when registry construction failed. This keeps plugin-related validation errors visible at the CLI boundary and updates tools tests to use the enum-based plugin permission API so workspace verification remains green. Constraint: Plugin tool permissions are now strongly typed in the plugins crate Rejected: Restore string-based permission arguments in tests \| weakens the plugin API contract Rejected: Keep builtin fallback in normalize_allowed_tools \| masks plugin registry integration failures Confidence: high Scope-risk: narrow Reversibility: clean Directive: Do not silently bypass current_tool_registry() failures unless plugin-aware allowed-tool validation is intentionally being disabled Tested: cargo test -p commands -- --nocapture; cargo test --workspace Not-tested: Manual REPL /plugins interaction in a live session	2026-04-01 07:22:41 +00:00
Yeachan-Heo	5812c9bd9e	feat: plugin system follow-up progress	2026-04-01 07:20:13 +00:00
Yeachan-Heo	d7c943b78f	feat: plugin hooks + tool registry + CLI integration	2026-04-01 07:11:42 +00:00
Yeachan-Heo	ee0c4cd097	feat: plugin subsystem progress	2026-04-01 07:11:25 +00:00
Yeachan-Heo	61b4def7bc	feat: telemetry progress	2026-04-01 06:15:15 +00:00
Yeachan-Heo	c9d214c8d1	feat: cache-tracking progress	2026-04-01 06:15:13 +00:00
Yeachan-Heo	f92c9e962a	feat: grok provider tests + cargo fmt	2026-04-01 04:20:15 +00:00
Yeachan-Heo	5654efb7b2	feat: provider abstraction layer + Grok API support	2026-04-01 04:10:46 +00:00
Yeachan-Heo	6b5331576e	fix: auto compaction threshold default 200k tokens	2026-04-01 03:55:00 +00:00
Yeachan-Heo	1bd0eef368	Merge remote-tracking branch 'origin/rcc/subagent' into dev/rust	2026-04-01 03:12:25 +00:00
Yeachan-Heo	e95eb86d1b	Merge remote-tracking branch 'origin/rcc/subagent' into dev/rust	2026-04-01 03:12:25 +00:00
Yeachan-Heo	ba220d210e	Enable real Agent tool delegation in the Rust CLI The Rust Agent tool only persisted queued metadata, so delegated work never actually ran. This change wires Agent into a detached background conversation path with isolated runtime, API client, session state, restricted tool subsets, and file-backed lifecycle/result updates. Constraint: Keep the tool entrypoint in the tools crate and avoid copying the upstream TypeScript implementation Rejected: Spawn an external claw process \| less aligned with the requested in-process runtime/client design Rejected: Leave execution in the CLI crate only \| would keep tools::Agent as a metadata-only stub Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Tool subset mappings are curated guardrails; revisit them before enabling recursive Agent access or richer agent definitions Tested: cargo build --release --manifest-path rust/Cargo.toml Tested: cargo test --manifest-path rust/Cargo.toml Not-tested: Live end-to-end background sub-agent run against Anthropic API credentials	2026-04-01 03:10:20 +00:00
Yeachan-Heo	48fa1c3ae5	Enable real Agent tool delegation in the Rust CLI The Rust Agent tool only persisted queued metadata, so delegated work never actually ran. This change wires Agent into a detached background conversation path with isolated runtime, API client, session state, restricted tool subsets, and file-backed lifecycle/result updates. Constraint: Keep the tool entrypoint in the tools crate and avoid copying the upstream TypeScript implementation Rejected: Spawn an external claw process \| less aligned with the requested in-process runtime/client design Rejected: Leave execution in the CLI crate only \| would keep tools::Agent as a metadata-only stub Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Tool subset mappings are curated guardrails; revisit them before enabling recursive Agent access or richer agent definitions Tested: cargo build --release --manifest-path rust/Cargo.toml Tested: cargo test --manifest-path rust/Cargo.toml Not-tested: Live end-to-end background sub-agent run against Anthropic API credentials	2026-04-01 03:10:20 +00:00
Yeachan-Heo	ac95f0387c	feat: allow multiple in_progress todos for parallel workflows	2026-04-01 02:55:13 +00:00
Yeachan-Heo	7661af230c	feat: allow multiple in_progress todos for parallel workflows	2026-04-01 02:55:13 +00:00
Yeachan-Heo	387a8bb13f	feat: git integration, sandbox isolation, init command (merged from rcc branches)	2026-04-01 01:23:47 +00:00
Yeachan-Heo	98264aa3a9	feat: git integration, sandbox isolation, init command (merged from rcc branches)	2026-04-01 01:23:47 +00:00
Yeachan-Heo	583d191527	fix: resolve thinking/streaming/update merge conflicts	2026-04-01 01:15:30 +00:00
Yeachan-Heo	c04ad316d4	fix: resolve thinking/streaming/update merge conflicts	2026-04-01 01:15:30 +00:00
Yeachan-Heo	2d09bf9961	Make sandbox isolation behavior explicit and inspectable This adds a small runtime sandbox policy/status layer, threads sandbox options through the bash tool, and exposes `/sandbox` status reporting in the CLI. Linux namespace/network isolation is best-effort and intentionally reported as requested vs active so the feature does not overclaim guarantees on unsupported hosts or nested container environments. Constraint: No new dependencies for isolation support Constraint: Must keep filesystem restriction claims honest unless hard mount isolation succeeds Rejected: External sandbox/container wrapper \| too heavy for this workspace and request Rejected: Inline bash-only changes without shared status model \| weaker testability and poorer CLI visibility Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Treat this as observable best-effort isolation, not a hard security boundary, unless stronger mount enforcement is added later Tested: cargo fmt --all; cargo clippy --workspace --all-targets --all-features -- -D warnings; cargo test --workspace Not-tested: Manual `/sandbox` REPL run on a real nested-container host	2026-04-01 01:14:38 +00:00
Yeachan-Heo	770fb8d0e7	Merge remote-tracking branch 'origin/rcc/tools' into dev/rust	2026-04-01 01:00:37 +00:00
Yeachan-Heo	1e354521fb	Merge remote-tracking branch 'origin/rcc/tools' into dev/rust	2026-04-01 01:00:37 +00:00
Yeachan-Heo	2fd6241bd8	Enable Agent tool child execution with bounded recursion The Agent tool previously stopped at queued handoff metadata, so this change runs a real nested conversation, preserves artifact output, and guards recursion depth. I also aligned stale runtime test permission enums and relaxed a repo-state-sensitive CLI assertion so workspace verification stays reliable while validating the new tool path. Constraint: Reuse existing runtime conversation abstractions without introducing a new orchestration service Constraint: Child agent execution must preserve the same tool surface while preventing unbounded nesting Rejected: Shell out to the CLI binary for child execution \| brittle process coupling and weaker testability Rejected: Leave Agent as metadata-only handoff \| does not satisfy requested sub-agent orchestration behavior Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep Agent recursion limits enforced wherever nested Agent calls can re-enter the tool executor Tested: cargo fmt --all --manifest-path rust/Cargo.toml; cargo test --manifest-path rust/Cargo.toml; cargo clippy --manifest-path rust/Cargo.toml --workspace --all-targets -- -D warnings Not-tested: Live Anthropic-backed child agent execution against production credentials	2026-04-01 00:59:20 +00:00
Yeachan-Heo	6b84fcfaa0	Enable Agent tool child execution with bounded recursion The Agent tool previously stopped at queued handoff metadata, so this change runs a real nested conversation, preserves artifact output, and guards recursion depth. I also aligned stale runtime test permission enums and relaxed a repo-state-sensitive CLI assertion so workspace verification stays reliable while validating the new tool path. Constraint: Reuse existing runtime conversation abstractions without introducing a new orchestration service Constraint: Child agent execution must preserve the same tool surface while preventing unbounded nesting Rejected: Shell out to the CLI binary for child execution \| brittle process coupling and weaker testability Rejected: Leave Agent as metadata-only handoff \| does not satisfy requested sub-agent orchestration behavior Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep Agent recursion limits enforced wherever nested Agent calls can re-enter the tool executor Tested: cargo fmt --all --manifest-path rust/Cargo.toml; cargo test --manifest-path rust/Cargo.toml; cargo clippy --manifest-path rust/Cargo.toml --workspace --all-targets -- -D warnings Not-tested: Live Anthropic-backed child agent execution against production credentials	2026-04-01 00:59:20 +00:00
Yeachan-Heo	549deb9a89	Preserve local project context across compaction and todo updates This change makes compaction summaries durable under .claude/memory, feeds those saved memory files back into prompt context, updates /memory to report both instruction and project-memory files, and moves TodoWrite persistence to a human-readable .claude/todos.md file. Constraint: Reuse existing compaction, prompt loading, and slash-command plumbing rather than add a new subsystem Constraint: Keep persisted project state under Claude-local .claude/ paths Rejected: Introduce a dedicated memory service module \| larger diff with no clear user benefit for this task Confidence: high Scope-risk: moderate Reversibility: clean Directive: Project memory files are loaded as prompt context, so future format changes must preserve concise readable content Tested: cargo fmt --all --manifest-path rust/Cargo.toml Tested: cargo clippy --manifest-path rust/Cargo.toml --all-targets --all-features -- -D warnings Tested: cargo test --manifest-path rust/Cargo.toml --all Not-tested: Long-term retention/cleanup policy for .claude/memory growth	2026-04-01 00:58:36 +00:00
Yeachan-Heo	ec898b808f	Preserve local project context across compaction and todo updates This change makes compaction summaries durable under .claw/memory, feeds those saved memory files back into prompt context, updates /memory to report both instruction and project-memory files, and moves TodoWrite persistence to a human-readable .claw/todos.md file. Constraint: Reuse existing compaction, prompt loading, and slash-command plumbing rather than add a new subsystem Constraint: Keep persisted project state under Claw-local .claw/ paths Rejected: Introduce a dedicated memory service module \| larger diff with no clear user benefit for this task Confidence: high Scope-risk: moderate Reversibility: clean Directive: Project memory files are loaded as prompt context, so future format changes must preserve concise readable content Tested: cargo fmt --all --manifest-path rust/Cargo.toml Tested: cargo clippy --manifest-path rust/Cargo.toml --all-targets --all-features -- -D warnings Tested: cargo test --manifest-path rust/Cargo.toml --all Not-tested: Long-term retention/cleanup policy for .claw/memory growth	2026-04-01 00:58:36 +00:00
Yeachan-Heo	e2f061fd08	Enforce tool permissions before execution The Rust CLI/runtime now models permissions as ordered access levels, derives tool requirements from the shared tool specs, and prompts REPL users before one-off danger-full-access escalations from workspace-write sessions. This also wires explicit --permission-mode parsing and makes /permissions operate on the live session state instead of an implicit env-derived default. Constraint: Must preserve the existing three user-facing modes read-only, workspace-write, and danger-full-access Constraint: Must avoid new dependencies and keep enforcement inside the existing runtime/tool plumbing Rejected: Keep the old Allow/Deny/Prompt policy model \| could not represent ordered tool requirements across the CLI surface Rejected: Continue sourcing live session mode solely from RUSTY_CLAUDE_PERMISSION_MODE \| /permissions would not reliably reflect the current session state Confidence: high Scope-risk: moderate Reversibility: clean Directive: Add required_permission entries for new tools before exposing them to the runtime Tested: cargo fmt; cargo clippy --workspace --all-targets -- -D warnings; cargo test -q Not-tested: Manual interactive REPL approval flow in a live Anthropic session	2026-04-01 00:06:15 +00:00
Yeachan-Heo	3efb38cf99	Enforce tool permissions before execution The Rust CLI/runtime now models permissions as ordered access levels, derives tool requirements from the shared tool specs, and prompts REPL users before one-off danger-full-access escalations from workspace-write sessions. This also wires explicit --permission-mode parsing and makes /permissions operate on the live session state instead of an implicit env-derived default. Constraint: Must preserve the existing three user-facing modes read-only, workspace-write, and danger-full-access Constraint: Must avoid new dependencies and keep enforcement inside the existing runtime/tool plumbing Rejected: Keep the old Allow/Deny/Prompt policy model \| could not represent ordered tool requirements across the CLI surface Rejected: Continue sourcing live session mode solely from RUSTY_CLAUDE_PERMISSION_MODE \| /permissions would not reliably reflect the current session state Confidence: high Scope-risk: moderate Reversibility: clean Directive: Add required_permission entries for new tools before exposing them to the runtime Tested: cargo fmt; cargo clippy --workspace --all-targets -- -D warnings; cargo test -q Not-tested: Manual interactive REPL approval flow in a live Anthropic session	2026-04-01 00:06:15 +00:00
Yeachan-Heo	1f8cfbce38	Prevent tool regressions by locking down dispatch-level edge cases The tools crate already covered several higher-level commands, but the public dispatch surface still lacked direct tests for shell and file operations plus several error-path behaviors. This change expands the existing lib.rs unit suite to cover the requested tools through `execute_tool`, adds deterministic temp-path helpers, and hardens assertions around invalid inputs and tricky offset/background behavior. Constraint: No new dependencies; coverage had to stay within the existing crate test structure Rejected: Split coverage into new integration tests under tests/ \| would require broader visibility churn for little gain Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep future tool-coverage additions on the public dispatch surface unless a lower-level helper contract specifically needs direct testing Tested: cargo fmt --all; cargo clippy -p tools --all-targets --all-features -- -D warnings; cargo test -p tools Not-tested: Cross-platform shell/runtime differences beyond the current Linux-like CI environment	2026-03-31 23:33:05 +00:00
Yeachan-Heo	5e22d5ec99	Prevent tool regressions by locking down dispatch-level edge cases The tools crate already covered several higher-level commands, but the public dispatch surface still lacked direct tests for shell and file operations plus several error-path behaviors. This change expands the existing lib.rs unit suite to cover the requested tools through `execute_tool`, adds deterministic temp-path helpers, and hardens assertions around invalid inputs and tricky offset/background behavior. Constraint: No new dependencies; coverage had to stay within the existing crate test structure Rejected: Split coverage into new integration tests under tests/ \| would require broader visibility churn for little gain Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep future tool-coverage additions on the public dispatch surface unless a lower-level helper contract specifically needs direct testing Tested: cargo fmt --all; cargo clippy -p tools --all-targets --all-features -- -D warnings; cargo test -p tools Not-tested: Cross-platform shell/runtime differences beyond the current Linux-like CI environment	2026-03-31 23:33:05 +00:00
Yeachan-Heo	46581fe442	Close the Claude Code tools parity gap Implement the remaining long-tail tool surfaces needed for Claude Code parity in the Rust tools crate: SendUserMessage/Brief, Config, StructuredOutput, and REPL, plus tests that lock down their current schemas and basic behavior. A small runtime clippy cleanup in file_ops was required so the requested verification lane could pass without suppressing workspace warnings. Constraint: Match Claude Code tool names and input schemas closely enough for parity-oriented callers Constraint: No new dependencies for schema validation or REPL orchestration Rejected: Split runtime clippy fixes into a separate commit \| would block the required cargo clippy verification step for this delivery Rejected: Implement a stateful persistent REPL session manager \| unnecessary for current parity scope and would widen risk substantially Confidence: medium Scope-risk: moderate Reversibility: clean Directive: If upstream Claude Code exposes a concrete REPL tool schema later, reconcile this implementation against that source before expanding behavior Tested: cargo fmt --all; cargo clippy -p tools --all-targets --all-features -- -D warnings; cargo test -p tools Not-tested: End-to-end integration with non-Rust consumers; schema-level validation against upstream generated tool payloads	2026-03-31 22:53:20 +00:00
Yeachan-Heo	ba12e1e738	Close the Claw Code tools parity gap Implement the remaining long-tail tool surfaces needed for Claw Code parity in the Rust tools crate: SendUserMessage/Brief, Config, StructuredOutput, and REPL, plus tests that lock down their current schemas and basic behavior. A small runtime clippy cleanup in file_ops was required so the requested verification lane could pass without suppressing workspace warnings. Constraint: Match Claw Code tool names and input schemas closely enough for parity-oriented callers Constraint: No new dependencies for schema validation or REPL orchestration Rejected: Split runtime clippy fixes into a separate commit \| would block the required cargo clippy verification step for this delivery Rejected: Implement a stateful persistent REPL session manager \| unnecessary for current parity scope and would widen risk substantially Confidence: medium Scope-risk: moderate Reversibility: clean Directive: If upstream Claw Code exposes a concrete REPL tool schema later, reconcile this implementation against that source before expanding behavior Tested: cargo fmt --all; cargo clippy -p tools --all-targets --all-features -- -D warnings; cargo test -p tools Not-tested: End-to-end integration with non-Rust consumers; schema-level validation against upstream generated tool payloads	2026-03-31 22:53:20 +00:00
Yeachan-Heo	99b78d6ea4	Polish Agent defaults and ignore crate-local agent artifacts Move the default Agent artifact store out of rust/crates/tools so repeated Agent runs stop generating noisy crate-local files, normalize explicit Agent names through the existing slug path, and ignore any crate-local .clawd-agents residue defensively. Keep the slice limited to the tools crate and preserve the existing manifest-writing behavior. Constraint: Must not touch unrelated dirty api files in this worktree Constraint: Keep the change limited to rust/crates/tools Rejected: Add a broader agent runtime or execution model \| outside the final cleanup slice Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep Agent persistence defaults outside package directories so generated artifacts do not pollute crate working trees Tested: cargo test -p tools Not-tested: concurrent multi-process Agent writes to the default fallback store	2026-03-31 20:46:06 +00:00
Yeachan-Heo	1bcec35c6b	Polish Agent defaults and ignore crate-local agent artifacts Move the default Agent artifact store out of rust/crates/tools so repeated Agent runs stop generating noisy crate-local files, normalize explicit Agent names through the existing slug path, and ignore any crate-local .clawd-agents residue defensively. Keep the slice limited to the tools crate and preserve the existing manifest-writing behavior. Constraint: Must not touch unrelated dirty api files in this worktree Constraint: Keep the change limited to rust/crates/tools Rejected: Add a broader agent runtime or execution model \| outside the final cleanup slice Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep Agent persistence defaults outside package directories so generated artifacts do not pollute crate working trees Tested: cargo test -p tools Not-tested: concurrent multi-process Agent writes to the default fallback store	2026-03-31 20:46:06 +00:00
Yeachan-Heo	6e378185e9	Accept $skill invocation form in Skill tool Teach Skill path resolution to accept the common $skill invocation form in addition to bare names and /skill prefixes. Keep the behavior narrow and add regression coverage using the existing help skill fixture. Constraint: Must not touch unrelated dirty api files in this worktree Constraint: Keep the change limited to rust/crates/tools Rejected: Canonicalize the returned skill field to the resolved name \| would change caller-visible output semantics unnecessarily Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep invocation-prefix normalization aligned with how prompt and skill references are written elsewhere in the CLI Tested: cargo test -p tools Not-tested: CODEX_HOME layouts with unusual symlink arrangements	2026-03-31 20:28:50 +00:00
Yeachan-Heo	0b909ef177	Accept $skill invocation form in Skill tool Teach Skill path resolution to accept the common $skill invocation form in addition to bare names and /skill prefixes. Keep the behavior narrow and add regression coverage using the existing help skill fixture. Constraint: Must not touch unrelated dirty api files in this worktree Constraint: Keep the change limited to rust/crates/tools Rejected: Canonicalize the returned skill field to the resolved name \| would change caller-visible output semantics unnecessarily Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep invocation-prefix normalization aligned with how prompt and skill references are written elsewhere in the CLI Tested: cargo test -p tools Not-tested: CODEX_HOME layouts with unusual symlink arrangements	2026-03-31 20:28:50 +00:00
Yeachan-Heo	019e9900ed	Relax WebSearch domain filter inputs for parity Accept case-insensitive domain filters and URL-style allow/block list entries so WebSearch behaves more forgivingly for caller-provided domain constraints. Keep the change small and limited to host matching logic plus regression coverage.\n\nConstraint: Must not touch unrelated dirty api files in this worktree\nConstraint: Keep the change limited to rust/crates/tools\nRejected: Add full public suffix or hostname normalization logic \| too broad for this parity slice\nConfidence: high\nScope-risk: narrow\nReversibility: clean\nDirective: Preserve simple host matching semantics unless upstream parity proves a more exact domain model is required\nTested: cargo test -p tools\nNot-tested: internationalized domain names and punycode edge cases	2026-03-31 20:27:09 +00:00
Yeachan-Heo	be3aa9a53d	Relax WebSearch domain filter inputs for parity Accept case-insensitive domain filters and URL-style allow/block list entries so WebSearch behaves more forgivingly for caller-provided domain constraints. Keep the change small and limited to host matching logic plus regression coverage.\n\nConstraint: Must not touch unrelated dirty api files in this worktree\nConstraint: Keep the change limited to rust/crates/tools\nRejected: Add full public suffix or hostname normalization logic \| too broad for this parity slice\nConfidence: high\nScope-risk: narrow\nReversibility: clean\nDirective: Preserve simple host matching semantics unless upstream parity proves a more exact domain model is required\nTested: cargo test -p tools\nNot-tested: internationalized domain names and punycode edge cases	2026-03-31 20:27:09 +00:00
Yeachan-Heo	67423d005a	Improve WebFetch title prompts for HTML pages Make title-focused WebFetch prompts prefer the real HTML <title> value when present instead of always falling back to the first rendered text line. Keep the behavior narrow and preserve the existing summary path for non-title prompts.\n\nConstraint: Must not touch unrelated dirty api files in this worktree\nConstraint: Keep the change limited to rust/crates/tools\nRejected: Broader HTML parsing dependency \| not needed for this small parity slice\nConfidence: high\nScope-risk: narrow\nReversibility: clean\nDirective: Preserve lightweight HTML handling unless parity requires a materially more robust parser\nTested: cargo test -p tools\nNot-tested: malformed HTML with mixed-case or nested title edge cases	2026-03-31 20:26:06 +00:00
Yeachan-Heo	df40b4f60a	Improve WebFetch title prompts for HTML pages Make title-focused WebFetch prompts prefer the real HTML <title> value when present instead of always falling back to the first rendered text line. Keep the behavior narrow and preserve the existing summary path for non-title prompts.\n\nConstraint: Must not touch unrelated dirty api files in this worktree\nConstraint: Keep the change limited to rust/crates/tools\nRejected: Broader HTML parsing dependency \| not needed for this small parity slice\nConfidence: high\nScope-risk: narrow\nReversibility: clean\nDirective: Preserve lightweight HTML handling unless parity requires a materially more robust parser\nTested: cargo test -p tools\nNot-tested: malformed HTML with mixed-case or nested title edge cases	2026-03-31 20:26:06 +00:00
Yeachan-Heo	4db21e9595	Make PowerShell tool report backgrounding and missing shells clearly Tighten the PowerShell tool to surface a clear not-found error when neither pwsh nor powershell exists, and mark explicit background execution as user-requested in the returned metadata. Harden the PowerShell tests against PATH mutation races while keeping the change confined to the tools crate.\n\nConstraint: Must not touch unrelated dirty api files in this worktree\nConstraint: Keep the change limited to rust/crates/tools\nRejected: Broader shell abstraction cleanup \| not needed for this parity slice\nConfidence: high\nScope-risk: narrow\nReversibility: clean\nDirective: Keep PowerShell output metadata aligned with bash semantics when adding future shell parity improvements\nTested: cargo test -p tools\nNot-tested: real powershell.exe behavior on Windows hosts	2026-03-31 20:23:55 +00:00
Yeachan-Heo	d32edf13b1	Make PowerShell tool report backgrounding and missing shells clearly Tighten the PowerShell tool to surface a clear not-found error when neither pwsh nor powershell exists, and mark explicit background execution as user-requested in the returned metadata. Harden the PowerShell tests against PATH mutation races while keeping the change confined to the tools crate.\n\nConstraint: Must not touch unrelated dirty api files in this worktree\nConstraint: Keep the change limited to rust/crates/tools\nRejected: Broader shell abstraction cleanup \| not needed for this parity slice\nConfidence: high\nScope-risk: narrow\nReversibility: clean\nDirective: Keep PowerShell output metadata aligned with bash semantics when adding future shell parity improvements\nTested: cargo test -p tools\nNot-tested: real powershell.exe behavior on Windows hosts	2026-03-31 20:23:55 +00:00

1 2 3 4

166 Commits