claw-code

Commit Graph

Author	SHA1	Message	Date
YeonGyu-Kim	7587f2c1eb	fix(cli): JSON parity for /memory and /providers in resume mode Two gaps closed: 1. /memory (resume): json field was None, emitting prose regardless of --output-format json. Now emits: {kind:memory, cwd, instruction_files:N, files:[{path,lines,preview}...]} 2. /providers (resume): had a spec entry but no parse arm, producing the circular 'Unknown slash command: /providers — Did you mean /providers'. Added 'providers' as an alias for 'doctor' in the parse match so /providers dispatches to the same structured diagnostic output. 3. /doctor (resume): also wired json_value() so --output-format json returns the structured doctor report instead of None. Continues ROADMAP #26 resumed-command JSON parity track. 159 CLI tests pass, fmt clean.	2026-04-09 23:35:25 +09:00
YeonGyu-Kim	7ec6860d9a	fix(cli): emit JSON for /config in --output-format json --resume mode /config resumed returned json:None, falling back to prose output even in --output-format json mode. Adds render_config_json() that produces: { "kind": "config", "cwd": "...", "loaded_files": N, "merged_keys": N, "files": [{"path":"...","source":"user\|project\|local","loaded":true\|false}, ...] } Wires it into the SlashCommand::Config resume arm alongside the existing prose render. Continues the resumed-command JSON parity track (ROADMAP #26). 159 CLI tests pass, fmt clean.	2026-04-09 22:03:11 +09:00
YeonGyu-Kim	0e12d15daf	fix(cli): add --allow-broad-cwd; require confirmation or flag in broad-CWD mode	2026-04-09 21:55:22 +09:00
YeonGyu-Kim	60ec2aed9b	fix(cli): wire /tokens and /cache as aliases for /stats; implement /stats Dogfood found that /tokens and /cache had spec entries (resume_supported: true) but no parse arms in the command parser, resulting in: 'Unknown slash command: /tokens — Did you mean /tokens' (the suggestion engine found the spec entry but parsing always failed) Fix three things: 1. Add 'tokens' \| 'cache' as aliases for 'stats' in the parse match so the commands actually resolve to SlashCommand::Stats 2. Implement SlashCommand::Stats in the REPL dispatch — previously fell through to 'Command registered but not yet implemented'. Now shows cumulative token usage for the session. 3. Implement SlashCommand::Stats in run_resume_command — previously returned 'unsupported resumed slash command'. Now emits: text: Cost / Input tokens / Output tokens / Cache create / Cache read json: {kind:stats, input_tokens, output_tokens, cache_*, total_tokens} 159 CLI tests pass, fmt clean.	2026-04-09 21:34:36 +09:00
YeonGyu-Kim	5f6f453b8d	fix(cli): warn when launched from home dir or filesystem root Users launching claw from their home directory (or /) have no project boundary — the agent can read/search the entire machine, often far beyond the intended scope. kapcomunica in #claw-code reported exactly this: 'it searched my entire computer.' Add warn_if_broad_cwd() called at prompt and REPL startup: - checks if CWD == $HOME or CWD has no parent (fs root) - prints a clear warning to stderr: Warning: claw is running from a very broad directory (/home/user). The agent can read and search everything under this path. Consider running from inside your project: cd /path/to/project && claw Warning fires on both claw (REPL) and claw prompt '...' paths. Does not fire from project subdirectories. Uses std::env::var_os("HOME"), no extra deps. 159 CLI tests pass, fmt clean.	2026-04-09 21:26:51 +09:00
YeonGyu-Kim	da4242198f	fix(cli): emit JSON error for unsupported resumed slash commands in JSON mode When claw --output-format json --resume <session> /commit (or /plugins, etc.) encountered an 'unsupported resumed slash command' error, it called eprintln!() and exit(2) directly, bypassing both the main() JSON error handler and the output_format check. Fix: in both the slash-command parse-error path and the run_resume_command Err path, check output_format and emit a structured JSON error: {"type":"error","error":"unsupported resumed slash command","command":"/commit"} Text mode unchanged (still exits 2 with prose to stderr). Addresses the resumed-command parity gap (gaebal-gajae ROADMAP #26 track). 159 CLI tests pass, fmt clean.	2026-04-09 21:04:50 +09:00
YeonGyu-Kim	84b77ece4d	fix(cli): pipe stdin to prompt when no args given (suppress REPL on pipe) When stdin is not a terminal (pipe or redirect) and no prompt is given on the command line, claw was starting the interactive REPL and printing the startup banner, then consuming the pipe without sending anything to the API. Fix: in parse_args, when rest.is_empty() and stdin is not a terminal, read stdin synchronously and dispatch as CliAction::Prompt instead of Repl. Empty pipe still falls through to Repl (interactive launch with no input). Before: echo 'hello' \| claw -> startup banner + REPL start After: echo 'hello' \| claw -> dispatches as one-shot prompt 159 CLI tests pass, fmt clean.	2026-04-09 20:36:14 +09:00
YeonGyu-Kim	aef85f8af5	fix(cli): /diff shows clear error when not in a git repo Previously claw --resume <session> /diff would produce: 'git diff --cached failed: error: unknown option `cached\'' when the CWD was not inside a git project, because git falls back to --no-index mode which does not support --cached. Two fixes: 1. render_diff_report_for() checks 'git rev-parse --is-inside-work-tree' before running git diff, and returns a human-readable message if not in a git repo: 'Diff\n Result no git repository\n Detail <cwd> is not inside a git project' 2. resume /diff now uses std::env::current_dir() instead of the session file's parent directory as the CWD for the diff (session parent dir is the .claw/sessions/<id>/ directory, never a git repo). 159 CLI tests pass, fmt clean.	2026-04-09 20:04:21 +09:00
YeonGyu-Kim	3ed27d5cba	fix(cli): emit JSON for /history in --output-format json --resume mode Previously claw --output-format json --resume <session> /history emitted prose text regardless of the output format flag. Now emits structured JSON: {"kind":"history","total":N,"showing":M,"entries":[{"timestamp_ms":...,"text":"..."},...]} Mirrors the parity pattern established in ROADMAP #26 for other resume commands. 159 CLI tests pass, fmt clean.	2026-04-09 19:33:50 +09:00
YeonGyu-Kim	e1ed30a038	fix(cli): surface session_id in /status JSON output When running claw --output-format json --resume <session> /status, the JSON output had 'session' (full file path) but no 'session_id' field, making it impossible for scripts to extract the loaded session ID. Now extracts the session-id directory component from the session path (e.g. .claw/sessions/<session-id>/session-xxx.jsonl → session-id) and includes it as 'session_id' in the JSON status envelope. 159 CLI tests pass, fmt clean.	2026-04-09 19:06:36 +09:00
YeonGyu-Kim	54269da157	fix(cli): claw state exits 1 when no worker state file exists Previously 'claw state' printed an error message but exited 0, making it impossible for scripts/CI to detect the absence of state without parsing prose. Now propagates Err() to main() which exits 1 and formats the error correctly for both text and --output-format json modes. Text: 'error: no worker state file found at ... — run a worker first' JSON: {"type":"error","error":"no worker state file found at ..."}	2026-04-09 18:34:41 +09:00
YeonGyu-Kim	f741a42507	test(cli): add regression coverage for reasoning-effort validation and stub-command filtering 3 new tests in mod tests: - rejects_invalid_reasoning_effort_value: confirms 'turbo' etc rejected at parse time - accepts_valid_reasoning_effort_values: confirms low/medium/high accepted and threaded - stub_commands_absent_from_repl_completions: asserts STUB_COMMANDS are not in completions 156 -> 159 CLI tests pass.	2026-04-09 18:06:32 +09:00
YeonGyu-Kim	1a8f73da01	fix(cli): emit JSON error on --output-format json — ROADMAP #42 When claw --output-format json hits an error, the error was previously printed as plain prose to stderr, making it invisible to downstream tooling that parses JSON output. Now: {"type":"error","error":"api returned 401 ..."} Detection: scan argv at process exit for --output-format json or --output-format=json. Non-JSON error path unchanged. 156 CLI tests pass.	2026-04-09 16:33:20 +09:00
YeonGyu-Kim	8d0308eecb	fix(cli): dispatch bare skill names to skill invoker in REPL — ROADMAP #36 Users were typing skill names (e.g. 'caveman', 'find-skills') directly in the REPL and getting LLM responses instead of skill invocation. Only '/skills <name>' triggered dispatch; bare names fell through to run_turn. Fix: after slash-command parse returns None (bare text), check if the first token looks like a skill name (alphanumeric/dash/underscore, no slash). If resolve_skill_invocation() confirms the skill exists, dispatch the full input as a skill prompt. Unknown words fall through unchanged. 156 CLI tests pass, fmt clean.	2026-04-09 16:01:18 +09:00
YeonGyu-Kim	4d10caebc6	fix(cli): validate --reasoning-effort accepts only low\|medium\|high Previously any string was accepted and silently forwarded to the API, which would fail at the provider with an unhelpful error. Now invalid values produce a clear error at parse time: invalid value for --reasoning-effort: 'xyz'; must be low, medium, or high 156 CLI tests pass, fmt clean.	2026-04-09 15:03:36 +09:00
YeonGyu-Kim	414526c1bd	fix(cli): exclude stub slash commands from help output — ROADMAP #39 The --help slash-command section was listing ~35 unimplemented commands alongside working ones. Combined with the completions fix (`c55c510`), the discovery surface now consistently shows only implemented commands. Changes: - commands crate: add render_slash_command_help_filtered(exclude: &[&str]) - move STUB_COMMANDS to module-level const in main.rs (reused by both completions and help rendering) - replace render_slash_command_help() with filtered variant at all help-rendering call sites 156 CLI tests pass, fmt clean.	2026-04-09 14:36:00 +09:00
YeonGyu-Kim	2a2e205414	fix(cli): intercept --help for prompt/login/logout/version subcommands before API dispatch 'claw prompt --help' was triggering an API call instead of showing help because --help was parsed as part of the prompt args. Now '--help' after known pass-through subcommands (prompt, login, logout, version, state, init, export, commit, pr, issue) sets wants_help=true and shows the top-level help page. Subcommands that consume their own args (agents, mcp, plugins, skills) and local help-topic subcommands (status, sandbox, doctor) are excluded from this interception so their existing --help handling is preserved. 156 CLI tests pass, fmt clean.	2026-04-09 14:06:26 +09:00
YeonGyu-Kim	c55c510883	fix(cli): exclude stub slash commands from REPL completions — ROADMAP #39 Commands registered in the spec list but not yet implemented in this build were appearing in REPL tab-completions, making the discovery surface over-promise what actually works. Users (mezz2301) reported 'many features are not supported' after discovering these through completions. Add STUB_COMMANDS exclusion list in slash_command_completion_candidates_with_sessions. Excluded: login logout vim upgrade stats share feedback files fast exit summary desktop brief advisor stickers insights thinkback release-notes security-review keybindings privacy-settings plan review tasks theme voice usage rename copy hooks context color effort branch rewind ide tag output-style add-dir These commands still parse and run (with the 'not yet implemented' message for users who type them directly), but they no longer surface as tab-completion candidates.	2026-04-09 13:36:12 +09:00
YeonGyu-Kim	ca8950c26b	feat(cli): wire --reasoning-effort flag end-to-end — closes ROADMAP #34 Parse --reasoning-effort <low\|medium\|high> in parse_args, thread through CliAction::Prompt and CliAction::Repl, LiveCli::set_reasoning_effort(), AnthropicRuntimeClient.reasoning_effort field, and MessageRequest.reasoning_effort. Changes: - parse_args: new --reasoning-effort / --reasoning-effort=VAL flag arms - AnthropicRuntimeClient: new reasoning_effort field + set_reasoning_effort() method - LiveCli: new set_reasoning_effort() that reaches through BuiltRuntime -> ConversationRuntime -> api_client_mut() - runtime::ConversationRuntime: new pub api_client_mut() accessor - MessageRequest construction: reasoning_effort: self.reasoning_effort.clone() - run_repl(): accepts and applies reasoning_effort parameter - parse_direct_slash_cli_action(): propagates reasoning_effort All 156 CLI tests pass, all api tests pass, cargo fmt clean.	2026-04-09 11:08:00 +09:00
YeonGyu-Kim	c1b1ce465e	feat(cli): add reasoning_effort field to CliAction::Prompt/Repl variants — ROADMAP #34 struct groundwork Adds reasoning_effort: Option<String> to CliAction::Prompt and CliAction::Repl enum variants. All constructor and pattern sites updated. All test literals updated with reasoning_effort: None. 156 cli tests pass, fmt clean. The --reasoning-effort flag parse and propagation to AnthropicRuntimeClient remains as follow-up work.	2026-04-09 10:34:28 +09:00
Jobdori	275b58546d	feat(cli): populate Git SHA, target triple, and build date at compile time via build.rs Add rust/crates/rusty-claude-cli/build.rs that: - Captures git rev-parse --short HEAD at build time → GIT_SHA env - Reads Cargo's TARGET env var → TARGET env - Derives BUILD_DATE from SOURCE_DATE_EPOCH / BUILD_DATE env or the current date via `date +%Y-%m-%d` fallback - Registers rerun-if-changed on .git/HEAD and .git/refs so the SHA stays fresh across commits Update main.rs DEFAULT_DATE to pick up BUILD_DATE from option_env!() instead of the hardcoded 2026-03-31 static string. Before: `claw --version` always showed Git SHA: unknown, Target: unknown, Build date: 2026-03-31 in local builds. After: e.g. Git SHA: `7f53d82`, Target: aarch64-apple-darwin, Build date: 2026-04-08 Generated by droid (Kimi K2.5 Turbo) via acpx (wrote build.rs), cleaned up by Jobdori (added BUILD_DATE step, updated main.rs const). Co-Authored-By: Droid <noreply@factory.ai>	2026-04-08 18:11:46 +09:00
YeonGyu-Kim	8dc65805c1	fix(cli): dispatch to correct provider backend based on model prefix — closes ROADMAP #29 The CLI entry point (build_runtime_with_plugin_state in main.rs) was hardcoded to always instantiate AnthropicRuntimeClient with an AnthropicClient, regardless of what detect_provider_kind(model) returned. This meant `--model openai/gpt-4` with OPENAI_API_KEY set and no ANTHROPIC_* vars still failed with "missing Anthropic credentials" because the CLI never dispatched to the OpenAI-compat backend that already exists in the api crate. Root cause: AnthropicRuntimeClient.client was typed as AnthropicClient (concrete) rather than ApiProviderClient (enum). The api crate already had a ProviderClient enum with Anthropic / Xai / OpenAi variants that dispatches correctly via detect_provider_kind, plus a unified MessageStream enum that wraps both anthropic::MessageStream and openai_compat::MessageStream with the same next_event() -> StreamEvent interface. The CLI just wasn't using it. Changes (1 file, +59 -7): - Import api::ProviderClient as ApiProviderClient - Change AnthropicRuntimeClient.client from AnthropicClient to ApiProviderClient - In AnthropicRuntimeClient::new(), dispatch based on detect_provider_kind(&resolved_model): * Anthropic: build AnthropicClient directly with resolve_cli_auth_source() + api::read_base_url() + PromptCache (preserves ANTHROPIC_BASE_URL override for mock test harness and the session-scoped prompt cache) * xAI / OpenAi: delegate to ApiProviderClient::from_model_with_anthropic_auth which routes to OpenAiCompatClient::from_env with the matching config (reads OPENAI_API_KEY/XAI_API_KEY/DASHSCOPE_API_KEY and their BASE_URL overrides internally) - Change push_prompt_cache_record to take &ApiProviderClient (ProviderClient::take_last_prompt_cache_record returns None for non-Anthropic variants, so the helper is a no-op on OpenAI-compat providers without extra branching) What this unlocks for users: claw --model openai/gpt-4.1-mini prompt 'hello' # OpenAI claw --model grok-3 prompt 'hello' # xAI claw --model qwen-plus prompt 'hello' # DashScope OPENAI_BASE_URL=https://openrouter.ai/api/v1 \ claw --model openai/anthropic/claude-sonnet-4 prompt 'hello' # OpenRouter All previously broken, now routed correctly by prefix. Verification: - cargo build --release -p rusty-claude-cli: clean - cargo test --release -p rusty-claude-cli: 182 tests, 0 failures (including compact_output tests that exercise the Anthropic mock) - cargo fmt --all: clean - cargo clippy --workspace: warnings-only (pre-existing) - cargo test --release --workspace: all crates green except one pre-existing race in runtime::config::tests (passes in isolation) Source: live users nicma (1491342350960562277) and Jengro (1491345009021030533) in #claw-code on 2026-04-08.	2026-04-08 17:29:55 +09:00
YeonGyu-Kim	5851f2dee8	fix(cli): 6 cascading test regressions hidden behind client_integration gate - compact flag: was parsed then discarded (`compact: _`) instead of passed to `run_turn_with_output` — hardcoded `false` meant --compact never took effect - piped stdin vs permission prompter: `read_piped_stdin()` consumed all stdin before `CliPermissionPrompter::decide()` could read interactive approval answers; now only consumes stdin as prompt context when permission mode is `DangerFullAccess` (fully unattended) - session resolver: `resolve_managed_session_path` and `list_managed_sessions` now fall back to the pre-isolation flat `.claw/sessions/` layout so legacy sessions remain accessible - help assertion: match on stable prefix after `/session delete` was added in batch 5 - prompt shorthand: fix copy-paste that changed expected prompt from "help me debug" to "$help overview" - mock parity harness: filter captured requests to `/v1/messages` path only, excluding count_tokens preflight calls added by `be561bf` All 6 failures were pre-existing but masked because `client_integration` always failed first (fixed in `8c6dfe5`). Workspace: 810+ tests passing, 0 failing.	2026-04-08 14:54:10 +09:00
YeonGyu-Kim	006f7d7ee6	fix(test): add env_lock to plugin lifecycle test — closes ROADMAP #24 build_runtime_runs_plugin_lifecycle_init_and_shutdown was the only test that set/removed ANTHROPIC_API_KEY without holding the env_lock mutex. Under parallel workspace execution, other tests racing on the same env var could wipe the key mid-construction, causing a flaky credential error. Root cause: process-wide env vars are shared mutable state. All other tests that touch ANTHROPIC_API_KEY already use env_lock(). This test was the only holdout. Fix: add let _guard = env_lock(); at the top of the test.	2026-04-08 12:46:04 +09:00
YeonGyu-Kim	c7b3296ef6	style: cargo fmt — fix CI formatting failures Pre-existing formatting issues in anthropic.rs surfaced by CI cargo fmt check. No functional changes.	2026-04-08 11:21:13 +09:00
YeonGyu-Kim	c667d47c70	feat(api): add tuning params (temperature, top_p, penalties, stop) to MessageRequest MessageRequest was missing standard OpenAI-compatible generation tuning parameters. Callers had no way to control temperature, top_p, frequency_penalty, presence_penalty, or stop sequences. Changes: - Added 5 optional fields to MessageRequest (all Option, None by default) - Wired into build_chat_completion_request: only included in payload when set - All existing construction sites updated with ..Default::default() - MessageRequest now derives Default for ergonomic partial construction Tests added: - tuning_params_included_in_payload_when_set: all 5 params flow into JSON - tuning_params_omitted_from_payload_when_none: absent params stay absent 83 api lib tests passing, 0 failing. cargo check --workspace: 0 warnings.	2026-04-08 07:07:33 +09:00
YeonGyu-Kim	314f0c99fd	feat(worker_boot): emit .claw/worker-state.json on every status transition WorkerStatus is fully tracked in worker_boot.rs but was invisible to external observers (clawhip, orchestrators) because opencode serve's HTTP server is upstream and not ours to extend. Solution: atomic file-based observability. - emit_state_file() writes .claw/worker-state.json on every push_event() call (tmp write + rename for atomicity) - Snapshot includes: worker_id, status, is_ready, trust_gate_cleared, prompt_in_flight, last_event, updated_at - Add 'claw state' CLI subcommand to read and print the file - Add regression test: emit_state_file_writes_worker_status_on_transition verifies spawning→ready_for_prompt transition is reflected on disk This closes the /state dogfood gap without requiring any upstream opencode changes. Clawhip can now distinguish a truly stalled worker (status: trust_required or running with no recent updated_at) from a quiet-but-progressing one.	2026-04-08 00:37:44 +09:00
YeonGyu-Kim	092d8b6e21	fix(tests): add missing test imports for session/prompt history features Add missing imports to test module: - PromptHistoryEntry, render_prompt_history_report, parse_history_count - parse_export_args, render_session_markdown - summarize_tool_payload_for_markdown, short_tool_id Fixes test compilation errors introduced by new session and export features from batch 5/6 work.	2026-04-07 16:20:33 +09:00
YeonGyu-Kim	e51566c745	feat: b6-bridge-directory follow-up work — batch 6	2026-04-07 16:11:50 +09:00
YeonGyu-Kim	20f3a5932a	fix(cli): wire sessions_dir() through SessionStore::from_cwd() (#41 ) The CLI was using a flat cwd/.claw/sessions/ path without workspace fingerprinting, while SessionStore::from_cwd() adds a hash subdirectory. This mismatch meant the isolation machinery existed but wasn't actually used by the main session management codepath. Now sessions_dir() delegates to SessionStore::from_cwd(), ensuring all session operations use workspace-fingerprinted directories.	2026-04-07 16:03:44 +09:00
YeonGyu-Kim	f03b8dce17	feat: bridge directory metadata + stale-base preflight check - Add CWD to SSE session events (kills Directory: unknown) - Add stale-base preflight: verify HEAD matches expected base commit - Warn on divergence before session starts	2026-04-07 15:55:38 +09:00
YeonGyu-Kim	82f2e8e92b	feat: doctor-cmd implementation	2026-04-07 15:28:43 +09:00
YeonGyu-Kim	dab16c230a	feat: b5-session-export — batch 5 wave 2	2026-04-07 15:19:45 +09:00
YeonGyu-Kim	a46711779c	feat: b5-markdown-fence — batch 5 wave 2	2026-04-07 15:19:45 +09:00
YeonGyu-Kim	4557a81d2f	feat: b5-doctor-cmd — batch 5 wave 2	2026-04-07 15:19:45 +09:00
YeonGyu-Kim	86c3667836	feat: b5-context-compress — batch 5 wave 2	2026-04-07 15:19:45 +09:00
YeonGyu-Kim	8663751650	fix: resolve merge conflicts from batch 5 cherry-picks (compact field, run_turn_with_output arity)	2026-04-07 14:53:46 +09:00
YeonGyu-Kim	90f2461f75	feat: b5-tool-timeout — batch 5 upstream parity	2026-04-07 14:51:32 +09:00
YeonGyu-Kim	0d8fd51a6c	feat: b5-stdin-pipe — batch 5 upstream parity	2026-04-07 14:51:28 +09:00
YeonGyu-Kim	9105e0c656	feat: b5-openrouter-fix — batch 5 upstream parity	2026-04-07 14:51:26 +09:00
YeonGyu-Kim	b8f76442e2	feat: b5-multi-provider — batch 5 upstream parity	2026-04-07 14:51:26 +09:00
YeonGyu-Kim	b216f9ce05	feat: b5-max-token-plugin — batch 5 upstream parity	2026-04-07 14:51:26 +09:00
YeonGyu-Kim	4be4b46bd9	feat: b5-git-aware — batch 5 upstream parity	2026-04-07 14:51:26 +09:00
YeonGyu-Kim	506ff55e53	feat: b5-doctor-cmd — batch 5 upstream parity	2026-04-07 14:51:26 +09:00
YeonGyu-Kim	65f4c3ad82	feat: b5-cost-tracker — batch 5 upstream parity	2026-04-07 14:51:25 +09:00
YeonGyu-Kim	700534de41	feat: b5-context-compress — batch 5 upstream parity	2026-04-07 14:51:25 +09:00
YeonGyu-Kim	8d866073c5	feat(cli): show active model and provider in startup banner Prints 'Connected: <model> via <provider>' before REPL prompt.	2026-04-07 14:22:26 +09:00
YeonGyu-Kim	4251c85855	fix(cli): add section headers to OMC output for agent type grouping voloshko: flat wall of text. Now groups output with section separators by agent type (Explore, Implementation, Verification).	2026-04-07 14:22:06 +09:00
YeonGyu-Kim	cd83c0ff68	fix(cli): detect OPENAI_BASE_URL during claw login and emit clear error OAuth 401 was confusing. Now detects custom base URL and suggests ANTHROPIC_API_KEY instead of OAuth login.	2026-04-07 14:22:05 +09:00
Yeachan-Heo	c1883d0f66	Clarify heuristic context window estimates	2026-04-06 09:26:08 +00:00
Yeachan-Heo	1fc5a1c457	Fix slash skill invoke normalization	2026-04-06 09:24:06 +00:00
Yeachan-Heo	ecadc5554a	fix(auth): harden OAuth fallback and collapse thinking output	2026-04-06 09:02:21 +00:00
Yeachan-Heo	8ff9c1b15a	Preserve recovery guidance for retried context-window failures The CLI already reframes direct preflight and provider oversized-request errors, but retry-wrapped provider failures still fell back to the generic retry-exhausted surface because the user-visible formatter keyed off the safe failure class. Route formatting through nested context-window detection so wrapped provider failures keep the same compact/reduce-scope guidance. Constraint: Keep the fix UX-scoped without widening broader failure classification behavior Rejected: Reorder safe_failure_class for all RetriesExhausted errors \| broader semantic change than needed for this issue Confidence: high Scope-risk: narrow Directive: Keep context-window rendering keyed to nested error inspection so provider wrappers do not lose recovery guidance Tested: cargo fmt --check; cargo test -p rusty-claude-cli context_window; cargo test -p api oversized Not-tested: Full workspace test suite	2026-04-06 09:02:21 +00:00
Yeachan-Heo	f9cb42fb44	Resolve claw-code main merge conflicts	2026-04-06 07:16:57 +00:00
Yeachan-Heo	01b263c838	Let /skills invocations reach the prompt skill path The CLI still treated every /skills payload other than list/install/help as local usage text, so skills that appeared in /skills could not actually be invoked. This restores prompt dispatch for /skills <skill> [args], keeps list/install on the local path, and shares skill resolution with the Skill tool so project-local and legacy /commands entries resolve consistently. Constraint: --resume local slash execution still only supports local commands without provider turns Rejected: Implement full resumed prompt-turn execution for /skills \| larger behavior change outside this bugfix Rejected: Keep separate skill lookups in tools and commands \| drift already caused listing/invocation mismatches Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep /skills discovery, CLI prompt dispatch, and Tool Skill resolution on the same registry semantics Tested: cargo fmt --all; cargo clippy -p commands -p tools -p rusty-claude-cli --all-targets -- -D warnings; cargo test --workspace -- --nocapture Not-tested: Live provider-backed /skills invocation against external skill packs in an interactive REPL	2026-04-06 06:43:31 +00:00
Yeachan-Heo	b930895736	Turn oversized-context failures into recovery guidance Dogfood showed oversized requests still surfacing as raw hard errors, even when claw could tell the user exactly how to recover. This keeps context-window failures classified, recognizes the same failure when it comes back from a provider response, and renders recovery steps that point operators at the existing compaction and fresh-session paths instead of a provider-style dump. Constraint: Keep the failure class explicit so automation and operators can still distinguish context-window exhaustion from generic provider failures Constraint: Reuse existing /compact and session-reset UX instead of inventing a new recovery workflow Rejected: Auto-run compaction on failure \| mutates session state on an error path the user may want to inspect first Rejected: Only prettify local preflight failures \| provider-returned context-window errors would still leak raw failure text Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep provider-side context-window detection aligned with real oversized-request messages before broadening the marker list Tested: cargo fmt --all --check Tested: cargo test -p api Tested: cargo test -p rusty-claude-cli Tested: cargo clippy -p api -p rusty-claude-cli --all-targets -- -D warnings Not-tested: cargo test --workspace	2026-04-06 06:43:31 +00:00
Yeachan-Heo	fe4da2aa65	Keep resumed JSON command surfaces machine-readable Resumed slash dispatch was still dropping back to prose for several JSON-capable local commands, which forced automation to special-case direct CLI invocations versus --resume flows. This routes resumed local-command handlers through the same structured JSON payloads used by direct status, sandbox, inventory, version, and init commands, and records the inventory parity audit result in the roadmap. Constraint: Text-mode resumed output must stay unchanged for existing shell users Rejected: Teach callers to scrape resumed text output \| brittle and defeats the JSON contract Confidence: high Scope-risk: narrow Reversibility: clean Directive: When a direct local command has a JSON renderer, keep resumed slash dispatch on the same serializer instead of adding one-off format branches Tested: cargo fmt --check; cargo test --workspace; cargo clippy --workspace --all-targets -- -D warnings Not-tested: Live provider-backed REPL resume flows outside the local test harness	2026-04-06 02:00:33 +00:00
Yeachan-Heo	53d6909b9b	Emit structured doctor JSON diagnostics	2026-04-06 01:42:59 +00:00
Yeachan-Heo	ceaf9cbc23	Preserve structured JSON parity for `claw agents` `claw agents --output-format json` was still wrapping the text report, which meant automation could not distinguish empty inventories from populated agent definitions. Add a dedicated structured handler in the commands crate, wire the CLI to it, and extend the contracts to cover both empty and populated agent listings. Constraint: Keep text-mode `claw agents` output unchanged while aligning JSON behavior with existing structured inventory handlers Rejected: Parse the text report into JSON in the CLI layer \| brittle duplication and no reusable structured handler Confidence: high Scope-risk: narrow Directive: Keep inventory subcommands on dedicated structured handlers instead of serializing human-readable reports Tested: cargo test -p commands renders_agents_reports_as_json; cargo test -p rusty-claude-cli --test output_format_contract; cargo test --workspace; cargo fmt --check; cargo clippy --workspace --all-targets -- -D warnings Not-tested: Manual invocation of `claw agents --output-format json` outside automated tests	2026-04-06 01:42:59 +00:00
Yeachan-Heo	ee92f131b0	Stabilize plugin lifecycle temp dirs across parallel tests	2026-04-06 01:18:56 +00:00
Yeachan-Heo	d94d792a48	Expose actionable ids for opaque provider failures Issue #22 was triggered by generic upstream fatal wrappers that only surfaced 'Something went wrong', which left repeated Jobdori-style failures opaque in the CLI. Capture provider request ids on error responses, classify the known generic wrapper as provider_internal, and prefix the user-visible runtime error with the failure class plus session/trace identifiers so operators can correlate the failure quickly. Constraint: Keep the fix small and user-safe without redesigning the broader runtime error taxonomy Constraint: Preserve existing non-generic error text unless the wrapper is the known opaque fatal surface Rejected: Broadly rewriting every runtime error into classified envelopes \| unnecessary scope expansion for issue #22 Confidence: high Scope-risk: narrow Reversibility: clean Directive: If more opaque wrappers appear, extend the marker list and classification helper rather than reintroducing raw wrapper text alone Tested: cargo test -p api detects_generic_fatal_wrapper_and_classifies_it_as_provider_internal -- --nocapture; cargo test -p api retries_exhausted_preserves_nested_request_id_and_failure_class -- --nocapture; cargo test -p rusty-claude-cli opaque_provider_wrapper_surfaces_failure_class_session_and_trace -- --nocapture; cargo test -p rusty-claude-cli retry_exhaustion_preserves_internal_failure_class_for_generic_provider_wrapper -- --nocapture; cargo test --workspace Not-tested: Live upstream reproduction of the Jobdori failure against a real provider session	2026-04-06 00:30:28 +00:00
Yeachan-Heo	2bab4080d6	Keep resumed /status JSON aligned with live status output The resumed slash-command path built a reduced status JSON payload by hand, so it drifted from the fresh status schema and dropped metadata like model, permission mode, workspace counters, and sandbox details. Reuse a shared status JSON builder for both code paths and tighten the resume regression tests to lock parity in place. Constraint: Resume mode does not carry an active runtime model, so restored sessions continue to report the existing restored-session sentinel value Rejected: Copy the fresh status JSON shape into the resume path again \| would recreate the same schema drift risk Confidence: high Scope-risk: narrow Directive: Keep resumed and fresh /status JSON on the same helper so future schema changes stay in parity Tested: Reproduced failure in temporary HEAD worktree with strengthened resumed_status_command_emits_structured_json_when_requested Tested: cargo test -p rusty-claude-cli resumed_status_command_emits_structured_json_when_requested --test resume_slash_commands -- --exact --nocapture Tested: cargo test -p rusty-claude-cli doctor_and_resume_status_emit_json_when_requested --test output_format_contract -- --exact --nocapture Tested: cargo test --workspace Tested: cargo fmt --check Tested: cargo clippy --workspace --all-targets -- -D warnings	2026-04-05 23:30:39 +00:00
Yeachan-Heo	19c6b29524	Close the clawability backlog with deterministic CLI output and lane lineage Finish the remaining roadmap work by making direct CLI JSON output deterministic across the non-interactive surface, restoring the degraded-startup MCP test as a real workspace test, and adding branch-lock plus commit-lineage primitives so downstream lane consumers can distinguish superseded worktree commits from canonical lineage. Constraint: Keep the user-facing config namespace centered on .claw while preserving legacy fallback discovery for compatibility Constraint: Verification needed to stay clean-room and reproducible from the checked-in workspace alone Rejected: Leave the output-format contract implied by ad-hoc smoke runs only \| too easy for direct CLI regressions to slip back into prose-only output Rejected: Keep commit provenance as free-form detail text \| downstream consumers need structured branch/worktree/supersession metadata Confidence: medium Scope-risk: moderate Directive: Extend the JSON contract through the same direct CLI entrypoints instead of adding one-off serializers on parallel code paths Tested: python .github/scripts/check_doc_source_of_truth.py Tested: cd rust && cargo fmt --all --check Tested: cd rust && cargo test --workspace Tested: cd rust && cargo clippy -p commands -p tools -p rusty-claude-cli --all-targets --no-deps -- -D warnings Not-tested: full cargo clippy --workspace --all-targets -- -D warnings still reports unrelated pre-existing runtime lint debt outside this change set	2026-04-05 18:41:02 +00:00
Yeachan-Heo	f43375f067	Complete local claw-first CLI and config surface alignment	2026-04-05 18:11:25 +00:00
Yeachan-Heo	136cedf1cc	Honor JSON output for skills and MCP inventory commands The skills and mcp inventory handlers were still emitting prose tables even when the global --output-format json flag was set. This wires structured JSON renderers into the command handlers and CLI dispatch so direct invocations and resumed slash-command execution both return machine-readable payloads while preserving existing text output in the REPL path. Constraint: Must preserve existing text output and help behavior for interactive slash commands Rejected: Parse existing prose tables into JSON at the CLI edge \| brittle and loses structured fields Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep text and JSON variants driven by the same command parsing branches so --output-format stays deterministic across entry points Tested: cargo test -p commands Tested: cargo test -p rusty-claude-cli Not-tested: Manual invocation against a live user skills registry or external MCP services	2026-04-05 18:11:25 +00:00
Yeachan-Heo	2dd05bfcef	Make .claw the only user-facing config namespace Agents, skills, and init output were still surfacing .codex/.claude paths even though the runtime already treats .claw as the canonical config home. This updates help text, reports, skill install defaults, and repo bootstrap output to present a single .claw namespace while keeping legacy discovery fallbacks in place for existing setups. Constraint: Existing .codex/.claude agent and skill directories still need to load for compatibility Rejected: Remove legacy discovery entirely \| would break existing user setups instead of just cleaning up surfaced output Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep future user-facing config, agent, and skill path copy aligned to .claw and even when legacy fallbacks remain supported internally Tested: cargo fmt --all --check; cargo test --workspace --exclude compat-harness Not-tested: cargo clippy --workspace --all-targets -- -D warnings \| fails in pre-existing unrelated runtime files (for example mcp_lifecycle_hardened.rs, mcp_tool_bridge.rs, lsp_client.rs, permission_enforcer.rs, recovery_recipes.rs, stale_branch.rs, task_registry.rs, team_cron_registry.rs, worker_boot.rs)	2026-04-05 18:11:25 +00:00
Yeachan-Heo	9b156e21cf	Route nested CLI help requests to usage instead of operand fallthrough The direct CLI wrappers for agents, skills, and mcp treated nested help flags as ordinary operands. That made commands like `claw mcp show --help` report a missing server and `claw skills install --help` fall into filesystem install logic instead of surfacing usage. This change normalizes help-path arguments before dispatch so nested help stays on the help path. The regression tests cover both handler-level behavior and end-to-end CLI output for nested help and unknown subcommands with trailing help flags. Constraint: Keep the fix scoped to direct CLI slash-command wrappers without changing unrelated parser behavior Rejected: Rework top-level argument parsing for all subcommands \| broader risk than needed for the regression Confidence: high Scope-risk: narrow Reversibility: clean Directive: If more nested subcommands are added, extend the help-path normalization table before relying on raw operand dispatch Tested: cargo build -p commands -p rusty-claude-cli Tested: cargo test -p commands -p rusty-claude-cli Not-tested: cargo clippy -p commands -p rusty-claude-cli --all-targets --no-deps -- -D warnings (pre-existing warnings in untouched files block clean run)	2026-04-05 18:11:25 +00:00
Yeachan-Heo	f0d82a7cc0	Keep doctor and local help paths shell-native Promote doctor into a real top-level CLI action, reuse the same local report for resumed and REPL doctor invocations, and intercept doctor/status/sandbox help flags before prompt-mode dispatch. The parser change also closes the help fallthrough that previously wandered into runtime startup for local-info commands. Constraint: Preserve prompt shorthand for normal multi-word text input while fixing exact local subcommand help paths Rejected: Route \7[1G[2K[m⠋ 🦀 Thinking...[0m8[1G[2K[m✘ ❌ Request failed [0m through prompt/slash guidance \| still shells out through the wrong surface and keeps health checks hidden Rejected: Reuse the status report as doctor output \| status does not explain auth/config health or expose a dedicated diagnostic summary Confidence: high Scope-risk: narrow Directive: Keep doctor local-only unless an explicit network probe is intentionally added and separately tested Tested: cargo build -p rusty-claude-cli; cargo test -p rusty-claude-cli; cargo run -p rusty-claude-cli -- doctor --help; CLAW_CONFIG_HOME=/tmp/tmp.7pm9SVzOPN ANTHROPIC_API_KEY= ANTHROPIC_AUTH_TOKEN= cargo run -p rusty-claude-cli -- doctor Not-tested: direct /doctor outside the REPL remains interactive-only	2026-04-05 18:11:25 +00:00
Yeachan-Heo	c3b0e12164	Remove unshipped rusty-claude-cli prototype modules The shipped CLI surface lives in `src/main.rs`, which only wires `init`, `input`, and `render`. The legacy `app.rs` and `args.rs` prototypes were not in the module tree and had no inbound references, so this change deletes those orphaned files instead of widening scope into a larger refactor. It also aligns the TUI enhancement plan with that reality so the document no longer describes the removed prototypes as current tracked structure. Constraint: Must preserve shipped CLI parsing and slash-command behavior Rejected: Refactor main.rs into smaller modules now \| widens scope beyond behavior-safe cleanup Rejected: Leave TUI plan wording untouched \| leaves low-risk stale documentation behind Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep this slice deletion-first; do not reintroduce alternate CLI surfaces without wiring them into main.rs and its tests Tested: cargo test -p rusty-claude-cli defaults_to_repl_when_no_args Tested: cargo test -p rusty-claude-cli parses_login_and_logout_subcommands Tested: cargo test -p rusty-claude-cli parses_direct_agents_mcp_and_skills_slash_commands Tested: cargo test -p rusty-claude-cli direct_slash_commands_surface_shared_validation_errors Tested: cargo test -p rusty-claude-cli parses_resume_flag_with_multiple_slash_commands -- --nocapture Tested: cargo test -p rusty-claude-cli resumed_binary_accepts_slash_commands_with_arguments -- --nocapture Tested: cargo check -p rusty-claude-cli Tested: git diff --check Not-tested: cargo clippy -p rusty-claude-cli --all-targets -- -D warnings (pre-existing failures in rust/crates/runtime/* and existing warnings outside this diff)	2026-04-05 17:44:34 +00:00
Yeachan-Heo	eb4d3b11ee	merge fix/p2-19-subcommand-help-fallthrough	2026-04-05 16:54:59 +00:00
Yeachan-Heo	24d8f916c8	merge fix/p0-10-json-status	2026-04-05 16:54:38 +00:00
Yeachan-Heo	30883bddbd	Keep doctor and local help paths shell-native Promote doctor into a real top-level CLI action, reuse the same local report for resumed and REPL doctor invocations, and intercept doctor/status/sandbox help flags before prompt-mode dispatch. The parser change also closes the help fallthrough that previously wandered into runtime startup for local-info commands. Constraint: Preserve prompt shorthand for normal multi-word text input while fixing exact local subcommand help paths Rejected: Route \7[1G[2K[m⠋ 🦀 Thinking...[0m8[1G[2K[m✘ ❌ Request failed [0m through prompt/slash guidance \| still shells out through the wrong surface and keeps health checks hidden Rejected: Reuse the status report as doctor output \| status does not explain auth/config health or expose a dedicated diagnostic summary Confidence: high Scope-risk: narrow Directive: Keep doctor local-only unless an explicit network probe is intentionally added and separately tested Tested: cargo build -p rusty-claude-cli; cargo test -p rusty-claude-cli; cargo run -p rusty-claude-cli -- doctor --help; CLAW_CONFIG_HOME=/tmp/tmp.7pm9SVzOPN ANTHROPIC_API_KEY= ANTHROPIC_AUTH_TOKEN= cargo run -p rusty-claude-cli -- doctor Not-tested: direct /doctor outside the REPL remains interactive-only	2026-04-05 16:44:36 +00:00
Yeachan-Heo	1a2fa1581e	Keep status JSON machine-readable for automation The global --output-format json flag already reached prompt-mode responses, but status and sandbox still bypassed that path and printed human-readable tables. This change threads the selected output format through direct command aliases and resumed slash-command execution so status queries emit valid structured JSON instead of mixed prose. It also adds end-to-end regression coverage for direct status/sandbox JSON and resumed /status JSON so shell automation can rely on stable parsing. Constraint: Global output formatting must stay compatible with existing text-mode reports Rejected: Require callers to scrape text status tables \| fragile and breaks automation Confidence: high Scope-risk: narrow Directive: New direct commands that honor --output-format should thread the format through CliAction and resumed slash execution paths Tested: cargo build -p rusty-claude-cli Tested: cargo test -p rusty-claude-cli -- --nocapture Tested: cargo test --workspace Tested: cargo run -q -p rusty-claude-cli -- --output-format json status Tested: cargo run -q -p rusty-claude-cli -- --output-format json sandbox Not-tested: cargo clippy --workspace --all-targets -- -D warnings (fails in pre-existing runtime files unrelated to this change)	2026-04-05 16:41:02 +00:00
Yeachan-Heo	1f53d961ff	Route nested CLI help requests to usage instead of operand fallthrough The direct CLI wrappers for agents, skills, and mcp treated nested help flags as ordinary operands. That made commands like `claw mcp show --help` report a missing server and `claw skills install --help` fall into filesystem install logic instead of surfacing usage. This change normalizes help-path arguments before dispatch so nested help stays on the help path. The regression tests cover both handler-level behavior and end-to-end CLI output for nested help and unknown subcommands with trailing help flags. Constraint: Keep the fix scoped to direct CLI slash-command wrappers without changing unrelated parser behavior Rejected: Rework top-level argument parsing for all subcommands \| broader risk than needed for the regression Confidence: high Scope-risk: narrow Reversibility: clean Directive: If more nested subcommands are added, extend the help-path normalization table before relying on raw operand dispatch Tested: cargo build -p commands -p rusty-claude-cli Tested: cargo test -p commands -p rusty-claude-cli Not-tested: cargo clippy -p commands -p rusty-claude-cli --all-targets --no-deps -- -D warnings (pre-existing warnings in untouched files block clean run)	2026-04-05 16:38:43 +00:00
Yeachan-Heo	784f07abfa	Harden worker boot recovery before task dispatch The worker boot registry now exposes the requested lifecycle states, emits structured trust and prompt-delivery events, and recovers from shell or wrong-target prompt delivery by replaying the last prompt. Supporting fixes keep MCP remote config parsing backwards-compatible and make CLI argument parsing less dependent on ambient config and cwd state so the workspace stays green under full parallel test runs. Constraint: Worker prompts must not be dispatched before a confirmed ready_for_prompt handshake Constraint: Prompt misdelivery recovery must stay minimal and avoid new dependencies Rejected: Keep prompt_accepted and blocked as public lifecycle states \| user requested the narrower explicit state set Rejected: Treat url-only MCP server configs as invalid \| existing CLI/runtime tests still rely on that shorthand Confidence: high Scope-risk: moderate Reversibility: clean Directive: Preserve prompt_in_flight semantics when extending worker boot; misdelivery detection depends on it Tested: cargo build --workspace; cargo test --workspace Not-tested: Live tmux worker delivery against a real external coding agent pane	2026-04-04 14:50:43 +00:00
Yeachan-Heo	8a9ea1679f	feat(mcp+lifecycle): MCP degraded-startup reporting, lane event schema, lane completion hardening Add MCP structured degraded-startup classification (P2.10): - classify MCP failures as startup/handshake/config/partial - expose failed_servers + recovery_recommendations in tool output - add mcp_degraded output field with server_name, failure_mode, recoverable Canonical lane event schema (P2.7): - add LaneEventName variants for all lifecycle states - wire LaneEvent::new with full 3-arg signature (event, status, emitted_at) - emit typed events for Started, Blocked, Failed, Finished Fix let mut executor for search test binary Fix lane_completion unused import warnings Note: mcp_stdio::manager_discovery_report test has pre-existing failure on clean main, unrelated to this commit.	2026-04-04 14:31:56 +00:00
Yeachan-Heo	ac3ad57b89	fix(ci): apply rustfmt to main	2026-04-04 02:18:52 +00:00
Jobdori	3327d0e3fe	fix(tests): isolate render_diff_report tests from real working-tree state Replace with_current_dir+render_diff_report() with direct render_diff_report_for(&root) calls in the three diff-report tests. The env_lock mutex only serializes within one test binary; cargo test --workspace runs binaries in parallel, so set_current_dir races were possible across binaries. render_diff_report_for(cwd) accepts an explicit path and requires no global state mutation, making the tests reliably green under full workspace parallelism.	2026-04-04 05:33:18 +09:00
Jobdori	5ad05c68a3	merge: ultraclaw/mcp-lifecycle-harden into main	2026-04-04 00:45:12 +09:00
Jobdori	13015f6428	feat(runtime): hardened MCP lifecycle with phase tracking and degraded-mode reporting	2026-04-04 00:42:43 +09:00
Jobdori	e1db949353	feat(runtime): typed task packet format for structured claw dispatch	2026-04-04 00:40:20 +09:00
Yeachan-Heo	bf5eb8785e	Recover the MCP lane on top of current main This resolves the stale-branch merge against origin/main, keeps the MCP runtime wiring, and preserves prompt-approved CLI tool execution after the mock parity harness additions landed upstream. Constraint: Branch had to absorb origin/main changes through a contentful merge before more MCP work Constraint: Prompt-approved runtime tool execution must continue working with new CLI/mock parity coverage Rejected: Keep permission enforcer attached inside CliToolExecutor for conversation turns \| caused prompt-approved bash parity flow to fail as a tool error Rejected: Defer the merge and continue on stale history \| would leave the lane red against current main Confidence: high Scope-risk: moderate Reversibility: clean Directive: Runtime permission policy and executor-side permission enforcement are separate layers; do not reapply executor enforcement to conversation turns without revalidating mock parity harness approval flows Tested: cargo test -p rusty-claude-cli --test mock_parity_harness -- --nocapture; cargo test -p rusty-claude-cli -- --nocapture; cargo test --workspace -- --nocapture Not-tested: Additional live remote/provider scenarios beyond the existing workspace suite	2026-04-03 14:51:18 +00:00
Yeachan-Heo	b3fe057559	Close the MCP lifecycle gap from config to runtime tool execution This wires configured MCP servers into the CLI/runtime path so discovered MCP tools, resource wrappers, search visibility, shutdown handling, and best-effort discovery all work together instead of living as isolated runtime primitives. Constraint: Keep non-MCP startup flows working without new required config Constraint: Preserve partial availability when one configured MCP server fails discovery Rejected: Fail runtime startup on any MCP discovery error \| too brittle for mixed healthy/broken server configs Rejected: Keep MCP support runtime-only without registry wiring \| left discovery and invocation unreachable from the CLI tool lane Confidence: high Scope-risk: moderate Reversibility: clean Directive: Runtime MCP tools are registry-backed but executed through CliToolExecutor state; keep future tool-registry changes aligned with that split Tested: cargo test -p runtime mcp -- --nocapture; cargo test -p tools -- --nocapture; cargo test -p rusty-claude-cli -- --nocapture; cargo test --workspace -- --nocapture Not-tested: Live remote MCP transports (http/sse/ws/sdk) remain unsupported in the CLI execution path	2026-04-03 14:31:25 +00:00
Jobdori	a2351fe867	feat(harness+usage): add auto_compact and token_cost parity scenarios Two new mock parity harness scenarios: 1. auto_compact_triggered (session-compaction category) - Mock returns 50k input tokens, validates auto_compaction key is present in JSON output - Validates format parity; trigger behavior covered by conversation::tests::auto_compacts_when_cumulative_input_threshold_is_crossed 2. token_cost_reporting (token-usage category) - Mock returns known token counts (1k input, 500 output) - Validates input/output token fields present in JSON output Additional changes: - Add estimated_cost to JSON prompt output (format_usd + pricing_for_model) - Add final_text_sse_with_usage and text_message_response_with_usage helpers to mock-anthropic-service for parameterized token counts - Add ScenarioCase.extra_env and ScenarioCase.resume_session fields - Update mock_parity_scenarios.json: 10 -> 12 scenarios - Update harness request count assertion: 19 -> 21 cargo test --workspace: 558 passed, 0 failed	2026-04-03 22:41:42 +09:00
Jobdori	6325add99e	fix(tests): add env_lock to permission-sensitive CLI arg tests Tests relying on PermissionMode::DangerFullAccess as default were flaky under --workspace runs because other tests set RUSTY_CLAUDE_PERMISSION_MODE without cleanup. Added env_lock() and explicit env var removal to 7 affected tests. Fixes: workspace-level cargo test flake (1 random test fails per run)	2026-04-03 22:07:12 +09:00
Jobdori	0490636031	feat(commands): expand slash command surface 67 → 135 specs Add 68 new slash command specs covering: - Approval flow: approve/deny - Editing: undo, retry, paste, image, screenshot - Code ops: test, lint, build, run, fix, refactor, explain, docs, perf - Git: git, stash, blame, log - LSP: symbols, references, definition, hover, diagnostics, autofix - Navigation: focus/unfocus, web, map, search, workspace - Model: max-tokens, temperature, system-prompt, tool-details - Session: history, tokens, cache, pin/unpin, bookmarks, format - Infra: cron, team, parallel, multi, macro, alias - Config: api-key, language, profile, telemetry, env, project - Other: providers, notifications, changelog, templates, benchmark, migrate, reset Update tests: flexible assertions for expanded command surface	2026-04-03 19:52:40 +09:00
Jobdori	8cc7d4c641	chore: additional AI slop cleanup and enforcer wiring from sessions 1/5 Session 1 (ses_2ad65873): with_enforcer builders + 2 regression tests Session 5 (ses_2ad67e8e): continued AI slop cleanup pass — redundant comments, unused_self suppressions, unreachable! tightening Session cleanup (ses_2ad6b26c): Python placeholder centralization Workspace tests: 363+ passed, 0 failed.	2026-04-03 18:35:27 +09:00
Jobdori	618a79a9f4	feat: ultraclaw session outputs — registry tests, MCP bridge, PARITY.md, cleanup Ultraclaw mode results from 10 parallel opencode sessions: - PARITY.md: Updated both copies with all 9 landed lanes, commit hashes, line counts, and test counts. All checklist items marked complete. - MCP bridge: McpToolRegistry.call_tool now wired to real McpServerManager via async JSON-RPC (discover_tools -> tools/call -> shutdown) - Registry tests: Added coverage for TaskRegistry, TeamRegistry, CronRegistry, PermissionEnforcer, LspRegistry (branch-focused tests) - Permissions refactor: Simplified authorize_with_context, extracted helpers, added characterization tests (185 runtime tests pass) - AI slop cleanup: Removed redundant comments, unused_self suppressions, tightened unreachable branches - CLI fixes: Minor adjustments in main.rs and hooks.rs All 363+ tests pass. Workspace compiles clean.	2026-04-03 18:23:03 +09:00
Yeachan-Heo	85c5b0e01d	Expand parity harness coverage before behavioral drift lands The landed mock Anthropic harness now covers multi-tool turns, bash flows, permission prompt approve/deny paths, and an external plugin tool path. A machine-readable scenario manifest plus a diff/checklist runner keep the new scenarios tied back to PARITY.md so future additions stay honest. Constraint: Must build on the deterministic mock service and clean-environment CLI harness Rejected: Add an MCP tool scenario now \| current MCP tool surface is still stubbed, so plugin coverage is the real executable path Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep rust/mock_parity_scenarios.json, mock_parity_harness.rs, and PARITY.md refs in lockstep Tested: cargo fmt --all Tested: cargo clippy --workspace --all-targets -- -D warnings Tested: cargo test --workspace Tested: python3 rust/scripts/run_mock_parity_diff.py Not-tested: Real MCP lifecycle handshakes; remote plugin marketplace install flows	2026-04-03 04:00:33 +00:00
Yeachan-Heo	c2f1304a01	Lock down CLI-to-mock behavioral parity for Anthropic flows This adds a deterministic mock Anthropic-compatible /v1/messages service, a clean-environment CLI harness, and repo docs so the first parity milestone can be validated without live network dependencies. Constraint: First milestone must prove Rust claw can connect from a clean environment and cover streaming, tool assembly, and permission/tool flow Constraint: No new third-party dependencies; reuse the existing Rust workspace stack Rejected: Record/replay live Anthropic traffic \| nondeterministic and unsuitable for repeatable CI coverage Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep scenario markers and expected tool payload shapes synchronized between the mock service and the harness tests Tested: cargo fmt --all Tested: cargo clippy --workspace --all-targets -- -D warnings Tested: cargo test --workspace Tested: ./scripts/run_mock_parity_harness.sh Not-tested: Live Anthropic responses beyond the five scripted harness scenarios	2026-04-03 01:15:52 +00:00
Jobdori	03bd7f0551	feat: add 40 slash commands — command surface 67/141 Port 40 missing user-facing slash commands from upstream parity audit: Session: /doctor, /login, /logout, /usage, /stats, /rename, /privacy-settings Workspace: /branch, /add-dir, /files, /hooks, /release-notes Discovery: /context, /tasks, /doctor, /ide, /desktop Analysis: /review, /security-review, /advisor, /insights Appearance: /theme, /vim, /voice, /color, /effort, /fast, /brief, /output-style, /keybindings, /stickers Communication: /copy, /share, /feedback, /summary, /tag, /thinkback, /plan, /exit, /upgrade, /rewind All commands have full SlashCommandSpec, enum variant, parse arm, and stub handler. Category system expanded with two new categories. Tests updated for new counts (67 specs, 39 resume-supported). fmt/clippy/tests all green.	2026-04-03 08:09:14 +09:00
Jobdori	06151c57f3	fix: make startup_banner test credential-free Remove the #[ignore] gate from startup_banner_mentions_workflow_completions by injecting a dummy ANTHROPIC_API_KEY. The test exercises LiveCli banner rendering, not API calls. Cleanup env var after test. Test suite now 102/102 in CLI crate (was 101 + 1 ignored).	2026-04-03 07:04:30 +09:00
Jobdori	08ed9a7980	fix: make plugin lifecycle test credential-free Inject a dummy ANTHROPIC_API_KEY for build_runtime_runs_plugin_lifecycle_init_and_shutdown so the test exercises plugin init/shutdown without requiring real credentials. The API client is constructed but never used for streaming. Clean up the env var after the test to avoid polluting parallel tests.	2026-04-03 05:53:18 +09:00
Jobdori	fbafb9cffc	fix: post-merge clippy/fmt cleanup (9407-9410 integration)	2026-04-03 05:12:51 +09:00
Jobdori	06a93a57c7	merge: clawcode-issue-9410-cli-ux-progress-status-clear into main	2026-04-03 05:08:19 +09:00
Jobdori	698ce619ca	merge: clawcode-issue-9409-config-env-project-permissions into main	2026-04-03 05:08:08 +09:00
Jobdori	bf848a43ce	merge: clawcode-issue-9407-cli-agents-mcp-config into main	2026-04-03 05:07:56 +09:00
Yeachan-Heo	8805386bea	merge: clawcode-issue-9406-commands-skill-install into main	2026-04-02 13:55:42 +00:00
Yeachan-Heo	5d8e131c14	Wire plugin hooks and lifecycle into runtime startup PARITY.md is stale relative to the current Rust plugin pipeline: plugin manifests, tool loading, and lifecycle primitives already exist, but runtime construction only consumed plugin tools. This change routes enabled plugin hooks into the runtime feature config, initializes plugin lifecycle commands when a runtime is built, and shuts plugins down when runtimes are replaced or dropped.\n\nThe test coverage exercises the new runtime plugin-state builder and verifies init/shutdown execution without relying on global cwd or config-home mutation, so the existing CLI suite stays stable under parallel execution.\n\nConstraint: Keep the change inside the current worktree and avoid touching unrelated pre-existing edits\nRejected: Add plugin hook execution inside the tools crate directly \| runtime feature merging is the existing execution boundary\nRejected: Use process-global CLAW_CONFIG_HOME/current_dir in tests \| races with the existing parallel CLI test suite\nConfidence: high\nScope-risk: moderate\nReversibility: clean\nDirective: Preserve plugin runtime shutdown when rebuilding LiveCli runtimes or temporary turn runtimes\nTested: cargo test -p rusty-claude-cli build_runtime_\nTested: cargo test -p rusty-claude-cli\nNot-tested: End-to-end live REPL session with a real plugin outside the test harness	2026-04-02 10:04:54 +00:00
Yeachan-Heo	9c67607670	Expose configured MCP servers from the CLI PARITY.md called out missing MCP management in the Rust CLI, so this adds a focused read-only /mcp path instead of expanding the broader config surface first. The new command works in the REPL, with --resume, and as a direct 7[1G[2K[m⠋ 🦀 Thinking...[0m8[1G[2K[m✘ ❌ Request failed [0m entrypoint. It lists merged MCP server definitions, supports detailed inspection for one server, and adds targeted tests for parsing, help text, completion hints, and config-backed rendering. Constraint: Keep the enhancement inside the existing Rust slash-command architecture Rejected: Extend /config with a raw mcp dump only \| less discoverable than a dedicated MCP workflow Confidence: high Scope-risk: narrow Directive: Keep /mcp read-only unless MCP lifecycle commands gain shared runtime orchestration Tested: cargo test -p commands parses_supported_slash_commands Tested: cargo test -p commands rejects_invalid_mcp_arguments Tested: cargo test -p commands renders_help_from_shared_specs Tested: cargo test -p commands renders_per_command_help_detail_for_mcp Tested: cargo test -p commands ignores_unknown_or_runtime_bound_slash_commands Tested: cargo test -p commands mcp_usage_supports_help_and_unexpected_args Tested: cargo test -p commands renders_mcp_reports_from_loaded_config Tested: cargo test -p rusty-claude-cli parses_login_and_logout_subcommands Tested: cargo test -p rusty-claude-cli parses_direct_agents_mcp_and_skills_slash_commands Tested: cargo test -p rusty-claude-cli repl_help_includes_shared_commands_and_exit Tested: cargo test -p rusty-claude-cli completion_candidates_include_workflow_shortcuts_and_dynamic_sessions Tested: cargo test -p rusty-claude-cli resume_supported_command_list_matches_expected_surface Tested: cargo test -p rusty-claude-cli init_help_mentions_direct_subcommand Tested: cargo run -p rusty-claude-cli -- mcp help Not-tested: Live MCP server connectivity against a real remote or stdio backend	2026-04-02 10:04:40 +00:00

1 2 3 4 5 ...

352 Commits