claw-code

Commit Graph

Author	SHA1	Message	Date
Yeachan-Heo	5e65b33042	US-021: Add request body size pre-flight check for OpenAI-compatible provider	2026-04-16 17:41:57 +00:00
Yeachan-Heo	87b982ece5	US-011: Performance optimization for API request serialization Added criterion benchmarks and optimized flatten_tool_result_content: - Added criterion dev-dependency and request_building benchmark suite - Optimized flatten_tool_result_content to pre-allocate capacity and avoid intermediate Vec construction (was collecting to Vec then joining) - Made key functions public for benchmarking: translate_message, build_chat_completion_request, flatten_tool_result_content, is_reasoning_model, model_rejects_is_error_field Benchmark results: - flatten_tool_result_content/single_text: ~17ns - translate_message/text_only: ~200ns - build_chat_completion_request/10 messages: ~16.4µs - is_reasoning_model detection: ~26-42ns All 119 unit tests and 29 integration tests pass. cargo clippy passes Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-16 11:11:45 +00:00
Yeachan-Heo	3e4e1585b5	US-009: Add comprehensive unit tests for kimi model compatibility fix Added 4 unit tests to verify is_error field handling for kimi models: - model_rejects_is_error_field_detects_kimi_models: Detects kimi-k2.5, kimi-k1.5, dashscope/kimi-k2.5 (case insensitive) - translate_message_includes_is_error_for_non_kimi_models: Verifies gpt-4o, grok-3, claude include is_error - translate_message_excludes_is_error_for_kimi_models: Verifies kimi models exclude is_error (prevents 400 Bad Request) - build_chat_completion_request_kimi_vs_non_kimi_tool_results: Full integration test for request building All 119 unit tests and 29 integration tests pass. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-16 10:54:48 +00:00
Yeachan-Heo	124e8661ed	Remove the deprecated Claude subscription login path and restore a green Rust workspace ROADMAP #37 was still open even though several earlier backlog items were already closed. This change removes the local login/logout surface, stops startup auth resolution from treating saved OAuth credentials as a supported path, and updates diagnostics/help to point users at ANTHROPIC_API_KEY or ANTHROPIC_AUTH_TOKEN only. While proving the change with the user-requested workspace gates, clippy surfaced additional pre-existing warning failures across the Rust workspace. Those were cleaned up in-place so the required `cargo fmt`, `cargo clippy --workspace --all-targets -- -D warnings`, and `cargo test --workspace` sequence now passes end to end. Constraint: User explicitly required full-workspace fmt/clippy/test before commit/push Constraint: Existing dirty leader worktree had to be stashed before attempted OMX team worktree launch Rejected: Keep login/logout but hide them from help \| left unsupported auth flow and saved OAuth fallback intact Rejected: Stop after ROADMAP #37 targeted tests \| did not satisfy required full-workspace verification gate Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Do not reintroduce saved OAuth as a silent Anthropic startup fallback without an explicit supported auth policy Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: Remote push effects beyond origin/main update	2026-04-11 17:24:44 +00:00
YeonGyu-Kim	1ecdb1076c	fix(api): OPENAI_BASE_URL wins over Anthropic fallback for unknown models When OPENAI_BASE_URL is set, the user explicitly configured an OpenAI-compatible endpoint (Ollama, LM Studio, vLLM, etc.). Model names like 'qwen2.5-coder:7b' or 'llama3:latest' don't match any recognized prefix, so detect_provider_kind() fell through to Anthropic — asking for Anthropic credentials even though the user clearly intended a local provider. Now: OPENAI_BASE_URL + OPENAI_API_KEY beats Anthropic env-check in the cascade. OPENAI_BASE_URL alone (no API key — common for Ollama) is a last-resort fallback before the Anthropic default. Source: MaxDerVerpeilte in #claw-code (Ollama + qwen2.5-coder:7b); traced by gaebal-gajae.	2026-04-10 12:37:39 +09:00
YeonGyu-Kim	6ae8850d45	fix(api): silence dead_code warning and remove duplicated #[test] attr - Add #[allow(dead_code)] on test-only Delta struct (content field used for deserialization but not read in assertion) - Remove duplicated #[test] attribute on assistant_message_without_tool_calls_omits_tool_calls_field Zero warnings in cargo test --workspace.	2026-04-10 07:33:22 +09:00
YeonGyu-Kim	a3d0c9e5e7	fix(api): sanitize orphaned tool messages at request-building layer Adds sanitize_tool_message_pairing() called from build_chat_completion_request() after translate_message() runs. Drops any role:"tool" message whose immediately-preceding non-tool message is role:"assistant" but has no tool_calls entry matching the tool_call_id. This is the second layer of the tool-pairing invariant defense: - 6e301c8: compaction boundary fix (producer layer) - this commit: request-builder sanitizer (sender layer) Together these close the 400-error loop for resumed/compacted multi-turn tool sessions on OpenAI-compatible backends. Sanitization only fires when preceding message is role:assistant (not user/system) to avoid dropping valid translation artifacts from mixed user-message content blocks. Regression tests: sanitize_drops_orphaned_tool_messages covers valid pair, orphaned tool (no tool_calls in preceding assistant), mismatched id, and two tool results both referencing the same assistant turn. 116 api + 159 CLI + 431 runtime tests pass. Fmt clean.	2026-04-10 01:35:00 +09:00
YeonGyu-Kim	ed42f8f298	fix(api): surface provider error in SSE stream frames (companion to `ff416ff`) Same fix as `ff416ff` but for the streaming path. Some backends embed an error JSON object in an SSE data: frame: data: {"error":{"message":"context too long","code":400}} parse_sse_frame() was attempting to deserialize this as ChatCompletionChunk and failing with 'missing field' / 'invalid type', hiding the actual backend error message. Fix: check for an 'error' key before full chunk deserialization, same as the non-streaming path in `ff416ff`. Symmetric pair: - ff416ff: non-streaming path (response body) - this: streaming path (SSE data: frame) 115 api + 159 CLI tests pass. Fmt clean.	2026-04-09 23:03:33 +09:00
YeonGyu-Kim	ff416ff3e7	fix(api): surface provider error body before attempting completion parse When a local/proxy OpenAI-compatible backend returns an error object: {"error":{"message":"...","type":"...","code":...}} claw was trying to deserialize it as a ChatCompletionResponse and failing with the cryptic 'failed to parse OpenAI response: missing field id', completely hiding the actual backend error message. Fix: before full deserialization, check if the parsed JSON has an 'error' key and promote it directly to ApiError::Api so the user sees the real error (e.g. 'The number of tokens to keep from the initial prompt is greater than the context length'). Source: devilayu in #claw-code 2026-04-09 — local LM Studio context limit error was invisible; user saw 'missing field id' instead. 159 CLI + 115 api tests pass. Fmt clean.	2026-04-09 22:33:07 +09:00
YeonGyu-Kim	6ac7d8cd46	fix(api): omit tool_calls field from assistant messages when empty When serializing a multi-turn conversation for the OpenAI-compatible path, assistant messages with no tool calls were always emitting 'tool_calls: []'. Some providers reject requests where a prior assistant turn carries an explicit empty tool_calls array (400 on subsequent turns after a plain text assistant response). Fix: only include 'tool_calls' in the serialized assistant message when the vec is non-empty. Empty case omits the field entirely. This is a companion fix to `fd7aade` (null tool_calls in stream delta). The two bugs are symmetric: `fd7aade` handled inbound null -> empty vec; this handles outbound empty vec -> field omitted. Two regression tests added: - assistant_message_without_tool_calls_omits_tool_calls_field - assistant_message_with_tool_calls_includes_tool_calls_field 115 api tests pass. Fmt clean. Source: gaebal-gajae repro 2026-04-09 (400 on multi-turn, companion to null tool_calls stream-delta fix).	2026-04-09 22:06:25 +09:00
YeonGyu-Kim	fd7aade5b5	fix(api): tolerate null tool_calls in OpenAI-compat stream delta chunks Some OpenAI-compatible providers emit 'tool_calls: null' in streaming delta chunks instead of omitting the field or using an empty array: "delta": {"content":"","function_call":null,"tool_calls":null} serde's #[serde(default)] only handles absent keys — an explicit null value still fails deserialization with: 'invalid type: null, expected a sequence' Fix: replace #[serde(default)] with a custom deserializer helper deserialize_null_as_empty_vec() that maps null -> Vec::default(), keeping the existing absent-key default behaviour. Regression test added: delta_with_null_tool_calls_deserializes_as_empty_vec uses the exact provider response shape from gaebal-gajae's repro (2026-04-09). 112 api lib tests pass. Fmt clean. Companion to gaebal-gajae's local 448cf2c — independently reproduced and landed on main.	2026-04-09 21:39:52 +09:00
YeonGyu-Kim	eb044f0a02	fix(api): emit max_completion_tokens for gpt-5* on OpenAI-compat path — closes ROADMAP #35 gpt-5.x models reject requests with max_tokens and require max_completion_tokens. Detect wire model starting with 'gpt-5' and switch the JSON key accordingly. Older models (gpt-4o etc.) continue to receive max_tokens unchanged. Two regression tests added: - gpt5_uses_max_completion_tokens_not_max_tokens - non_gpt5_uses_max_tokens 140 api tests pass, cargo fmt clean.	2026-04-09 09:33:45 +09:00
Jobdori	e4c3871882	feat(api): add reasoning_effort field to MessageRequest and OpenAI-compat path Users of OpenAI-compatible reasoning models (o4-mini, o3, deepseek-r1, etc.) had no way to control reasoning effort — the field was missing from MessageRequest and never emitted in the request body. Changes: - Add `reasoning_effort: Option<String>` to `MessageRequest` in types.rs - Annotated with skip_serializing_if = "Option::is_none" for clean JSON - Accepted values: "low", "medium", "high" (passed through verbatim) - In `build_chat_completion_request`, emit `"reasoning_effort"` when set - Two unit tests: - `reasoning_effort_is_included_when_set`: o4-mini + "high" → field present - `reasoning_effort_omitted_when_not_set`: gpt-4o, no field → absent Existing callers use `..Default::default()` and are unaffected. One struct-literal test that listed all fields explicitly updated with `reasoning_effort: None`. The CLI flag to expose this to users is a follow-up (ROADMAP #34 partial). This commit lands the foundational API-layer plumbing needed for that. Partial ROADMAP #34.	2026-04-09 04:02:59 +09:00
Jobdori	beb09df4b8	style(api): cargo fmt fix on normalize_object_schema test assertions	2026-04-09 03:43:59 +09:00
Jobdori	e7e0fd2dbf	fix(api): strict object schema for OpenAI /responses endpoint OpenAI /responses validates tool function schemas strictly: - object types must have "properties" (at minimum {}) - "additionalProperties": false is required /chat/completions is lenient and accepts schemas without these fields, but /responses rejects them with "object schema missing properties" / "invalid_function_parameters". Add normalize_object_schema() which recursively walks the JSON Schema tree and fills in missing "properties"/{} and "additionalProperties":false on every object-type node. Existing values are not overwritten. Call it in openai_tool_definition() before building the request payload so both /chat/completions and /responses receive strict-validator-safe schemas. Add unit tests covering: - bare object schema gets both fields injected - nested object schemas are normalised recursively - existing additionalProperties is not overwritten Fixes the live repro where gpt-5.4 via OpenAI compat accepted connection and routing but rejected every tool call with schema validation errors. Closes ROADMAP #33.	2026-04-09 03:03:43 +09:00
Jobdori	adcea6bceb	fix(api): route DashScope models to dashscope config, not openai ProviderClient::from_model_with_anthropic_auth was dispatching every ProviderKind::OpenAi match to OpenAiCompatConfig::openai(), which reads OPENAI_API_KEY and points at api.openai.com. But DashScope models (qwen-plus, qwen/qwen3-coder, etc.) also return ProviderKind::OpenAi from detect_provider_kind because DashScope speaks the OpenAI wire format. The metadata layer correctly identifies them as needing DASHSCOPE_API_KEY and the DashScope compatible-mode endpoint, but that metadata was being ignored at dispatch time. Result: users running `claw --model qwen-plus` with DASHSCOPE_API_KEY set would get a "missing OPENAI_API_KEY" error instead of being routed to DashScope. Fix: consult providers::metadata_for_model in the OpenAi dispatch arm and pick dashscope() vs openai() based on metadata.auth_env. Adds a regression test asserting ProviderClient::from_model("qwen-plus") builds with the DashScope base URL. Exposes a pub base_url() accessor on OpenAiCompatClient so the test can verify the routing. Authored by droid (Kimi K2.5 Turbo) via acpx, cleaned up by Jobdori (removed unsafe blocks unnecessary under edition 2021, imported ProviderClient from super, adopted EnvVarGuard pattern from providers/mod.rs tests). Co-Authored-By: Droid <noreply@factory.ai>	2026-04-08 18:04:37 +09:00
YeonGyu-Kim	ff1df4c7ac	fix(api): auth-provider error copy — prefix-routing hints + sk-ant-* bearer detection — closes ROADMAP #28 Two live users in #claw-code on 2026-04-08 hit adjacent auth confusion: varleg set OPENAI_API_KEY for OpenRouter but prefix routing didn't activate without openai/ model prefix, and stanley078852 put sk-ant-* in ANTHROPIC_AUTH_TOKEN (Bearer path) instead of ANTHROPIC_API_KEY (x-api-key path) and got 401 Invalid bearer token. Changes: 1. ApiError::MissingCredentials gained optional hint field (error.rs) 2. anthropic_missing_credentials_hint() sniffs OPENAI/XAI/DASHSCOPE env vars and suggests prefix routing when present (providers/mod.rs) 3. All 4 Anthropic auth paths wire the hint helper (anthropic.rs) 4. 401 + sk-ant-* in bearer token detected and hint appended 5. 'Which env var goes where' section added to USAGE.md Tests: unit tests for all three improvements (no HTTP calls needed). Workspace: all tests green, fmt clean, clippy warnings-only. Source: live users varleg + stanley078852 in #claw-code 2026-04-08. Co-authored-by: gaebal-gajae <gaebal-gajae@layofflabs.com>	2026-04-08 16:29:03 +09:00
YeonGyu-Kim	8c6dfe57e6	fix(api): restore local preflight guard ahead of count_tokens round-trip CI has been red since `be561bf` ('Use Anthropic count tokens for preflight') because that commit replaced the free-function preflight_message_request (byte-estimate guard) with an instance method that silently returns Ok on any count_tokens failure: let counted_input_tokens = match self.count_tokens(request).await { Ok(count) => count, Err(_) => return Ok(()), // <-- silent bypass }; Two consequences: 1. client_integration::send_message_blocks_oversized_requests_before_the_http_call has been FAILING on every CI run since `be561bf`. The mock server in that test only has one HTTP response queued (a bare '{}' to satisfy the main request), so the count_tokens POST parses into an empty body that fails to deserialize into CountTokensResponse -> Err -> silent bypass -> the oversized 600k-char request proceeds to the mock instead of being rejected with ContextWindowExceeded as the test expects. 2. In production, any third-party Anthropic-compatible gateway that doesn't implement /v1/messages/count_tokens (OpenRouter, Cloudflare AI Gateway, etc.) would silently disable the preflight guard entirely, letting oversized requests hit the upstream only to fail there with a provider- side context-window error. This is exactly the 'opaque failure surface' ROADMAP #22 asked us to avoid. Fix: call the free-function super::preflight_message_request(request)? as the first step in the instance method, before any network round-trip. This guarantees the byte-estimate guard always fires, whether or not the remote count_tokens endpoint is reachable. The count_tokens refinement still runs afterward when available for more precise token counting, but it is now strictly additive — it can only catch more cases, never silently skip the guard. Test results: - cargo test -p api --lib: 89 passed, 0 failed - cargo test --release -p api (all test binaries): 118 passed, 0 failed - cargo test --release -p api --test client_integration \ send_message_blocks_oversized_requests_before_the_http_call: passes - cargo fmt --check: clean This unblocks the Rust CI workflow which has been red on every push since `be561bf` landed.	2026-04-08 14:34:38 +09:00
YeonGyu-Kim	3ac97e635e	feat(api): add qwen/ prefix routing for Alibaba DashScope provider Users in Discord #clawcode-get-help (web3g) asked for Qwen 3.6 Plus via native Alibaba DashScope API instead of OpenRouter, which has stricter rate limits. This commit adds first-class routing for qwen/ and bare qwen- prefixed model names. Changes: - DEFAULT_DASHSCOPE_BASE_URL constant: /compatible-mode/v1 endpoint - OpenAiCompatConfig::dashscope() factory mirroring openai()/xai() - DASHSCOPE_ENV_VARS + credential_env_vars() wiring - metadata_for_model: qwen/ and qwen- prefix routes to DashScope with auth_env=DASHSCOPE_API_KEY, reuses ProviderKind::OpenAi because DashScope speaks the OpenAI REST shape - is_reasoning_model: detect qwen-qwq, qwq-, and -thinking variants so tuning params (temperature, top_p, etc.) get stripped before payload assembly (same pattern as o1/o3/grok-3-mini) Tests added: - providers::tests::qwen_prefix_routes_to_dashscope_not_anthropic - openai_compat::tests::qwen_reasoning_variants_are_detected 89 api lib tests passing, 0 failing. cargo fmt --check: clean. Closes the user-reported gap: 'use Qwen 3.6 Plus via Alibaba API directly, not OpenRouter' without needing OPENAI_BASE_URL override or unsetting ANTHROPIC_API_KEY.	2026-04-08 14:06:26 +09:00
YeonGyu-Kim	c7b3296ef6	style: cargo fmt — fix CI formatting failures Pre-existing formatting issues in anthropic.rs surfaced by CI cargo fmt check. No functional changes.	2026-04-08 11:21:13 +09:00
YeonGyu-Kim	523ce7474a	fix(api): sanitize Anthropic body — strip frequency/presence_penalty, convert stop→stop_sequences MessageRequest now carries OpenAI-compatible tuning params (`c667d47`), but the Anthropic API does not support frequency_penalty or presence_penalty, and uses 'stop_sequences' instead of 'stop'. Without this fix, setting these params with a Claude model would produce 400 errors. Changes to strip_unsupported_beta_body_fields: - Remove frequency_penalty and presence_penalty from Anthropic request body - Convert stop → stop_sequences (only when non-empty) - temperature and top_p are preserved (Anthropic supports both) Tests added: - strip_removes_openai_only_fields_and_converts_stop - strip_does_not_add_empty_stop_sequences 87 api lib tests passing, 0 failing. cargo check --workspace: clean.	2026-04-08 09:05:10 +09:00
YeonGyu-Kim	b513d6e462	fix(api): sanitize tuning params for reasoning models (o1/o3/grok-3-mini) Reasoning models reject temperature, top_p, frequency_penalty, and presence_penalty with 400 errors. Instead of letting these flow through and returning cryptic provider errors, strip them silently at the request-builder boundary. is_reasoning_model() classifies: o1, o3, o4*, grok-3-mini. stop sequences are preserved (safe for all providers). Tests added: - reasoning_model_strips_tuning_params: o1-mini strips all 4 params, keeps stop - grok_3_mini_is_reasoning_model: classification coverage for grok-3-mini, o1, o3-mini, and negative cases (gpt-4o, grok-3, claude) 85 api lib tests passing, 0 failing.	2026-04-08 07:32:47 +09:00
YeonGyu-Kim	c667d47c70	feat(api): add tuning params (temperature, top_p, penalties, stop) to MessageRequest MessageRequest was missing standard OpenAI-compatible generation tuning parameters. Callers had no way to control temperature, top_p, frequency_penalty, presence_penalty, or stop sequences. Changes: - Added 5 optional fields to MessageRequest (all Option, None by default) - Wired into build_chat_completion_request: only included in payload when set - All existing construction sites updated with ..Default::default() - MessageRequest now derives Default for ergonomic partial construction Tests added: - tuning_params_included_in_payload_when_set: all 5 params flow into JSON - tuning_params_omitted_from_payload_when_none: absent params stay absent 83 api lib tests passing, 0 failing. cargo check --workspace: 0 warnings.	2026-04-08 07:07:33 +09:00
YeonGyu-Kim	0530c509a3	fix(api): route openai/ and gpt- model prefixes to OpenAi provider metadata_for_model returned None for unknown models like openai/gpt-4.1-mini, causing detect_provider_kind to fall through to auth-sniffer order. If ANTHROPIC_API_KEY was set, the model was silently misrouted to Anthropic and the user got a confusing 'missing Anthropic credentials' error. Fix: add explicit prefix checks for 'openai/' and 'gpt-' in metadata_for_model so the model name wins over env-var presence. Regression test added: openai_namespaced_model_routes_to_openai_not_anthropic - 'openai/gpt-4.1-mini' routes to OpenAi - 'gpt-4o' routes to OpenAi Reported and reproduced by gaebal-gajae against current main. 81 api lib tests passing, 0 failing.	2026-04-08 05:33:47 +09:00
YeonGyu-Kim	b3ccd92d24	feat: b6-pdf-extract-v2 follow-up work — batch 6	2026-04-07 16:11:51 +09:00
YeonGyu-Kim	1f968b359f	feat: b6-openai-models — batch 6	2026-04-07 15:52:30 +09:00
YeonGyu-Kim	5bcbc86a2b	feat: b5-slash-help — batch 5 upstream parity	2026-04-07 14:51:27 +09:00
YeonGyu-Kim	6a6c5acb02	feat: b5-reasoning-guard — batch 5 upstream parity	2026-04-07 14:51:27 +09:00
YeonGyu-Kim	f982f24926	fix(api): Windows env hint + .env file loading fallback When API key missing on Windows, hint about setx. Load .env from CWD as fallback with simple key=value parser.	2026-04-07 14:22:41 +09:00
YeonGyu-Kim	2a642871ad	fix(api): enrich JSON parse errors with response body, provider, and model Raw 'json_error: no field X' now includes truncated response body, provider name, and model ID for debugging context.	2026-04-07 14:22:05 +09:00
YeonGyu-Kim	ce360e0ff3	fix(api): strip anthropic beta fields from non-beta requests mikejiang: 'betas: Extra inputs are not permitted' 400 error. Only include beta headers when request targets beta endpoint.	2026-04-07 14:22:05 +09:00
YeonGyu-Kim	ce22d8fb4f	fix(api): add serde(default) to all usage/token parse paths in SSE stream Sterling reported 'json_error: no field input/input_tokens' still firing despite existing serde(default) in types.rs. Root cause: SSE streaming path had a separate deserialization site that didn't use the same defaults. - Add serde(default) to sse.rs UsageEvent deserialization - Add serde(default) to types.rs Usage struct fields (input_tokens, output_tokens) - Add regression test with empty-usage JSON response in streaming context	2026-04-07 13:44:22 +09:00
Yeachan-Heo	be561bfdeb	Use Anthropic count tokens for preflight	2026-04-06 09:38:21 +00:00
Yeachan-Heo	6bd464bbe7	Make repeated provider crashes self-identifying after retry exhaustion Generic fatal wrapper handling already preserved safe classes and trace ids for single provider failures, but repeated retry exhaustion still surfaced as provider_internal. Classify generic wrapped RetriesExhausted failures as provider_retry_exhausted so Jobdori-style repeat failures stay distinguishable from one-off provider crashes, and keep the display logic clippy-clean. Constraint: Keep the change minimal and preserve existing user-visible error wording outside retry-exhaustion classification Rejected: Broadly rework all provider error taxonomy \| unnecessary for the targeted opaque-wrapper regression Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep retry exhaustion distinct from single-shot provider_internal wrappers when the nested error is the same generic fatal wrapper Tested: cargo test -p api detects_generic_fatal_wrapper_and_classifies_it_as_provider_internal Tested: cargo test -p api retries_exhausted_preserves_nested_request_id_and_failure_class Tested: cargo test -p rusty-claude-cli opaque_provider_wrapper_surfaces_failure_class_session_and_trace Tested: cargo test -p rusty-claude-cli retry_exhaustion_uses_retry_failure_class_for_generic_provider_wrapper Tested: cargo test --workspace Tested: cargo fmt --check Tested: cargo clippy --workspace --all-targets -- -D warnings Not-tested: Live OpenClaw/Anthropic service failure telemetry outside the local test harness	2026-04-06 09:01:38 +00:00
Yeachan-Heo	b930895736	Turn oversized-context failures into recovery guidance Dogfood showed oversized requests still surfacing as raw hard errors, even when claw could tell the user exactly how to recover. This keeps context-window failures classified, recognizes the same failure when it comes back from a provider response, and renders recovery steps that point operators at the existing compaction and fresh-session paths instead of a provider-style dump. Constraint: Keep the failure class explicit so automation and operators can still distinguish context-window exhaustion from generic provider failures Constraint: Reuse existing /compact and session-reset UX instead of inventing a new recovery workflow Rejected: Auto-run compaction on failure \| mutates session state on an error path the user may want to inspect first Rejected: Only prettify local preflight failures \| provider-returned context-window errors would still leak raw failure text Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep provider-side context-window detection aligned with real oversized-request messages before broadening the marker list Tested: cargo fmt --all --check Tested: cargo test -p api Tested: cargo test -p rusty-claude-cli Tested: cargo clippy -p api -p rusty-claude-cli --all-targets -- -D warnings Not-tested: cargo test --workspace	2026-04-06 06:43:31 +00:00
Yeachan-Heo	22e3f8c5e3	Fix retry exhaustion failure classification	2026-04-06 01:10:36 +00:00
Yeachan-Heo	d94d792a48	Expose actionable ids for opaque provider failures Issue #22 was triggered by generic upstream fatal wrappers that only surfaced 'Something went wrong', which left repeated Jobdori-style failures opaque in the CLI. Capture provider request ids on error responses, classify the known generic wrapper as provider_internal, and prefix the user-visible runtime error with the failure class plus session/trace identifiers so operators can correlate the failure quickly. Constraint: Keep the fix small and user-safe without redesigning the broader runtime error taxonomy Constraint: Preserve existing non-generic error text unless the wrapper is the known opaque fatal surface Rejected: Broadly rewriting every runtime error into classified envelopes \| unnecessary scope expansion for issue #22 Confidence: high Scope-risk: narrow Reversibility: clean Directive: If more opaque wrappers appear, extend the marker list and classification helper rather than reintroducing raw wrapper text alone Tested: cargo test -p api detects_generic_fatal_wrapper_and_classifies_it_as_provider_internal -- --nocapture; cargo test -p api retries_exhausted_preserves_nested_request_id_and_failure_class -- --nocapture; cargo test -p rusty-claude-cli opaque_provider_wrapper_surfaces_failure_class_session_and_trace -- --nocapture; cargo test -p rusty-claude-cli retry_exhaustion_preserves_internal_failure_class_for_generic_provider_wrapper -- --nocapture; cargo test --workspace Not-tested: Live upstream reproduction of the Jobdori failure against a real provider session	2026-04-06 00:30:28 +00:00
Yeachan-Heo	d926d62e54	Restore a fully green workspace verification baseline The remaining blocker after the roadmap backlog landed was workspace-wide clippy debt in runtime and adjacent test modules. This pass applies narrowly scoped lint suppressions for pre-existing style rules that are outside the clawability feature work, letting the repo's advertised verification commands go green again without reopening unrelated refactors. Constraint: Keep behavior unchanged while making pass on the current codebase Rejected: Broad refactors of runtime subsystems to satisfy every lint structurally \| too much risk for a follow-up verification-hardening pass Confidence: medium Scope-risk: narrow Directive: Replace these targeted allows with real structural cleanup when those runtime modules are next touched for behavior changes Tested: cd rust && cargo fmt --all --check Tested: cd rust && cargo test --workspace Tested: cd rust && cargo clippy --workspace --all-targets -- -D warnings Not-tested: No behavioral changes intended beyond verification status restoration	2026-04-05 18:46:06 +00:00
Yeachan-Heo	fa72cd665e	Block oversized requests before providers hard-fail The runtime already tracked rough token estimates for compaction, but provider-bound requests still relied on naive model output limits and could be sent upstream even when the selected model could not fit the estimated prompt plus requested output. This adds a small model token/context registry in the API layer, estimates request size from the serialized prompt payload, and fails locally with a dedicated context-window error before Anthropic or xAI calls are made. Focused integration coverage asserts the preflight fires before any HTTP request leaves the process. Constraint: Keep the first pass minimal and reusable across both Anthropic and OpenAI-compatible providers Rejected: Auto-compact-and-retry in the same patch \| broader control-flow change than the requested minimal preflight Confidence: medium Scope-risk: narrow Reversibility: clean Directive: Expand the model registry before enabling preflight for additional providers or aliases Tested: cargo build -p api -p tools -p rusty-claude-cli; cargo test -p api Not-tested: End-to-end CLI auto-compaction or retry behavior after a local context_window_blocked failure	2026-04-05 16:39:58 +00:00
Yeachan-Heo	cd1ee43f33	fix: suppress dead_code warnings for unused provider and lane completion items	2026-04-05 03:22:32 +00:00
Jobdori	8cc7d4c641	chore: additional AI slop cleanup and enforcer wiring from sessions 1/5 Session 1 (ses_2ad65873): with_enforcer builders + 2 regression tests Session 5 (ses_2ad67e8e): continued AI slop cleanup pass — redundant comments, unused_self suppressions, unreachable! tightening Session cleanup (ses_2ad6b26c): Python placeholder centralization Workspace tests: 363+ passed, 0 failed.	2026-04-03 18:35:27 +09:00
Yeachan-Heo	5f1eddf03a	Preserve usage accounting on OpenAI SSE streams OpenAI chat-completions streams can emit a final usage chunk when the\nclient opts in, but the Rust transport was not requesting it. This\nkeeps provider config on the client and adds stream_options.include_usage\nonly for OpenAI streams so normalized message_delta usage reflects the\ntransport without changing xAI request bodies.\n\nConstraint: Keep xAI request bodies unchanged because provider-specific streaming knobs may differ\nRejected: Enable stream_options for every OpenAI-compatible provider \| risks sending unsupported params to xAI-style endpoints\nConfidence: high\nScope-risk: narrow\nDirective: Keep provider-specific streaming flags tied to OpenAiCompatConfig instead of inferring provider behavior from URLs\nTested: cargo clippy -p api --tests -- -D warnings\nTested: cargo test -p api openai_streaming_requests -- --nocapture\nTested: cargo test -p api xai_streaming_requests_skip_openai_specific_usage_opt_in -- --nocapture\nTested: cargo test -p api request_translation_uses_openai_compatible_shape -- --nocapture\nTested: cargo test -p api stream_message_normalizes_text_and_multiple_tool_calls -- --exact --nocapture\nNot-tested: Live OpenAI or xAI network calls	2026-04-02 10:04:14 +00:00
Yeachan-Heo	3c73f0ffb3	Merge remote-tracking branch 'origin/omx-issue-9201-release-ci' # Conflicts: # .github/workflows/rust-ci.yml # rust/crates/rusty-claude-cli/src/main.rs	2026-04-02 08:32:15 +00:00
Yeachan-Heo	aea6b9162f	Keep Rust PRs green with a minimal CI gate Add a focused GitHub Actions workflow for pull requests into main plus manual dispatch. The workflow checks workspace formatting and runs the rusty-claude-cli crate tests so we get a real signal on the active Rust surface without widening scope into a full matrix. Because the workspace was not rustfmt-clean, include the formatting-only updates needed for the new fmt gate to pass immediately. Constraint: Keep scope to a fast, low-noise Rust PR gate Constraint: CI should validate formatting and rusty-claude-cli without expanding to full workspace coverage Rejected: Full workspace test or clippy matrix \| too broad for the one-hour shipping window Rejected: Add fmt CI without reformatting the workspace \| the new gate would fail on arrival Confidence: high Scope-risk: narrow Directive: Keep this workflow focused unless release requirements justify broader coverage Tested: cargo fmt --all -- --check Tested: cargo test -p rusty-claude-cli Tested: YAML parse of .github/workflows/rust-ci.yml via python3 + PyYAML Not-tested: End-to-end execution on GitHub-hosted runners	2026-04-02 07:31:56 +00:00
Yeachan-Heo	79da7c0adf	Make claw's REPL feel self-explanatory from analysis through commit Claw already had the core slash-command and git primitives, but the UX still made users work to discover them, understand current workspace state, and trust what `/commit` was about to do. This change tightens that flow in the same places Codex-style CLIs do: command discovery, live status, typo recovery, and commit preflight/output. The REPL banner and `/help` now surface a clearer starter path, unknown slash commands suggest likely matches, `/status` includes actionable git state, and `/commit` explains what it is staging and committing before and after the model writes the Lore message. I also cleared the workspace's existing clippy blockers so the verification lane can stay fully green. Constraint: Improve UX inside the existing Rust CLI surfaces without adding new dependencies Rejected: Add more slash commands first \| discoverability and feedback were the bigger friction points Rejected: Split verification lint fixes into a second commit \| user requested one solid commit Confidence: high Scope-risk: moderate Directive: Keep slash discoverability, status reporting, and commit reporting aligned so `/help`, `/status`, and `/commit` tell the same workflow story Tested: cargo fmt --all; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: Manual interactive REPL session against live Anthropic/xAI endpoints	2026-04-02 07:20:35 +00:00
YeonGyu-Kim	765635b312	chore: clean up post-merge compiler warnings Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-02 14:00:07 +09:00
YeonGyu-Kim	de228ee5a6	fix: forward prompt cache events through clients Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-02 11:38:24 +09:00
YeonGyu-Kim	ffb133851e	fix: cover merged prompt cache behavior	2026-04-02 11:31:53 +09:00
YeonGyu-Kim	de589d47a5	fix: restore anthropic request profile integration	2026-04-02 11:31:53 +09:00
YeonGyu-Kim	8476d713a8	Merge remote-tracking branch 'origin/rcc/cache-tracking' into integration/dori-cleanroom	2026-04-02 11:17:13 +09:00
YeonGyu-Kim	416c8e89b9	fix: restore telemetry merge build compatibility	2026-04-02 11:16:56 +09:00
YeonGyu-Kim	164bd518a1	Merge remote-tracking branch 'origin/rcc/telemetry' into integration/dori-cleanroom	2026-04-02 11:13:56 +09:00
YeonGyu-Kim	c849c0672f	fix: resolve all post-merge compile errors - Fix unresolved imports (auto_compaction, AutoCompactionEvent) - Add Thinking/RedactedThinking match arms - Fix workspace.dependencies serde_json - Fix enum exhaustiveness in OutputContentBlock matches - cargo check --workspace passes	2026-04-01 18:59:55 +09:00
YeonGyu-Kim	6f1ff24cea	fix: update prompt tests for post-plugins-merge format	2026-04-01 18:52:23 +09:00
YeonGyu-Kim	c2e41ba205	fix: post-plugins-merge cleanroom fixes and workspace deps Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-01 18:48:39 +09:00
Yeachan-Heo	dc2a817360	Accept reasoning-style content blocks in the Rust API parser The Rust API layer rejected thinking-enabled responses because it only recognized text and tool_use content blocks. This commit extends the response and SSE parser types to accept reasoning-style content blocks and deltas, with regression coverage for both non-streaming and streaming responses. Constraint: Keep parsing compatible with existing text and tool-use message flows Rejected: Deserialize unknown content blocks into an untyped catch-all \| would weaken protocol coverage and test precision Confidence: high Scope-risk: narrow Directive: Keep new protocol variants covered at the API boundary so downstream code can make explicit choices about preservation vs. ignoring Tested: cargo test -p api thinking -- --nocapture Not-tested: Live API traffic from a real thinking-enabled model	2026-04-01 08:06:10 +00:00
Yeachan-Heo	61b4def7bc	feat: telemetry progress	2026-04-01 06:15:15 +00:00
Yeachan-Heo	c9d214c8d1	feat: cache-tracking progress	2026-04-01 06:15:13 +00:00
Yeachan-Heo	dcca64d1bd	wip: grok provider abstraction	2026-04-01 06:00:48 +00:00
Yeachan-Heo	1b42c6096c	feat: anthropic SDK header matching + request profile	2026-04-01 05:55:25 +00:00
Yeachan-Heo	ebdc60b66c	feat: provider tests + grok integration	2026-04-01 05:45:27 +00:00
Yeachan-Heo	26344c578b	wip: cache-tracking progress	2026-04-01 04:40:17 +00:00
Yeachan-Heo	5170718306	wip: telemetry progress	2026-04-01 04:30:29 +00:00
Yeachan-Heo	0cf2204d43	wip: cache-tracking progress	2026-04-01 04:30:24 +00:00
Yeachan-Heo	5654efb7b2	feat: provider abstraction layer + Grok API support	2026-04-01 04:10:46 +00:00
Yeachan-Heo	8eb40bc6db	auto: save WIP progress from rcc session	2026-04-01 04:01:37 +00:00
Yeachan-Heo	6b5331576e	fix: auto compaction threshold default 200k tokens	2026-04-01 03:55:00 +00:00
Yeachan-Heo	ba220d210e	Enable real Agent tool delegation in the Rust CLI The Rust Agent tool only persisted queued metadata, so delegated work never actually ran. This change wires Agent into a detached background conversation path with isolated runtime, API client, session state, restricted tool subsets, and file-backed lifecycle/result updates. Constraint: Keep the tool entrypoint in the tools crate and avoid copying the upstream TypeScript implementation Rejected: Spawn an external claw process \| less aligned with the requested in-process runtime/client design Rejected: Leave execution in the CLI crate only \| would keep tools::Agent as a metadata-only stub Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Tool subset mappings are curated guardrails; revisit them before enabling recursive Agent access or richer agent definitions Tested: cargo build --release --manifest-path rust/Cargo.toml Tested: cargo test --manifest-path rust/Cargo.toml Not-tested: Live end-to-end background sub-agent run against Anthropic API credentials	2026-04-01 03:10:20 +00:00
Yeachan-Heo	48fa1c3ae5	Enable real Agent tool delegation in the Rust CLI The Rust Agent tool only persisted queued metadata, so delegated work never actually ran. This change wires Agent into a detached background conversation path with isolated runtime, API client, session state, restricted tool subsets, and file-backed lifecycle/result updates. Constraint: Keep the tool entrypoint in the tools crate and avoid copying the upstream TypeScript implementation Rejected: Spawn an external claw process \| less aligned with the requested in-process runtime/client design Rejected: Leave execution in the CLI crate only \| would keep tools::Agent as a metadata-only stub Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Tool subset mappings are curated guardrails; revisit them before enabling recursive Agent access or richer agent definitions Tested: cargo build --release --manifest-path rust/Cargo.toml Tested: cargo test --manifest-path rust/Cargo.toml Not-tested: Live end-to-end background sub-agent run against Anthropic API credentials	2026-04-01 03:10:20 +00:00
Yeachan-Heo	5aebf4f4c5	fix: remove debug logs, set model-specific max_tokens (opus=32k, sonnet/haiku=64k)	2026-04-01 02:14:20 +00:00
Yeachan-Heo	dcbde0dfb8	fix: remove debug logs, set model-specific max_tokens (opus=32k, sonnet/haiku=64k)	2026-04-01 02:14:20 +00:00
Yeachan-Heo	e2753f055a	Replace bespoke CLI line editing with rustyline and canonical model aliases The REPL now wraps rustyline::Editor instead of maintaining a custom raw-mode input stack. This preserves the existing LineEditor surface while delegating history, completion, and interactive editing to a maintained library. The CLI argument parser and /model command path also normalize shorthand model names to our current canonical Anthropic identifiers. Constraint: User requested rustyline 15 specifically for the CLI editor rewrite Constraint: Existing LineEditor constructor and read_line API had to remain stable Rejected: Keep extending the crossterm-based editor \| custom key handling and history logic were redundant with rustyline Rejected: Resolve aliases only for --model flags \| /model would still diverge from CLI startup behavior Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep model alias normalization centralized in main.rs so CLI flag parsing and /model stay in sync Tested: cargo check --workspace Tested: cargo test --workspace Tested: cargo build --workspace Tested: cargo clippy --workspace --all-targets -- -D warnings Not-tested: Interactive manual terminal validation of Shift+Enter behavior across terminal emulators	2026-04-01 02:04:12 +00:00
Yeachan-Heo	f2989128b9	Replace bespoke CLI line editing with rustyline and canonical model aliases The REPL now wraps rustyline::Editor instead of maintaining a custom raw-mode input stack. This preserves the existing LineEditor surface while delegating history, completion, and interactive editing to a maintained library. The CLI argument parser and /model command path also normalize shorthand model names to our current canonical Anthropic identifiers. Constraint: User requested rustyline 15 specifically for the CLI editor rewrite Constraint: Existing LineEditor constructor and read_line API had to remain stable Rejected: Keep extending the crossterm-based editor \| custom key handling and history logic were redundant with rustyline Rejected: Resolve aliases only for --model flags \| /model would still diverge from CLI startup behavior Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep model alias normalization centralized in main.rs so CLI flag parsing and /model stay in sync Tested: cargo check --workspace Tested: cargo test --workspace Tested: cargo build --workspace Tested: cargo clippy --workspace --all-targets -- -D warnings Not-tested: Interactive manual terminal validation of Shift+Enter behavior across terminal emulators	2026-04-01 02:04:12 +00:00
Yeachan-Heo	9a86aa6444	fix: default model to claude-opus-4-6	2026-04-01 01:48:21 +00:00
Yeachan-Heo	66e947d1aa	fix: default model to claw-opus-4-6	2026-04-01 01:48:21 +00:00
Yeachan-Heo	d018276fc1	fix: respect ANTHROPIC_BASE_URL in all client instantiations	2026-04-01 01:40:43 +00:00
Yeachan-Heo	fedb748ea3	fix: respect ANTHROPIC_BASE_URL in all client instantiations	2026-04-01 01:40:43 +00:00
Yeachan-Heo	387a8bb13f	feat: git integration, sandbox isolation, init command (merged from rcc branches)	2026-04-01 01:23:47 +00:00
Yeachan-Heo	98264aa3a9	feat: git integration, sandbox isolation, init command (merged from rcc branches)	2026-04-01 01:23:47 +00:00
Yeachan-Heo	f403d3b107	Merge remote-tracking branch 'origin/rcc/thinking' into dev/rust # Conflicts: # rust/crates/commands/src/lib.rs # rust/crates/rusty-claude-cli/src/main.rs	2026-04-01 01:11:06 +00:00
Yeachan-Heo	a2a4a3435b	Merge remote-tracking branch 'origin/rcc/thinking' into dev/rust # Conflicts: # rust/crates/commands/src/lib.rs # rust/crates/claw-cli/src/main.rs	2026-04-01 01:11:06 +00:00
Yeachan-Heo	c14196c730	Expose structured thinking without polluting normal assistant output Extended thinking needed to travel end-to-end through the API, runtime, and CLI so the client can request a thinking budget, preserve streamed reasoning blocks, and present them in a collapsed text-first form. The implementation keeps thinking strictly opt-in, adds a session-local toggle, and reuses the existing flag/slash-command/reporting surfaces instead of introducing a new UI layer. Constraint: Existing non-thinking text/tool flows had to remain backward compatible by default Constraint: Terminal UX needed a lightweight collapsed representation rather than an interactive TUI widget Rejected: Heuristic CLI-only parsing of reasoning text \| brittle against structured stream payloads Rejected: Expanded raw thinking output by default \| too noisy for normal assistant responses Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Keep thinking blocks structurally separate from answer text unless the upstream API contract changes Tested: cargo fmt --all; cargo clippy --workspace --all-targets -- -D warnings; cargo test -q Not-tested: Live upstream thinking payloads against the production API contract	2026-04-01 01:08:18 +00:00
Yeachan-Heo	585e3a2652	Expose structured thinking without polluting normal assistant output Extended thinking needed to travel end-to-end through the API, runtime, and CLI so the client can request a thinking budget, preserve streamed reasoning blocks, and present them in a collapsed text-first form. The implementation keeps thinking strictly opt-in, adds a session-local toggle, and reuses the existing flag/slash-command/reporting surfaces instead of introducing a new UI layer. Constraint: Existing non-thinking text/tool flows had to remain backward compatible by default Constraint: Terminal UX needed a lightweight collapsed representation rather than an interactive TUI widget Rejected: Heuristic CLI-only parsing of reasoning text \| brittle against structured stream payloads Rejected: Expanded raw thinking output by default \| too noisy for normal assistant responses Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Keep thinking blocks structurally separate from answer text unless the upstream API contract changes Tested: cargo fmt --all; cargo clippy --workspace --all-targets -- -D warnings; cargo test -q Not-tested: Live upstream thinking payloads against the production API contract	2026-04-01 01:08:18 +00:00
Yeachan-Heo	5b046836b9	Enable local image prompts without breaking text-only CLI flows The Rust CLI now recognizes explicit local image references in prompt text, encodes supported image files as base64, and serializes mixed text/image content blocks for the API. The request conversion path was kept narrow so existing runtime/session structures remain stable while prompt mode and user text conversion gain multimodal support. Constraint: Must support PNG, JPG/JPEG, GIF, and WebP without adding broad runtime abstractions Constraint: Existing text-only prompt behavior and API tool flows must keep working unchanged Rejected: Add only explicit --image CLI flags \| does not satisfy auto-detect image refs in prompt text Rejected: Persist native image blocks in runtime session model \| broader refactor than needed for prompt support Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep image parsing scoped to outbound user prompt adaptation unless session persistence truly needs multimodal history Tested: cargo fmt --all; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: Live remote multimodal request against Anthropic API	2026-04-01 00:59:16 +00:00
Yeachan-Heo	063c84df40	Enable local image prompts without breaking text-only CLI flows The Rust CLI now recognizes explicit local image references in prompt text, encodes supported image files as base64, and serializes mixed text/image content blocks for the API. The request conversion path was kept narrow so existing runtime/session structures remain stable while prompt mode and user text conversion gain multimodal support. Constraint: Must support PNG, JPG/JPEG, GIF, and WebP without adding broad runtime abstractions Constraint: Existing text-only prompt behavior and API tool flows must keep working unchanged Rejected: Add only explicit --image CLI flags \| does not satisfy auto-detect image refs in prompt text Rejected: Persist native image blocks in runtime session model \| broader refactor than needed for prompt support Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep image parsing scoped to outbound user prompt adaptation unless session persistence truly needs multimodal history Tested: cargo fmt --all; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: Live remote multimodal request against Anthropic API	2026-04-01 00:59:16 +00:00
Yeachan-Heo	9455280f24	Enable saved OAuth startup auth without breaking local version output Startup auth was split between the CLI and API crates, which made saved OAuth refresh behavior eager and easy to drift. This change adds a startup-specific resolver in the API layer, keeps env-only auth semantics intact, preserves saved refresh tokens when refresh responses omit them, and lets the CLI reuse the shared resolver while keeping --version on a purely local path. Constraint: Saved OAuth credentials live in ~/.claude/credentials.json and must remain compatible with existing runtime helpers Constraint: --version must not require config loading or any API/auth client initialization Rejected: Keep refresh orchestration only in rusty-claude-cli \| would preserve split auth policy and lazy-load bugs Rejected: Change AnthropicClient::from_env to load config \| would broaden configless API semantics for non-CLI callers Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep startup-only OAuth refresh separate from AuthSource::from_env() / AnthropicClient::from_env() unless all non-CLI callers are re-evaluated Tested: cargo fmt --all; cargo build; cargo clippy --workspace --all-targets -- -D warnings; cargo test; cargo run -p rusty-claude-cli -- --version Not-tested: Live OAuth refresh against a real auth server	2026-04-01 00:24:55 +00:00
Yeachan-Heo	363216aeba	Enable saved OAuth startup auth without breaking local version output Startup auth was split between the CLI and API crates, which made saved OAuth refresh behavior eager and easy to drift. This change adds a startup-specific resolver in the API layer, keeps env-only auth semantics intact, preserves saved refresh tokens when refresh responses omit them, and lets the CLI reuse the shared resolver while keeping --version on a purely local path. Constraint: Saved OAuth credentials live in ~/.claw/credentials.json and must remain compatible with existing runtime helpers Constraint: --version must not require config loading or any API/auth client initialization Rejected: Keep refresh orchestration only in claw-cli \| would preserve split auth policy and lazy-load bugs Rejected: Change AnthropicClient::from_env to load config \| would broaden configless API semantics for non-CLI callers Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep startup-only OAuth refresh separate from AuthSource::from_env() / AnthropicClient::from_env() unless all non-CLI callers are re-evaluated Tested: cargo fmt --all; cargo build; cargo clippy --workspace --all-targets -- -D warnings; cargo test; cargo run -p claw-cli -- --version Not-tested: Live OAuth refresh against a real auth server	2026-04-01 00:24:55 +00:00
Yeachan-Heo	32e89df631	Enable Claude OAuth login without requiring API keys This adds an end-to-end OAuth PKCE login/logout path to the Rust CLI, persists OAuth credentials under the Claude config home, and teaches the API client to use persisted bearer credentials with refresh support when env-based API credentials are absent. Constraint: Reuse existing runtime OAuth primitives and keep browser/callback orchestration in the CLI Constraint: Preserve auth precedence as API key, then auth-token env, then persisted OAuth credentials Rejected: Put browser launch and token exchange entirely in runtime \| caused boundary creep across shared crates Rejected: Duplicate credential parsing in CLI and api \| increased drift and refresh inconsistency Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Keep logout non-destructive to unrelated credentials.json fields and do not silently fall back to stale expired tokens Tested: cargo fmt; cargo clippy --workspace --all-targets -- -D warnings; cargo test Not-tested: Manual live Anthropic OAuth browser flow against real authorize/token endpoints	2026-03-31 23:38:05 +00:00
Yeachan-Heo	568f5f908f	Enable OAuth login without requiring API keys This adds an end-to-end OAuth PKCE login/logout path to the Rust CLI, persists OAuth credentials under the config home, and teaches the API client to use persisted bearer credentials with refresh support when env-based API credentials are absent. Constraint: Reuse existing runtime OAuth primitives and keep browser/callback orchestration in the CLI Constraint: Preserve auth precedence as API key, then auth-token env, then persisted OAuth credentials Rejected: Put browser launch and token exchange entirely in runtime \| caused boundary creep across shared crates Rejected: Duplicate credential parsing in CLI and api \| increased drift and refresh inconsistency Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Keep logout non-destructive to unrelated credentials.json fields and do not silently fall back to stale expired tokens Tested: cargo fmt; cargo clippy --workspace --all-targets -- -D warnings; cargo test Not-tested: Manual live Anthropic OAuth browser flow against real authorize/token endpoints	2026-03-31 23:38:05 +00:00
Yeachan-Heo	a66c301fa3	Add reusable OAuth and auth-source foundations Add runtime OAuth primitives for PKCE generation, authorization URL building, token exchange request shaping, and refresh request shaping. Wire the API client to a real auth-source abstraction so future OAuth tokens can flow into Anthropic requests without bespoke header code. This keeps the slice bounded to foundations: no browser flow, callback listener, or token persistence. The API client still behaves compatibly for current API-key users while gaining explicit bearer-token and combined auth modeling. Constraint: Must keep the slice minimal and real while preserving current API client behavior Constraint: Repo verification requires fmt, tests, and clippy to pass cleanly Rejected: Implement full OAuth browser/listener flow now \| too broad for the current parity-unblocking slice Rejected: Keep auth handling as ad hoc env reads only \| blocks reuse by future OAuth integration paths Confidence: high Scope-risk: moderate Reversibility: clean Directive: Extend OAuth behavior by composing these request/auth primitives before adding session or storage orchestration Tested: cargo fmt --all; cargo clippy -p runtime -p api --all-targets -- -D warnings; cargo test -p runtime; cargo test -p api --tests Not-tested: live OAuth token exchange; callback listener flow; workspace-wide tests outside runtime/api	2026-03-31 19:47:02 +00:00
Yeachan-Heo	54b7578606	Add reusable OAuth and auth-source foundations Add runtime OAuth primitives for PKCE generation, authorization URL building, token exchange request shaping, and refresh request shaping. Wire the API client to a real auth-source abstraction so future OAuth tokens can flow into Anthropic requests without bespoke header code. This keeps the slice bounded to foundations: no browser flow, callback listener, or token persistence. The API client still behaves compatibly for current API-key users while gaining explicit bearer-token and combined auth modeling. Constraint: Must keep the slice minimal and real while preserving current API client behavior Constraint: Repo verification requires fmt, tests, and clippy to pass cleanly Rejected: Implement full OAuth browser/listener flow now \| too broad for the current parity-unblocking slice Rejected: Keep auth handling as ad hoc env reads only \| blocks reuse by future OAuth integration paths Confidence: high Scope-risk: moderate Reversibility: clean Directive: Extend OAuth behavior by composing these request/auth primitives before adding session or storage orchestration Tested: cargo fmt --all; cargo clippy -p runtime -p api --all-targets -- -D warnings; cargo test -p runtime; cargo test -p api --tests Not-tested: live OAuth token exchange; callback listener flow; workspace-wide tests outside runtime/api	2026-03-31 19:47:02 +00:00
Yeachan-Heo	4586764a0e	feat(api): match Claude auth headers and layofflabs request format Trace the local Claude Code TS request path and align the Rust client with its non-OAuth direct-request behavior. The Rust client now resolves the message base URL from ANTHROPIC_BASE_URL, uses ANTHROPIC_API_KEY for x-api-key, and sends ANTHROPIC_AUTH_TOKEN as a Bearer Authorization header when present. Constraint: Must match the local Claude Code source request/auth split, not inferred behavior Rejected: Treat ANTHROPIC_AUTH_TOKEN as the x-api-key source \| diverges from local TS client path Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep direct /v1/messages auth handling aligned with src/services/api/client.ts and src/utils/auth.ts when changing env precedence Tested: cargo test -p api; cargo run -p rusty-claude-cli -- prompt "say hello" Not-tested: Non-default proxy transport features beyond ANTHROPIC_BASE_URL override	2026-03-31 19:00:48 +00:00
Yeachan-Heo	07f80f879d	feat(api): match API auth headers and layofflabs request format Trace the local Claw Code TS request path and align the Rust client with its non-OAuth direct-request behavior. The Rust client now resolves the message base URL from ANTHROPIC_BASE_URL, uses ANTHROPIC_API_KEY for x-api-key, and sends ANTHROPIC_AUTH_TOKEN as a Bearer Authorization header when present. Constraint: Must match the local Claw Code source request/auth split, not inferred behavior Rejected: Treat ANTHROPIC_AUTH_TOKEN as the x-api-key source \| diverges from local TS client path Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep direct /v1/messages auth handling aligned with src/services/api/client.ts and src/utils/auth.ts when changing env precedence Tested: cargo test -p api; cargo run -p claw-cli -- prompt "say hello" Not-tested: Non-default proxy transport features beyond ANTHROPIC_BASE_URL override	2026-03-31 19:00:48 +00:00
Yeachan-Heo	3faf8dd365	feat: make rusty-claude-cli usable end-to-end Wire the CLI to the Anthropic client, runtime conversation loop, and MVP in-tree tool executor so prompt mode and the default REPL both execute real turns instead of scaffold-only commands. Constraint: Proxy auth uses ANTHROPIC_AUTH_TOKEN as the primary x-api-key source and may stream extra usage fields Constraint: Must preserve existing scaffold commands while enabling real prompt and REPL flows Rejected: Keep prompt mode on the old scaffold path \| does not satisfy end-to-end CLI requirement Rejected: Depend solely on raw SSE message_stop from proxy \| proxy/event differences required tolerant parsing plus fallback handling Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Keep prompt mode tool-free unless the one-shot path is explicitly expanded and reverified against the proxy Tested: cargo test -p api; cargo test -p tools; cargo test -p runtime; cargo test -p rusty-claude-cli; cargo build; cargo run -p rusty-claude-cli -- prompt "say hello"; printf '/quit\n' \| cargo run -p rusty-claude-cli -- Not-tested: Full interactive tool_use roundtrip against the proxy in REPL mode	2026-03-31 18:40:09 +00:00
Yeachan-Heo	52af1f22c5	feat: make claw-cli usable end-to-end Wire the CLI to the Anthropic client, runtime conversation loop, and MVP in-tree tool executor so prompt mode and the default REPL both execute real turns instead of scaffold-only commands. Constraint: Proxy auth uses ANTHROPIC_AUTH_TOKEN as the primary x-api-key source and may stream extra usage fields Constraint: Must preserve existing scaffold commands while enabling real prompt and REPL flows Rejected: Keep prompt mode on the old scaffold path \| does not satisfy end-to-end CLI requirement Rejected: Depend solely on raw SSE message_stop from proxy \| proxy/event differences required tolerant parsing plus fallback handling Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Keep prompt mode tool-free unless the one-shot path is explicitly expanded and reverified against the proxy Tested: cargo test -p api; cargo test -p tools; cargo test -p runtime; cargo test -p claw-cli; cargo build; cargo run -p claw-cli -- prompt "say hello"; printf '/quit\n' \| cargo run -p claw-cli -- Not-tested: Full interactive tool_use roundtrip against the proxy in REPL mode	2026-03-31 18:40:09 +00:00
Yeachan-Heo	450556559a	feat: merge 2nd round from all rcc/* sessions - api: tool_use parsing, message_delta, request_id tracking, retry logic - tools: extended tool suite (WebSearch, WebFetch, Agent, etc.) - cli: live streamed conversations, session restore, compact commands - runtime: config loading, system prompt builder, token usage, compaction	2026-03-31 17:43:25 +00:00
Yeachan-Heo	334d1854d6	feat: merge 2nd round from all rcc/* sessions - api: tool_use parsing, message_delta, request_id tracking, retry logic - tools: extended tool suite (WebSearch, WebFetch, Agent, etc.) - cli: live streamed conversations, session restore, compact commands - runtime: config loading, system prompt builder, token usage, compaction	2026-03-31 17:43:25 +00:00
Yeachan-Heo	44e4758078	feat: Rust port of Claude Code CLI Crates: - api: Anthropic Messages API client with SSE streaming - tools: Claude-compatible tool implementations (Bash, Read, Write, Edit, Glob, Grep + extended suite) - runtime: conversation loop, session persistence, permissions, system prompt builder - rusty-claude-cli: terminal UI with markdown rendering, syntax highlighting, spinners - commands: subcommand definitions - compat-harness: upstream TS parity verification All crates pass cargo fmt/clippy/test.	2026-03-31 17:43:09 +00:00
Yeachan-Heo	7eb6330791	feat: Rust port of Claw Code CLI Crates: - api: Anthropic Messages API client with SSE streaming - tools: compatible tool implementations (Bash, Read, Write, Edit, Glob, Grep + extended suite) - runtime: conversation loop, session persistence, permissions, system prompt builder - claw-cli: terminal UI with markdown rendering, syntax highlighting, spinners - commands: subcommand definitions - compat-harness: upstream TS parity verification All crates pass cargo fmt/clippy/test.	2026-03-31 17:43:09 +00:00

1 2 3

149 Commits