MicroFish

History

rqd6f4g6zn-bit c2d533d933 fix: robust LLM JSON parsing + sanitize report sections + chat history dedup Closes #624, #622, #601, #599, #577 ## LLM JSON parsing (#624 / #622 / #601) - New `_parse_llm_json()` in llm_client.py with 5-stage fallback: 1. Strip markdown fences (existing) 2. Strict json.loads (fast path) 3. json.JSONDecoder.raw_decode (handles trailing prose after JSON) 4. Balanced-brace extraction (leading prose + embedded JSON) 5. Strip control chars + retry - Replaces strict json.loads in chat_json() that was failing on any LLM appending text after the JSON (common with qwen-plus, ollama, gemma even with response_format=json_object). - Logs which fallback was used so problematic LLMs are visible. - 8 unit-test cases covering each strategy. ## Report section tool_call leak (#599) - New `_sanitize_section_content()` in report_agent.py detects when a section's "final answer" is actually an unexecuted tool_call JSON (e.g. `{"name":"quick_search","parameters":{...}}`) and replaces it with a clear fallback message instead of writing the raw artifact to the report. - Applied at all 3 places where final_answer is returned in write_section(): the Final Answer path, the no-prefix fallback, and the force-finalize path. ## Chat history duplicate user message (#577) - In report_agent.py chat(), defensively dedupe chat_history: - Only keep {role, content} from history items - Skip entries that match the current message exactly - This prevents LLM from seeing a duplicate trailing user message and echoing back the previous answer. - Added debug log of constructed messages array for diagnostics.		2026-05-17 08:23:16 +02:00
..
__init__.py	Implement Interview feature for agent interactions in simulations	2025-12-08 15:55:39 +08:00
graph_builder.py	fix(i18n): pass locale to background threads via thread-local storage	2026-04-01 16:55:51 +08:00
oasis_profile_generator.py	feat(i18n): replace remaining Chinese in config generator and profile generator	2026-04-01 17:19:12 +08:00
ontology_generator.py	Merge pull request #428 from Ghostubborn/feat/i18n	2026-04-02 14:27:04 +08:00
report_agent.py	fix: robust LLM JSON parsing + sanitize report sections + chat history dedup	2026-05-17 08:23:16 +02:00
simulation_config_generator.py	feat(i18n): replace remaining Chinese in config generator and profile generator	2026-04-01 17:19:12 +08:00
simulation_ipc.py	Implement Interview feature for agent interactions in simulations	2025-12-08 15:55:39 +08:00
simulation_manager.py	feat(i18n): replace remaining hardcoded Chinese in progress callbacks	2026-04-01 16:53:29 +08:00
simulation_runner.py	fix(i18n): pass locale to background threads via thread-local storage	2026-04-01 16:55:51 +08:00
text_processor.py	Introduce Project ID for context management, finalizing the stateful API pipeline from file submission to graph construction.	2025-11-28 17:21:08 +08:00
zep_entity_reader.py	feat(graph): implement pagination for fetching nodes and edges; add utility functions for streamlined data retrieval	2026-02-27 15:53:29 +08:00
zep_graph_memory_updater.py	fix(i18n): pass locale to background threads via thread-local storage	2026-04-01 16:55:51 +08:00
zep_tools.py	feat(i18n): replace all user-visible Chinese logger messages in zep_tools.py	2026-04-01 17:46:39 +08:00