MicroFish

Commit Graph

Author	SHA1	Message	Date
Dominik Seemann	b15dc2ea2c	Merge pull request #19 from salestech-group/feat/i18n-6-externalize-backend-logs feat(i18n): externalize chinese log and api response strings	2026-05-07 17:09:38 +02:00
Dominik Seemann	4f6d5d19a8	Merge pull request #17 from salestech-group/feat/i18n-5-translate-report-agent-prompts feat(i18n): translate report_agent react prompts to english	2026-05-07 17:08:22 +02:00
Dominik Seemann	0f3ba443dd	Merge pull request #16 from salestech-group/feat/i18n-4-translate-sim-config-prompts feat(i18n): translate simulation_config_generator prompts to english	2026-05-07 17:07:52 +02:00
Dominik Seemann	74997fd088	feat(i18n): externalize chinese log and api response strings Extract every Chinese string inside backend logger.{info,warning,error, debug,exception} calls and inside user-facing jsonify({"error\|message": ...}) responses across the listed in-scope modules into locales/{en,zh}.json under nested namespaces (log.<module>., api.{error,message}.<scope>.). Locale dictionaries stay structurally identical; the existing flat frontend-facing keys at log.* / api.* are left untouched. The locale helper (backend/app/utils/locale.py) now emits a single deduplicated mirofish.locale warning per (locale, key) pair when a translation is missing instead of silently returning the raw key, so unknown keys are visible without crashing requests or background tasks. A repo-root scripts/check_i18n_logs.py verifier performs an AST-aware source scan for residual Chinese inside the in-scope logger/jsonify calls and a recursive parity diff between en.json and zh.json — both modes pass. Why: backend logs and API errors previously emitted Chinese-only strings, leaving English-speaking operators with unreadable log aggregator output and API consumers with locale-mismatched error messages. The t() helper and per-thread set_locale propagation already existed; this change makes every backend caller route through them. Closes #6	2026-05-07 13:52:22 +00:00
Dominik Seemann	22a3ca7af5	feat(i18n): translate report_agent react prompts to english translate every llm-facing string-literal in backend/app/services/report_agent.py — the four tool-description constants, the plan/section/chat system+user prompts, the react loop templates, the inline messages re-injected during the section and chat loops, the _execute_tool error returns, and the plan_outline default and fallback outline content. preserve every {interpolation} token, the literal final answer: trigger and <tool_call> xml tag, the four primary tool names, and all three get_language_instruction() postfix call sites. also switch the unused-tools join separator from "、" to ", " so it renders naturally inside the now-english react templates. removes the chinese language bias that the english postfix alone could not overcome — under accept-language: en the report agent now produces english-flavoured analytical reports and chat replies; under accept-language: zh the postfix continues to steer the model into chinese with no semantic delta. logger calls (#6) and docstrings or comments (#7) are deliberately untouched. Closes #5	2026-05-07 12:49:23 +00:00
Dominik Seemann	6c2a412196	feat(i18n): translate simulation_config_generator prompts to english translate the three llm prompt blocks plus the two prompt-feeding helpers (_build_context, _summarize_entities) in backend/app/services/simulation_config_generator.py from chinese to english. the chinese base prompts were biasing the model toward chinese structure and lexical choice for content, narrative_direction, hot_topics, and reasoning fields even when accept-language was en, because get_language_instruction() only steers the response language as a postfix. translation is in-place and preserves every functional contract: the json output schema for all three prompts, every variable interpolation, the per-entity-type heuristic ranges in the agent-config prompt, the trailing english IMPORTANT directives that lock poster_type to PascalCase and stance to {supportive,opposing,neutral,observer}, and all three get_language_instruction() postfix call sites. the two default-path reasoning literals are translated to locale-agnostic english so generation_reasoning no longer mixes chinese and english on the failure path. logger calls, docstrings, and inline comments are intentionally left chinese (out of scope; covered by issues #6 and #7). public api, dataclasses, class constants, and the SimulationParameters payload shape are unchanged. Closes #4	2026-05-07 11:43:47 +00:00
Dominik Seemann	080683295d	feat(i18n): translate ontology_generator prompts to english translate the system prompt constant and the user-message template in backend/app/services/ontology_generator.py from chinese to english. the chinese base prompt was biasing the model toward chinese structure and word choice even when accept-language was en, leaving ontology descriptions and analysis_summary fields chinese-flavoured. translation is in-place and preserves every functional contract: the json output schema, the entity-type and relationship-type taxonomies verbatim, the reserved-attribute-name list, the count and length constraints, and all f-string interpolations. the get_language_instruction() postfix call site and the trailing english identifier-format directive are unchanged, so zh and other locales continue to receive locale-appropriate descriptions. logger calls, docstrings, and inline comments are intentionally left in chinese — they are owned by issues #6 and #7. a small static guard script (backend/scripts/test_ontology_prompts_no_cjk.py) ast-parses the module and asserts zero cjk in the system prompt and in every string literal of _build_user_message except the docstring, so the regression cannot reappear silently. Closes #2	2026-05-07 09:40:27 +00:00
Dominik Seemann	2badf568e7	feat(graphiti): finalize neo4j migration with provider switch Adds a Neo4j service to docker-compose so `docker compose up -d` works on a clean checkout, and unhardcodes Graphiti's LLM/embedder so the documented default provider (Qwen via Dashscope) actually works. - docker-compose: neo4j:5-community service with cypher-shell healthcheck, named volumes, and `depends_on: service_healthy` on the app container; in-Docker NEO4J_URI override leaves the host-mode default untouched. - Config: new GRAPHITI_LLM_PROVIDER (openai\|gemini, default openai) plus optional EMBEDDING_API_KEY / EMBEDDING_BASE_URL that fall back to the chat LLM credentials. - graphiti_adapter: provider switch inside the singleton factory with lazy per-provider imports; Gemini path is preserved exactly. The no-op `_GeminiReranker` becomes a provider-agnostic `_PassthroughReranker`, still injected explicitly so Graphiti does not fall back to its OpenAI-only default reranker. - Drop the ignored `reranker=` kwarg from `_GraphNamespace.search` and the misleading callers in `zep_tools.py` and `oasis_profile_generator.py`. - Refresh `.env.example` to mirror the README env section. Spec, requirements, and design under `.kiro/specs/graphiti-neo4j-finalize/`. Closes #1	2026-05-07 08:43:36 +00:00

8 Commits