MicroFish

Commit Graph

Author	SHA1	Message	Date
Dominik Seemann	080683295d	feat(i18n): translate ontology_generator prompts to english translate the system prompt constant and the user-message template in backend/app/services/ontology_generator.py from chinese to english. the chinese base prompt was biasing the model toward chinese structure and word choice even when accept-language was en, leaving ontology descriptions and analysis_summary fields chinese-flavoured. translation is in-place and preserves every functional contract: the json output schema, the entity-type and relationship-type taxonomies verbatim, the reserved-attribute-name list, the count and length constraints, and all f-string interpolations. the get_language_instruction() postfix call site and the trailing english identifier-format directive are unchanged, so zh and other locales continue to receive locale-appropriate descriptions. logger calls, docstrings, and inline comments are intentionally left in chinese — they are owned by issues #6 and #7. a small static guard script (backend/scripts/test_ontology_prompts_no_cjk.py) ast-parses the module and asserts zero cjk in the system prompt and in every string literal of _build_user_message except the docstring, so the regression cannot reappear silently. Closes #2	2026-05-07 09:40:27 +00:00
BaiFu	af71244974	Merge pull request #428 from Ghostubborn/feat/i18n feat(i18n): 添加多语言切换功能，支持中英文	2026-04-02 14:27:04 +08:00
666ghj	e3350a919d	fix(graph): enforce PascalCase for entity names and SCREAMING_SNAKE_CASE for edge names in ontology validation	2026-04-01 17:42:27 +08:00
ghostubborn	97aa58384e	fix(i18n): ensure ontology names stay PascalCase regardless of language setting The language instruction was causing LLM to change entity/relation naming conventions. Now explicitly enforce PascalCase/UPPER_SNAKE_CASE for technical identifiers while only applying language preference to description fields.	2026-04-01 16:40:17 +08:00
ghostubborn	f75c6487b3	fix(i18n): replace remaining hardcoded language directives in LLM prompts - oasis_profile_generator: replace hardcoded "使用中文" with dynamic get_language_instruction() - ontology_generator: remove hardcoded "（中文）" from schema annotation - report_agent: replace Chinese-specific language consistency rules with language-neutral ones - zep_tools: dynamically select quote style based on locale	2026-04-01 15:55:04 +08:00
ghostubborn	8f6110df0f	feat(i18n): inject language instruction into LLM system prompts	2026-04-01 15:24:12 +08:00
666ghj	e98da6b53e	Enhance backend startup logging and API endpoint display - Updated `run.py` to conditionally print startup information only in the reloader process to avoid duplicate logs in debug mode. - Modified `__init__.py` to log startup and completion messages based on the reloader process condition. - Added warnings suppression in `graph_builder.py` for Pydantic v2 regarding Field usage. - Revised `ontology_generator.py` to enforce strict design guidelines for entity types and relationships, ensuring compliance with new requirements. - Improved logging behavior in `logger.py` to prevent log propagation to the root logger, avoiding duplicate outputs.	2025-11-28 18:59:36 +08:00
666ghj	08f417f3b7	Introduce Project ID for context management, finalizing the stateful API pipeline from file submission to graph construction.	2025-11-28 17:21:08 +08:00

8 Commits