MicroFish

Commit Graph

Author	SHA1	Message	Date
Dominik Seemann	74997fd088	feat(i18n): externalize chinese log and api response strings Extract every Chinese string inside backend logger.{info,warning,error, debug,exception} calls and inside user-facing jsonify({"error\|message": ...}) responses across the listed in-scope modules into locales/{en,zh}.json under nested namespaces (log.<module>., api.{error,message}.<scope>.). Locale dictionaries stay structurally identical; the existing flat frontend-facing keys at log.* / api.* are left untouched. The locale helper (backend/app/utils/locale.py) now emits a single deduplicated mirofish.locale warning per (locale, key) pair when a translation is missing instead of silently returning the raw key, so unknown keys are visible without crashing requests or background tasks. A repo-root scripts/check_i18n_logs.py verifier performs an AST-aware source scan for residual Chinese inside the in-scope logger/jsonify calls and a recursive parity diff between en.json and zh.json — both modes pass. Why: backend logs and API errors previously emitted Chinese-only strings, leaving English-speaking operators with unreadable log aggregator output and API consumers with locale-mismatched error messages. The t() helper and per-thread set_locale propagation already existed; this change makes every backend caller route through them. Closes #6	2026-05-07 13:52:22 +00:00
Abhishek Yadav	28827067c0	feat: migrate knowledge graph from Zep Cloud to Graphiti + local Neo4j Replaces the paid, rate-limited Zep Cloud service with Graphiti (graphiti-core 0.11.6) backed by a local Neo4j instance — free, unlimited, and self-hosted. Key changes: - Add GraphitiAdapter: drop-in Zep-compatible wrapper around Graphiti with a persistent event-loop thread to avoid asyncio/Neo4j driver conflicts - Switch LLM client to native GeminiClient + GeminiEmbedder (text-embedding-004 fails on Gemini compat endpoint; use google-genai SDK directly) - Add _GeminiReranker passthrough replacing OpenAIRerankerClient (which hardcodes gpt-4.1-nano and uses logprobs unsupported by Gemini) - Fix Cypher queries: use s.uuid/t.uuid for edge source/target instead of r.source_node_uuid (null property in Graphiti's schema) - Add ontology-based entity type classifier (_classify_entity_type) so nodes get colored by type in the D3 graph visualization instead of all being Entity - Apply classifier in ZepEntityReader so filter_defined_entities finds entities (previously 0 personas loaded because all labels were ['Entity']) - Add startup recovery: auto-mark graph_building projects as graph_completed on backend restart if Neo4j already has their data - Add resume capability to graph build: skip already-processed episodes after a restart (断点续传) - Add non-blocking graph data cache with background refresh in graph.py - Add EMBEDDING_MODEL config (default: gemini-embedding-001 for Gemini users) - Add CLAUDE.md with project architecture and dev commands Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-22 01:30:28 +05:30
666ghj	5ece3f670b	Implement Report Agent for automated report generation and interaction - Introduced the Report Agent module to facilitate the automatic generation of simulation analysis reports using LangChain and Zep, following the ReACT model. - Added functionality for report outline planning, segmented content generation, and user interaction through a dialogue interface. - Implemented new API endpoints for report generation, status checking, and retrieval, enhancing the overall reporting capabilities. - Updated README.md to include detailed instructions on the new report generation features and API usage. - Enhanced the project structure to accommodate the new report management functionalities, including report storage and retrieval mechanisms.	2025-12-09 15:10:55 +08:00
666ghj	91eb73ae44	Enhance signal handling and suppress warnings in simulation scripts - Added signal handling to gracefully manage shutdown events across simulation scripts, ensuring proper cleanup of resources. - Introduced a global shutdown event to allow simulations to respond to termination signals, improving robustness. - Suppressed warnings related to multiprocessing resource tracking to avoid unnecessary log clutter during execution. - Updated cleanup logic in `SimulationRunner` and `ZepGraphMemoryManager` to prevent redundant calls and ensure efficient resource management. - Enhanced logging to provide clearer feedback during shutdown processes, improving traceability.	2025-12-09 00:37:12 +08:00
666ghj	5b4f02f421	Enhance simulation configuration and management features - Added support for a `max_rounds` parameter in simulation API, allowing users to limit the number of simulation rounds, improving control over simulation duration. - Updated README.md to reflect the new `max_rounds` parameter and its usage in simulation requests. - Enhanced error handling for `max_rounds` input validation to ensure it is a positive integer. - Modified simulation runner and related scripts to incorporate `max_rounds` functionality, ensuring consistent application across Twitter and Reddit simulations. - Improved logging to indicate when the number of rounds is truncated due to the `max_rounds` setting, enhancing traceability during simulation execution.	2025-12-05 15:50:54 +08:00
666ghj	d4fac63eb4	Enhance simulation management and logging features - Registered a cleanup function for simulation processes to ensure proper termination on server shutdown. - Improved logging during application startup to confirm the registration of the cleanup function. - Updated simulation preparation checks to clarify the conditions for considering a simulation ready, enhancing error handling and user feedback. - Added detailed logging for simulation status changes, improving traceability during the simulation lifecycle. - Introduced new files for simulation configuration and profile data, supporting enhanced testing and visualization capabilities.	2025-12-02 17:11:47 +08:00
666ghj	5f159f6d88	Enhance backend functionality with OASIS simulation features - Updated README.md to include new simulation scripts and configuration details for OASIS, including API retry mechanisms and environment variable settings. - Added simulation management and configuration generation services to streamline the simulation process across Twitter and Reddit platforms. - Introduced new API routes for simulation-related operations, including entity retrieval and simulation status management. - Implemented a robust retry mechanism for external API calls to improve system stability. - Enhanced task management model to include detailed progress tracking. - Added logging capabilities for action tracking during simulations. - Included new scripts for running parallel simulations and testing profile formats.	2025-12-01 15:03:44 +08:00
666ghj	e98da6b53e	Enhance backend startup logging and API endpoint display - Updated `run.py` to conditionally print startup information only in the reloader process to avoid duplicate logs in debug mode. - Modified `__init__.py` to log startup and completion messages based on the reloader process condition. - Added warnings suppression in `graph_builder.py` for Pydantic v2 regarding Field usage. - Revised `ontology_generator.py` to enforce strict design guidelines for entity types and relationships, ensuring compliance with new requirements. - Improved logging behavior in `logger.py` to prevent log propagation to the root logger, avoiding duplicate outputs.	2025-11-28 18:59:36 +08:00
666ghj	08f417f3b7	Introduce Project ID for context management, finalizing the stateful API pipeline from file submission to graph construction.	2025-11-28 17:21:08 +08:00

9 Commits