MicroFish

Commit Graph

Author	SHA1	Message	Date
Christian Möllmann	609ea6c5bd	Merge `895a5fbaee` into `96096ea0ff`	2026-05-25 00:06:17 +02:00
BaiFu	96096ea0ff	Merge pull request #640 from lllopic/fix/add-type-hints-and-helper-method refactor: add type hints and FileParser.is_supported() helper	2026-05-25 00:48:57 +08:00
666ghj	3f4d56116c	fix(backend): constrain Python version to 3.11-3.12	2026-05-24 22:59:36 +08:00
Christian Moellmann	895a5fbaee	fix(interviews): accept stringified ints in all 4 subagent validators Real LLMs (observed with anthropic/claude-haiku-4-5 on a 23-agent run) sometimes return Likert values as JSON strings ('3' not 3). The 4 subagent validators rejected this with isinstance(v, int), losing ~30% of agents at N=23. Added a shared coerce_int helper in base.py that accepts ints and numeric strings, rejects bools/floats/garbage, and is now used by: - Longitudinal: response values 1-5 - Diversity: Q-sort placements -3..+3 and 6 Likert axes 1-7 - Delphi: R2 and R3 importance/plausibility 1-5 - Scenario: 4 dimensions 1-7 Validators now coerce in place so downstream code sees ints regardless of the wire format. Added 8 tests (4 unit on coerce_int + 4 per-subagent contract tests showing stringified values are accepted). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 14:03:34 +02:00
Christian Moellmann	6a53c110b7	feat(interviews): capture raw LLM output on schema-validation failures Adds SchemaValidationFailure exception carrying both retry attempts' raw output, so audit.jsonl preserves what the model actually said when an agent's response can't be coerced into the instrument schema. Lets us diagnose persona-vs-format failures without re-running. Two new tests. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 13:40:43 +02:00
Christian Moellmann	6e1489fe08	fix(interviews): wire Zep updater/memory/hooks correctly for production runs (C1-C5) Five tightly-coupled fixes that were causing the interview subsystem to silently degrade in production: - C1+C2: `_build_orchestrator` now resolves `graph_id` from `SimulationManager().get_simulation(sim_id).graph_id` (the real persisted state) instead of a `graph_id.txt` that nothing in the codebase writes. `ZepGraphMemoryUpdater(graph_id=...)` is now called with the correct positional argument; the bare `try/except Exception` that was swallowing the TypeError is replaced with a narrow fallback that logs explicitly. - C3: `SimulationManager._on_ready_hooks` / `_on_completed_hooks` are now class-level (mirroring `SimulationRunner._on_completed_callbacks`). Hooks registered at app startup now survive across the per-request `SimulationManager()` instances created by the Flask API, so the T0 longitudinal auto-survey actually fires. - C4: `ZepGraphMemoryUpdater` gains an explicit `add_text_episode(graph_id, text)` method for synchronous text writes. `InterviewZepWriter._emit` no longer silently falls back to a dict-shaped `add_activity` call that the real implementation rejects (its `add_activity` requires an `AgentActivity` dataclass). - C5: `FileSystemPersonaProvider.agent_to_entity()` builds an `{agent_id: zep_entity_uuid}` map from the persisted profile files; the map is now passed to `ZepMemoryProvider` so `get_entity_with_context` is called with real Zep UUIDs instead of `str(agent_id)`. To make this work, `OasisProfileGenerator._save_reddit_json` and `_save_twitter_csv` now persist `source_entity_uuid` (Reddit JSON: optional field; Twitter CSV: appended column). Tests: 51 unit + 2 integration pass (was 40 + 2). New tests lock in each fix: - `test_hooks_survive_across_instances` (C3) - `test_build_orchestrator_reads_graph_id_from_state` (C1+C2+C5) - `test_build_orchestrator_falls_back_when_state_missing` (C1+C2) - `test_emit_uses_add_text_episode_with_graph_id`, `test_emit_raises_when_updater_lacks_add_text_episode`, `test_real_updater_exposes_add_text_episode` (C4) - `test_agent_to_entity_from_reddit_json`, `test_agent_to_entity_empty_when_no_field`, `test_agent_to_entity_falls_back_to_twitter_csv`, `test_agent_to_entity_reddit_takes_precedence` (C5) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 13:27:47 +02:00
Christian Moellmann	6b04ea5c27	feat(interviews): auto-trigger lifecycle hooks + bridge SimulationRunner→Manager on COMPLETED - Add backend/app/services/interviews/lifecycle.py with install_hooks() that registers on_ready (pre-survey) and on_completed (post-survey + synthesis) daemon-thread callbacks on a SimulationManager. - Add SimulationRunner.register_on_completed() / _fire_on_completed() so external callbacks can be notified when _monitor_simulation transitions to COMPLETED (both exit-code-0 path and simulation_end event path). - Wire both in app/__init__.py: create singleton SimulationManager, install lifecycle hooks, and register its _notify_on_completed with SimulationRunner. - Add test_lifecycle.py: verifies install_hooks registers one callable for each of ready and completed. - All 40 unit tests + 2 integration tests pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:51:13 +02:00
Christian Moellmann	61f13a806d	test(interviews): end-to-end pipeline test + content-aware LLM stubs for all 4 subagents Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:40:53 +02:00
Christian Moellmann	52bae0a3da	feat(interviews): Flask blueprint /api/interview with task-based async + CSV export Add /api/interview blueprint with POST pre/post/rerun, GET status/results/synthesis/export.csv endpoints. Background tasks tracked by UUID in module-level dict. Add register_blueprints() helper to api/__init__.py and wire app factory through it. Add UPLOADS_DIR to Config with env-override default. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:37:04 +02:00
Christian Moellmann	bc07170dbf	feat(interviews): persona + Zep memory adapters bridging existing services to interview subsystem Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:34:03 +02:00
Christian Moellmann	d79c81d2b7	feat(interviews): synthesiser emits cross-method report + tidy CSV + limitations section Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:32:14 +02:00
Christian Moellmann	3322bcb20c	feat(interviews): on_ready / on_completed hook registry on SimulationManager Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:29:30 +02:00
Christian Moellmann	b3e2039817	feat(interviews): orchestrator with two-phase lifecycle, parallel fan-out, isolated failures Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:27:47 +02:00
Christian Moellmann	cca67365b9	feat(interviews): Zep writer adapts add_activity/add_text_episode for per-agent + aggregate episodes Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:24:33 +02:00
Christian Moellmann	998cf1ac27	feat(interviews): JSONL/JSON storage layout with run_id directories and latest pointer Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:23:06 +02:00
Christian Moellmann	ae4941df8e	feat(interviews): scenario subagent with 4 futures × 4 dimensions + polarity matrix Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:21:21 +02:00
Christian Moellmann	5d7111b54e	feat(interviews): Delphi subagent (3 rounds: open, rate, revise) + convergence metrics Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:19:07 +02:00
Christian Moellmann	75762ccc18	feat(interviews): diversity subagent with Q-sort + 6 Likert axes + PCA/k-means typology Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:16:21 +02:00
Christian Moellmann	0fcb815cde	feat(interviews): longitudinal subagent + 12-item Likert instrument Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:12:46 +02:00
Christian Moellmann	289a0cff56	feat(interviews): StakeholderInterviewer base with in-character prompting and schema retry Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:10:01 +02:00
Christian Moellmann	eb3c3629c1	feat(interviews): LLM stub mode for deterministic CI tests Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:08:29 +02:00
Christian Moellmann	29be754ff4	feat(interviews): YAML instrument loader with pydantic validation and hash freezing Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:06:52 +02:00
Christian Moellmann	f1898b4eac	feat(interviews): add pydantic models for instruments and responses Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:04:45 +02:00
Christian Moellmann	071f8b5c4c	feat(interviews): add interview config keys (token budget, workers, language, stub mode) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:02:33 +02:00
Christian Moellmann	f63bc5542a	chore(interviews): add deps and pytest scaffold for interview subsystem Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 12:00:09 +02:00
lllopic	daec4b6be4	refactor: add type hints and FileParser.is_supported() helper - Add return type annotation (list[str]) to Config.validate() - Add type annotations (msg: str, -> None) to logger convenience functions - Add FileParser.is_supported() classmethod for checking file format support	2026-05-23 14:57:46 +08:00
BaiFu	af71244974	Merge pull request #428 from Ghostubborn/feat/i18n feat(i18n): 添加多语言切换功能，支持中英文	2026-04-02 14:27:04 +08:00
ghostubborn	f2404903d6	fix(i18n): validate Accept-Language header against registered locales	2026-04-02 14:20:15 +08:00
ghostubborn	24e9bee5be	feat(i18n): replace all user-visible Chinese logger messages in zep_tools.py These are shown to users via ConsoleLogger in the report page.	2026-04-01 17:46:39 +08:00
ghostubborn	e79569ab4f	feat(i18n): replace all user-visible Chinese in report_agent.py Covers ReportLogger message fields and logger messages shown via ConsoleLogger.	2026-04-01 17:44:52 +08:00
666ghj	e3350a919d	fix(graph): enforce PascalCase for entity names and SCREAMING_SNAKE_CASE for edge names in ontology validation	2026-04-01 17:42:27 +08:00
ghostubborn	380e456d41	fix(i18n): replace hardcoded Chinese stage names in simulation prepare SSE	2026-04-01 17:31:00 +08:00
ghostubborn	0e55e4cf6b	feat(i18n): replace remaining Chinese in config generator and profile generator Also update simulation prompts to be locale-neutral for timezone/schedule.	2026-04-01 17:19:12 +08:00
ghostubborn	7c07237544	fix(i18n): pass locale to background threads via thread-local storage Background threads (graph building, simulation prep, report generation, profile generation) now inherit the requesting user's locale preference. Previously these fell back to 'zh' because Flask request context was unavailable in spawned threads.	2026-04-01 16:55:51 +08:00
ghostubborn	592ee52f59	feat(i18n): replace remaining hardcoded Chinese in progress callbacks	2026-04-01 16:53:29 +08:00
ghostubborn	da2490ec31	fix(i18n): protect JSON field values from language instruction in config generator Ensure poster_type stays PascalCase English and stance stays English enum values regardless of language setting. Only natural language fields follow the user's language preference.	2026-04-01 16:41:22 +08:00
ghostubborn	97aa58384e	fix(i18n): ensure ontology names stay PascalCase regardless of language setting The language instruction was causing LLM to change entity/relation naming conventions. Now explicitly enforce PascalCase/UPPER_SNAKE_CASE for technical identifiers while only applying language preference to description fields.	2026-04-01 16:40:17 +08:00
ghostubborn	9d43b77511	feat(i18n): replace hardcoded Chinese in backend SSE progress messages	2026-04-01 16:32:10 +08:00
ghostubborn	f75c6487b3	fix(i18n): replace remaining hardcoded language directives in LLM prompts - oasis_profile_generator: replace hardcoded "使用中文" with dynamic get_language_instruction() - ontology_generator: remove hardcoded "（中文）" from schema annotation - report_agent: replace Chinese-specific language consistency rules with language-neutral ones - zep_tools: dynamically select quote style based on locale	2026-04-01 15:55:04 +08:00
ghostubborn	74f673a238	feat(i18n): replace hardcoded Chinese in backend API responses with t() calls	2026-04-01 15:32:24 +08:00
ghostubborn	8f6110df0f	feat(i18n): inject language instruction into LLM system prompts	2026-04-01 15:24:12 +08:00
ghostubborn	0c18e1aeca	feat(i18n): add backend translation utility with shared locale files	2026-04-01 15:22:14 +08:00
666ghj	985f89f49a	fix: resolve 500 error caused by <think> tags and markdown code fences in content field from reasoning models like MiniMax/GLM	2026-03-06 00:30:31 +08:00
666ghj	da6548e96f	feat(graph): implement pagination for fetching nodes and edges; add utility functions for streamlined data retrieval	2026-02-27 15:53:29 +08:00
666ghj	25aa4f75d2	fix(report_agent): refine tool call handling and response validation; enforce strict separation between tool calls and final answers	2026-02-24 17:47:44 +08:00
666ghj	08ec856a58	fix(report_agent): update max_agents parameter description and enforce maximum limit of 10 agents	2026-02-14 18:35:05 +08:00
666ghj	ddd9ff2479	feat(report_agent): update report language consistency guidelines; ensure all quoted content is translated to the report language for clarity	2026-02-14 18:24:03 +08:00
666ghj	7601d78fd4	feat(report_agent): enhance interview text processing and response handling; improve quote extraction and formatting for better clarity	2026-02-14 16:56:48 +08:00
666ghj	dc0a9261d1	feat(report_agent): add detailed tool descriptions and prompts for future prediction report generation	2026-02-14 15:16:17 +08:00
666ghj	d2041f6fb8	fix(report_agent): update description of insight_forge tool to remove "最强大" and enhance clarity	2026-02-14 14:48:23 +08:00

1 2 3

112 Commits