MicroFish

Commit Graph

Author	SHA1	Message	Date
Maggie Chen	29fdb64fa0	fix: address critical security vulnerabilities — auth and path traversal Two critical issues and several high/medium issues were identified during a security review of the backend API. Critical fixes: 1. Path traversal (CWE-22): user-supplied `simulation_id`, `report_id`, and `project_id` values were passed directly to `os.path.join()` without validation, allowing `../` sequences to escape intended directories. - Added `backend/app/utils/id_validator.py` with `validate_safe_id()` (rejects anything that isn't alphanumeric/underscore/hyphen) and `safe_join()` (resolves realpath and verifies containment). - Applied to all 3 path-construction sites in simulation.py, all 12 relevant handlers in report.py, and 6 sites in graph.py. - Sanitized uploaded filenames with `os.path.basename()` in graph.py. 2. Missing authentication: all API endpoints were publicly accessible with no auth mechanism. - Added `backend/app/utils/auth.py` with an `X-Api-Key` middleware registered as a `before_request` hook. - Auth is opt-in: set `API_KEY` in `.env` to enforce it; if unset a startup warning is logged. This preserves local dev workflows. High fixes: 3. Hardcoded `SECRET_KEY` fallback replaced with `os.urandom(32).hex()` so an unset key is never predictable. 4. `FLASK_DEBUG` now defaults to `False` instead of `True`. 5. Full Python tracebacks removed from all API error responses (51 total across graph.py, report.py, simulation.py) — tracebacks still go to the logger. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 19:35:50 -04:00
ghostubborn	f2404903d6	fix(i18n): validate Accept-Language header against registered locales	2026-04-02 14:20:15 +08:00
ghostubborn	7c07237544	fix(i18n): pass locale to background threads via thread-local storage Background threads (graph building, simulation prep, report generation, profile generation) now inherit the requesting user's locale preference. Previously these fell back to 'zh' because Flask request context was unavailable in spawned threads.	2026-04-01 16:55:51 +08:00
ghostubborn	0c18e1aeca	feat(i18n): add backend translation utility with shared locale files	2026-04-01 15:22:14 +08:00
666ghj	985f89f49a	fix: resolve 500 error caused by <think> tags and markdown code fences in content field from reasoning models like MiniMax/GLM	2026-03-06 00:30:31 +08:00
666ghj	da6548e96f	feat(graph): implement pagination for fetching nodes and edges; add utility functions for streamlined data retrieval	2026-02-27 15:53:29 +08:00
666ghj	390c120fef	fix(file_parser): handle non-UTF-8 encoded text files with automatic encoding detection	2026-01-22 18:28:37 +08:00
666ghj	f46c1a9ec7	Add UTF-8 encoding support for Windows console in run.py and logger.py to prevent character encoding issues	2025-12-26 17:58:48 +08:00
666ghj	5f159f6d88	Enhance backend functionality with OASIS simulation features - Updated README.md to include new simulation scripts and configuration details for OASIS, including API retry mechanisms and environment variable settings. - Added simulation management and configuration generation services to streamline the simulation process across Twitter and Reddit platforms. - Introduced new API routes for simulation-related operations, including entity retrieval and simulation status management. - Implemented a robust retry mechanism for external API calls to improve system stability. - Enhanced task management model to include detailed progress tracking. - Added logging capabilities for action tracking during simulations. - Included new scripts for running parallel simulations and testing profile formats.	2025-12-01 15:03:44 +08:00
666ghj	e98da6b53e	Enhance backend startup logging and API endpoint display - Updated `run.py` to conditionally print startup information only in the reloader process to avoid duplicate logs in debug mode. - Modified `__init__.py` to log startup and completion messages based on the reloader process condition. - Added warnings suppression in `graph_builder.py` for Pydantic v2 regarding Field usage. - Revised `ontology_generator.py` to enforce strict design guidelines for entity types and relationships, ensuring compliance with new requirements. - Improved logging behavior in `logger.py` to prevent log propagation to the root logger, avoiding duplicate outputs.	2025-11-28 18:59:36 +08:00
666ghj	08f417f3b7	Introduce Project ID for context management, finalizing the stateful API pipeline from file submission to graph construction.	2025-11-28 17:21:08 +08:00

11 Commits