MicroFish

Commit Graph

Author	SHA1	Message	Date
Maggie Chen	29fdb64fa0	fix: address critical security vulnerabilities — auth and path traversal Two critical issues and several high/medium issues were identified during a security review of the backend API. Critical fixes: 1. Path traversal (CWE-22): user-supplied `simulation_id`, `report_id`, and `project_id` values were passed directly to `os.path.join()` without validation, allowing `../` sequences to escape intended directories. - Added `backend/app/utils/id_validator.py` with `validate_safe_id()` (rejects anything that isn't alphanumeric/underscore/hyphen) and `safe_join()` (resolves realpath and verifies containment). - Applied to all 3 path-construction sites in simulation.py, all 12 relevant handlers in report.py, and 6 sites in graph.py. - Sanitized uploaded filenames with `os.path.basename()` in graph.py. 2. Missing authentication: all API endpoints were publicly accessible with no auth mechanism. - Added `backend/app/utils/auth.py` with an `X-Api-Key` middleware registered as a `before_request` hook. - Auth is opt-in: set `API_KEY` in `.env` to enforce it; if unset a startup warning is logged. This preserves local dev workflows. High fixes: 3. Hardcoded `SECRET_KEY` fallback replaced with `os.urandom(32).hex()` so an unset key is never predictable. 4. `FLASK_DEBUG` now defaults to `False` instead of `True`. 5. Full Python tracebacks removed from all API error responses (51 total across graph.py, report.py, simulation.py) — tracebacks still go to the logger. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 19:35:50 -04:00
666ghj	5ece3f670b	Implement Report Agent for automated report generation and interaction - Introduced the Report Agent module to facilitate the automatic generation of simulation analysis reports using LangChain and Zep, following the ReACT model. - Added functionality for report outline planning, segmented content generation, and user interaction through a dialogue interface. - Implemented new API endpoints for report generation, status checking, and retrieval, enhancing the overall reporting capabilities. - Updated README.md to include detailed instructions on the new report generation features and API usage. - Enhanced the project structure to accommodate the new report management functionalities, including report storage and retrieval mechanisms.	2025-12-09 15:10:55 +08:00
666ghj	91eb73ae44	Enhance signal handling and suppress warnings in simulation scripts - Added signal handling to gracefully manage shutdown events across simulation scripts, ensuring proper cleanup of resources. - Introduced a global shutdown event to allow simulations to respond to termination signals, improving robustness. - Suppressed warnings related to multiprocessing resource tracking to avoid unnecessary log clutter during execution. - Updated cleanup logic in `SimulationRunner` and `ZepGraphMemoryManager` to prevent redundant calls and ensure efficient resource management. - Enhanced logging to provide clearer feedback during shutdown processes, improving traceability.	2025-12-09 00:37:12 +08:00
666ghj	5b4f02f421	Enhance simulation configuration and management features - Added support for a `max_rounds` parameter in simulation API, allowing users to limit the number of simulation rounds, improving control over simulation duration. - Updated README.md to reflect the new `max_rounds` parameter and its usage in simulation requests. - Enhanced error handling for `max_rounds` input validation to ensure it is a positive integer. - Modified simulation runner and related scripts to incorporate `max_rounds` functionality, ensuring consistent application across Twitter and Reddit simulations. - Improved logging to indicate when the number of rounds is truncated due to the `max_rounds` setting, enhancing traceability during simulation execution.	2025-12-05 15:50:54 +08:00
666ghj	d4fac63eb4	Enhance simulation management and logging features - Registered a cleanup function for simulation processes to ensure proper termination on server shutdown. - Improved logging during application startup to confirm the registration of the cleanup function. - Updated simulation preparation checks to clarify the conditions for considering a simulation ready, enhancing error handling and user feedback. - Added detailed logging for simulation status changes, improving traceability during the simulation lifecycle. - Introduced new files for simulation configuration and profile data, supporting enhanced testing and visualization capabilities.	2025-12-02 17:11:47 +08:00
666ghj	5f159f6d88	Enhance backend functionality with OASIS simulation features - Updated README.md to include new simulation scripts and configuration details for OASIS, including API retry mechanisms and environment variable settings. - Added simulation management and configuration generation services to streamline the simulation process across Twitter and Reddit platforms. - Introduced new API routes for simulation-related operations, including entity retrieval and simulation status management. - Implemented a robust retry mechanism for external API calls to improve system stability. - Enhanced task management model to include detailed progress tracking. - Added logging capabilities for action tracking during simulations. - Included new scripts for running parallel simulations and testing profile formats.	2025-12-01 15:03:44 +08:00
666ghj	e98da6b53e	Enhance backend startup logging and API endpoint display - Updated `run.py` to conditionally print startup information only in the reloader process to avoid duplicate logs in debug mode. - Modified `__init__.py` to log startup and completion messages based on the reloader process condition. - Added warnings suppression in `graph_builder.py` for Pydantic v2 regarding Field usage. - Revised `ontology_generator.py` to enforce strict design guidelines for entity types and relationships, ensuring compliance with new requirements. - Improved logging behavior in `logger.py` to prevent log propagation to the root logger, avoiding duplicate outputs.	2025-11-28 18:59:36 +08:00
666ghj	08f417f3b7	Introduce Project ID for context management, finalizing the stateful API pipeline from file submission to graph construction.	2025-11-28 17:21:08 +08:00

8 Commits