fischer-agentkit

Commit Graph

Author	SHA1	Message	Date
chiguyong	bbedfff597	feat: hub-and-spoke experts, tiered tool injection, unified event model (U3/U7/U10)	2026-06-17 10:46:16 +08:00
chiguyong	200174c5c7	feat: SQLite persistence, verification loop, spec-driven execution Phase 2 of architecture optimization (U5/U6/U9): - U5: SqliteConversationStore with WAL mode + LRU cache (1000 convs) Replaces in-memory ConversationStore in portal.py Data survives server restarts (ref: Codex Thread persistence) - U6: VerificationLoop with verify/verify_and_retry Default commands: pytest + ruff check ReActEngine integration via verification_enabled flag New run_tests tool for LLM to invoke verification - U9: SpecManager for plan-as-contract (ref: Qoder Quest Mode) Plans persisted to .agentkit/specs/{spec_id}.yaml API: GET/PUT /api/v1/specs, POST /api/v1/specs/{id}/confirm PlanExecEngine emits spec_created event after plan generation Also fixes: portal skill_name routing, app.py SessionManager guard, test_telemetry CostAwareRouter removal, test_compression_config fixture	2026-06-17 10:45:20 +08:00
chiguyong	64d62a2b60	feat: autonomous task execution - connect PlanExecEngine + TeamOrchestrator U1: TeamOrchestrator._execute_phase real execution (Expert.agent.execute) U2: LLM-based merge strategies (BEST/VOTE/FUSION) with fallback U3: ReActStepExecutor replacing _LLMStepAgent for tool-enabled steps U4: SharedWorkspace integration for cross-phase/cross-execution state U5: GoalPlanner prompt tuning with few-shot and verb pattern matching U6: Replan-before-fallback in TeamOrchestrator U7: End-to-end validation tests for multi-step research tasks U8: WebSocket progress events (step_event_callback + new event types) Code review fixes: P0 response.strip fix, P1 competitor status check, milestone real impl, VOTE self-bias fix, confirmation_handler wiring, ExpertTeam public API, DRY _build_result_summaries, replan tests Also: geo_server.py refactor (ServerConfig.from_yaml), delete llm_config.yaml	2026-06-15 12:41:32 +08:00
chiguyong	7384ecb03e	feat: Expert Team Mode — plan-execute collaboration with conversation UI Implements B+C hybrid Expert Team Mode with ExpertConfig, CollaborationPlan, TeamOrchestrator, ExpertTeamRouter, HandoffTransport, SharedWorkspace, and Expert wrapper. Frontend includes ExpertTeamView, ExpertMessage, PlanVisualization, team store, and WS event handlers. Code review fixes: sentinel-based close, per-phase retry, name validation, Vue component integration, teamState dedup, Redis reset, plan reassign, event_type validation, hmac timing-safe compare, message dedup, reactive updatePhases, O(1) phase lookup, iterative DFS, bounded Queue. 232 unit tests passing.	2026-06-14 22:20:14 +08:00
chiguyong	d47f279887	fix: resolve code review issues from deferred improvements 1. InMemoryMessageBus.request(): fix param name (timeout→timeout_seconds) to match ABC 2. InMemoryMessageBus: track consumer tasks, cancel on unsubscribe 3. InMemoryMessageBus: _try_resolve_pending() in queue consumer path 4. evolve_soul(): use "default" category when patterns is empty 5. quick_classify(): use delimiter-based prompt to mitigate injection risk 6. Use asyncio.get_running_loop() instead of deprecated get_event_loop()	2026-06-11 13:49:02 +08:00
chiguyong	658e188939	fix(review): resolve P0/P1 findings from final code review	2026-06-10 09:57:29 +08:00
chiguyong	1d1805753c	fix: resolve key P2 findings from code review - Shell whitelist: use exact binary match instead of startswith - Shell audit log: use deque(maxlen=10000) to cap memory - Terminal history: use deque(maxlen) for O(1) eviction - Path optimizer: cap _pending_paths at 50 entries per task_type - Pitfall detector: only add tips to matching steps, not all - Experience store: handle non-numeric _parse_time_window input - Extract shared is_safe_url() to utils/security.py (DRY) - Workflow condition evaluator: handle float() ValueError	2026-06-10 09:01:23 +08:00
chiguyong	fd4a811929	feat(phase1): implement core kernel and experience foundation (U1-U5) - U1: GoalPlanner - structured goal decomposition wrapping _decompose_task() - U2: PlanExecutor - parallel execution with retry/skip/replace strategies - U3: PlanChecker - quality gate + review + experience writing - U4: Skill spec upgrade - dependencies, capabilities, version management - U5: ExperienceStore - PostgreSQL+pgvector task experience storage 208 new tests passing, fully backward compatible.	2026-06-09 23:57:03 +08:00

8 Commits