fischer-agentkit

Commit Graph

Author	SHA1	Message	Date
chiguyong	1599d193c7	test: fix async generator mock for U3 streaming orchestrator Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details U3 streaming refactor switched orchestrator from agent.execute() to agent.execute_stream() (async gen), but tests still mocked execute(). AsyncMock() returns a coroutine lacking __aiter__, causing: - 'async for' requires an object with __aiter__ method, got coroutine - RuntimeWarning: coroutine was never awaited Add shared helpers in tests/unit/experts/_helpers.py: - make_chat_stream_mock: async gen for gateway.chat_stream - make_execute_stream_mock: async gen yielding final_answer event - make_execute_stream_raising_mock: async gen that raises (for failure tests) Update 3 test files to use the helpers: - test_team_orchestrator.py: _make_mock_expert, _make_mock_pool, failure tests (phase_failed, all_phases_fail, fallback_uses_lead, phase_failure_marks_dependents), assertion updates (execute_stream instead of execute), synthesizer warning cleanup - test_pm_collaboration.py: _make_mock_expert, _make_mock_llm_gateway, collaboration/risk/rework assertions - test_board_orchestrator.py: _make_mock_gateway (warning cleanup) All 483 experts/ tests pass with 0 warnings.	2026-07-02 22:52:10 +08:00
chiguyong	78a7faa17b	refactor: remove all emoji from agentkit Deploy to Production / deploy (push) Waiting to run Details Test / backend-test (push) Waiting to run Details Test / frontend-unit (push) Waiting to run Details Test / api-e2e (push) Waiting to run Details Test / frontend-e2e (push) Waiting to run Details Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details Replace emoji across codebase: YAML avatars -> first char, frontend banners -> Ant Design Vue components, CLI status -> OK/FAIL/WARN labels, terminal -> [WARN]/[OK]/[PENDING], Bitable DB default -> table, App.vue font cleanup, test fixtures -> first char letters. shell.avatar type upgraded to string \| Component.	2026-07-02 01:33:28 +08:00
chiguyong	47a437c5e3	fix(experts): resolve residual review findings from PR #13 Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details Addresses 4 actionable findings (1 P1 + 3 P2) from ce-code-review of feat/ui-ue-enhancement (PR #13), now merged to main (`8066e0b`). P1 — expert_step payload alignment (_phase_executor.py) The thinking/tool_call/tool_result event payloads were missing the fields the frontend WsServerMessage contract requires (expert_name/expert_color/content/step). Frontend code consuming these events silently degraded. Now all expert_step broadcasts carry the full contract; tool_call/tool_result keep step_data for the raw payload. P2 #1 — execute_stream CancellationToken registration (config_driven.py) execute_stream() bypassed BaseAgent.execute() and never registered a CancellationToken, so cancel_task() could not cooperatively cancel a streaming task. Now registers the token and cleans it up in finally. P2 #2 — team_synthesis orphan milestone cleanup (orchestrator.py) If synthesis streaming was interrupted (cancel/exception), no terminal team_synthesis event was emitted, leaving the frontend streaming milestone spinning forever. Now an inner try/except emits a terminal team_synthesis with status=cancelled\|error before re-raising, so the frontend can finalize the milestone. The success path also carries the synthesis_id. P2 #3 — synthesis_id dedup (orchestrator.py + types.ts + chatStream.ts) Without an identifier, the frontend could not precisely match a team_synthesis terminal event to its streaming milestone (especially across retries/concurrent teams). The backend now injects a stable synthesis_id (`{plan.id}:synthesis`) into both team_synthesis_chunk and team_synthesis events; the frontend uses it for exact milestone matching and treats error/cancelled status as terminal. Test updates - Updated test_thinking_events_forwarded_as_expert_step to assert the new payload contract (expert_id/name/color/content/step). - Added test_tool_call_events_forwarded_as_expert_step covering tool_call/tool_result payload shape (content=tool_name摘要 + step_data=原始 payload). Verification - ruff check: clean - pytest tests/unit/experts/test_phase_executor_streaming.py: 14/14 - npm run typecheck: clean - vitest: 126/127 (1 unrelated baseline failure in tauri-auth.test.ts) Residuals doc: docs/residual-review-findings/feat-ui-ue-enhancement.md	2026-07-01 13:26:19 +08:00
chiguyong	f872a3fac6	feat: UI/UE enhancement — streaming, sticky header, hover actions, calendar tokens Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details U1 ThinkingBlock: streaming cursor + auto-collapse to summary bar U2 StickyModeHeader: new component replacing ExpertTeamView + BoardStatusView U3 Backend _phase_executor: execute_stream() with token/thinking/final_answer forwarding U4 Frontend chatStream: expert_result_chunk/team_synthesis_chunk token accumulation U5 AssistantText: routing tag hover fade-in U6 UserBubble: hover actions (copy/delete/refill) U7 CalendarGrid: token-based color redesign Review fixes (ce-code-review): - P0: _VALID_TEAM_EVENT_TYPES whitelist adds 3 new streaming event types - P0: final_answer no longer double-accumulates token content - P2: exception handling expanded to except Exception for LLMProviderError etc. Simplification (ce-simplify-code): - _synthesizer.py: O(n²) concat -> list+join, _concat_results extraction - config_driven.py: 4 duplicate _handle_*_stream -> _wrap_sync_as_stream - chatStream.ts: 5x [...messages].reverse().find() -> findLastMessage helper Tests: pytest 13/13, vitest 126/127 (1 baseline), typecheck pass, ruff clean	2026-07-01 12:51:45 +08:00
chiguyong	be5c4e09f8	refactor(core,experts): classify except Exception + structured ReviewResult (U3) ReviewResult dataclass (passed/degraded/feedback) replaces tuple+[DEGRADED] prefix in _review_phase_output; 3 review_result WS payloads now carry degraded field (AE3). except Exception narrowed to specific types across 10 files (core/react, rewoo, base, orchestrator, dispatcher, plan_exec_engine + experts/orchestrator, _phase_executor, _review_gate + orchestrator/pipeline_engine). Baseline 140 -> 66 occurrences (>=50% reduction). Fix RuntimeError regression: review-gate + compression paths now catch RuntimeError (LLM/provider internal errors) to preserve degradation semantics. Test side_effect switched to functional form to avoid StopIteration on list exhaustion. ruff clean; 135 key + 469 experts + 163 core tests pass.	2026-06-30 18:03:58 +08:00
chiguyong	ef84e3fd53	feat(experts): add SharedWorkspace state offloading for long-horizon runs U4: ExpertTeam accepts redis_client, passes to SharedWorkspace. After phase completion, full result is written to workspace and in-memory phase.result is replaced with a 500-char summary + _ref_key. Dependency output reading resolves offloaded content from workspace on demand, with graceful fallback to summary on read failure. Tests: 8 scenarios (offload creation, short content, dependency resolution, workspace failure fallback, non-offloaded passthrough, redis_client wiring, memory dict fallback, pipeline integration) — all pass.	2026-06-24 20:32:10 +08:00
chiguyong	717aad1303	feat(experts): add concurrency limit to TeamOrchestrator parallel phases U2: Add asyncio.Semaphore to bound concurrent phase execution and debate argument generation. Default limit=3, configurable via max_concurrent_phases. Prevents LLM rate-limit spikes when many phases run in the same layer. Tests: 5 scenarios (happy path, 5-phase edge case, serial mode, failure release, debate integration) — all pass.	2026-06-24 20:23:30 +08:00
chiguyong	574db8458f	fix(experts): PM 协同代码审查全量修复 P0: 跨阶段契约状态同步 — _notify_collaborators 更新接收方契约状态为 received P0: 4 个 PM 事件加入 _VALID_TEAM_EVENT_TYPES 白名单 P1: 验收 fail-open 改标注降级原因 P1: 返工失败抛 RuntimeError 而非返回 dict P1: 验收 prompt injection 防护 — 专家输出用 XML 标签包裹 P1: 契约字段校验 _EXPERT_NAME_RE P1: bool("false") 修复 — 显式比较避免字符串真值陷阱 P1: _parse_risk_flags(None) 防御 P2: _notify_collaborators 移到验收通过后 P2: SharedWorkspace 写入移到验收通过后 P2: 验收贪婪正则修复 P2: 风险标记数量上限 MAX_RISK_FLAGS=10 P2: 返工 feedback 截断 P2: 前端会话隔离 — 切换会话时清除/恢复 collaborationState P2: 前端契约状态更新 — collaboration_notice 时标记 delivered P2: CLI 死代码标注 + 异常改 debug 日志 P2: 模块级 _RISK_FLAG_RE 预编译	2026-06-24 18:56:27 +08:00
chiguyong	5487cca199	feat(experts): U4 专家风险标记 + risk_flagged 事件 - orchestrator 新增 _parse_risk_flags 静态方法，正则解析 [RISK: ...] 标记 - _execute_execution_phase 在协作通知后、验收前解析风险标记 - 风险标记通过 risk_flagged 事件广播，供前端/CLI 渲染 - 无风险标记时行为不变，向后兼容 - 新增 TestRiskFlagging 7 个测试（单/多/无/格式错误/事件发出/内容/兼容）	2026-06-24 14:17:58 +08:00
chiguyong	62fcbc0feb	feat(experts): U3 Lead 验收环节 + 返工机制 - PlanPhase 添加 rework_count 和 review_feedback 字段 - 添加 _review_phase_output 方法，Lead 用 LLM 验收阶段输出 - _execute_execution_phase 重构为返工循环（MAX_REWORKS=2） - 验收通过/返工/失败三种路径，发出 review_result 事件 - LLM 不可用时优雅降级直接通过 - 6 个新测试，全套 449 passed 无回归	2026-06-24 14:09:18 +08:00
chiguyong	c46cf06f6d	feat(experts): U2 协作契约执行 — 专家可见 + 主动通知 - _execute_execution_phase 按协作契约读取相关专家输出（可见性） - 添加 _notify_collaborators 方法，完成后通知相关专家（可协助） - 发出 collaboration_notice 事件，契约状态更新为 delivered - 7 个新测试，全套 443 passed 无回归	2026-06-24 13:54:38 +08:00
chiguyong	f219c5f016	feat(experts): U1 协作契约数据模型 + Lead 生成契约 - PlanPhase 添加 collaboration_contracts 字段（CollaborationContract dataclass） - 修改 _decompose_task prompt，要求 Lead 分解任务时定义协作契约 - 修改 _parse_phases 解析 LLM 返回的协作契约信息 - plan_update 事件自动包含协作契约（通过 to_dict 序列化） - 71 + 9 = 80 个新测试，全套 436 passed 无回归	2026-06-24 13:44:50 +08:00
chiguyong	c831e925b6	feat(experts): U4 用户干预通道 + 手动辩论触发建立 @team 执行期间的用户干预通道，支持 /stop、/debate <topic>、普通文本追加上下文。 ExpertTeam (src/agentkit/experts/team.py): - 新增 _interventions: asyncio.Queue (maxsize=64) 干预队列 - add_user_intervention(msg): 广播 + 入队 - consume_user_interventions(): 排空并返回待处理干预 - broadcast_user_message 现在同时入队干预队列 TeamOrchestrator (src/agentkit/experts/orchestrator.py): - 新增 _user_context: list[str] 累积普通文本干预 - 新增 _process_interventions(lead, plan) 在每层执行前调用： * /stop → 终止执行，广播 plan_update(stopped_by_user) * /debate <topic> → 动态插入 DEBATE phase（受 MAX_DEBATES 限制） * 普通文本 → 累积到 _user_context - _synthesize_results 将 _user_context 追加到 synthesis prompt WS 路由 (src/agentkit/server/routes/chat.py): - 模块级 _active_teams dict 跟踪每个 session 的活跃团队 - _execute_team_collab 执行前注册、finally 注销 - WS 消息循环：若 session 有活跃团队，message 路由为干预而非新任务 - 新增 team_intervention_ack 确认消息测试：tests/unit/experts/test_team_intervention.py（20 测试），覆盖队列基础、/stop、/debate、普通文本、混合消息、synthesis 影响。同步更新 test_orchestrator_debate.py 的干预通道兼容性测试（U4 已实现 consume_user_interventions）。全部 418 experts 测试 + 325 server 测试通过。	2026-06-24 12:17:09 +08:00
chiguyong	ac26d417b3	feat(experts): U3 分歧检测 + 方案评审辩论自动触发在 TeamOrchestrator 中新增 4 个方法实现自动辩论触发： - _maybe_add_plan_review_debate: 任务分解后可选插入方案评审 DEBATE phase（phases > 2 且 LLM 判断需要时），所有执行阶段依赖它 - _detect_divergence: 每层执行后用 LLM 判断已完成阶段产出是否与其他阶段存在分歧，偏好 false negative - _insert_debate_phase: 动态插入 DEBATE phase 并重 wiring 依赖（原依赖 trigger 的 phase 现在依赖 DEBATE） - _check_divergence_and_insert_debates: 每层完成后的协调入口，受 MAX_DEBATES=3 上限保护主循环从 `for layer in layers` 改为 `while True` + 重新计算 topological_sort()，以支持动态插入 DEBATE phase 后的依赖分层。测试：tests/unit/experts/test_divergence_detection.py（21 测试），覆盖 happy path / 边界 / 错误路径 / 集成分层。同步修复 test_team_orchestrator.py 的 mock gateway 以适配 U3 的额外 LLM 调用。全部 398 测试通过。	2026-06-24 11:09:53 +08:00
chiguyong	fbe08cb1e2	feat(experts): add debate phase executor to TeamOrchestrator (U2) Implement _execute_debate_phase() with Lead-facilitated structured debate: - Lead opens with divergence point + dependency context - Experts argue in parallel per round (asyncio.gather) - Lead summarizes each round, then adjudicates final verdict - Verdict produces decision (adopt/compromise/shelve/inconclusive) + conclusion - Conclusion written to SharedWorkspace for downstream phases Escape hatches: - debate_config.skip=true short-circuits with template text - MAX_DEBATE_ROUNDS=4 hard cap on rounds - User /stop intervention ends debate early (U4-compatible via getattr fallback) - LLM unavailable falls back to template verdict, no crash New events: debate_started, expert_argument, debate_round_summary, debate_resolved (plus existing phase_completed for consistency). Phase dispatcher (_execute_phase) routes by phase_type: EXECUTION to _execute_execution_phase, DEBATE to _execute_debate_phase. 36 new tests in test_orchestrator_debate.py covering happy path (2 rounds, 2 experts), max_rounds=1 boundary, empty participants, user stop, skip escape hatch, LLM unavailable, SharedWorkspace integration, event broadcasting, intervention channel compatibility, and helper methods. All 377 expert tests pass. Also includes planning artifacts (brainstorm requirements + implementation plan with 6 units U1-U6).	2026-06-24 10:54:51 +08:00
chiguyong	e539122314	feat(experts): add PhaseType enum and debate_config to PlanPhase U1: Data model foundation for structured debate collaboration. - Add PhaseType enum (EXECUTION \| DEBATE) - Add phase_type and debate_config fields to PlanPhase - Update to_dict/from_dict for serialization with backward compatibility - Add tests for PhaseType, debate phase creation, serialization, and mixed EXECUTION+DEBATE topological sort	2026-06-24 10:42:11 +08:00
chiguyong	91f56ca663	feat: 企业级客户端-服务端架构 + 代码审查修复 ## 主要变更 ### 新增功能 - 企业级客户端-服务端架构（JWT 认证 + RBAC 权限 + 终端安全） - Tauri 桌面客户端与服务端配置同步 - 远程 LLM 网关（RemoteLLMProvider，支持 401 token 刷新重试） - 服务端终端 WebSocket（带管理员审批流程） - 终端白名单六层防御（黑名单 → shell 操作符检测 → 内置安全 → 全局/用户/会话白名单 → 危险检测） ### 代码审查修复（P0/P1/P2） - P0: 危险二进制（rm/docker 等）不再加入白名单，compute_whitelist_entry 返回 None - P1: 终端审批所有权追踪（_approval_owners dict）+ 会话清理防泄漏 - P1: 本地终端 WebSocket URL 补齐 JWT token - P1: 审计日志支持 terminal_mode 过滤 - P1: /system/resources 端点强制 SYSTEM_CONFIG 权限 - P1: RemoteLLMProvider 增加 401 token 刷新重试机制 - P1: auth/models.py 使用 Mapping[str, object] 替代 Any 类型 - P2: 终端授权依赖检查 is_active 账户状态 - 修复 app.py 未使用的 APIKeyAuthMiddleware 导入 ### 文档更新 - README.md: 新增第 16 章「企业级客户端-服务端架构」 - AGENTS.md / CLAUDE.md: 同步模块映射、路由表、前端页面 - 计划文档标记为 completed Closes: docs/plans/2026-06-19-003-feat-enterprise-client-server-evolution-plan.md	2026-06-20 06:48:18 +08:00
chiguyong	771756814f	fix(review): 修复代码审查发现的 P0/P1/P2 问题 P0 (Critical): - orchestrator: plan_update 事件 key 从 phases 改为 plan_phases 匹配前端契约 - orchestrator: team_formed 事件 payload 从 string[] 改为 IExpertInfo[] + plan_phases:[] P1 (High): - orchestrator: 新增 phase_failed 事件广播 (3处: gather 失败/_execute_phase 异常/_mark_dependents_failed 级联) - orchestrator: 新增 team_dissolved 事件广播 (3处: 正常完成/ValueError/Exception) - orchestrator: _mark_dependents_failed 改为 async 以支持事件广播 - orchestrator: gather 结果检查增加 asyncio.CancelledError (Python 3.11+ BaseException) - plan: PhaseStatus.RUNNING 值从 running 改为 in_progress 匹配前端联合类型 - team.ts: updatePhaseStatus 增加 plan_phases undefined 防御守卫 - chat.py: 增加 asyncio.CancelledError 处理 + team.dissolve() 移入 finally 块 P2 (Medium): - orchestrator: _get_isolated_agent 返回类型 Any 改为 ConfigDrivenAgent - orchestrator: _get_llm_gateway 返回类型 Any 改为 LLMGateway \| None - orchestrator: 依赖输出从 SharedWorkspace 读取改为内存 dep_phase.result (减少冗余 I/O) - plan: PlanPhase.to_dict() result 序列化为 string 匹配前端 ITeamPlanPhase.result 类型 - types.ts: expert_step.step 类型从 number 改为 string (后端发送 phase ID) Tests: 377 passed (experts + chat_team + expert_team)	2026-06-18 13:00:59 +08:00
chiguyong	ee6d16345c	feat(experts): U7 新增 5 个编程专家模板 + dev_team 团队模板 + ExpertTeamRouter 模板展开	2026-06-18 01:50:43 +08:00
chiguyong	0f8ea6e21e	feat(experts):重写 TeamOrchestrator 为流水线模式 + TeamStatus.PLANNING	2026-06-18 01:39:22 +08:00
chiguyong	1075598ebf	feat(experts):恢复 plan.py 阶段依赖图 (PlanPhase + topological_sort) - 新增 PhaseStatus 枚举 (PENDING/RUNNING/COMPLETED/FAILED) - 新增 PlanPhase 数据类 (id/name/assigned_expert/task_description/depends_on/status/result) - TeamPlan 新增 phases 字段及配套方法: get_phase/update_phase_status/topological_sort/get_ready_phases - topological_sort 使用 Kahn 算法返回执行层 (list[list[PlanPhase]])，检测循环依赖 - 保留 SubTask/MergeStrategy 向后兼容 - 新增 54 个单元测试覆盖线性/并行/循环依赖、无效引用、就绪阶段、序列化	2026-06-18 01:28:18 +08:00
chiguyong	dddcbd24e3	feat: 私董会讨论模式 + 回测集成 + WS持久化修复私董会讨论模式 (Board Meeting Mode): - BoardRouter: @board 前缀路由, 专家名验证, 模板回退 - BoardTeam: 讨论容器, 状态机 (FORMING->DISCUSSING->CONCLUDING->COMPLETED) - BoardOrchestrator: 多轮自主循环讨论引擎, 主持人小结, 停止命令检测 - 9个预设名人专家 YAML (马斯克/贝佐斯/张小龙/芒格等) - 前端 BoardStatusView 群聊式 UI + WebSocket 事件处理 - 后端 chat.py 集成 @board 路由到主聊天流程回测集成: - benchmark.py: 新增 board_meeting 维度 (18 tasks, 6 categories) - benchmark_dataset.py: 新增 BOARD_BENCHMARKS (11 E2E cases) - test_board_backtest.py: 66 个回测测试 (9 test classes) Bug 修复: - resolve_expert_configs: deep-copy 防止 is_lead 修改污染共享模板 - 所有专家名无效时回退到默认模板 - board_router: 非匹配路径 topic 未 strip - benchmark_dataset: board-name-invalid-001 输入修正 WebSocket 持久化修复: - chat.py: 三层防御机制确保任务结果不丢失 - chat store: 断线恢复逻辑部署配置: - Gitea Actions CI/CD workflow - docker-compose.deploy.yaml 部署编排 - scripts/deploy.sh 自动化部署脚本测试结果: 120 单元测试通过, 71 benchmark 测试 100% 通过, ruff 全部通过	2026-06-17 23:52:53 +08:00
chiguyong	bbedfff597	feat: hub-and-spoke experts, tiered tool injection, unified event model (U3/U7/U10)	2026-06-17 10:46:16 +08:00
chiguyong	64d62a2b60	feat: autonomous task execution - connect PlanExecEngine + TeamOrchestrator U1: TeamOrchestrator._execute_phase real execution (Expert.agent.execute) U2: LLM-based merge strategies (BEST/VOTE/FUSION) with fallback U3: ReActStepExecutor replacing _LLMStepAgent for tool-enabled steps U4: SharedWorkspace integration for cross-phase/cross-execution state U5: GoalPlanner prompt tuning with few-shot and verb pattern matching U6: Replan-before-fallback in TeamOrchestrator U7: End-to-end validation tests for multi-step research tasks U8: WebSocket progress events (step_event_callback + new event types) Code review fixes: P0 response.strip fix, P1 competitor status check, milestone real impl, VOTE self-bias fix, confirmation_handler wiring, ExpertTeam public API, DRY _build_result_summaries, replan tests Also: geo_server.py refactor (ServerConfig.from_yaml), delete llm_config.yaml	2026-06-15 12:41:32 +08:00
chiguyong	7384ecb03e	feat: Expert Team Mode — plan-execute collaboration with conversation UI Implements B+C hybrid Expert Team Mode with ExpertConfig, CollaborationPlan, TeamOrchestrator, ExpertTeamRouter, HandoffTransport, SharedWorkspace, and Expert wrapper. Frontend includes ExpertTeamView, ExpertMessage, PlanVisualization, team store, and WS event handlers. Code review fixes: sentinel-based close, per-phase retry, name validation, Vue component integration, teamState dedup, Redis reset, plan reassign, event_type validation, hmac timing-safe compare, message dedup, reactive updatePhases, O(1) phase lookup, iterative DFS, bounded Queue. 232 unit tests passing.	2026-06-14 22:20:14 +08:00

25 Commits