fischer-agentkit

Commit Graph

Author	SHA1	Message	Date
Fischer	8633f60831	feat: complex-task-quality-loop (R1-R12) — 11 P1 blockers fixed (#22 ) Deploy to Production / deploy (push) Waiting to run Details Test / backend-test (push) Waiting to run Details Test / frontend-unit (push) Waiting to run Details Test / api-e2e (push) Waiting to run Details Test / frontend-e2e (push) Waiting to run Details Merge feat/complex-task-quality-loop into main. Includes U1-U9/R1-R12 implementation + 11 P1 blocker fixes from ce-code-review. P1 fixes: trace_outcome propagation, portal execute_stream routing, network_block reentrancy, spec review gate wiring, max_reflections threading, phase budgets, plan aggregation, failure status mapping, evolution drain timeout, portal spec_review_reply, spec_review persistence.	2026-07-05 22:31:21 +08:00
chiguyong	826b766af0	docs(solutions): record bitable agent tool parity patterns + final review findings Add docs/solutions/architecture-patterns/bitable-agent-tool-parity-patterns.md capturing three architecture patterns from U6 (R15a): - Dual-sync action registration (KTD10): handlers dict + input_schema.enum - 404-before-403 ownership check (KTD9): prevent existence leak via DELETE - 409 last-view protection: prevent invalid zero-view table state Update residual findings with DR-4 (TOCTOU race in delete_view) and DR-5 (_update_field silent type drop) surfaced in final pre-merge ce-code-review pass. Both P2, neither blocks merge. Documented in the solutions doc under Known Limitations with concrete fix paths.	2026-07-04 01:04:46 +08:00
chiguyong	0f4a418408	docs(review): record residual review findings for feat/bitable-enhancement	2026-07-04 00:29:02 +08:00
chiguyong	7c900ce280	docs: add complex-task-quality-loop plan and requirements documents Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details Adds the brainstorm requirements and implementation plan that drove the 9-unit quality-loop feature (R1-R12). Also gitignores local worktree directories.	2026-07-03 22:54:11 +08:00
chiguyong	96ccca3d87	docs(bitable-p0): add implementation plan for P0 UX polish + agent parity ce-plan Deep plan (6 Implementation Units, 3 delivery phases): - Phase 1: U1 R5 design token system + vxe-table dependency declaration - Phase 2: U2-U5 R1-R4 frontend UX (inline field config, record drawer, view type switcher, grouping + conditional formatting) - Phase 3: U6 R15a BitableTool 4 new actions + DELETE /views endpoint 11 KTDs covering: CSS token layer, vxe-table ghost dependency fix, inline field configurator (hybrid vxe-table slot + custom component), record detail drawer (single column 480/640px), view type dropdown with disabled states, grouping + conditional format in View.config with backend Pydantic validation, BitableTool action registration (handlers dict + input_schema enum), X-Internal-Token ownership semantics, 3-phase delivery with config schema freeze for parallel U6. Phase 5.3 headless ce-doc-review (5 reviewers, 14 findings): - Applied 2 safe_auto (U6 verification method, U5→U6 dependency) - Applied 2 gated_auto (input_schema enum step, color_token→color_key) - Applied 5 P1 manual fixes (backend config validation, X-Internal-Token ownership, grouping+CF combo state, LoadingState/ErrorState justification, R3/R4 backend assumption verification) - 8 P2/P3 manual findings appended to Open Questions Origin: docs/brainstorms/2026-07-03-bitable-comparative-evaluation-requirements.md	2026-07-03 13:49:57 +08:00
chiguyong	f8927d1749	docs(bitable-eval): apply ce-doc-review best-judgment fixes (20 gated_auto + 12 manual) ce-doc-review（7 reviewers, 39 raw findings → 32 actionable + 3 FYI 经合成管道），用户选择"自动用最佳判断处理"路径。本提交应用全部 20 个 gated_auto 修复，并把 12 个 manual findings 追加到 Outstanding Questions 的 From 2026-07-03 review 子节。主要修复： - 修正 BitableTool 动作清单：实际为 create_table/import_excel/import_database/ collect_api/upsert_records/query_records（原文 4/6 错），消除 R15a 范围误判 - R15a 从 B 线提升至 P0（4 reviewers 独立标记的优先级矛盾——B 线"non-blocking" 与"agent 对等最高优先级子项"自相矛盾） - G23 闭合路径标注（R15c 路径 (a)/(b)） - 默认字段类型未来 user/datetime 标注（Inventory + G6） - R3 后端依赖标注（POST /views schema 扩展） - 视图删除端点补 P0 验收标准（R15a 验收 + 前端 deleteView 方法） - vxe-table 幽灵依赖标注（package.json 未声明，靠主仓 hoisting） - create_field 动作标注为必需（R8 17 新类型需 agent 能批量建字段） - R15 测试映射拆分为 R15a/R15b/R15c 三行 - R8 验收矩阵补 PII/XSS/auto-number 写保护列 + schema V3 迁移成本估算 - R15c 安全要求补 SSRF/认证/凭据加密 + 端点访问控制 - 横切验收标准补 WCAG AA 可访问性 + 空状态要求 - R8 矩阵范围标注（覆盖 P1，非 P0） Open Questions 新增 12 个 manual findings（ce-plan 阶段决策）： - user 字段用户模型 - C 先行优先级策略的实证依据 - 并发编辑 UX 策略 - 加载/错误状态统一模式 - 条件格式规则构建器 UX 形态 - 分组交互细节 - 响应式断点定义 - R2 记录详情抽屉宽度 - vxe-table 容量上限评估 - R13 仪表盘图表库 buy-vs-build - 禁用态视图类型路线图 - schema V3 双向关联回滚策略文件：docs/brainstorms/2026-07-03-bitable-comparative-evaluation-requirements.md （107 insertions, 31 deletions）	2026-07-03 13:32:07 +08:00
chiguyong	e9821a3b7f	docs(bitable): add comparative evaluation requirements with ce-code-review P1 fixes 新增三向对比评估需求文档（agentkit bitable vs Twenty vs 飞书），并应用 ce-code-review 产出的全部 P1 缺口修复（共 9 项）： - P1-1: R8 字段类型计数对齐 16+1=17（KD6 与 R8 同步） - P1-2: 新增 R8 字段类型验收矩阵（17 行表，含 V2->V3 迁移列） - P1-3: KTD7 引用具体文件 formula/parser.py 替代裸引用 - P1-4: R-ID 命名空间冲突，加日期前缀 2026-06-29-R1..R5 - P1-5: created-time 统一为 datetime（通用类型 + 默认字段使用 datetime） - P1-6: 新增 P0 验收标准段落（R1-R5 Given/When/Then） - P1-7: 新增测试策略段落 + 测试文件映射表（R1-R5、R8、R15） - P1-8: R15 拆解为 R15a/R15b/R15c + 新增 Agent 对等评估方法段落 - P1-9: R4 补充后端扩展（group_by/conditional_formatting schema）+ agent 对等说明同时包含 2 项 gated_auto 修复： - 组件计数 14 -> 15 - 移除文档中的全部 emoji，替换为 [OK] ce-code-review run-id: 20260703-123134-c7c2b2ea	2026-07-03 12:59:41 +08:00
Fischer	00b2dad36e	feat(compressor): CJK-aware token estimation + linear compress flow (#21 ) Deploy to Production / deploy (push) Waiting to run Details Test / backend-test (push) Waiting to run Details Test / frontend-unit (push) Waiting to run Details Test / api-e2e (push) Waiting to run Details Test / frontend-e2e (push) Waiting to run Details Squash merge PR #21: CJK-aware token estimation + linear compress flow + solution doc	2026-07-03 09:40:28 +08:00
Fischer	2296d0b209	refactor: remove all emoji from source code (#20 ) Deploy to Production / deploy (push) Waiting to run Details Test / backend-test (push) Waiting to run Details Test / frontend-unit (push) Waiting to run Details Test / api-e2e (push) Waiting to run Details Test / frontend-e2e (push) Waiting to run Details Replace emoji/glyph characters with Ant Design Vue Outlined icons (frontend), text labels with ANSI colors (CLI/shell), and ASCII art (docstrings). Add pre-commit guard (scripts/check-no-emoji.sh) and style guide to prevent regression. Closes: docs/plans/2026-07-02-001-refactor-remove-all-emoji-plan.md	2026-07-03 02:46:40 +08:00
chiguyong	e04e2868c3	docs(compound): message bubble empty-content and card-type exclusion pattern Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details Documents the G1 (:empty never matches Vue root), F4-A (card-bearing type exclusion via messageType prop + Set), and pure-function extraction pattern for testability without @vue/test-utils.	2026-07-03 01:58:00 +08:00
chiguyong	981a794a54	docs(plan): private-board restrictions + scheme B bubbles plan ready Plan document finalized after 4 rounds of ce-doc-review: - F4-A exclusion list extended from 5 to 9 card-bearing types - Verified root class names for all 9 card components - Corrected chrome description (2 full chrome + 7 partial chrome) - Added U1 modal focus restoration note (WAI-ARIA) - Documented R4-DA1/R4-A3/R4-A4 as Open Questions for implementation	2026-07-03 01:14:37 +08:00
chiguyong	53347ed1fe	test(u6): add L4 real-LLM smoke test for ReAct tool-use prompt Manual smoke test verifying U4 L0 prompt rule rearrangement under real LLM calls (bailian-coding/qwen3.7-plus). 5 probe queries covering external_info / realtime_data / multi_step / realtime_simple / no_tool. Results: - Probe #1 external_info: PASS (8 web_search calls, 99.9s) - Probe #2 realtime_data: ERROR (120s timeout, not LLM refusal) - Probe #3 multi_step: PASS (8 web_search calls, 62.6s) - Probe #4 realtime_data_simple: PASS (3 web_search calls, 23.8s) - Probe #5 no_tool_escape_hatch: PASS (0 tool calls, direct answer, 4.2s) Verdict: 3/4 tool-call pass (>=3/4 threshold) + 1/1 direct pass Bug 2 status upgraded to 'L4 verified'. Plan Progress table updated: U6 done, U7 done.	2026-07-02 22:08:45 +08:00
chiguyong	96f459c27d	docs: add brainstorm/plan decision artifacts + plan progress update Add ce-brainstorm requirements doc and ce-plan plan doc for private board restrictions and scheme B bubbles (decision artifacts). Update 2026-07-02-002 plan with U6/U7 progress table. Add .compound-engineering/config.local.example.yaml from ce-setup. gitignore tmp_*.html and delete_old_cluster.sh.	2026-07-02 21:27:20 +08:00
chiguyong	7376005868	fix: 修复 transient state 重置口径 + ReAct 工具调用规则 Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details Bug 1: chatStore 三个 action 重置 boardState/debateState/collaborationState - createConversation: 新增三态重置（原缺失，旧私董会状态泄漏到新会话） - selectConversation: 统一为条件重置（prevConvId !== id），避免 force-reload 误清空 - deleteConversation: 补全 collaborationState 重置 - 附带：selectConversation 中 board_speech/board_summary 消息缺失 expert_avatar/expert_color 时从 boardState.experts 兜底补全 Bug 2: ReAct _build_tool_use_prompt L0 规则调整 - 新增规则 1：涉及外部信息/实时数据/多步骤分析/不确定事实时必须使用工具 - 原规则 3 降为规则 4，收窄为仅在确实无需工具时可直接回答 - base_prompt 与工具描述不动（L1/L2 拆为独立 plan）测试：5 前端 transient-state reset matrix + 6 后端 prompt rules 断言 Plan: docs/plans/2026-07-02-002-fix-transient-state-reset-and-react-tool-guidance-plan.md	2026-07-02 20:51:57 +08:00
chiguyong	36b0296730	fix: 私董会数据持久化修复 + emoji 移除计划 - 修复 board_started/expert_speech/round_summary/board_concluded 事件持久化 - 添加 is_board 标记到会话列表和详情接口 - 实现 restoreBoardStateFromMessages 从持久化消息恢复 boardState - 添加 ChatSidebar 私董会徽章 - 添加 emoji 移除计划文档 (docs/plans/2026-07-02-001)	2026-07-02 01:07:12 +08:00
chiguyong	fe93b0f2a4	docs: compound streaming-event-contract-residuals learning Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details Knowledge sedimentation for PR #14's 4 residual findings (1 P1 + 3 P2) from ce-code-review of feat/ui-ue-enhancement. ce-compound Full mode run. Created: - docs/solutions/integration-issues/streaming-event-contract-residuals.md Bug-track doc covering the 4-fix cluster: expert_step payload alignment, execute_stream CancellationToken registration, team_synthesis orphan milestone cleanup, synthesis_id dedup. Includes code examples, root cause analysis, and prevention strategies (streaming contract testing, cancellation registration checklist, terminal event symmetry, milestone identifier pattern). Updated: - AGENTS.md: WebSocket Chat 协议 section expanded with streaming event types (expert_step/expert_result_chunk/team_synthesis_chunk), synthesis_id dedup contract, and execute_stream cancellation contract. - CONCEPTS.md: Added "Streaming Milestone" entry to Expert Orchestration cluster — the UI pattern for streaming progress indicators that transition through streaming → completed\|error states, including orphan failure mode and synthesis_id matching semantics. Overlap with existing docs/solutions/runtime-errors/streaming-event-whitelist-and-accumulation.md is MODERATE (same area, different specific bugs). Flagged for potential consolidation via ce-compound-refresh.	2026-07-01 13:53:10 +08:00
chiguyong	4866a16109	docs: compound streaming-event-whitelist-and-accumulation learning Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details Captures the ReAct streaming contract bug + WS event whitelist governance from PR #13's review fixes. Three intertwined runtime issues documented: 1. P0: final_answer double-accumulated token content (logic_error) 2. P0: _VALID_TEAM_EVENT_TYPES whitelist missing 3 new streaming event types 3. P2: except (RuntimeError, TimeoutError, ConnectionError) too narrow for LLMProviderError/ConfigValidationError in async generator Adds ReAct Streaming Contract entry to CONCEPTS.md — defines the protocol execute_stream() yields (token events with incremental content, then one final_answer event with the concatenated full text). Consumers must pick one accumulation strategy, cannot mix both without doubled output.	2026-07-01 13:15:01 +08:00
chiguyong	f872a3fac6	feat: UI/UE enhancement — streaming, sticky header, hover actions, calendar tokens Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details U1 ThinkingBlock: streaming cursor + auto-collapse to summary bar U2 StickyModeHeader: new component replacing ExpertTeamView + BoardStatusView U3 Backend _phase_executor: execute_stream() with token/thinking/final_answer forwarding U4 Frontend chatStream: expert_result_chunk/team_synthesis_chunk token accumulation U5 AssistantText: routing tag hover fade-in U6 UserBubble: hover actions (copy/delete/refill) U7 CalendarGrid: token-based color redesign Review fixes (ce-code-review): - P0: _VALID_TEAM_EVENT_TYPES whitelist adds 3 new streaming event types - P0: final_answer no longer double-accumulates token content - P2: exception handling expanded to except Exception for LLMProviderError etc. Simplification (ce-simplify-code): - _synthesizer.py: O(n²) concat -> list+join, _concat_results extraction - config_driven.py: 4 duplicate _handle_*_stream -> _wrap_sync_as_stream - chatStream.ts: 5x [...messages].reverse().find() -> findLastMessage helper Tests: pytest 13/13, vitest 126/127 (1 baseline), typecheck pass, ruff clean	2026-07-01 12:51:45 +08:00
chiguyong	975b7c4e57	docs: compound any-and-except-exception-governance convention Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details Record the strategies established during PR #8-#11 (1214+ tech debt governance) for Any replacement priority, except Exception classification, framework boundary preservation, and intentional-design retention.	2026-07-01 08:16:02 +08:00
chiguyong	03b1e3d751	docs: add systematic tech debt cleanup plan (U1-U5)	2026-06-30 14:27:47 +08:00
chiguyong	a872a459a6	docs: add PLAN_EXEC concepts + commit Wave 4 plan Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details CONCEPTS.md: new PLAN_EXEC section (Phase State Machine, PhasePolicy, Phase Violation, AdvancePhaseTool, _build_phase_engine). docs/plans/: commit the Wave 4 plan document (was untracked).	2026-06-30 12:46:24 +08:00
Fischer	2b8a7d8909	feat(agent): Wave 3 strategic coupling (G5/G6) (#6 ) Deploy to Production / deploy (push) Waiting to run Details Test / backend-test (push) Waiting to run Details Test / frontend-unit (push) Waiting to run Details Test / api-e2e (push) Waiting to run Details Test / frontend-e2e (push) Waiting to run Details	2026-06-30 09:17:19 +08:00
Fischer	a2dcde01b8	feat(agent): Wave 2 medium coupling (G4/G7/G9) (#5 ) Deploy to Production / deploy (push) Waiting to run Details Test / backend-test (push) Waiting to run Details Test / frontend-unit (push) Waiting to run Details Test / api-e2e (push) Waiting to run Details Test / frontend-e2e (push) Waiting to run Details	2026-06-30 09:09:33 +08:00
chiguyong	2747bb4e64	chore(prior): malformed tool call handling, auth whitelist, dev scripts, wave1 plan	2026-06-29 20:25:03 +08:00
chiguyong	a6e1bf5884	feat(bitable): 多维表格文件层 + 默认字段 + 表内字段操作 + ce-code-review 修复 (Stage 1) Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details 实现多维表格 UI 完整性 Stage 1（U1-U6），补齐飞书/twenty 对齐缺失的文件层、默认字段与表内字段操作能力，并修复 ce-code-review 走查发现的 P0/P1 级问题。后端（U1-U2）: - 新增 BitableFile 实体（models/db/repository/service/routes），三级层级：文件→数据表→字段/记录 - Schema V2 迁移：bitable_files 表 + tables.file_id 列，幂等（IF NOT EXISTS），保留 V1 孤儿表 - 新建数据表自动创建 5 个默认字段（标题/状态/日期/创建人/创建时间） - agent-owned 字段在 create_record 时自动填充（按 type+owner 匹配，传 actor_user_id） - 7 个文件 REST 端点 + IDOR ownership 检查（404-before-403，internal token 旁路）前端（U3-U5）: - 文件列表页（FileCard 网格 + 新建/重命名/删除）+ 文件详情页（侧栏表格列表 + vxe-table 网格） - Vue Router 嵌套路由 /bitable → /bitable/:fileId → /bitable/:fileId/:tableId - 列头菜单（编辑/隐藏/删除字段）+ 末尾 + 列新增字段 - select/multiselect 字段自定义单元格编辑器 + Tag 展示 - Pinia store 扩展 file 状态与动作，深链直访回退 getFile，fileId 切换 watch 测试（U6）: - 文件 CRUD（12 例）+ 默认字段（10 例）单元测试 - 3 个 E2E spec（视图加载、文件流、字段操作），后端不可用时优雅跳过 ce-code-review 修复（P0/P1）: - P0 路由冲突：GET /files/{file_id} 遮蔽下载端点 → 下载改 /uploads/{filename} - P0 IDOR：update/delete field/record/view 五端点补 ownership 检查 - P1 is_initialized property 缺失致二次初始化崩溃 - P1 直接 URL 导航失效（files 数组为空）→ selectFile 回退 getFile - P1 fileId 切换不重载 → 增加 watch - P1 轮询丢弃最终公式值（wasCalculating 守卫）+ 复用视图 filters - P1 测试断言 200→201；test_db 无 URL 用例解除 postgres 标记得以执行 - P2 _check_table_ownership 403→404；输入长度校验；upload field-table 一致性校验 - P2 multiselect 浅比较 → 深比较；E2E bitable-view 补 waitForServer 守卫验证：ruff check 通过；pytest 91 passed/116 skipped；vue-tsc --noEmit 通过。	2026-06-29 04:07:45 +08:00
chiguyong	5c15238a5a	fix(calendar): 修复 agent 创建日历事件后 UI 不刷新 + 文档化三根因三部曲 Test / backend-test (pull_request) Has been cancelled Details Test / frontend-unit (pull_request) Has been cancelled Details Test / api-e2e (pull_request) Has been cancelled Details Test / frontend-e2e (pull_request) Has been cancelled Details 代码修复 (ce-debug): - CalendarService.create_event 注入 notify_callback，成功后广播 calendar_event_created WS 消息 - app.py 调整 _calendar_ws_sender 闭包定义顺序，注入 CalendarService（与 ReminderScheduler 共享） - tauri-auth.ts keychain fallback 修复（localStorage 始终作为备份） - 新增 2 个广播回归测试文档 (ce-compound + ce-compound-refresh): - 新增 docs/solutions/ui-bugs/calendar-agent-create-no-refresh.md（第三根因：WS 广播缺失） - 更新 calendar-capability-and-ui-fixes.md：刷新 test count + 加 Related Issues 前向引用 - 更新 jwt-secret-dev-mode-user-id-mismatch.md：扩展 e2e bullet + 加第三个根因引用 - CONCEPTS.md 新增 Service Broadcast Callback 条目 (Real-Time Fan-Out 节) 测试: - 新增 E2E 测试套件 (admin/auth-persistence/bitable/calendar/conversation/documents/evolution/settings/skills) - 新增 tests/e2e/test_api_coverage.py - CI: .gitea/.github workflows/test.yml	2026-06-29 02:20:33 +08:00
chiguyong	c9ce15fa4b	fix(code-review): 修复走查发现的 13 High + Medium 安全/可靠性问题代码修复（8 High + 9 Medium）： - portal.py — C1 IDOR 文档 / C2 类型修复 / C3 WS 连接上限 16 / C4 ws_user_id 早初始化 / M silent swallow 日志化 - auth/middleware.py — C5 WS sid 补齐 - calendar_tool.py — C6 偏移量 ±43200 双向校验 + reminder_channels 类型/白名单校验 - sqlite_conversation_store.py — C7 DELETE 事务回滚 - chat.ts (Pinia) — C8 deleteConversation 清理 pending 缓存 - app.py — M except: pass → logger.debug(exc_info=True) - Scene6Error.vue — M onUnmounted 清理 setTimeout - DocumentsTab.vue — M Invalid Date 守卫 - ChatSidebar/RightPanel/TopNav.vue — M aria-label 无障碍标签 - SystemMonitorPanel.vue — M v-else 兜底 + active 边框色 + tablist 键盘导航 - CalendarDrawer.vue — M overflow-y: auto - CalendarGrid.vue — M ResizeObserver 反馈循环防护 - SkillsTab.vue — M onMounted 始终 fetchSkills 文档修复（5 High + 6 Medium）： - portal-platform-security-reliability-fixes.md — D2 测试路径 / D3 Root Cause+Impact 章节 / D4 severity: mixed / 标题中文化 / 12 处绝对路径转相对 / P2 #12 数字口径 - AGENTS.md — D5 路由表 22→28 / 专家模板 5→15 / LiteLLM U15 迁移 / 配置查找 fallback - README.md — 8 处端口 8000→8001 新增测试： - tests/unit/calendar/test_calendar_tool.py — ponytail 自检断言验证： - ruff check (5 文件) — All checks passed - vue-tsc --noEmit — exit 0 - git stash baseline 验证 — portal 17 个 401 失败为预存在问题已知限制（预存在）： - 17 个 portal 测试 401 失败 — 需另起 ce-debug 调查 - README.md 7 处 CostAwareRouter 引用过时 — 文档同步另起任务	2026-06-28 15:06:41 +08:00
chiguyong	43e9025c6d	fix(calendar): 日历能力缺失修复 + UI 布局优化 + 会话404处理 P0: calendar_tool reminder_rules 未传入 create_event，提醒功能完全失效。P1: chat.ts deleteConversation 未清理 pending + 404 递归保护。P2: app.py 系统提示重复段落 + gui_mode F821 + SystemMonitorPanel flex 布局。P3: portal send_json 快照 + WS connected 清除 is_local + 移除死代码。验证: ruff+pytest 98passed+typecheck 通过。	2026-06-28 14:24:58 +08:00
chiguyong	31c65e01b8	fix(security): P0 安全加固 + 多实例部署一致性 (U1-U4 + U5c) Deploy to Production / deploy (push) Has been cancelled Details U1: LLM gateway KB 缓存 fail-closed — 异常时默认禁用缓存防止 KB 数据泄漏 U2: MCP 危险工具黑名单过滤 — 6+1 端点覆盖，防止绕过 chat confirmation U3: SecretsStore Redis 迁移 — 多 worker 共享凭证，内存降级保留开发模式 U4: channels webhook Redis 状态 — ZSET 滑动窗口限流 + nonce dedup + backpressure U5c: ce-code-review 修复批次: - P0: 统一 MCP 黑名单与 publisher.py 一致 (terminal_execute -> terminal, +file_read) - P1: ZSET 限流 member 加 uuid 后缀避免同时间戳碰撞 - P1: SecretsStore redis 参数 Any -> aioredis.Redis \| None (AGENTS.md 合规) - P1: Redis client 添加 socket_timeout 防止单点故障请求挂死测试: 171 scoped tests pass, ruff clean	2026-06-26 04:05:33 +08:00
chiguyong	75e9b58e46	docs(ce-compound): 记录 portal-platform 安全/可靠性修复批次记录 ce-code-review 修复批次（commit 53faa60）的 10 个 P1/P2/P3 修复： - P1: WeCom 重放、缓存跨用户泄漏、webhook 异常风暴、shutdown 泄漏 - P2: Feishu TTL、无界任务集、配额 N+1、冗余 SHA-256、未用参数 - P3: DIRECT_CHAT 去重新增 docs/solutions/security-issues/portal-platform-security-reliability-fixes.md CONCEPTS.md 补充 3 个领域术语：Per-User Cache Namespace、Webhook Signature Freshness、Webhook Backpressure	2026-06-26 01:47:57 +08:00
chiguyong	af96cb49bd	docs(plan): deepen portal platform evolution plan — KTD5/7/8/9 expanded, KTD11 added	2026-06-25 20:13:27 +08:00
chiguyong	22c89763e2	docs: add long-horizon reliability fixes learning + scrub CONCEPTS.md - New solution doc: logic-errors/long-horizon-reliability-code-review-fixes.md Documents 13 code-review fixes (2 P0, 5 P1, 6 P2) across U1-U7 long-horizon reliability features (disclosure_level default, resume plan_id mismatch, middleware dataclass compat, state offload readback, checkpoint dedup, dynamic phase persistence, debate count restore, loop detection reset, concurrent resume lock, FAILED phase handling, checkpoint cleanup, offload type guard). - CONCEPTS.md: add Expert Orchestration cluster (Disclosure Level, State Offloading, Pipeline Checkpoint, Debate Phase, Resume). Scrub Bitable entries to remove implementation specifics per vocabulary rules (API paths, library calls, SQL syntax, class names, enum values).	2026-06-25 02:40:22 +08:00
chiguyong	71eaf8dc7c	docs: add bitable security/reliability patterns solution doc + CONCEPTS.md Deploy to Production / deploy (push) Has been cancelled Details - docs/solutions/architecture-patterns/bitable-companion-service-security-reliability-patterns.md Knowledge-track doc capturing 10 security/reliability patterns from the bitable companion service (SSRF prevention, SQL injection, IDOR, atomic task claiming, cache invalidation, composite cursor, batch ops, async I/O safety, OOM prevention, internal token auth) - CONCEPTS.md Seeded with 3 core domain nouns: Bitable, Field Ownership, Recalc - AGENTS.md Added discoverability tips for docs/solutions/ and CONCEPTS.md	2026-06-25 01:25:06 +08:00
chiguyong	bbbf9cd40a	feat(bitable): add bitable companion service with full P0-P2 fixes Bitable is a multi-dimensional table companion service that runs alongside the main AgentKit server. It provides structured data storage with formula fields, views, and ingestion pipelines. Major components: - Domain models (Pydantic v2): Table, Field, Record, View, RecalcTask - SQLAlchemy 2 async ORM with independent bitable PostgreSQL schema - Formula engine: AST parser, DAG, Kahn topological sort, safe eval - RecalcWorker: atomic task claiming (FOR UPDATE SKIP LOCKED), topo-order processing, stale-threshold reaper for crash recovery - REST API (/api/v1/bitable): tables, fields, records, views, files - BitableTool: agent-facing tool with batch chunking (500/batch) - CLI: agentkit bitable subcommands (create, list, import-excel, etc.) - Frontend: Vue 3 + vxe-table grid with field management, views, filters - Ingestion: Excel (openpyxl), database reflection, API collector Security fixes (ce-code-review P0 + ce-debug P1): - SQL injection prevention (field_id validation, parameterized queries) - IDOR protection (_check_table_ownership on all table-level endpoints) - SSRF prevention (URL scheme + private IP validation in parse_excel_url) - OOM prevention (streaming file upload, batch delete, batch insert) - Atomic recalc task claiming (FOR UPDATE SKIP LOCKED) - Formula engine cache invalidation on field changes - Composite cursor pagination for non-id sort orders - Batch upsert (eliminates N+1 queries) - Sync I/O offloaded to thread pool in async contexts - Internal token auth (X-Internal-Token, hmac.compare_digest) - PK unique index enforcement Test coverage: 88 unit tests (95 skipped without Docker)	2026-06-25 01:09:59 +08:00
chiguyong	a312e584ae	Merge branch 'feat/expert-team-pm-collaboration' — PM 协同模式 + 代码审查全量修复 Deploy to Production / deploy (push) Waiting to run Details # Conflicts: # src/agentkit/server/frontend/components.d.ts	2026-06-24 18:57:37 +08:00
chiguyong	574db8458f	fix(experts): PM 协同代码审查全量修复 P0: 跨阶段契约状态同步 — _notify_collaborators 更新接收方契约状态为 received P0: 4 个 PM 事件加入 _VALID_TEAM_EVENT_TYPES 白名单 P1: 验收 fail-open 改标注降级原因 P1: 返工失败抛 RuntimeError 而非返回 dict P1: 验收 prompt injection 防护 — 专家输出用 XML 标签包裹 P1: 契约字段校验 _EXPERT_NAME_RE P1: bool("false") 修复 — 显式比较避免字符串真值陷阱 P1: _parse_risk_flags(None) 防御 P2: _notify_collaborators 移到验收通过后 P2: SharedWorkspace 写入移到验收通过后 P2: 验收贪婪正则修复 P2: 风险标记数量上限 MAX_RISK_FLAGS=10 P2: 返工 feedback 截断 P2: 前端会话隔离 — 切换会话时清除/恢复 collaborationState P2: 前端契约状态更新 — collaboration_notice 时标记 delivered P2: CLI 死代码标注 + 异常改 debug 日志 P2: 模块级 _RISK_FLAG_RE 预编译	2026-06-24 18:56:27 +08:00
chiguyong	fef7ecea39	feat(skills): SkillHarness 激活前置条件 + 风险守卫学习基于 SkillHarness 论文（arXiv:2606.20636）与 Agent Skills 综述（arXiv:2602.12430）引入激活前置条件（preconditions）与来源标记（provenance），并新增从失败轨迹学习风险守卫建议的能力。变更内容： - U1: SkillConfig 新增 v7 preconditions/provenance 字段（base.py） - U2: build_skill_system_prompt 注入 preconditions 软检查段落 - U3: SkillLoader 三路径记录 provenance + entry_points 危险能力告警 - U4: 10 个业务 Skill YAML 补充 preconditions（2-4 条中文短句） - U5: RiskGuardLearner 从失败轨迹学习风险守卫建议（人工审查，不自动应用） - U6: CLI 命令 agentkit skill learn-risk-guards 关键决策： - KTD1: preconditions 通过 system_prompt 注入（软检查），不做硬 LLM 调用 - KTD2: RiskGuardLearner 不自动应用，需人工审查（论文显示 75% 自动学习不安全） - KTD3: provenance 为轻量字符串，不加 hash/签名（无合规需求）测试：39 个新增单元测试全部通过，ruff 检查通过。	2026-06-24 13:56:37 +08:00
chiguyong	d4bc79e409	test(calendar): wire calendar router into app.py + test plan - Register calendar router in create_app() so /api/v1/calendar/* is reachable - Initialize CalendarService + ReminderScheduler in lifespan - Register CalendarTool into tool registry for ReAct integration - Lazy-import ICSProvider in routes to break circular import - Add test plan document (5 layers: unit/integration/e2e)	2026-06-24 11:51:31 +08:00
chiguyong	460cf6e926	docs(calendar): add implementation history with code review summary	2026-06-24 11:36:10 +08:00
chiguyong	fbe08cb1e2	feat(experts): add debate phase executor to TeamOrchestrator (U2) Implement _execute_debate_phase() with Lead-facilitated structured debate: - Lead opens with divergence point + dependency context - Experts argue in parallel per round (asyncio.gather) - Lead summarizes each round, then adjudicates final verdict - Verdict produces decision (adopt/compromise/shelve/inconclusive) + conclusion - Conclusion written to SharedWorkspace for downstream phases Escape hatches: - debate_config.skip=true short-circuits with template text - MAX_DEBATE_ROUNDS=4 hard cap on rounds - User /stop intervention ends debate early (U4-compatible via getattr fallback) - LLM unavailable falls back to template verdict, no crash New events: debate_started, expert_argument, debate_round_summary, debate_resolved (plus existing phase_completed for consistency). Phase dispatcher (_execute_phase) routes by phase_type: EXECUTION to _execute_execution_phase, DEBATE to _execute_debate_phase. 36 new tests in test_orchestrator_debate.py covering happy path (2 rounds, 2 experts), max_rounds=1 boundary, empty participants, user stop, skip escape hatch, LLM unavailable, SharedWorkspace integration, event broadcasting, intervention channel compatibility, and helper methods. All 377 expert tests pass. Also includes planning artifacts (brainstorm requirements + implementation plan with 6 units U1-U6).	2026-06-24 10:54:51 +08:00
chiguyong	d1250cf32b	docs(calendar): mark plan as completed — all 12 units implemented	2026-06-24 05:04:39 +08:00
chiguyong	47f3bfecfc	feat(documents): add document processing capability (U1-U9) Implements end-to-end document generation, template filling, and reading: - DocumentService: unified business layer for create/query/download - Renderers: Word (Markdown->docx), Excel (Markdown/JSON->xlsx), PDF (Markdown->pdf with CJK font), Template (Jinja2 sandbox .docx fill) - DocumentLoader: read PDF/Word/Excel/Markdown/HTML/text -> Document - DocumentTool: Agent tool with action=create\|read - REST API: /api/v1/documents (create, upload-template, list, download) - Frontend: DocumentPanel, DocumentCard, documents Pinia store, chat store tool_result detection - Security: path traversal guard (Path.resolve + relative_to), SSTI guard (SandboxedEnvironment), API key auth, 50MB upload limit - Bug fixes: template path traversal (400 not 500), TemplateRenderer lazy-load (no external registration dependency) - Tests: 168 tests (unit + security + E2E F1/F2/F3 + bug hunt) - Docs: README section 17, requirements + plan + test-plan docs Requirements R1-R28 verified, F1-F3 user flows pass.	2026-06-23 15:05:01 +08:00
chiguyong	3efdaafb5f	docs: mark admin console plan as completed	2026-06-21 20:02:27 +08:00
chiguyong	ad65f7a8d7	feat(admin): U1+U2+U4 — schema v3, department service, context filtering U1: Bump _SCHEMA_VERSION to 3, add 5 department tables (departments, user_departments, department_skill_bindings, department_kb_bindings, department_quotas) + 5 ORM models + helpers. U2: DepartmentService (12 async methods: CRUD + bind/unbind skill/KB + count_users). Mount admin_router in app.py. 36 unit + 28 integration tests. U4: DepartmentContext FastAPI dependency (per-route, admin bypasses filtering). filter_skills_by_department / filter_kb_sources_by_department helpers. Applied to GET /skills and GET /kb-management/* routes. 15 integration tests for department isolation. Also includes brainstorm + plan docs. 108 new tests, all pass.	2026-06-21 15:03:27 +08:00
chiguyong	67c0d67262	fix(auth,chat): P0 security fixes + stop-generation button + doc sync U1: whoami cold-start security — add is_active check (disabled users now get 401, not 200) and replace create_token_pair with create_access_token to avoid minting a discarded refresh token (token-amplification risk). U2: list_active_by_provider now filters expired sessions (expires_at > now) matching its docstring promise; previously only checked revoked = 0. U3: Fix asyncio.run() crash in test_revoke_other_user_session_returns_404 (converted to async). Add U1/U2 verification tests (disabled-user whoami, no-refresh-leak, expired-session filtering, provider filtering) and strengthen admin route tests (404 boundary, non-admin 403 on /admin/sessions). U4: Update CLAUDE.md/AGENTS.md Request Flow — CostAwareRouter 3-layer diagram replaced with actual RequestPreprocessor architecture (@board/@team prefix intercepts then @skill: prefix then trivial-input regex then default REACT). ExecutionMode list expanded to all 7 values. U5: Frontend stop-generation button — ChatInput.vue shows a stop button when isGenerating is true; chat store gains stopGeneration() that sends {type:"cancel"} over WebSocket (backend portal.py already handles cancel). Tests: 120 auth tests pass (unit + integration). ruff clean. vue-tsc clean.	2026-06-21 11:36:58 +08:00
chiguyong	54955aab50	plan: 计划审查修订 + AuthProvider 抽象层设计 - 修复 U1 (Schema): 澄清不使用 Alembic，采用 _SCHEMA_SQL + init_auth_db()，新增 user_sessions → auth_sessions 一次性数据回填 - 修复 U4 (Routes): whoami 端点添加到中间件白名单并实现自主认证，明确 get_current_session / load_user / user_to_response 等函数定义 - 新增 AuthProvider 抽象层：Protocol 接口、LocalAuthProvider、StubOIDCProvider 及依赖注入工厂，支持未来对接集团 IdP - 新增 AE-10 (Provider 切换) + AE-11 (审计字段) 验收用例 - 更新 Component Map，添加 AuthProvider 相关组件	2026-06-21 00:21:52 +08:00
TraeAI	3d1cad4710	plan: 集中鉴权与 Token 持久化实施计划 10 个实施单元，分 5 个阶段： - Phase 1 (U1-U3): 后端 Schema / JWT sid / SessionService + reuse 检测 - Phase 2 (U4, U10): 新端点 + 向后兼容 shim - Phase 3 (U5, U6): Tauri keyring + 前端 adapter - Phase 4 (U7-U9): auth store 重构 + 登录/Settings/Admin UI - Phase 5: 30 天后清理 legacy path 验收 9 条端到端 AE 覆盖 F1-F12 / N5 / N6。	2026-06-20 23:48:58 +08:00
TraeAI	df8a995ec4	docs: 集中鉴权与 Token 持久化需求文档覆盖 A+B+C 一次到位方案： - A 当前实现加固（refresh 轮换、记住我、预刷新、启动三态） - B Tauri OS Keychain 集成（keyring crate 跨 macOS/Win/Linux） - C 服务端 Session 表（滑动过期、踢出、改密码强踢、reuse 检测） Out of scope: 企业 IdP / SSO / 2FA / 多租户（后续单独 brainstorm）	2026-06-20 23:42:34 +08:00
chiguyong	cac9c73dd5	fix(routing): U1-U6 路由优化 + 修复方案 + 代码审查修复实现 6 个修复单元（U1-U6）并应用 ce-code-review 发现的 5 项安全修复。 ## U1: benchmark 超时阈值 - 按 difficulty 分级超时：easy=45s, medium=60s, hard=90s - 替换原单一 60s 硬编码 ## U2: OpenAICompatibleProvider httpx 超时 - 新增 timeout 参数（默认 120s），替换硬编码 60s - ProviderConfig.timeout 透传到 Provider - 新增 2 项单元测试 ## U3: 激活 QualityGate skill_match 校验 - BaseAgent._build_skill_context() 构造 skill_context - 在 base.py / tasks.py / runner.py 三处传入 QualityGate.validate() ## U4: 添加 disambiguation_keywords 字段 - IntentConfig 新增 disambiguation_keywords 字段 - 8 个 skill YAML 补充该字段 ## U5: 优化 RequestPreprocessor 路由正则 - 拆分 _FACTUAL_RE 为 CN/EN 双正则（中文无空格） - 新增 _MATH_RE / _TRANSLATION_RE 纯模式 - _TOOL_CONTEXT_RE 排除需要工具的实时查询 - 多行输入守卫 + 结尾标点支持 - 新增 21 项单元测试（共 40 项全通过） ## U6: 重新基准测试 - 真实 LLM benchmark：准确率 60% -> 93.3% - 4/5 通过，p50=40.8s，一致性=100% - 旧基线备份至 baseline_2026-06-17_old_arch.json ## ce-code-review 修复（5 项） - 修复 \s 字符类匹配换行符的安全隐患 - 添加事实/数学正则的结尾标点支持 - 修复 geo_optimizer.yaml 关键词重复 - 修复 _login_with_retry 不可达 return - 修复 real_llm_server fixture stderr_fh 资源泄漏测试：tests/unit/chat/ 63 项全通过，ruff 检查通过。	2026-06-20 19:31:49 +08:00
chiguyong	91f56ca663	feat: 企业级客户端-服务端架构 + 代码审查修复 ## 主要变更 ### 新增功能 - 企业级客户端-服务端架构（JWT 认证 + RBAC 权限 + 终端安全） - Tauri 桌面客户端与服务端配置同步 - 远程 LLM 网关（RemoteLLMProvider，支持 401 token 刷新重试） - 服务端终端 WebSocket（带管理员审批流程） - 终端白名单六层防御（黑名单 → shell 操作符检测 → 内置安全 → 全局/用户/会话白名单 → 危险检测） ### 代码审查修复（P0/P1/P2） - P0: 危险二进制（rm/docker 等）不再加入白名单，compute_whitelist_entry 返回 None - P1: 终端审批所有权追踪（_approval_owners dict）+ 会话清理防泄漏 - P1: 本地终端 WebSocket URL 补齐 JWT token - P1: 审计日志支持 terminal_mode 过滤 - P1: /system/resources 端点强制 SYSTEM_CONFIG 权限 - P1: RemoteLLMProvider 增加 401 token 刷新重试机制 - P1: auth/models.py 使用 Mapping[str, object] 替代 Any 类型 - P2: 终端授权依赖检查 is_active 账户状态 - 修复 app.py 未使用的 APIKeyAuthMiddleware 导入 ### 文档更新 - README.md: 新增第 16 章「企业级客户端-服务端架构」 - AGENTS.md / CLAUDE.md: 同步模块映射、路由表、前端页面 - 计划文档标记为 completed Closes: docs/plans/2026-06-19-003-feat-enterprise-client-server-evolution-plan.md	2026-06-20 06:48:18 +08:00

1 2

77 Commits