fischer-agentkit

Commit Graph

Author	SHA1	Message	Date
chiguyong	cac9c73dd5	fix(routing): U1-U6 路由优化 + 修复方案 + 代码审查修复实现 6 个修复单元（U1-U6）并应用 ce-code-review 发现的 5 项安全修复。 ## U1: benchmark 超时阈值 - 按 difficulty 分级超时：easy=45s, medium=60s, hard=90s - 替换原单一 60s 硬编码 ## U2: OpenAICompatibleProvider httpx 超时 - 新增 timeout 参数（默认 120s），替换硬编码 60s - ProviderConfig.timeout 透传到 Provider - 新增 2 项单元测试 ## U3: 激活 QualityGate skill_match 校验 - BaseAgent._build_skill_context() 构造 skill_context - 在 base.py / tasks.py / runner.py 三处传入 QualityGate.validate() ## U4: 添加 disambiguation_keywords 字段 - IntentConfig 新增 disambiguation_keywords 字段 - 8 个 skill YAML 补充该字段 ## U5: 优化 RequestPreprocessor 路由正则 - 拆分 _FACTUAL_RE 为 CN/EN 双正则（中文无空格） - 新增 _MATH_RE / _TRANSLATION_RE 纯模式 - _TOOL_CONTEXT_RE 排除需要工具的实时查询 - 多行输入守卫 + 结尾标点支持 - 新增 21 项单元测试（共 40 项全通过） ## U6: 重新基准测试 - 真实 LLM benchmark：准确率 60% -> 93.3% - 4/5 通过，p50=40.8s，一致性=100% - 旧基线备份至 baseline_2026-06-17_old_arch.json ## ce-code-review 修复（5 项） - 修复 \s 字符类匹配换行符的安全隐患 - 添加事实/数学正则的结尾标点支持 - 修复 geo_optimizer.yaml 关键词重复 - 修复 _login_with_retry 不可达 return - 修复 real_llm_server fixture stderr_fh 资源泄漏测试：tests/unit/chat/ 63 项全通过，ruff 检查通过。	2026-06-20 19:31:49 +08:00
chiguyong	a1318df420	feat: add LLM and GUI benchmark modes with real agent testing	2026-06-17 12:55:19 +08:00
chiguyong	1fbfd9d132	refactor: standardize benchmark with industry methodology (P/R/F1, multi-run, baseline)	2026-06-17 12:01:34 +08:00
chiguyong	89a9534678	feat: add benchmark_runner skill for capability testing and report generation	2026-06-17 11:31:15 +08:00
chiguyong	11e2009cb8	feat(router): improve colloquial/mixed-lang routing, fix low-complexity IntentRouter bypass Key improvements: - Low-complexity queries (<0.3) now try IntentRouter keyword match before falling back to DIRECT_CHAT, fixing 0% F1 on keyword_match - SemanticRouter similarity_low lowered from 0.6 to 0.4 - Short text (<20 chars) uses effective_low = max(0.25, low - 0.15) - Short text with no semantic match forces LLM classify fallback - Added colloquial keywords to 7 skill YAMLs - Fixed code_reviewer.yaml output_schema placement - Fixed SemanticRouter build in e2e tests - Fixed base_url detection for bailian-coding API keys Results: keyword_match F1 0->60.87%, colloquial F1 0->100%, mixed_lang F1 0->100%	2026-06-15 23:54:57 +08:00
chiguyong	5ef08a3b30	fix(review): comprehensive P0-P2 code review fixes	2026-06-12 22:18:25 +08:00
chiguyong	6731d96c65	feat(configs): add code_reviewer skill and coding_harness pipeline - code_reviewer.yaml: Verifier Agent skill config for adversarial review with structured output schema for ReviewFeedback format - coding_harness.yaml: Example pipeline with adversarial loop develop → test → review (Worker↔Verifier) → archive	2026-06-12 09:38:37 +08:00
chiguyong	cc4c6fe346	fix: direct-mode agent falls through to default when task needs tools When IntentRouter matches a direct-mode agent (no tools), but the task content suggests tool needs (shell, search, file ops, etc.), the routing now falls through to the default agent which has full tool access. This fixes the issue where "帮我执行个命令" would be routed to direct_agent and fail because direct mode doesn't support tool calling. Also restored "你好" in direct_agent keywords since it's correctly handled now — greetings don't need tools, direct mode is fine.	2026-06-11 15:26:19 +08:00
chiguyong	52b7d6007d	fix: remove '你好' from direct_agent keywords so greetings route to default agent with tools	2026-06-11 14:49:59 +08:00
chiguyong	93bc7c4e3e	fix: change all agent YAML model from hardcoded provider to 'default' Hardcoded model names like 'openai/gpt-4o-mini' or 'anthropic/claude-sonnet' cause 'No provider available' errors when the specific provider isn't configured. Using 'default' lets the system pick the available provider automatically.	2026-06-11 14:19:26 +08:00
chiguyong	5b42487d8a	feat(core): add ReWOO, Plan-and-Execute, Reflexion execution engines Phase A of Multi-Agent Marketplace architecture: - ReWOOEngine: plan-all-then-execute pattern for parallel data fetch - PlanExecEngine: adapter wrapping GoalPlanner+PlanExecutor+PipelineReplanner - ReflexionEngine: ReAct + Evaluate + Reflect + Retry for high-precision tasks - SkillConfig: extend VALID_EXECUTION_MODES with rewoo/plan_exec/reflexion - ConfigDrivenAgent: add _handle_rewoo/_handle_plan_exec/_handle_reflexion routes - 5 professional agent YAML configs with layered model defaults - 107 unit tests passing	2026-06-10 17:08:48 +08:00
chiguyong	7874e875af	merge: integrate feat/agentkit-phase8-chat-adaptive (chat/gui commands + GUI mode) Restores agentkit chat, agentkit gui CLI commands, onboarding wizard, and GUI mode (AGENTKIT_GUI_MODE) with static file serving. Resolves merge conflicts in orchestrator.py, app.py, tools/__init__.py, shell.py.	2026-06-10 07:44:06 +08:00
chiguyong	b34f74f598	feat(phase6): implement end-to-end enterprise scenario validation (U15) - Add goal-driven agent skill config and pipeline config - Add 9 E2E integration tests covering all 7 capabilities: - SC1: Goal-driven SEO analysis (GoalPlanner→PlanExecutor→PlanChecker→ExperienceStore) - SC2: Knowledge Q&A with system operation (MultiSourceRAG) - SC3: Workflow with approval (WorkflowStore + approval node) - SC4: Self-evolution experience accumulation (ExperienceStore→PitfallDetector→PathOptimizer) - SC5: Parallel execution efficiency verification - SC6: Skill registry integration (capabilities, versions, health) - Cross-capability: Plan+Experience+Pitfall, Review+Experience, RAG+Workflow - All 2472 tests passing (9 integration + 2463 unit)	2026-06-10 01:38:28 +08:00
chiguyong	31bd3b126c	feat(phase8): chat adaptive enhancements, pipeline reflection, search tools upgrade - Enhanced chat CLI with adaptive mode and session management - Added pipeline reflection and schema extensions - Upgraded BaiduSearch and WebSearch tools with advanced capabilities - Expanded server routes for skills and chat - Added session store enhancements - New chat module and pipeline reflection support	2026-06-09 23:18:06 +08:00
chiguyong	2e547e345a	feat(geo): U4 GEO skill tool binding with BaiduSearch and E2E tests Add BaiduSearchTool (API mode + scraping fallback), bind tools to GEO skill YAML configs (baidu_search, web_crawl, schema_extract, schema_generate), extend geo_full_pipeline with generate_content and deai steps, add 36 E2E integration tests.	2026-06-07 17:25:37 +08:00
chiguyong	669ca604e5	feat(configs): add GEO AgentKit Server configuration - llm_config.yaml: DeepSeek + OpenAI-compatible providers with env var substitution - skills/ (8 YAML): citation_detector, content_generator, deai_agent, geo_optimizer, monitor, schema_advisor, competitor_analyzer, trend_agent - Added intent fields for content_generator, competitor_analyzer, trend_agent - Added quality_gate fields for content_generator, deai_agent, geo_optimizer - Updated custom_handler paths to configs.geo_handlers - geo_tools.py: 14 FunctionTools calling GEO Backend via HTTP - geo_handlers.py: 3 custom handlers (citation/monitor/schema) calling /internal/ API - geo_server.py: FastAPI factory with LLM Gateway, Tool Registry, Skill Registry	2026-06-05 23:25:14 +08:00

16 Commits