fischer-agentkit/tests/e2e
chiguyong 11e2009cb8 feat(router): improve colloquial/mixed-lang routing, fix low-complexity IntentRouter bypass
Key improvements:
- Low-complexity queries (<0.3) now try IntentRouter keyword match
  before falling back to DIRECT_CHAT, fixing 0% F1 on keyword_match
- SemanticRouter similarity_low lowered from 0.6 to 0.4
- Short text (<20 chars) uses effective_low = max(0.25, low - 0.15)
- Short text with no semantic match forces LLM classify fallback
- Added colloquial keywords to 7 skill YAMLs
- Fixed code_reviewer.yaml output_schema placement
- Fixed SemanticRouter build in e2e tests
- Fixed base_url detection for bailian-coding API keys

Results: keyword_match F1 0->60.87%, colloquial F1 0->100%, mixed_lang F1 0->100%
2026-06-15 23:54:57 +08:00
..
__init__.py feat(router): optimize routing intelligence — ExecutionMode expansion, multi-candidate scoring, quality gate skill match 2026-06-15 22:43:13 +08:00
benchmark_dataset.py feat(router): enable SemanticRouter + upgrade benchmark to L3/L5 2026-06-15 23:02:47 +08:00
benchmark_generator.py feat(router): optimize routing intelligence — ExecutionMode expansion, multi-candidate scoring, quality gate skill match 2026-06-15 22:43:13 +08:00
capability_metrics.py feat(router): enable SemanticRouter + upgrade benchmark to L3/L5 2026-06-15 23:02:47 +08:00
conftest.py feat(router): enable SemanticRouter + upgrade benchmark to L3/L5 2026-06-15 23:02:47 +08:00
test_basic_api.py feat(router): optimize routing intelligence — ExecutionMode expansion, multi-candidate scoring, quality gate skill match 2026-06-15 22:43:13 +08:00
test_basic_cli.py feat(router): optimize routing intelligence — ExecutionMode expansion, multi-candidate scoring, quality gate skill match 2026-06-15 22:43:13 +08:00
test_basic_websocket.py feat(router): optimize routing intelligence — ExecutionMode expansion, multi-candidate scoring, quality gate skill match 2026-06-15 22:43:13 +08:00
test_capability_alignment.py feat(router): optimize routing intelligence — ExecutionMode expansion, multi-candidate scoring, quality gate skill match 2026-06-15 22:43:13 +08:00
test_capability_react.py feat(router): optimize routing intelligence — ExecutionMode expansion, multi-candidate scoring, quality gate skill match 2026-06-15 22:43:13 +08:00
test_capability_router_direct.py feat(router): improve colloquial/mixed-lang routing, fix low-complexity IntentRouter bypass 2026-06-15 23:54:57 +08:00
test_capability_routing.py feat(router): optimize routing intelligence — ExecutionMode expansion, multi-candidate scoring, quality gate skill match 2026-06-15 22:43:13 +08:00
test_capability_team.py feat(router): optimize routing intelligence — ExecutionMode expansion, multi-candidate scoring, quality gate skill match 2026-06-15 22:43:13 +08:00