AgentRouter / README.md
ENTUM-AI's picture
Upload AgentRouter
0355487 verified
---
language:
- en
license: apache-2.0
library_name: transformers
tags:
- text-classification
- intent-classification
- query-routing
- agent
- llm-router
pipeline_tag: text-classification
---
# ⚑ AgentRouter
Ultra-fast intent classification for LLM query routing. Classifies user queries into 10 intent categories in **<5ms** on GPU.
Built on [MiniLM](https://huggingface.co/microsoft/MiniLM-L12-H384-uncased) (33M params) β€” small enough for CPU inference, fast enough for real-time routing.
## πŸš€ Usage
```python
from transformers import pipeline
router = pipeline("text-classification", model="ENTUM-AI/AgentRouter")
router("Write a Python function to sort a list")
# [{'label': 'code_generation', 'score': 0.98}]
router("Why am I getting a TypeError?")
# [{'label': 'code_debugging', 'score': 0.97}]
router("Translate hello to Spanish")
# [{'label': 'translation', 'score': 0.99}]
router("What is quantum computing?")
# [{'label': 'information_retrieval', 'score': 0.96}]
```
## 🏷️ Intent Classes
| Intent | Description | Suggested Tools |
|--------|-------------|----------------|
| `code_generation` | Write new code | code_interpreter, file_editor |
| `code_debugging` | Fix bugs and errors | code_interpreter, debugger |
| `math_reasoning` | Solve math problems | calculator, wolfram_alpha |
| `creative_writing` | Write stories, poems, essays | β€” |
| `summarization` | Summarize text | file_reader |
| `translation` | Translate between languages | translator |
| `information_retrieval` | Answer questions, explain topics | knowledge_base |
| `data_analysis` | Analyze data, create charts | code_interpreter, data_visualizer |
| `web_search` | Search the web for current info | web_browser, search_engine |
| `general_chat` | Casual conversation | β€” |
## πŸ” Use Cases
- **LLM routing** β€” route queries to specialized models or tools
- **Agent frameworks** β€” decide which tool to invoke
- **Cost optimization** β€” use cheap models for simple intents, expensive for complex
- **Latency optimization** β€” skip heavy pipelines for general chat
## ⚠️ Limitations
- English only
- 10 fixed intent categories