| <!DOCTYPE html> |
| <html lang="en"> |
| <head> |
| <meta charset="UTF-8"> |
| <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| <title>dispatchAI Inference API</title> |
| <style> |
| body { font-family: system-ui, sans-serif; max-width: 900px; margin: 0 auto; padding: 40px; background: #0A0F1A; color: #F5F7FA; } |
| h1 { color: #2E6BFF; } |
| h2 { color: #1FE0E6; margin-top: 2em; } |
| code { background: #1a2332; padding: 2px 6px; border-radius: 4px; color: #1FE0E6; } |
| pre { background: #1a2332; padding: 20px; border-radius: 8px; overflow-x: auto; } |
| .badge { display: inline-block; padding: 4px 12px; border-radius: 12px; font-size: 0.85em; font-weight: bold; } |
| .live { background: #4ade80; color: #0A0F1A; } |
| .price { background: #2E6BFF; color: white; } |
| table { border-collapse: collapse; width: 100%; margin: 1em 0; } |
| th, td { border: 1px solid #2a3a4a; padding: 8px 12px; text-align: left; } |
| th { background: #1a2332; color: #2E6BFF; } |
| </style> |
| </head> |
| <body> |
| <h1>π dispatchAI Inference API</h1> |
| <p>Small. Mobile. Free. UAE-built. Mobile-optimized LLM inference on real Snapdragon hardware.</p> |
| <p> |
| <span class="badge live">β
LIVE</span> |
| <span class="badge price">$0.001/1K tokens</span> |
| <span class="badge" style="background:#1FE0E6;color:#0A0F1A">10x cheaper than OpenAI</span> |
| </p> |
| <h2>Base URL</h2> |
| <code>https://api.dispatchai.ai/v1</code> |
| <h2>Quick Start</h2> |
| <pre>import openai |
| client = openai.OpenAI(base_url="https://api.dispatchai.ai/v1", api_key="da-demo-key-0001") |
| response = client.chat.completions.create( |
| model="dispatchAI/SmolLM2-135M-Instruct-mobile", |
| messages=[{"role": "user", "content": "What is the capital of France?"}] |
| ) |
| print(response.choices[0].message.content) |
| # β "The capital of France is Paris."</pre> |
| <h2>Available Models</h2> |
| <table> |
| <tr><th>Model</th><th>Size</th><th>Phone Speed</th></tr> |
| <tr><td>dispatchAI/SmolLM2-135M-Instruct-mobile</td><td>101MB</td><td>46 t/s</td></tr> |
| <tr><td>dispatchAI/Qwen2.5-0.5B-Instruct-mobile-int4</td><td>469MB</td><td>23 t/s</td></tr> |
| <tr><td>dispatchAI/Llama-3.2-1B-Instruct-Q4-mobile</td><td>770MB</td><td>5.4 t/s</td></tr> |
| </table> |
| <h2>Pricing</h2> |
| <table> |
| <tr><th>Type</th><th>Price</th></tr> |
| <tr><td>Input</td><td>$0.001/1K tokens</td></tr> |
| <tr><td>Output</td><td>$0.002/1K tokens</td></tr> |
| </table> |
| <hr> |
| <p>Dispatch AI (FZE) β Sharjah Free Zone, UAE β License No. 10818</p> |
| </body> |
| </html> |
|
|