Oxlo.ai
company
AI & ML interests
None defined yet.
Recent Activity
Organization Card
Oxlo.ai - Flat Monthly Pricing for AI Inference
What is Oxlo.ai?
Oxlo.ai is a developer-first AI inference platform with request-based pricing. Unlike token-based providers, we charge a flat fee per API call - a 100-token prompt costs the same as a 50,000-token prompt.
💡 Why Oxlo.ai?
| Feature | Oxlo.ai | Token-Based Providers |
|---|---|---|
| Pricing model | Per request (flat) | Per token (variable) |
| Cost predictability | ✅ Fixed monthly bill | ❌ Scales with usage |
| Long-context cost | Same as short context | 10-100x more expensive |
| OpenAI SDK compatible | ✅ Drop-in replacement | Varies |
📦 Available Models (35+)
Text & Chat
- Qwen 3 32B, Llama 3.3 70B, DeepSeek R1 671B, DeepSeek V3.2
- Llama 4 Maverick 17B, GPT-OSS 120B, Mistral 7B, Gemma 3
Code
- Qwen 3 Coder 30B, DeepSeek Coder 33B
Vision
- Gemma 3 27B, Kimi K2.5, Kimi K2 Thinking
Image Generation
- Oxlo Image Pro, SDXL Lightning, Stable Diffusion 1.5, Flux 1.5
Audio
- Whisper Large v3, Whisper Turbo, Kokoro 82M TTS
Embeddings & Detection
- BGE-Large, E5-Large, YOLOv9, YOLOv11
🔗 Quick Start
from openai import OpenAI
client = OpenAI(
base_url="https://api.oxlo.ai/v1",
api_key="your-oxlo-api-key"
)
response = client.chat.completions.create(
model="qwen-3-32b",
messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
🔗 Links
- Website
- Documentation
- Get Started
- GitHub
- Contact: hello@oxlo.ai
models 0
None public yet
datasets 0
None public yet