-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
GadflyII/GLM-4.7-Flash-NVFP4
Text Generation
•
18B
•
Updated
•
172k
•
38
Text Generation
•
120B
•
Updated
•
2.99M
•
•
4.38k
Text Generation
•
22B
•
Updated
•
6.65M
•
•
4.24k
mlx-community/GLM-4.7-Flash-8bit
Text Generation
•
30B
•
Updated
•
4.88k
•
15
mlx-community/Qwen3-TTS-12Hz-0.6B-CustomVoice-8bit
Text-to-Speech
•
0.5B
•
Updated
•
1.14k
•
8
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
5.95k
•
1.26k
MultiverseComputingCAI/HyperNova-60B
Text Generation
•
60B
•
Updated
•
1.45k
•
48
mlx-community/GLM-4.7-Flash-8bit-gs32
Text Generation
•
30B
•
Updated
•
437
•
5
openai/gpt-oss-safeguard-20b
Text Generation
•
22B
•
Updated
•
12.8k
•
•
182
AlicanKiraz0/Mihenk-LLM-14B-Turkish-Financial-Model-mlx-8Bit
15B
•
Updated
•
29
•
7
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
•
16B
•
Updated
•
3.82k
•
6
Text Generation
•
177B
•
Updated
•
4.97k
•
10
nvidia/DeepSeek-V3.2-NVFP4
Text Generation
•
394B
•
Updated
•
1.07k
•
3
LiquidAI/LFM2.5-1.2B-Thinking-MLX-8bit
Text Generation
•
0.3B
•
Updated
•
179
•
3
lmstudio-community/GLM-4.7-Flash-MLX-8bit
Text Generation
•
30B
•
Updated
•
292k
•
3
mlx-community/Qwen3-TTS-12Hz-1.7B-VoiceDesign-8bit
Text-to-Speech
•
0.8B
•
Updated
•
754
•
3
ragraph-ai/stable-cypher-instruct-3b
Text Generation
•
3B
•
Updated
•
361
•
31
MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
144k
•
9
tiiuae/Falcon-E-3B-Instruct
Text Generation
•
0.9B
•
Updated
•
291
•
36
MaziyarPanahi/Qwen3-1.7B-GGUF
Text Generation
•
2B
•
Updated
•
220k
•
6
drwlf/medgemma-4b-it-abliterated
Text Generation
•
Updated
•
17
•
6
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
32.5k
•
21
Text Generation
•
5B
•
Updated
•
6.45k
•
12
GY2233/Qwen2.5-32B-NVFP4A16
Text Generation
•
19B
•
Updated
•
2
FabioSarracino/VibeVoice-Large-Q8
Text-to-Audio
•
9B
•
Updated
•
2.71k
•
78
mlx-community/DeepSeek-OCR-8bit
Image-Text-to-Text
•
1B
•
Updated
•
1.37k
•
30
ig1/Qwen3-VL-30B-A3B-Instruct-NVFP4
Image-Text-to-Text
•
18B
•
Updated
•
2.2k
•
5
Firworks/NVIDIA-Nemotron-3-Nano-30B-A3B-nvfp4
18B
•
Updated
•
2.08k
•
7
mlx-community/GLM-4.7-8bit
Text Generation
•
353B
•
Updated
•
1.19k
•
4
Tengyunw/MiniMax-M2.1-NVFP4
Text Generation
•
115B
•
Updated
•
187
•
6