Inference Providers
Active filters: 4bit
mlx-community/Qwen3.5-9B-MLX-4bit
Image-Text-to-Text
• 2B • Updated • 116k
• 81
mlx-community/Qwen3.5-9B-OptiQ-4bit
Text Generation
• 9B • Updated • 10.9k
• 25
mlx-community/Qwen3.5-0.8B-OptiQ-4bit
Text Generation
• 0.2B • Updated • 4.75k
• 11
mlx-community/Qwen3.5-4B-MLX-4bit
1.0B • Updated • 32.4k
• 16
livadies/gemma-4-31B-Ghetto-NF4
Image-to-Text
• 32B • Updated • 336
• 4
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated • 212k
• 23
mlx-community/Qwen2-Audio-7B-Instruct-4bit
Audio-Text-to-Text
• Updated • 187
• 2
unsloth/Z-Image-Turbo-unsloth-bnb-4bit
Text-to-Image
• Updated • 534
• 5
manu02/Octen-Embedding-8B-bnb-4bit-nf4-dq
Text Generation
• 8B • Updated • 132
• 3
ahoybrotherbear/MiniMax-M2.5-4bit-MLX
Text Generation
• 229B • Updated • 210
• 1
EricRollei/HunyuanImage-3.0-Instruct-Distil-NF4-v2
Text-to-Image
• 83B • Updated • 733
• 9
Sepolian/Huihui-Qwen3.5-27B-Claude-4.6-Opus-abliterated-Q4_K_M
Text Generation
• 27B • Updated • 3.23k
• 14
groxaxo/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-heretic-v2-AutoRound-W4A16
Image-Text-to-Text
• 3B • Updated • 712
• 2
Cabdi1/Qwen3.5-9B-MLX-4bit
2B • Updated • 125
• 1
empero-ai/openNemo-Cascade-2-30B-A3B
Text Generation
• 32B • Updated • 2.03k
• 5
Image-Text-to-Text
• 3B • Updated • 807
• 1
mconcat/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-AWQ-4bit
Text Generation
• 29B • Updated • 1.41k
• 1
CyberYui/Codestral-22B-Yui-MLX
Text Generation
• 22B • Updated • 365
• 1
3amthoughts/DeepLink-R1-GGUF
Text Generation
• 8B • Updated • 1
Chun121/Qwen3-4B-RPG-Roleplay-V2
Text Generation
• 4B • Updated • 10.1k
• 46
mayaeary/pygmalion-6b-4bit-128g
Text Generation
• Updated • 25
• 40
mayaeary/pygmalion-6b_dev-4bit-128g
Text Generation
• Updated • 16
• 121
mayaeary/PPO_Pygway-V8p4_Dev-6b-4bit-128g
Text Generation
• Updated • 11
• 2
mayaeary/PPO_Pygway-6b-Mix-4bit-128g
Text Generation
• Updated • 5
• 2
Ancestral/Dolly_Shygmalion-6b-4bit-128g
Text Generation
• Updated • 6
• 5
Ancestral/PPO_Shygmalion-6b-4bit-128g
Text Generation
• Updated • 34
Ancestral/Dolly_Malion-6b-4bit-128g
Text Generation
• Updated • 4
• 1
4bit/pygmalion-6b-4bit-128g
Text Generation
• Updated • 7
• 3
Text Generation
• Updated • 8
• 1
seonglae/opt-125m-4bit-gptq
Text Generation
• Updated • 20