Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

54

Base only

Active filters: open4bits

Open4bits/LFM2.5-1.2B-Base-Quantized

Text Generation • Updated Feb 2

Open4bits/granite-4.0-h-micro-quantized

Text Generation • Updated Feb 2

Open4bits/gemma-3-270m-it-gguf

Text Generation • 0.3B • Updated Feb 19 • 169

Open4bits/Qwen3-0.6b-gguf

Text Generation • 0.8B • Updated Jan 31 • 345

Open4bits/whisper-tiny-f16

Automatic Speech Recognition • 37.8M • Updated Jan 31 • 2

Open4bits/whisper-base-f16

Automatic Speech Recognition • 72.6M • Updated Jan 31 • 2

Open4bits/llama-3.2-1b-onnx

Text Generation • Updated Feb 2 • 3

Open4bits/Qwen3-0.6B-onnx

Text Generation • Updated Feb 2 • 2

Open4bits/llama3.2-1b-gguf

Text Generation • 1B • Updated Feb 4 • 31

Open4bits/EXAONE-4.0-1.2B-gguf

Text Generation • 1B • Updated Feb 5 • 226

Open4bits/Ministral-3-3B-Base-2512-gguf

Image-Text-to-Text • 3B • Updated Feb 6 • 175

Open4bits/Schematron-3B-gguf

Text Generation • 3B • Updated Feb 6 • 98

Open4bits/granite-4.0-micro-mlx-3Bit

Text Generation • 0.4B • Updated Feb 11 • 59

Open4bits/gpt-oss-120b-mlx-2Bit

Text Generation • 117B • Updated Feb 9 • 61 • 1

Open4bits/gpt-oss-20b-mlx-2Bit

Text Generation • 21B • Updated Feb 9 • 23

Open4bits/EXAONE-4.0-1.2B-mlx-fp16

Text Generation • 1B • Updated Feb 10 • 31

Open4bits/Qwen3-Coder-Next-mlx-2Bit

Text Generation • 80B • Updated Feb 10 • 169

Open4bits/Soprano-1.1-80M-mlx-fp16

Text-to-Speech • 79.7M • Updated Feb 10 • 2

Open4bits/granite-4.0-micro-mlx-fp16

Text Generation • 3B • Updated Feb 10 • 6

Open4bits/gpt-oss-20b-mlx-fp16

Text Generation • 21B • Updated Feb 10 • 9

Open4bits/sarvam-1-GGUF

Text Generation • 3B • Updated Feb 28 • 100 • 1

Open4bits/granite-4.0-h-tiny-mlx-fp16

Text Generation • 7B • Updated Feb 11 • 28 • 1

Open4bits/Llama-3.2-3B-GGUF

Text Generation • 3B • Updated Feb 12 • 45

Open4bits/Ministral-3-3B-Base-2512-mlx-mxfp4

3B • Updated Feb 11 • 9

Open4bits/DeepSeek-R1-mlx-2Bit

Text Generation • 671B • Updated Feb 11 • 110 • 2

Open4bits/Ministral-3-3B-Base-2512-mlx-fp16

3B • Updated Feb 11 • 21

Open4bits/Olmo-3.1-32B-Think-mlx-2Bit

Text Generation • 32B • Updated Feb 11 • 11

Open4bits/Qwen2.5-Omni-7B-GGUF

Any-to-Any • 8B • Updated Feb 12 • 16

Open4bits/MiniMax-M2-GGUF

Text Generation • 229B • Updated Feb 13 • 8

Open4bits/Qwen3-14B-Base-mlx-fp16

Text Generation • 15B • Updated Feb 14 • 11