Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

efficient-inference

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

118

Base only

Active filters: efficient-inference

owensong/Inflect-Nano-v1

Text-to-Speech • Updated about 2 hours ago • 182

Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash-GGUF

Image-Text-to-Text • 9B • Updated May 2 • 375k • 236

Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash

Image-Text-to-Text • 10B • Updated May 2 • 3.1k • 35

Qapdex/SLM750-Edge-1.58-bit

Text Generation • 1B • Updated about 1 hour ago • 226 • 1

vhab10/llama_3.1_8b_Q4_K_M-gguf

Text Generation • 8B • Updated Oct 6, 2024 • 10

saytes/SoT_DistilBERT

Text Classification • 67M • Updated Mar 11, 2025 • 490 • 7

stiger1000/TC-MoE

Text Generation • 2B • Updated Jul 25, 2025 • 14 • 1

agentlans/Qwen3-4B-multilingual-sft-GGUF

Text Generation • 4B • Updated Jun 29, 2025 • 18

sudeshmu/fine_tune

Text Generation • Updated Aug 28, 2025 • 297 • 9

weathermanj/Nemotron-nano-9b-fp8

Text Generation • 9B • Updated Aug 29, 2025 • 40 • 6

jackal79/gpt2-ibce-lowrank-192

Text Generation • Updated Sep 19, 2025 • 4

huawei-csl/Qwen3-1.7B-3bit-SINQ

Text Generation • 0.5B • Updated Feb 2 • 6 • 7

huawei-csl/Qwen3-1.7B-3bit-ASINQ

Text Generation • 0.5B • Updated Feb 2 • 7 • 7

huawei-csl/Qwen3-14B-3bit-SINQ

Text Generation • 3B • Updated Feb 2 • 13 • 5

huawei-csl/Qwen3-14B-3bit-ASINQ

Text Generation • 3B • Updated Feb 2 • 5 • 5

huawei-csl/Qwen3-32B-3bit-SINQ

Text Generation • 6B • Updated Feb 2 • 7 • 6

huawei-csl/Qwen3-32B-3bit-ASINQ

Text Generation • 6B • Updated Feb 2 • 5 • 5

huawei-csl/Qwen3-1.7B-4bit-SINQ

Text Generation • 1B • Updated Feb 2 • 3 • 5

huawei-csl/Qwen3-1.7B-4bit-ASINQ

Text Generation • 1B • Updated Feb 2 • 5 • 5

huawei-csl/Qwen3-32B-4bit-SINQ

Text Generation • 18B • Updated Feb 2 • 6 • 7

huawei-csl/Qwen3-14B-4bit-SINQ

Text Generation • 9B • Updated Feb 2 • 4 • 5

huawei-csl/Qwen3-14B-4bit-ASINQ

Text Generation • 9B • Updated Feb 2 • 6 • 6

huawei-csl/Qwen3-32B-4bit-ASINQ

Text Generation • 18B • Updated Feb 2 • 7 • 8

huawei-csl/Qwen3-235B-A22B-3bit-SINQ

Text Generation • Updated Feb 2 • 7 • 2

huawei-csl/Apertus-8B-2509-4bit-SINQ

Text Generation • 5B • Updated Feb 2 • 6 • 2

huawei-csl/Apertus-8B-2509-4bit-ASINQ

Text Generation • 5B • Updated Feb 2 • 303 • 3

huawei-csl/Kimi-Linear-48B-A3B-Instruct-4bit-SINQ

Text Generation • 27B • Updated Feb 2 • 7 • 3

huawei-csl/Qwen3-Next-80B-A3B-Instruct-4bit-SINQ

Text Generation • Updated Feb 2 • 24 • 2

huawei-csl/Kimi-Linear-48B-A3B-Instruct-3bit-SINQ

Text Generation • 7B • Updated Feb 2 • 5 • 1

huawei-csl/Qwen3-Next-80B-A3B-Instruct-3bit-SINQ

Text Generation • Updated Feb 2 • 24 • 2