Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

288

Full-text search

Active filters: ollama

vito95311/Qwen3-Omni-30B-A3B-Thinking-GGUF-INT8FP16

Text Generation • 16B • Updated Sep 28, 2025 • 462 • 18

DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters

Updated Jul 27, 2025 • 152

bartowski/Llama-SmolTalk-3.2-1B-Instruct-GGUF

Text Generation • 1B • Updated Nov 26, 2024 • 311 • 2

Chemin-AI/malaysian-Llama-3.2-3B-Instruct-gguf

4B • Updated Nov 27, 2024 • 226 • 1

fibonacciai/RealRobot-Chatbot-Ecommerce-Robot-Fibonacci-Nano-llm

Question Answering • 4B • Updated Dec 2, 2025 • 1.51k • 10

sylvester-francis/typescript-slm-7b-reasoning-full

Text Generation • 8B • Updated Nov 30, 2025 • 129 • 1

amihai4by/logic-reasoner-v2

Text Generation • 8B • Updated 1 day ago • 27 • 1

pacozaa/mistral-unsloth-chatml-first

4B • Updated Apr 14, 2024 • 23

pacozaa/tinyllama-alpaca-lora

Updated Apr 14, 2024 • 1

pacozaa/bonito-gguf

7B • Updated Apr 14, 2024 • 3

pacozaa/TinyLlama-1.1B-intermediate-step-1431k-3T-GGUF

1B • Updated Apr 19, 2024 • 15

pacozaa/mistral-sharegpt90k

Updated Aug 2, 2024

pacozaa/mistral-sharegpt90k-merged_16bit

Text Generation • 7B • Updated Jul 30, 2024

TrabEsrever/dolphin-2.9-llama3-70b-GGUF

Updated Apr 29, 2024

daekeun-ml/Phi-3-medium-4k-instruct-ko-poc-gguf-v0.1

Text Generation • 14B • Updated May 26, 2024 • 19 • 1

hierholzer/Llama-3.1-70B-Instruct-GGUF

Text Generation • 71B • Updated Dec 11, 2024 • 343 • 3

LucasInsight/Meta-Llama-3.1-8B-Instruct

8B • Updated Aug 20, 2024 • 20 • 1

LucasInsight/Meta-Llama-3-8B-Instruct

8B • Updated Aug 20, 2024 • 59

Shyamnath/Llama-3.2-3b-Uncensored-GGUF

Text Generation • 4B • Updated Oct 21, 2024 • 130 • 4

ghost-x/ghost-8b-beta-1608-gguf

Text Generation • 8B • Updated Aug 26, 2024 • 115 • 6

cahaj/Phi-3.5-mini-instruct-text2sql-GGUF

4B • Updated Aug 29, 2024 • 17

Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python_Spanish_English_16bit

0.5B • Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-extra_small_quantization_GGUF_3bit

0.5B • Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python-Spanish_English_GGUF_4bit

0.5B • Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-Spanish_English_GGUF_q5_k

0.5B • Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-Spanish_English_GGUF_q6_k

0.5B • Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python-GGUF_Spanish_English_8bit

0.5B • Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python_English_GGUF_16bit

0.5B • Updated Sep 2, 2024 • 1

Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-Spanish_English_GGUF_32bit

Updated Sep 2, 2024

saberbx/XO

3B • Updated Jun 22, 2025 • 18