marksverdhei/Qwen3-Voice-Embedding-12Hz-1.7B Feature Extraction • Updated 2 days ago • 184 • 13
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 5 days ago • 407
stepfun-ai/Step-3.5-Flash-GGUF-Q4_K_S Text Generation • 197B • Updated 12 days ago • 18.4k • 132
bartowski/allura-forge_Llama-3.3-8B-Instruct-GGUF Text Generation • 8B • Updated Dec 30, 2025 • 6.33k • 27
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 28 items • Updated 6 days ago • 119