TurboQuant 4-bit mlx-lm models. TriAttention compatible. PR #1 merged MIT+NVIDIA.
-
deadbydawn101/gemma-4-E4B-mlx-4bit
Image-Text-to-Text • 2B • Updated • 993 • 6 -
deadbydawn101/gemma-4-E4B-Agentic-Opus-Reasoning-GeminiCLI-mlx-4bit
Text Generation • Updated • 11.2k • 20 -
deadbydawn101/gemma-4-E2B-Heretic-Uncensored-mlx-4bit
Image-Text-to-Text • 1B • Updated • 7.95k • 14 -
deadbydawn101/gemma-4-21b-REAP-Tool-Calling-mlx-4bit
Image-Text-to-Text • 4B • Updated • 1.24k • 4