RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 12 days ago • 26.8k • 9
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3 Text Generation • 1.0B • Updated Dec 19, 2025 • 20.8k • 2
Running 1 Quantization Formats And Cuda Compute Capability Support 🧠 1 Quantization Formats & CUDA Compute Capability Support