RedHatAI/granite-3.1-2b-instruct-quantized.w4a16 Text Generation • 0.5B • Updated Feb 28, 2025 • 81.2k
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w4a16 Text Generation • 2B • Updated Feb 27, 2025 • 150 • 1
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w8a8 Text Generation • 2B • Updated Feb 27, 2025 • 2.33k • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w4a16 Text Generation • 8B • Updated Feb 27, 2025 • 122 • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8 Text Generation • 8B • Updated Feb 27, 2025 • 2.01k • 5
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-FP8-dynamic Text Generation • 8B • Updated Feb 27, 2025 • 346 • 1
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-quantized.w4a16 Text Generation • 15B • Updated Feb 27, 2025 • 365 • 1
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-quantized.w8a8 Text Generation • 15B • Updated Feb 27, 2025 • 1.75k • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-FP8-dynamic Text Generation • 15B • Updated Feb 27, 2025 • 309 • 3
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w4a16 Text Generation • 33B • Updated Feb 27, 2025 • 523 • 5
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8 Text Generation • 33B • Updated Feb 27, 2025 • 3.56k • 13
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-FP8-dynamic Text Generation • 33B • Updated Feb 27, 2025 • 7.7k • 10
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w4a16 Text Generation • 71B • Updated Feb 27, 2025 • 1.14k • 6
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w8a8 Text Generation • 71B • Updated Feb 27, 2025 • 177 • 2
RedHatAI/DeepSeek-R1-Distill-Llama-70B-FP8-dynamic Text Generation • 71B • Updated Feb 27, 2025 • 11.6k • 10
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w4a16 Text Generation • 8B • Updated Feb 27, 2025 • 495
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w8a8 Text Generation • 8B • Updated Feb 27, 2025 • 1.87k • 2
RedHatAI/DeepSeek-R1-Distill-Llama-8B-FP8-dynamic Text Generation • 8B • Updated Feb 27, 2025 • 290 • 5