RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
4
•
1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
1
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
55
•
2
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
5
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
4
•
3
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
2
RedHatAI/Qwen2.5-3B-quantized.w4a16
Text Generation
•
1.0B
•
Updated
•
609
RedHatAI/Qwen2.5-1.5B-quantized.w4a16
Text Generation
•
0.6B
•
Updated
•
1
RedHatAI/Qwen2.5-0.5B-quantized.w4a16
Text Generation
•
0.3B
•
Updated
•
3
RedHatAI/Qwen2.5-14B-Instruct-quantized.w8a8
Text Generation
•
15B
•
Updated
•
26
RedHatAI/granite-3.1-8b-instruct-GGUF
8B
•
Updated
RedHatAI/Sparse-Llama-3.1-8B-2of4
Text Generation
•
8B
•
Updated
•
32
•
62
RedHatAI/Qwen2.5-Math-7B-Instruct-FP8-dynamic
8B
•
Updated
•
1
RedHatAI/Qwen2.5-0.5B-Instruct-quantized.w8a8
Text Generation
•
0.6B
•
Updated
•
36
RedHatAI/Qwen2.5-72B-FP8-dynamic
Text Generation
•
73B
•
Updated
•
16
•
1
RedHatAI/Qwen2.5-72B-quantized.w8a8
Text Generation
•
73B
•
Updated
•
1
RedHatAI/Qwen2.5-14B-quantized.w8a8
Text Generation
•
15B
•
Updated
•
2
•
2
RedHatAI/Qwen2.5-14B-FP8-dynamic
Text Generation
•
15B
•
Updated
•
73
•
2
RedHatAI/Qwen2.5-7B-quantized.w8a8
Text Generation
•
8B
•
Updated
•
22
•
1
RedHatAI/Qwen2.5-3B-FP8-dynamic
Text Generation
•
3B
•
Updated
•
15
RedHatAI/Qwen2.5-1.5B-FP8-dynamic
Text Generation
•
2B
•
Updated
•
94
RedHatAI/Qwen2.5-0.5B-FP8-dynamic
Text Generation
•
0.6B
•
Updated
•
2
RedHatAI/Qwen2.5-3B-quantized.w8a8
Text Generation
•
3B
•
Updated
•
3
•
1
RedHatAI/Qwen2.5-1.5B-quantized.w8a8
Text Generation
•
2B
•
Updated
•
837k
•
2
RedHatAI/Qwen2.5-0.5B-quantized.w8a8
Text Generation
•
0.6B
•
Updated
•
277
RedHatAI/Meta-Llama-3.1-405B-Instruct-quantized.w8a8
Text Generation
•
406B
•
Updated
•
11
•
2
RedHatAI/Qwen2.5-14B-Instruct-FP8-dynamic
15B
•
Updated
•
10.2k
RedHatAI/Qwen2.5-72B-Instruct-FP8-dynamic
73B
•
Updated
•
219
•
1
RedHatAI/Qwen2.5-Coder-7B-FP8-dynamic
8B
•
Updated
•
13
RedHatAI/Qwen2.5-Coder-7B-Instruct-FP8-dynamic
8B
•
Updated
•
34