RedHatAI/SparseLlama-3-8B-pruned_50.2of4
Text Generation
• 8B • Updated
• 10
RedHatAI/Llama-3.2-90B-Vision-Instruct-FP8-dynamic
Text Generation
• 89B • Updated
• 1.22k
• 11
RedHatAI/Phi-3.5-mini-instruct-FP8-KV
Text Generation
• 4B • Updated
• 372
• 2
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w4a16
Text Generation
• 71B • Updated
• 191
• 2
RedHatAI/Mixtral-8x22B-Instruct-v0.1-AutoFP8
Text Generation
• 141B • Updated
• 8
• 3
RedHatAI/DeepSeek-Coder-V2-Base-FP8
Text Generation
• 236B • Updated
• 7
RedHatAI/DeepSeek-Coder-V2-Instruct-FP8
Text Generation
• 236B • Updated
• 106
• 7
RedHatAI/Mistral-Nemo-Instruct-2407-FP8
Text Generation
• 12B • Updated
• 9.99k
• 18
RedHatAI/Qwen2-57B-A14B-Instruct-FP8
Text Generation
• 57B • Updated
• 916
• 1
RedHatAI/Llama-2-7b-chat-hf-FP8
Text Generation
• 7B • Updated
• 388
RedHatAI/Mistral-7B-Instruct-v0.3-FP8
Text Generation
• 7B • Updated
• 2.9k
• 3
RedHatAI/Qwen2-0.5B-Instruct-FP8
Text Generation
• 0.5B • Updated
• 315
• 3
RedHatAI/Qwen2-1.5B-Instruct-FP8
Text Generation
• 2B • Updated
• 23.2k
RedHatAI/Qwen2-7B-Instruct-FP8
Text Generation
• 8B • Updated
• 3.68k
• • 2
RedHatAI/Qwen2-72B-Instruct-FP8
Text Generation
• 73B • Updated
• 944
• 15
RedHatAI/Mixtral-8x7B-Instruct-v0.1-AutoFP8
Text Generation
• 47B • Updated
• 7
• 3
RedHatAI/Meta-Llama-3-70B-Instruct-FP8
Text Generation
• 71B • Updated
• 1.24k
• • 13
RedHatAI/Meta-Llama-3-8B-Instruct-FP8
Text Generation
• Updated
• 3.45k
• • 24
RedHatAI/DeepSeek-Coder-V2-Lite-Base-FP8
Text Generation
• 16B • Updated
• 6
RedHatAI/DeepSeek-Coder-V2-Lite-Instruct-FP8
Text Generation
• 16B • Updated
• 78.2k
• 9
RedHatAI/Qwen2-7B-Instruct-quantized.w4a16
Text Generation
• 8B • Updated
• 6
RedHatAI/Qwen2-72B-Instruct-quantized.w4a16
Text Generation
• 73B • Updated
• 682
• 4
RedHatAI/Qwen2-1.5B-Instruct-quantized.w4a16
Text Generation
• 2B • Updated
• 3
RedHatAI/Qwen2-0.5B-Instruct-quantized.w4a16
Text Generation
• 0.6B • Updated
• 1
RedHatAI/Qwen2-72B-Instruct-quantized.w8a16
Text Generation
• 73B • Updated
• 2
• 1
RedHatAI/Qwen2-7B-Instruct-quantized.w8a16
Text Generation
• 8B • Updated
• 11
RedHatAI/Qwen2-1.5B-Instruct-quantized.w8a16
Text Generation
• 2B • Updated
• 5
RedHatAI/Qwen2-0.5B-Instruct-quantized.w8a16
Text Generation
• 0.5B • Updated
• 9
RedHatAI/Llama-2-7b-chat-quantized.w4a16
Text Generation
• 7B • Updated
• 215