amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-fp16-onnx-hybrid Text Generation • Updated Aug 27, 2025 • 15
amd/Auto-Mixed-Precision-Mixtral-8x7B-Instruct-v0.1-Weight-Activation-Mixed-MXFP4-FP8PT-KVFP8 Updated Aug 26, 2025
amd/Llama-2-70b-chat-hf-WMXFP4-AMXFP4-KVFP8-Scale-UINT8-MLPerf-GPTQ 37B • Updated Aug 5, 2025 • 7
amd/Llama-3.1-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Jun 28, 2025 • 13 • 2
amd/Llama-3-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Jun 28, 2025 • 13 • 2
amd/Llama2-7b-chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Jun 28, 2025 • 7
amd/Llama-2-7b-hf-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Jun 28, 2025 • 11
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Jun 28, 2025 • 8 • 1
amd/gemma-2-2b-awq-uint4-asym-g128-lmhead-g32-fp16-onnx-hybrid_v2 Text Generation • Updated Jun 23, 2025