Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
1.86k
kernel
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
refs/pr/3
quantization
/
fp8
Commit History
Sync updates for CUDA 13 compat
70da85f
danieldk
HF Staff
commited on
16 days ago
Sync to vLLM 20250627
8aa00a3
danieldk
HF Staff
commited on
Jul 2, 2025
Sync with vLLM
0da5bf5
danieldk
HF Staff
commited on
Jan 16, 2025
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
5c6fb68
danieldk
HF Staff
commited on
Dec 9, 2024