Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
1.92k
kernel
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
3c8bb73
quantization
1.08 GB
2 contributors
History:
30 commits
danieldk
HF Staff
Add support for ROCm
3c8bb73
11 months ago
build
Build (Torch 2.6)
about 1 year ago
compressed_tensors
Sync with vLLM
about 1 year ago
core
Sync with vLLM
about 1 year ago
cutlass_extensions
Sync with vLLM
about 1 year ago
cutlass_w8a8
Sync with vLLM
about 1 year ago
fp8
Sync with vLLM
about 1 year ago
gptq_marlin
Sync with vLLM
about 1 year ago
marlin
Add full Marlin support and tests for Marlin/CUTLASS
about 1 year ago
tests
Add full Marlin support and tests for Marlin/CUTLASS
about 1 year ago
torch-ext
Add support for ROCm
11 months ago
.gitattributes
Safe
1.56 kB
Build
about 1 year ago
LICENSE
Safe
11.4 kB
Add cutlass_w8a8
about 1 year ago
README.md
Safe
195 Bytes
Update README.md (#1)
12 months ago
build.toml
3.14 kB
Add support for ROCm
11 months ago
dispatch_utils.h
Safe
1.49 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
about 1 year ago
flake.lock
3.03 kB
Add support for ROCm
11 months ago
flake.nix
Safe
335 Bytes
Add support for ROCm
11 months ago
vectorization.cuh
Safe
778 Bytes
Sync with vLLM
about 1 year ago