ERNIE-Image-nf4 / quantization_info.json
ovedrive's picture
Upload folder using huggingface_hub
c50702f verified
raw
history blame contribute delete
246 Bytes
{
"quantization_method": "mixed_precision_nf4",
"model_type": "ernie",
"description": "First and last blocks plus boundary modules kept at bfloat16, middle layers quantized to NF4 (ernie architecture)",
"high_precision_layers_count": 20
}