inference-optimization
/

sarvam-30b-NVFP4

Text Generation

8-bit precision

compressed-tensors

Model card Files Files and versions

README.md exists but content is empty.

Downloads last month: 13

Safetensors

Model size

19B params

Tensor type

F32

·

F8_E4M3

·

U8

·

Model tree for inference-optimization/sarvam-30b-NVFP4

Base model

sarvamai/sarvam-30b

Quantized

(20)

this model