lainlives/codellama-34b-python-merge (Quantized)

Description

This model is a quantized version of the original model lainlives/codellama-34b-python-merge.

It's quantized using the BitsAndBytes library to 4-bit using the bnb-my-repo space.

Quantization Details

  • Quantization Type: int4
  • bnb_4bit_quant_type: nf4
  • bnb_4bit_use_double_quant: True
  • bnb_4bit_compute_dtype: bfloat16
  • bnb_4bit_quant_storage: uint8
Downloads last month
12
Safetensors
Model size
33B params
Tensor type
F32
BF16
U8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for lainlives/codellama-34b-python-merge-bnb

Quantized
(1)
this model