Upload folder using huggingface_hub

Files changed (4) hide show

README.md ADDED Viewed

+---
+license: llama2
+tags:
+- code
+---
+This is a quantized version of **WizardLM/WizardCoder-Python-7B-V1.0**, quantized using [ctranslate2](https://github.com/OpenNMT/CTranslate2) (see inference instructions there).
+**The license/caveats/intended usage is the same as the original model**.
+The quality of its output may have
+been negatively affected by the quantization process.
+The command run to quantize the model was:
+ `ct2-transformers-converter --model ./models-hf/WizardLM/WizardCoder-Python-7B-V1.0 --quantization float16 --output_dir ./models-ct/WizardLM/WizardCoder-Python-7B-V1.0-ct2-float16`
+The quantization was run on a 'high-mem', CPU only (8 core, 51GB) colab instance and took approximately 10 minutes.

config.json ADDED Viewed

+{
+  "bos_token": "</s>",
+  "eos_token": "</s>",
+  "layer_norm_epsilon": 1e-05,
+  "unk_token": "</s>"
+}

model.bin ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c88b6ed131dcb3f0f4c3763018e0557dcff933ec2b658e298d3f2a4fa3671a90
+size 13476866371

vocabulary.json ADDED Viewed

The diff for this file is too large to render. See raw diff