ekurtic commited on
Commit
aa91cc7
·
verified ·
1 Parent(s): dedec7d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ All layers within transformer blocks are compressed. Weights are quantized using
15
 
16
  Model checkpoint is saved in [compressed_tensors](https://github.com/neuralmagic/compressed-tensors) format.
17
 
18
- | Models | Experts Quantized | Attention blocks quantized | Size (Gb) |
19
  | ------ | --------- | --------- | --------- |
20
  | [deepseek-ai/DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1) | ❌ | ❌ | 671 GB |
21
  | [ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g](https://huggingface.co/ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g) | ✅ | ✅ | 325 GB |
 
15
 
16
  Model checkpoint is saved in [compressed_tensors](https://github.com/neuralmagic/compressed-tensors) format.
17
 
18
+ | Models | Experts Quantized | Attention blocks quantized | Size (GB) |
19
  | ------ | --------- | --------- | --------- |
20
  | [deepseek-ai/DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1) | ❌ | ❌ | 671 GB |
21
  | [ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g](https://huggingface.co/ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g) | ✅ | ✅ | 325 GB |