CompactAI commited on
Commit
4c960a6
·
verified ·
1 Parent(s): cc6ca85

Update pruned model - 8 files

Browse files
Files changed (3) hide show
  1. README.md +9 -3
  2. model.safetensors +1 -1
  3. tokenizer.json +1 -1
README.md CHANGED
@@ -11,10 +11,16 @@ pipeline_tag: text-generation
11
 
12
  # Falcon-H1-Tiny-R-0.6B-math-safe
13
 
14
- > 🎯 **MATH-optimized** | 📦 **Safe** pruning | **1% weights pruned**
15
 
16
  This model is a **conservatively pruned** version of [tiiuae/Falcon-H1-Tiny-R-0.6B](https://huggingface.co/tiiuae/Falcon-H1-Tiny-R-0.6B).
17
 
 
 
 
 
 
 
18
  ## Performance Comparison
19
 
20
  | Category | Original | Pruned | Change |
@@ -28,7 +34,7 @@ This model is a **conservatively pruned** version of [tiiuae/Falcon-H1-Tiny-R-0.
28
  | Linux | 0.0% | 0.0% | → |
29
  | Writing | 0.0% | 0.0% | → |
30
 
31
- **Average**: 0.0% 0.0% (+0.0%)
32
 
33
 
34
 
@@ -54,7 +60,7 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
54
  | Base Model | [tiiuae/Falcon-H1-Tiny-R-0.6B](https://huggingface.co/tiiuae/Falcon-H1-Tiny-R-0.6B) |
55
  | Specialization | Math |
56
  | Prune Mode | Safe |
57
- | Weight Reduction | 1% weights pruned |
58
 
59
  ## License
60
 
 
11
 
12
  # Falcon-H1-Tiny-R-0.6B-math-safe
13
 
14
+ > **MATH-optimized** | **Safe** pruning | **30% weights pruned**
15
 
16
  This model is a **conservatively pruned** version of [tiiuae/Falcon-H1-Tiny-R-0.6B](https://huggingface.co/tiiuae/Falcon-H1-Tiny-R-0.6B).
17
 
18
+
19
+
20
+ > **Pruning Alert:** The benchmarks show virtually NO quality drop! This isn't a bug -- it is a feature. The Wanda pruning algorithm is so effective at identifying unimportant weights that it can remove a large percentage of parameters without affecting performance. Think of it like pruning dead leaves from a tree -- the tree does not miss them because they were not doing anything anyway!
21
+
22
+
23
+
24
  ## Performance Comparison
25
 
26
  | Category | Original | Pruned | Change |
 
34
  | Linux | 0.0% | 0.0% | → |
35
  | Writing | 0.0% | 0.0% | → |
36
 
37
+ **Average**: 0.0% -> 0.0% (+0.0%)
38
 
39
 
40
 
 
60
  | Base Model | [tiiuae/Falcon-H1-Tiny-R-0.6B](https://huggingface.co/tiiuae/Falcon-H1-Tiny-R-0.6B) |
61
  | Specialization | Math |
62
  | Prune Mode | Safe |
63
+ | Weight Reduction | 30% weights pruned |
64
 
65
  ## License
66
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:411d7061fc6a05a33fc64bd1f6c04f32488f71ca55e92b3e5f8622f95505fdfd
3
  size 1244731728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7a08c1df8d0e6ea580cf0cb7ee99b152c3997a4c2e0e271bf4f14ffc1b3521b2
3
  size 1244731728
tokenizer.json CHANGED
@@ -2,7 +2,7 @@
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
- "max_length": 261994,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
 
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
+ "max_length": 253952,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },