Sweaterdog commited on
Commit
7ebdd4e
·
verified ·
1 Parent(s): b3dd979

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -51
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  license: apache-2.0
3
  language:
4
- - en
5
  - zh
6
  - fr
7
  - de
@@ -28,6 +28,7 @@ tags:
28
  - coding
29
  - math
30
  - science
 
31
  ---
32
 
33
  ![grape_2_banner](https://cdn-uploads.huggingface.co/production/uploads/66960602f0ffd8e3a381106a/XqhlL-CCTeRgPKDbqyyT7.png)
@@ -37,7 +38,7 @@ _The **G**eneral **R**easoning **A**gent (for) **P**roject **E**xploration_
37
 
38
  | Model | Size | Modalities | Domain |
39
  | :--- | :--- | :--- | :--- |
40
- | **GRaPE 2 Pro** | TBA | Image + Text in, Text out | Large-Scale Intelligence and "Raw Reasoning" |
41
  | **GRaPE 2 Flash** | 9B | Image + Text in, Text out | Advanced Device Deployment |
42
  | **GRaPE 2 Mini** | 5B | Image + Text in, Text out | On-Device Deployment |
43
 
@@ -100,55 +101,7 @@ Implement a red-black tree in Python with insertion and deletion. <thinking_mode
100
 
101
  # Benchmarks
102
 
103
- Scores sourced from official technical reports (Qwen3 Technical Report, May 2025; Qwen2.5 Technical Report, January 2025).
104
-
105
- > **Note:** *Benchmarks are Underway for GRaPE 2 Mini, they will be empty and set as "TBD" for the time being*
106
-
107
- ### General Knowledge — MMLU (5-shot)
108
-
109
- | Model | Params | MMLU |
110
- | :--- | :--- | :--- |
111
- | **GRaPE 2 Mini** | **5B** | **TBD** |
112
- | Qwen3-4B-Instruct | 4B | 83.7\* |
113
- | Qwen3-8B-Instruct | 8B | ~85.0 |
114
- | Qwen2.5-7B-Instruct | 7B | 74.2 |
115
- | Gemma-3-12B | 12B | 73.9 |
116
- | Qwen2.5-14B | 14B | 79.7 |
117
-
118
- ### Mathematics — MATH (4-shot)
119
-
120
- | Model | Params | MATH |
121
- | :--- | :--- | :--- |
122
- | **GRaPE 2 Mini** | **5B** | **TBD** |
123
- | Qwen3-4B (Thinking) | 4B | 54.1 |
124
- | Qwen3-8B (Thinking) | 8B | ~65.0 |
125
- | Qwen2.5-7B-Instruct | 7B | 75.5 |
126
- | Qwen2.5-14B | 14B | 55.6 |
127
- | Gemma-3-12B | 12B | 44.4 |
128
-
129
- ### Coding — EvalPlus (avg. HumanEval + MBPP)
130
-
131
- | Model | Params | EvalPlus |
132
- | :--- | :--- | :--- |
133
- | **GRaPE 2 Mini** | **5B** | **TBD** |
134
- | Qwen3-4B-Instruct | 4B | 72.1 |
135
- | Qwen3-8B-Instruct | 8B | ~76.0 |
136
- | Qwen2.5-7B-Instruct | 7B | ~65.0 |
137
- | Gemma-3-12B | 12B | 52.7 |
138
- | Qwen2.5-14B | 14B | 60.7 |
139
-
140
- ### Math Word Problems — GSM8K (4-shot)
141
-
142
- | Model | Params | GSM8K |
143
- | :--- | :--- | :--- |
144
- | **GRaPE 2 Mini** | **5B** | **TBD** |
145
- | Qwen3-4B (Thinking) | 4B | 87.8 |
146
- | Qwen2.5-7B-Instruct | 7B | 91.1 |
147
- | Qwen2.5-14B | 14B | 90.2 |
148
- | Gemma-3-12B | 12B | 78.0 |
149
-
150
-
151
- ***
152
 
153
  # Recommended Inference Settings
154
 
 
1
  ---
2
  license: apache-2.0
3
  language:
4
+ - en
5
  - zh
6
  - fr
7
  - de
 
28
  - coding
29
  - math
30
  - science
31
+ new_version: SL-AI/GRaPE-2.1-Flash
32
  ---
33
 
34
  ![grape_2_banner](https://cdn-uploads.huggingface.co/production/uploads/66960602f0ffd8e3a381106a/XqhlL-CCTeRgPKDbqyyT7.png)
 
38
 
39
  | Model | Size | Modalities | Domain |
40
  | :--- | :--- | :--- | :--- |
41
+ | **GRaPE 2 Pro** | 27B | Image + Text in, Text out | Large-Scale Intelligence and "Raw Reasoning" |
42
  | **GRaPE 2 Flash** | 9B | Image + Text in, Text out | Advanced Device Deployment |
43
  | **GRaPE 2 Mini** | 5B | Image + Text in, Text out | On-Device Deployment |
44
 
 
101
 
102
  # Benchmarks
103
 
104
+ Benchmarks for GRaPE 2 Flash will not exist due to GRaPE 2.1's release.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
105
 
106
  # Recommended Inference Settings
107