TicklingShell commited on
Commit
bf86d92
·
verified ·
1 Parent(s): 7c3ef3f

Delete checkpoint-150

Browse files
checkpoint-150/README.md DELETED
@@ -1,209 +0,0 @@
1
- ---
2
- base_model: /root/.cache/huggingface/hub/models--Qwen--Qwen3-1.7B/snapshots/70d244cc86ccca08cf5af4e1e306ecf908b1ad5e
3
- library_name: peft
4
- pipeline_tag: text-generation
5
- tags:
6
- - base_model:adapter:/root/.cache/huggingface/hub/models--Qwen--Qwen3-1.7B/snapshots/70d244cc86ccca08cf5af4e1e306ecf908b1ad5e
7
- - lora
8
- - sft
9
- - transformers
10
- - trl
11
- ---
12
-
13
- # Model Card for Model ID
14
-
15
- <!-- Provide a quick summary of what the model is/does. -->
16
-
17
-
18
-
19
- ## Model Details
20
-
21
- ### Model Description
22
-
23
- <!-- Provide a longer summary of what this model is. -->
24
-
25
-
26
-
27
- - **Developed by:** [More Information Needed]
28
- - **Funded by [optional]:** [More Information Needed]
29
- - **Shared by [optional]:** [More Information Needed]
30
- - **Model type:** [More Information Needed]
31
- - **Language(s) (NLP):** [More Information Needed]
32
- - **License:** [More Information Needed]
33
- - **Finetuned from model [optional]:** [More Information Needed]
34
-
35
- ### Model Sources [optional]
36
-
37
- <!-- Provide the basic links for the model. -->
38
-
39
- - **Repository:** [More Information Needed]
40
- - **Paper [optional]:** [More Information Needed]
41
- - **Demo [optional]:** [More Information Needed]
42
-
43
- ## Uses
44
-
45
- <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
46
-
47
- ### Direct Use
48
-
49
- <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
50
-
51
- [More Information Needed]
52
-
53
- ### Downstream Use [optional]
54
-
55
- <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
56
-
57
- [More Information Needed]
58
-
59
- ### Out-of-Scope Use
60
-
61
- <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
62
-
63
- [More Information Needed]
64
-
65
- ## Bias, Risks, and Limitations
66
-
67
- <!-- This section is meant to convey both technical and sociotechnical limitations. -->
68
-
69
- [More Information Needed]
70
-
71
- ### Recommendations
72
-
73
- <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
74
-
75
- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
76
-
77
- ## How to Get Started with the Model
78
-
79
- Use the code below to get started with the model.
80
-
81
- [More Information Needed]
82
-
83
- ## Training Details
84
-
85
- ### Training Data
86
-
87
- <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
88
-
89
- [More Information Needed]
90
-
91
- ### Training Procedure
92
-
93
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
94
-
95
- #### Preprocessing [optional]
96
-
97
- [More Information Needed]
98
-
99
-
100
- #### Training Hyperparameters
101
-
102
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
103
-
104
- #### Speeds, Sizes, Times [optional]
105
-
106
- <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
107
-
108
- [More Information Needed]
109
-
110
- ## Evaluation
111
-
112
- <!-- This section describes the evaluation protocols and provides the results. -->
113
-
114
- ### Testing Data, Factors & Metrics
115
-
116
- #### Testing Data
117
-
118
- <!-- This should link to a Dataset Card if possible. -->
119
-
120
- [More Information Needed]
121
-
122
- #### Factors
123
-
124
- <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
125
-
126
- [More Information Needed]
127
-
128
- #### Metrics
129
-
130
- <!-- These are the evaluation metrics being used, ideally with a description of why. -->
131
-
132
- [More Information Needed]
133
-
134
- ### Results
135
-
136
- [More Information Needed]
137
-
138
- #### Summary
139
-
140
-
141
-
142
- ## Model Examination [optional]
143
-
144
- <!-- Relevant interpretability work for the model goes here -->
145
-
146
- [More Information Needed]
147
-
148
- ## Environmental Impact
149
-
150
- <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
151
-
152
- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
153
-
154
- - **Hardware Type:** [More Information Needed]
155
- - **Hours used:** [More Information Needed]
156
- - **Cloud Provider:** [More Information Needed]
157
- - **Compute Region:** [More Information Needed]
158
- - **Carbon Emitted:** [More Information Needed]
159
-
160
- ## Technical Specifications [optional]
161
-
162
- ### Model Architecture and Objective
163
-
164
- [More Information Needed]
165
-
166
- ### Compute Infrastructure
167
-
168
- [More Information Needed]
169
-
170
- #### Hardware
171
-
172
- [More Information Needed]
173
-
174
- #### Software
175
-
176
- [More Information Needed]
177
-
178
- ## Citation [optional]
179
-
180
- <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
181
-
182
- **BibTeX:**
183
-
184
- [More Information Needed]
185
-
186
- **APA:**
187
-
188
- [More Information Needed]
189
-
190
- ## Glossary [optional]
191
-
192
- <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
193
-
194
- [More Information Needed]
195
-
196
- ## More Information [optional]
197
-
198
- [More Information Needed]
199
-
200
- ## Model Card Authors [optional]
201
-
202
- [More Information Needed]
203
-
204
- ## Model Card Contact
205
-
206
- [More Information Needed]
207
- ### Framework versions
208
-
209
- - PEFT 0.18.1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
checkpoint-150/adapter_config.json DELETED
@@ -1,43 +0,0 @@
1
- {
2
- "alora_invocation_tokens": null,
3
- "alpha_pattern": {},
4
- "arrow_config": null,
5
- "auto_mapping": null,
6
- "base_model_name_or_path": "/root/.cache/huggingface/hub/models--Qwen--Qwen3-1.7B/snapshots/70d244cc86ccca08cf5af4e1e306ecf908b1ad5e",
7
- "bias": "none",
8
- "corda_config": null,
9
- "ensure_weight_tying": false,
10
- "eva_config": null,
11
- "exclude_modules": null,
12
- "fan_in_fan_out": false,
13
- "inference_mode": true,
14
- "init_lora_weights": true,
15
- "layer_replication": null,
16
- "layers_pattern": null,
17
- "layers_to_transform": null,
18
- "loftq_config": {},
19
- "lora_alpha": 16,
20
- "lora_bias": false,
21
- "lora_dropout": 0.05,
22
- "megatron_config": null,
23
- "megatron_core": "megatron.core",
24
- "modules_to_save": null,
25
- "peft_type": "LORA",
26
- "peft_version": "0.18.1",
27
- "qalora_group_size": 16,
28
- "r": 8,
29
- "rank_pattern": {},
30
- "revision": null,
31
- "target_modules": [
32
- "q_proj",
33
- "o_proj",
34
- "v_proj",
35
- "k_proj"
36
- ],
37
- "target_parameters": null,
38
- "task_type": "CAUSAL_LM",
39
- "trainable_token_indices": null,
40
- "use_dora": false,
41
- "use_qalora": false,
42
- "use_rslora": false
43
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
checkpoint-150/adapter_model.safetensors DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:7fe9856c79a036d0387c1e33bc3b5c4c11f6c402443d6ba88c8509079b278fd0
3
- size 6452200
 
 
 
 
checkpoint-150/chat_template.jinja DELETED
@@ -1,89 +0,0 @@
1
- {%- if tools %}
2
- {{- '<|im_start|>system\n' }}
3
- {%- if messages[0].role == 'system' %}
4
- {{- messages[0].content + '\n\n' }}
5
- {%- endif %}
6
- {{- "# Tools\n\nYou may call one or more functions to assist with the user query.\n\nYou are provided with function signatures within <tools></tools> XML tags:\n<tools>" }}
7
- {%- for tool in tools %}
8
- {{- "\n" }}
9
- {{- tool | tojson }}
10
- {%- endfor %}
11
- {{- "\n</tools>\n\nFor each function call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:\n<tool_call>\n{\"name\": <function-name>, \"arguments\": <args-json-object>}\n</tool_call><|im_end|>\n" }}
12
- {%- else %}
13
- {%- if messages[0].role == 'system' %}
14
- {{- '<|im_start|>system\n' + messages[0].content + '<|im_end|>\n' }}
15
- {%- endif %}
16
- {%- endif %}
17
- {%- set ns = namespace(multi_step_tool=true, last_query_index=messages|length - 1) %}
18
- {%- for message in messages[::-1] %}
19
- {%- set index = (messages|length - 1) - loop.index0 %}
20
- {%- if ns.multi_step_tool and message.role == "user" and message.content is string and not(message.content.startswith('<tool_response>') and message.content.endswith('</tool_response>')) %}
21
- {%- set ns.multi_step_tool = false %}
22
- {%- set ns.last_query_index = index %}
23
- {%- endif %}
24
- {%- endfor %}
25
- {%- for message in messages %}
26
- {%- if message.content is string %}
27
- {%- set content = message.content %}
28
- {%- else %}
29
- {%- set content = '' %}
30
- {%- endif %}
31
- {%- if (message.role == "user") or (message.role == "system" and not loop.first) %}
32
- {{- '<|im_start|>' + message.role + '\n' + content + '<|im_end|>' + '\n' }}
33
- {%- elif message.role == "assistant" %}
34
- {%- set reasoning_content = '' %}
35
- {%- if message.reasoning_content is string %}
36
- {%- set reasoning_content = message.reasoning_content %}
37
- {%- else %}
38
- {%- if '</think>' in content %}
39
- {%- set reasoning_content = content.split('</think>')[0].rstrip('\n').split('<think>')[-1].lstrip('\n') %}
40
- {%- set content = content.split('</think>')[-1].lstrip('\n') %}
41
- {%- endif %}
42
- {%- endif %}
43
- {%- if loop.index0 > ns.last_query_index %}
44
- {%- if loop.last or (not loop.last and reasoning_content) %}
45
- {{- '<|im_start|>' + message.role + '\n<think>\n' + reasoning_content.strip('\n') + '\n</think>\n\n' + content.lstrip('\n') }}
46
- {%- else %}
47
- {{- '<|im_start|>' + message.role + '\n' + content }}
48
- {%- endif %}
49
- {%- else %}
50
- {{- '<|im_start|>' + message.role + '\n' + content }}
51
- {%- endif %}
52
- {%- if message.tool_calls %}
53
- {%- for tool_call in message.tool_calls %}
54
- {%- if (loop.first and content) or (not loop.first) %}
55
- {{- '\n' }}
56
- {%- endif %}
57
- {%- if tool_call.function %}
58
- {%- set tool_call = tool_call.function %}
59
- {%- endif %}
60
- {{- '<tool_call>\n{"name": "' }}
61
- {{- tool_call.name }}
62
- {{- '", "arguments": ' }}
63
- {%- if tool_call.arguments is string %}
64
- {{- tool_call.arguments }}
65
- {%- else %}
66
- {{- tool_call.arguments | tojson }}
67
- {%- endif %}
68
- {{- '}\n</tool_call>' }}
69
- {%- endfor %}
70
- {%- endif %}
71
- {{- '<|im_end|>\n' }}
72
- {%- elif message.role == "tool" %}
73
- {%- if loop.first or (messages[loop.index0 - 1].role != "tool") %}
74
- {{- '<|im_start|>user' }}
75
- {%- endif %}
76
- {{- '\n<tool_response>\n' }}
77
- {{- content }}
78
- {{- '\n</tool_response>' }}
79
- {%- if loop.last or (messages[loop.index0 + 1].role != "tool") %}
80
- {{- '<|im_end|>\n' }}
81
- {%- endif %}
82
- {%- endif %}
83
- {%- endfor %}
84
- {%- if add_generation_prompt %}
85
- {{- '<|im_start|>assistant\n' }}
86
- {%- if enable_thinking is defined and enable_thinking is false %}
87
- {{- '<think>\n\n</think>\n\n' }}
88
- {%- endif %}
89
- {%- endif %}
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
checkpoint-150/optimizer.pt DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:b5bc68a9981be9e6a79bc8013192bb2fe7f83d96264abdd4ec8ad33aab6b8d9e
3
- size 6787851
 
 
 
 
checkpoint-150/rng_state.pth DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:3407948e662beedeb42c69cc63a020cc985bee47d060db8bb61e73cf8be14b20
3
- size 14645
 
 
 
 
checkpoint-150/scheduler.pt DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:f847bf189265d8eee7da89b2c635bd5ad82bf85f644dadbc5646ac3ed438aa14
3
- size 1465
 
 
 
 
checkpoint-150/tokenizer.json DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:be75606093db2094d7cd20f3c2f385c212750648bd6ea4fb2bf507a6a4c55506
3
- size 11422650
 
 
 
 
checkpoint-150/tokenizer_config.json DELETED
@@ -1,29 +0,0 @@
1
- {
2
- "add_prefix_space": false,
3
- "backend": "tokenizers",
4
- "bos_token": null,
5
- "clean_up_tokenization_spaces": false,
6
- "eos_token": "<|im_end|>",
7
- "errors": "replace",
8
- "extra_special_tokens": [
9
- "<|im_start|>",
10
- "<|im_end|>",
11
- "<|object_ref_start|>",
12
- "<|object_ref_end|>",
13
- "<|box_start|>",
14
- "<|box_end|>",
15
- "<|quad_start|>",
16
- "<|quad_end|>",
17
- "<|vision_start|>",
18
- "<|vision_end|>",
19
- "<|vision_pad|>",
20
- "<|image_pad|>",
21
- "<|video_pad|>"
22
- ],
23
- "is_local": true,
24
- "model_max_length": 131072,
25
- "pad_token": "<|endoftext|>",
26
- "split_special_tokens": false,
27
- "tokenizer_class": "Qwen2Tokenizer",
28
- "unk_token": null
29
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
checkpoint-150/trainer_state.json DELETED
@@ -1,184 +0,0 @@
1
- {
2
- "best_global_step": null,
3
- "best_metric": null,
4
- "best_model_checkpoint": null,
5
- "epoch": 0.2437043054427295,
6
- "eval_steps": 500,
7
- "global_step": 150,
8
- "is_hyper_param_search": false,
9
- "is_local_process_zero": true,
10
- "is_world_process_zero": true,
11
- "log_history": [
12
- {
13
- "entropy": 1.350339876115322,
14
- "epoch": 0.016246953696181964,
15
- "grad_norm": 0.435546875,
16
- "learning_rate": 0.00012,
17
- "loss": 2.059933090209961,
18
- "mean_token_accuracy": 0.5740167520940304,
19
- "num_tokens": 43502.0,
20
- "step": 10
21
- },
22
- {
23
- "entropy": 1.6725194156169891,
24
- "epoch": 0.03249390739236393,
25
- "grad_norm": 0.314453125,
26
- "learning_rate": 0.00019407407407407408,
27
- "loss": 1.978948974609375,
28
- "mean_token_accuracy": 0.5732895873486996,
29
- "num_tokens": 86625.0,
30
- "step": 20
31
- },
32
- {
33
- "entropy": 1.938981404900551,
34
- "epoch": 0.048740861088545896,
35
- "grad_norm": 0.240234375,
36
- "learning_rate": 0.00017925925925925927,
37
- "loss": 1.81705322265625,
38
- "mean_token_accuracy": 0.6026726946234703,
39
- "num_tokens": 139355.0,
40
- "step": 30
41
- },
42
- {
43
- "entropy": 1.7795559912919998,
44
- "epoch": 0.06498781478472786,
45
- "grad_norm": 0.369140625,
46
- "learning_rate": 0.00016444444444444444,
47
- "loss": 1.625088882446289,
48
- "mean_token_accuracy": 0.6222493700683117,
49
- "num_tokens": 186902.0,
50
- "step": 40
51
- },
52
- {
53
- "entropy": 1.7282595068216324,
54
- "epoch": 0.08123476848090982,
55
- "grad_norm": 0.302734375,
56
- "learning_rate": 0.00014962962962962963,
57
- "loss": 1.6160415649414062,
58
- "mean_token_accuracy": 0.620630569756031,
59
- "num_tokens": 232500.0,
60
- "step": 50
61
- },
62
- {
63
- "entropy": 1.732401692867279,
64
- "epoch": 0.09748172217709179,
65
- "grad_norm": 0.36328125,
66
- "learning_rate": 0.00013481481481481482,
67
- "loss": 1.5774802207946776,
68
- "mean_token_accuracy": 0.6243203669786453,
69
- "num_tokens": 278841.0,
70
- "step": 60
71
- },
72
- {
73
- "entropy": 1.7628644570708274,
74
- "epoch": 0.11372867587327376,
75
- "grad_norm": 0.259765625,
76
- "learning_rate": 0.00012,
77
- "loss": 1.5647431373596192,
78
- "mean_token_accuracy": 0.6330625265836716,
79
- "num_tokens": 328289.0,
80
- "step": 70
81
- },
82
- {
83
- "entropy": 1.9395479083061218,
84
- "epoch": 0.12997562956945571,
85
- "grad_norm": 0.3203125,
86
- "learning_rate": 0.00010518518518518518,
87
- "loss": 1.726154899597168,
88
- "mean_token_accuracy": 0.6044244647026062,
89
- "num_tokens": 374976.0,
90
- "step": 80
91
- },
92
- {
93
- "entropy": 1.887816959619522,
94
- "epoch": 0.1462225832656377,
95
- "grad_norm": 0.302734375,
96
- "learning_rate": 9.037037037037038e-05,
97
- "loss": 1.5822885513305665,
98
- "mean_token_accuracy": 0.6185309916734696,
99
- "num_tokens": 425760.0,
100
- "step": 90
101
- },
102
- {
103
- "entropy": 1.8419553756713867,
104
- "epoch": 0.16246953696181965,
105
- "grad_norm": 0.283203125,
106
- "learning_rate": 7.555555555555556e-05,
107
- "loss": 1.6049184799194336,
108
- "mean_token_accuracy": 0.6241745337843895,
109
- "num_tokens": 474271.0,
110
- "step": 100
111
- },
112
- {
113
- "entropy": 1.9641838252544404,
114
- "epoch": 0.17871649065800163,
115
- "grad_norm": 0.35546875,
116
- "learning_rate": 6.074074074074074e-05,
117
- "loss": 1.6779600143432618,
118
- "mean_token_accuracy": 0.6183042153716087,
119
- "num_tokens": 521971.0,
120
- "step": 110
121
- },
122
- {
123
- "entropy": 1.9284702748060227,
124
- "epoch": 0.19496344435418358,
125
- "grad_norm": 0.37109375,
126
- "learning_rate": 4.592592592592593e-05,
127
- "loss": 1.5790912628173828,
128
- "mean_token_accuracy": 0.6303764775395393,
129
- "num_tokens": 568996.0,
130
- "step": 120
131
- },
132
- {
133
- "entropy": 1.9886349856853485,
134
- "epoch": 0.21121039805036557,
135
- "grad_norm": 0.466796875,
136
- "learning_rate": 3.111111111111111e-05,
137
- "loss": 1.6734039306640625,
138
- "mean_token_accuracy": 0.6165395066142082,
139
- "num_tokens": 613096.0,
140
- "step": 130
141
- },
142
- {
143
- "entropy": 1.937325766682625,
144
- "epoch": 0.22745735174654752,
145
- "grad_norm": 0.373046875,
146
- "learning_rate": 1.62962962962963e-05,
147
- "loss": 1.6472185134887696,
148
- "mean_token_accuracy": 0.6231428682804108,
149
- "num_tokens": 659038.0,
150
- "step": 140
151
- },
152
- {
153
- "entropy": 1.989447721838951,
154
- "epoch": 0.2437043054427295,
155
- "grad_norm": 0.318359375,
156
- "learning_rate": 1.4814814814814817e-06,
157
- "loss": 1.6484994888305664,
158
- "mean_token_accuracy": 0.6125213362276554,
159
- "num_tokens": 707348.0,
160
- "step": 150
161
- }
162
- ],
163
- "logging_steps": 10,
164
- "max_steps": 150,
165
- "num_input_tokens_seen": 0,
166
- "num_train_epochs": 1,
167
- "save_steps": 5,
168
- "stateful_callbacks": {
169
- "TrainerControl": {
170
- "args": {
171
- "should_epoch_stop": false,
172
- "should_evaluate": false,
173
- "should_log": false,
174
- "should_save": true,
175
- "should_training_stop": true
176
- },
177
- "attributes": {}
178
- }
179
- },
180
- "total_flos": 1.26001189484544e+16,
181
- "train_batch_size": 4,
182
- "trial_name": null,
183
- "trial_params": null
184
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
checkpoint-150/training_args.bin DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:35fc86963ba9050c9035bf36d67f9fc595a64db629bd16f643bee8223d72371a
3
- size 5585