dzungpham commited on
Commit
8cafee4
·
verified ·
1 Parent(s): 4d2bc22

upload graphcodebert robust, best f1 score at 0.54 at robust checkpoint 200

Browse files
inference/graphcodebert-robust/inference.log CHANGED
@@ -67,3 +67,117 @@ RobertaForSequenceClassification(
67
  [[422 355]
68
  [142 81]]
69
  2026-04-16 09:16:55,921 - INFO - ✅ Predictions saved to test/inference/graphcodebert-robust/submission.csv
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
67
  [[422 355]
68
  [142 81]]
69
  2026-04-16 09:16:55,921 - INFO - ✅ Predictions saved to test/inference/graphcodebert-robust/submission.csv
70
+ 2026-04-16 10:06:49,138 - INFO - Loading model and tokenizer from: ./output_checkpoints/graphcodebert-robust/checkpoint-1000
71
+ 2026-04-16 10:06:49,327 - INFO - ===== Model Architecture =====
72
+ 2026-04-16 10:06:49,331 - INFO -
73
+ RobertaForSequenceClassification(
74
+ (roberta): RobertaModel(
75
+ (embeddings): RobertaEmbeddings(
76
+ (word_embeddings): Embedding(50265, 768, padding_idx=1)
77
+ (position_embeddings): Embedding(514, 768, padding_idx=1)
78
+ (token_type_embeddings): Embedding(1, 768)
79
+ (LayerNorm): LayerNorm((768,), eps=1e-05, elementwise_affine=True)
80
+ (dropout): Dropout(p=0.2, inplace=False)
81
+ )
82
+ (encoder): RobertaEncoder(
83
+ (layer): ModuleList(
84
+ (0-11): 12 x RobertaLayer(
85
+ (attention): RobertaAttention(
86
+ (self): RobertaSdpaSelfAttention(
87
+ (query): Linear(in_features=768, out_features=768, bias=True)
88
+ (key): Linear(in_features=768, out_features=768, bias=True)
89
+ (value): Linear(in_features=768, out_features=768, bias=True)
90
+ (dropout): Dropout(p=0.2, inplace=False)
91
+ )
92
+ (output): RobertaSelfOutput(
93
+ (dense): Linear(in_features=768, out_features=768, bias=True)
94
+ (LayerNorm): LayerNorm((768,), eps=1e-05, elementwise_affine=True)
95
+ (dropout): Dropout(p=0.2, inplace=False)
96
+ )
97
+ )
98
+ (intermediate): RobertaIntermediate(
99
+ (dense): Linear(in_features=768, out_features=3072, bias=True)
100
+ (intermediate_act_fn): GELUActivation()
101
+ )
102
+ (output): RobertaOutput(
103
+ (dense): Linear(in_features=3072, out_features=768, bias=True)
104
+ (LayerNorm): LayerNorm((768,), eps=1e-05, elementwise_affine=True)
105
+ (dropout): Dropout(p=0.2, inplace=False)
106
+ )
107
+ )
108
+ )
109
+ )
110
+ )
111
+ (classifier): RobertaClassificationHead(
112
+ (dense): Linear(in_features=768, out_features=768, bias=True)
113
+ (dropout): Dropout(p=0.2, inplace=False)
114
+ (out_proj): Linear(in_features=768, out_features=2, bias=True)
115
+ )
116
+ )
117
+ 2026-04-16 10:06:49,337 - INFO - ===== Parameter Summary =====
118
+ 2026-04-16 10:06:49,340 - INFO - Total Parameters: 124,647,170
119
+ 2026-04-16 10:06:49,343 - INFO - Trainable Parameters: 124,647,170
120
+ 2026-04-16 10:06:49,346 - INFO - Non-trainable Parameters: 0
121
+ 2026-04-16 10:06:49,349 - INFO - ===== Tokenizer Summary =====
122
+ 2026-04-16 10:06:49,366 - INFO - Vocab size: 50265 | Special tokens: ['<s>', '</s>', '<unk>', '<pad>', '<mask>']
123
+ 2026-04-16 10:06:49,369 - INFO - ===== End of Architecture Log =====
124
+ 2026-04-16 10:06:49,539 - INFO - Loading dataset: dzungpham/SemEval-2026-TaskA-dataset (default)
125
+ 2026-04-16 10:08:44,659 - INFO - Loading model and tokenizer from: ./output_checkpoints/graphcodebert-robust/checkpoint-1000
126
+ 2026-04-16 10:08:44,856 - INFO - ===== Model Architecture =====
127
+ 2026-04-16 10:08:44,861 - INFO -
128
+ RobertaForSequenceClassification(
129
+ (roberta): RobertaModel(
130
+ (embeddings): RobertaEmbeddings(
131
+ (word_embeddings): Embedding(50265, 768, padding_idx=1)
132
+ (position_embeddings): Embedding(514, 768, padding_idx=1)
133
+ (token_type_embeddings): Embedding(1, 768)
134
+ (LayerNorm): LayerNorm((768,), eps=1e-05, elementwise_affine=True)
135
+ (dropout): Dropout(p=0.2, inplace=False)
136
+ )
137
+ (encoder): RobertaEncoder(
138
+ (layer): ModuleList(
139
+ (0-11): 12 x RobertaLayer(
140
+ (attention): RobertaAttention(
141
+ (self): RobertaSdpaSelfAttention(
142
+ (query): Linear(in_features=768, out_features=768, bias=True)
143
+ (key): Linear(in_features=768, out_features=768, bias=True)
144
+ (value): Linear(in_features=768, out_features=768, bias=True)
145
+ (dropout): Dropout(p=0.2, inplace=False)
146
+ )
147
+ (output): RobertaSelfOutput(
148
+ (dense): Linear(in_features=768, out_features=768, bias=True)
149
+ (LayerNorm): LayerNorm((768,), eps=1e-05, elementwise_affine=True)
150
+ (dropout): Dropout(p=0.2, inplace=False)
151
+ )
152
+ )
153
+ (intermediate): RobertaIntermediate(
154
+ (dense): Linear(in_features=768, out_features=3072, bias=True)
155
+ (intermediate_act_fn): GELUActivation()
156
+ )
157
+ (output): RobertaOutput(
158
+ (dense): Linear(in_features=3072, out_features=768, bias=True)
159
+ (LayerNorm): LayerNorm((768,), eps=1e-05, elementwise_affine=True)
160
+ (dropout): Dropout(p=0.2, inplace=False)
161
+ )
162
+ )
163
+ )
164
+ )
165
+ )
166
+ (classifier): RobertaClassificationHead(
167
+ (dense): Linear(in_features=768, out_features=768, bias=True)
168
+ (dropout): Dropout(p=0.2, inplace=False)
169
+ (out_proj): Linear(in_features=768, out_features=2, bias=True)
170
+ )
171
+ )
172
+ 2026-04-16 10:08:44,865 - INFO - ===== Parameter Summary =====
173
+ 2026-04-16 10:08:44,867 - INFO - Total Parameters: 124,647,170
174
+ 2026-04-16 10:08:44,870 - INFO - Trainable Parameters: 124,647,170
175
+ 2026-04-16 10:08:44,874 - INFO - Non-trainable Parameters: 0
176
+ 2026-04-16 10:08:44,876 - INFO - ===== Tokenizer Summary =====
177
+ 2026-04-16 10:08:44,893 - INFO - Vocab size: 50265 | Special tokens: ['<s>', '</s>', '<unk>', '<pad>', '<mask>']
178
+ 2026-04-16 10:08:44,896 - INFO - ===== End of Architecture Log =====
179
+ 2026-04-16 10:08:45,082 - INFO - Loading dataset: dzungpham/SemEval-2026-TaskA-dataset (default)
180
+ 2026-04-16 10:08:51,304 - WARNING - Default loading failed due to schema mismatch: An error occurred while generating the dataset
181
+ 2026-04-16 10:08:51,307 - INFO - Attempting to load split 'test' using data_files...
182
+ 2026-04-16 10:08:55,114 - INFO - Tokenizing dataset...
183
+ 2026-04-16 10:14:03,634 - INFO - Running inference on 500000 examples...