d1_code_all_large / all_results.json
neginr's picture
End of training
b2c8055 verified
raw
history blame contribute delete
225 Bytes
{
"epoch": 4.998875140607424,
"total_flos": 2.8638155373837025e+18,
"train_loss": 0.09022162760700192,
"train_runtime": 11331.6206,
"train_samples_per_second": 25.101,
"train_steps_per_second": 0.049
}