math-reasoning/med_gpt2_serial_attn_mlp_weight_tying_13-3.1_checkpoint-74000 0.1B • Updated 1 day ago • 13
math-reasoning/med_gpt2_serial_attn_mlp_weight_tying_13-3.1_checkpoint-74000 0.1B • Updated 1 day ago • 13