MTP-LM Collection Models to accompany research paper on training multi token prediction language models using self-distillation. • 4 items • Updated 7 days ago • 3
MTP-LM Collection Models to accompany research paper on training multi token prediction language models using self-distillation. • 4 items • Updated 7 days ago • 3
MTP-LM Collection Models to accompany research paper on training multi token prediction language models using self-distillation. • 4 items • Updated 7 days ago • 3
MTP-LM Collection Models to accompany research paper on training multi token prediction language models using self-distillation. • 4 items • Updated 7 days ago • 3
jwkirchenbauer/debug_metamath_full_rand_k2-8_ex_valk_baseline_latest Text Generation • 8B • Updated 28 days ago • 10