mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT Paper • 2603.21606 • Published 8 days ago • 37
MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models Paper • 2602.17602 • Published Feb 19 • 56
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Gemma-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 181
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Gemma-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 181
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 221
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 221
thwannbe/Llama-3.1-8B-Instruct-GSM8K-PO-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 9 • 298
thwannbe/Llama-3.1-8B-Instruct-GSM8K-PO-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 9 • 298
thwannbe/Llama-3.1-8B-Instruct-GSM8K-GPT5-mini-Style-distill Text Generation • 8B • Updated Feb 5 • 195
thwannbe/Llama-3.1-8B-Instruct-GSM8K-GPT5-mini-Style-distill Text Generation • 8B • Updated Feb 5 • 195