nsjain/modernbert-dclm-coconot-cover-lr5e-5-wd1e-06-class-1Mpool-100k-nn50-cossim-7B-OLMo2-v2-epochs2 Text Classification • 0.1B • Updated 8 days ago • 717
nsjain/modernbert-dclm-coconot-cover-lr5e-5-wd1e-06-class-1Mpool-100k-nn50-cossim-7B-OLMo2-v2-epochs2 Text Classification • 0.1B • Updated 8 days ago • 717
nsjain/modernbert_dclm_coconot_1M_bf16_1e-5_bs16_wd1e-5_regression Text Classification • 0.1B • Updated Nov 14, 2025 • 1
nsjain/modernbert_dclm_coconot_1M_bf16_1e-5_bs16_wd1e-5_regression Text Classification • 0.1B • Updated Nov 14, 2025 • 1
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence Paper • 2511.07384 • Published Nov 10, 2025 • 19