Self-Improvement of Large Language Models: A Technical Overview and Future Outlook Paper • 2603.25681 • Published Mar 26 • 1
Capability Self-Assessment: Teaching LLMs to Know Their Limits Paper • 2606.00251 • Published 5 days ago • 9
joyfine/router_SFT_larger_model_generated_data_mmlu_pro_science_Qwen3-4B_aime Viewer • Updated Apr 30 • 860 • 37
joyfine/router_SFT_larger_model_generated_data_mmlu_pro_science_Qwen3-4B_aime Viewer • Updated Apr 30 • 860 • 37
joyfine/router_SFT_self_generated_data_mmlu_pro_science_Qwen3-4B_aime Viewer • Updated Apr 30 • 860 • 34
joyfine/router_SFT_self_generated_data_mmlu_pro_science_Qwen3-4B_aime Viewer • Updated Apr 30 • 860 • 34
joyfine/router_PEFT_data_mmlu_pro_science_5_shot_shuffle_OLMo-2-1124-13B-Instruct Viewer • Updated Apr 24 • 3.27k • 14
joyfine/router_PEFT_data_mmlu_pro_science_5_shot_shuffle_OLMo-2-1124-13B-Instruct Viewer • Updated Apr 24 • 3.27k • 14
joyfine/router_PEFT_data_mmlu_pro_science_5_shot_shuffle_Meta-Llama-3-8B-Instruct Viewer • Updated Apr 14 • 3.27k • 23
joyfine/router_PEFT_data_mmlu_pro_science_5_shot_shuffle_Meta-Llama-3-8B-Instruct Viewer • Updated Apr 14 • 3.27k • 23
joyfine/router_SFT_self_generated_data_mmlu_pro_science_Meta-Llama-3-8B-Instruct Viewer • Updated Apr 14 • 3.27k • 429
joyfine/router_SFT_self_generated_data_mmlu_pro_science_Meta-Llama-3-8B-Instruct Viewer • Updated Apr 14 • 3.27k • 429