Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding Paper ⢠2605.02290 ⢠Published 17 days ago ⢠36
Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex Paper ⢠2605.06139 ⢠Published 14 days ago ⢠65
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c ⢠Feb 20 ⢠505