Compliance versus Sensibility: On the Reasoning Controllability in Large Language Models Paper • 2604.27251 • Published 4 days ago • 5
Compliance versus Sensibility: On the Reasoning Controllability in Large Language Models Paper • 2604.27251 • Published 4 days ago • 5
Where does output diversity collapse in post-training? Paper • 2604.16027 • Published 16 days ago • 22
Where does output diversity collapse in post-training? Paper • 2604.16027 • Published 16 days ago • 22
Where does output diversity collapse in post-training? Paper • 2604.16027 • Published 16 days ago • 22
Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models Paper • 2602.08658 • Published Feb 9 • 13
Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models Paper • 2602.08658 • Published Feb 9 • 13
Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models Paper • 2602.08658 • Published Feb 9 • 13
Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation Paper • 2602.02007 • Published Feb 2 • 18
No Shortcuts to Culture: Indonesian Multi-hop Question Answering for Complex Cultural Understanding Paper • 2602.03709 • Published Feb 3 • 8
No Shortcuts to Culture: Indonesian Multi-hop Question Answering for Complex Cultural Understanding Paper • 2602.03709 • Published Feb 3 • 8
No Shortcuts to Culture: Indonesian Multi-hop Question Answering for Complex Cultural Understanding Paper • 2602.03709 • Published Feb 3 • 8
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published Dec 31, 2025 • 154
An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift Paper • 2601.05882 • Published Jan 9 • 21
An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift Paper • 2601.05882 • Published Jan 9 • 21
An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift Paper • 2601.05882 • Published Jan 9 • 21