Papers
updated
Unified Multimodal Understanding and Generation Models: Advances,
Challenges, and Opportunities
Paper
• 2505.02567
• Published • 82
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware
Representations
Paper
• 2505.18125
• Published • 112
Distilling LLM Agent into Small Models with Retrieval and Code Tools
Paper
• 2505.17612
• Published • 81
One RL to See Them All: Visual Triple Unified Reinforcement Learning
Paper
• 2505.18129
• Published • 62
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning
Attention
Paper
• 2506.13585
• Published • 274
Scaling Test-time Compute for LLM Agents
Paper
• 2506.12928
• Published • 63
Reinforcement Pre-Training
Paper
• 2506.08007
• Published • 265
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical
Understanding and Reasoning
Paper
• 2506.07044
• Published • 114
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical
Reasoning
Paper
• 2506.09513
• Published • 102
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models
Paper
• 2506.06395
• Published • 135
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper
• 2505.24726
• Published • 279