-
GARDO: Reinforcing Diffusion Models without Reward Hacking
Paper • 2512.24138 • Published • 30 -
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
Paper • 2512.24165 • Published • 52 -
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
Paper • 2512.24617 • Published • 65 -
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
Paper • 2512.23447 • Published • 99
Ma Jiahao
AH26
·
AI & ML interests
AI for Bio
Recent Activity
upvoted a paper about 20 hours ago
A Subgoal-driven Framework for Improving Long-Horizon LLM Agents upvoted a paper 5 days ago
Mixture-of-Depths Attention upvoted a paper 5 days ago
Attention Residuals