LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws Paper • 2605.23901 • Published 5 days ago • 9
Rethinking Cross-Layer Information Routing in Diffusion Transformers Paper • 2605.20708 • Published 7 days ago • 98