Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders
Paper • 2603.19209 • Published • 2
None defined yet.
Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD
A Subgoal-driven Framework for Improving Long-Horizon LLM Agents