DFlash: Block Diffusion for Flash Speculative Decoding Paper • 2602.06036 • Published 3 days ago • 30
VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents Paper • 2601.16973 • Published 16 days ago • 40
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper • 2512.07843 • Published Nov 24, 2025 • 22
NVILA (HuggingFace) Collection HuggingFace Transformers can load us. • 5 items • Updated Sep 13, 2025 • 5
Learning to Grasp Anything by Playing with Random Toys Paper • 2510.12866 • Published Oct 14, 2025 • 6
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13, 2025 • 180