view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego β’ Mar 10 β’ 165
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper β’ 2604.08626 β’ Published Apr 9 β’ 248
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding Paper β’ 2604.05015 β’ Published Apr 6 β’ 236
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation Paper β’ 2601.22153 β’ Published Jan 29 β’ 75
PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity Paper β’ 2510.23603 β’ Published Oct 27, 2025 β’ 26
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper β’ 2510.11696 β’ Published Oct 13, 2025 β’ 183
High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting Paper β’ 2510.10637 β’ Published Oct 12, 2025 β’ 15
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources Paper β’ 2509.21268 β’ Published Sep 25, 2025 β’ 104
mvp-lab/LLaVA-OneVision-1.5-Mid-Training-85M Viewer β’ Updated Nov 24, 2025 β’ 91.5M β’ 603k β’ 72