Mo2BERTa Collection Mixture-of-Recursions on Modernized BERT • 2 items • Updated about 1 month ago
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper • 2507.10524 • Published Jul 14, 2025 • 73
Running 81 Chinese Open Source Heatmap 🔥 81 Explore model release activity with interactive heatmaps