MARS^2: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation Paper • 2604.14564 • Published 21 days ago
Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents Paper • 2604.05808 • Published 22 days ago
GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems Paper • 2603.19677 • Published Mar 20
Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-Agent LLMs with Continual Learning Paper • 2603.07972 • Published Mar 9
Latent-DARM: Bridging Discrete Diffusion And Autoregressive Models For Reasoning Paper • 2603.09184 • Published Mar 10
Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems Paper • 2604.21794 • Published 14 days ago
Combee: Scaling Prompt Learning for Self-Improving Language Model Agents Paper • 2604.04247 • Published Apr 5 • 31