Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
aschuetz 's Collections
agent
RL
paper2media
no-tag

agent

updated about 5 hours ago
Upvote
-

  • MARS^2: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation

    Paper • 2604.14564 • Published 21 days ago

  • Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents

    Paper • 2604.05808 • Published 22 days ago

  • GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems

    Paper • 2603.19677 • Published Mar 20

  • Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-Agent LLMs with Continual Learning

    Paper • 2603.07972 • Published Mar 9

  • Latent-DARM: Bridging Discrete Diffusion And Autoregressive Models For Reasoning

    Paper • 2603.09184 • Published Mar 10

  • Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems

    Paper • 2604.21794 • Published 14 days ago

  • dLLM: Simple Diffusion Language Modeling

    Paper • 2602.22661 • Published Feb 26 • 153

  • Self-Sovereign Agent

    Paper • 2604.08551 • Published Mar 4 • 5

  • Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

    Paper • 2604.04247 • Published Apr 5 • 31
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs