MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation Paper • 2606.02470 • Published 2 days ago • 12
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories Paper • 2605.04036 • Published 29 days ago • 68