Abstract
MindDR is an efficient multi-agent deep research framework that achieves high performance through a collaborative three-agent architecture and specialized four-stage training pipeline, demonstrating strong results on multiple benchmarks.
We present Mind DeepResearch (MindDR), an efficient multi-agent deep research framework that achieves leading performance with only ~30B-parameter models through a meticulously designed data synthesis and multi-stage training pipeline. The core innovation of MindDR lies in a collaborative three-agent architecture (Planning Agent, DeepSearch Agent, and Report Agent) and a four-stage agent-specialized training pipeline comprising SFT cold-start, Search-RL, Report-RL and preference alignment. With this regime, MindDR demonstrates competitive performance even with ~30B-scale models. Specifically, MindDR achieves 45.7% on BrowseComp-ZH, 42.8% on BrowseComp, 46.5% on WideSearch, 75.0% on xbench-DS, and 52.5 on DeepResearch Bench, outperforming comparable-scale open-source agent systems and rivaling larger-scale models. MindDR has been deployed as an online product in Li Auto. Furthermore, we introduce MindDR Bench, a curated benchmark of 500 real-world Chinese queries from our internal product user interactions, evaluated through a comprehensive multi-dimensional rubric system rather than relying on a single RACE metric. On MindDR Bench, MindDR achieves a state-of-the-art score of 51.8.
Community
Mind Deep Research (MindDR) is an efficient multi-agent framework that achieves high performance on deep search and deep research tasks with relevant low cost. It breaks down end-to-end RL training into multi-stage search-rl, report-rl and preference alignment training pipeline for better efficiency and training stability. Check it out for details!
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- DeepResearch-9K: A Challenging Benchmark Dataset of Deep-Research Agent (2026)
- Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization (2026)
- Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design (2026)
- MagicAgent: Towards Generalized Agent Planning (2026)
- MiroThinker-1.7&H1: Towards Heavy-Duty Research Agents via Verification (2026)
- SynPlanResearch-R1: Encouraging Tool Exploration for Deep Research with Synthetic Plans (2026)
- Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2604.14518 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper