ACL-Verbatim: hallucination-free question answering for research Paper • 2605.21102 • Published 15 days ago • 3
DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning Paper • 2503.00223 • Published Feb 28, 2025 • 2
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses Paper • 2606.02373 • Published 3 days ago • 34