McAuley-Lab/Reddit2Deezer
Preview • Updated • 1.36k • 6
We're the McAuley Lab at UC San Diego with PI Prof. Julian McAuley, focusing on cool machine learning and natural language processing applications!
Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism
F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking