Rohan Surana's picture

Rohan Surana

rohan2810

rohan2810

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago

MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization

submitted a paper 1 day ago

F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking

upvoted a paper 8 days ago

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

View all activity

Organizations

None yet

submitted a paper to Daily Papers 1 day ago

F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking

Paper • 2605.12995 • Published 3 days ago • 1

submitted a paper to Daily Papers 9 days ago

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

Paper • 2605.02913 • Published Apr 8 • 9

authored a paper 8 months ago

In-context Ranking Preference Optimization

Paper • 2504.15477 • Published Apr 21, 2025