arxiv:2606.09821
Penghui Qi
QPHutu
AI & ML interests
None yet
Recent Activity
authored a paper about 23 hours ago
Rethinking the Divergence Regularization in LLM RL upvoted a paper 2 days ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models upvoted a paper 2 days ago
Rethinking the Divergence Regularization in LLM RL