Yifeng Liu's picture

2

Yifeng Liu

lyf07

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago

R-PRM: Reasoning-Driven Process Reward Modeling

authored a paper 2 days ago

Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation

updated a model 3 days ago

lyf07/Translategemma-4B-it-WALAR

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation

Paper • 2603.13045 • Published 7 days ago • 1

upvoted a paper 7 months ago

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published Aug 20, 2025 • 85