Iris's picture

Iris

irisxx

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

updated a dataset 9 months ago

irisxx/chatarena_tied

updated a dataset 9 months ago

irisxx/ultrafeedback_tied

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published 9 days ago • 20