Predict human preference to LLM responses.
Binfeng Xu
billxbf
AI & ML interests
evolving back to apes
Recent Activity
upvoted a paper 23 days ago
PhyCritic: Multimodal Critic Models for Physical AI upvoted a paper about 1 month ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text updated
a dataset 2 months ago
billxbf/math_pile_v3