Shaobai Jiang
shaobaij
AI & ML interests
None yet
Recent Activity
upvoted a paper about 3 hours ago
Composer 2 Technical Report upvoted a paper about 13 hours ago
Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training upvoted a paper about 13 hours ago
OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward
Modeling and LLM AlignmentOrganizations
None yet