Shaobai Jiang
shaobaij
AI & ML interests
None yet
Recent Activity
upvoted a paper about 5 hours ago
Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training upvoted a paper about 5 hours ago
OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward
Modeling and LLM Alignment upvoted a paper about 8 hours ago
τ-Knowledge: Evaluating Conversational Agents over Unstructured KnowledgeOrganizations
None yet