C2: Scalable Rubric-Augmented Reward Modeling from Binary Preferences Paper • 2604.13618 • Published 4 days ago • 3
C2: Scalable Rubric-Augmented Reward Modeling from Binary Preferences Paper • 2604.13618 • Published 4 days ago • 3