Submitted by Juanxi Tian 23 Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria OpenEnvision 32 2