arxiv:2603.14473

AI Can Learn Scientific Taste

Published on Mar 15

· Submitted by

Tony.Li on Mar 17

Authors:

Mingzhe Li ,

Hangcheng Li ,

Yongzhuo Yang ,

Yurong Mou ,

Zhiheng Xi ,

Hongji Chen ,

Qinyuan Cheng ,

Qiguang Chen ,

Qipeng Guo ,

Tianxiang Sun ,

Xuanjing Huang ,

Abstract

Great scientists have strong judgement and foresight, closely tied to what we call scientific taste. Here, we use the term to refer to the capacity to judge and propose research ideas with high potential impact. However, most relative research focuses on improving an AI scientist's executive capability, while enhancing an AI's scientific taste remains underexplored. In this work, we propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference modeling and alignment problem. For preference modeling, we train Scientific Judge on 700K field- and time-matched pairs of high- vs. low-citation papers to judge ideas. For preference alignment, using Scientific Judge as a reward model, we train a policy model, Scientific Thinker, to propose research ideas with high potential impact. Experiments show Scientific Judge outperforms SOTA LLMs (e.g., GPT-5.2, Gemini 3 Pro) and generalizes to future-year test, unseen fields, and peer-review preference. Furthermore, Scientific Thinker proposes research ideas with higher potential impact than baselines. Our findings show that AI can learn scientific taste, marking a key step toward reaching human-level AI scientists.

View arXiv page View PDF Project page GitHub 243 Add to collection

Community

lkdhy

Paper author Paper submitter about 20 hours ago

We find AI can learn scientific taste! This marks a key step toward reaching human-level AI scientists.

panikov

about 2 hours ago

•

edited about 2 hours ago

all this will generate a lot of noise which nobody will have the capacity to go through. Majority of scientific breakthroughs have nothing to do with sense of crowd/community. It's usually some stubborn persons who go against mainstream. On this case the best was formulated by Pythagoras to his followers: "Go unwalked routes"