Luo

ramiroluo

3 10

LuoXiaoHeics

AI & ML interests

None yet

Recent Activity

upvoted a paper 28 days ago

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

upvoted a paper about 1 month ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

upvoted a paper about 2 months ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

View all activity

Organizations

upvoted a paper 28 days ago

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

Paper • 2606.10479 • Published 29 days ago • 19

upvoted a paper about 1 month ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Paper • 2605.29343 • Published May 28 • 37

upvoted 3 papers about 2 months ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 165

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Paper • 2605.06326 • Published May 7 • 26

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

Paper • 2605.10832 • Published May 11 • 22

upvoted 2 papers 3 months ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

Paper • 2604.19295 • Published Apr 21 • 35

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published Mar 30 • 87

upvoted a paper 5 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 38

submitted a paper to Daily Papers 5 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 38

New activity in PRIME-RL/P1-VL-30B-A3B 5 months ago

Add metadata and link to paper/code

#1 opened 5 months ago by

nielsr

New activity in PRIME-RL/P1-VL-235B-A22B 5 months ago

Add metadata and links to paper and code

#1 opened 5 months ago by

nielsr

authored 2 papers 5 months ago

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9, 2025 • 32

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 135

upvoted a paper 5 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

updated a model 5 months ago

PRIME-RL/P1-VL-235B-A22B

Image-Text-to-Text • 236B • Updated Feb 12 • 125 • 3

published 2 models 5 months ago

PRIME-RL/P1-VL-30B-A3B

Image-Text-to-Text • 31B • Updated Feb 12 • 9 • 3

PRIME-RL/P1-VL-235B-A22B

Image-Text-to-Text • 236B • Updated Feb 12 • 125 • 3

updated a model 5 months ago

PRIME-RL/P1-VL-30B-A3B

Image-Text-to-Text • 31B • Updated Feb 12 • 9 • 3

upvoted a paper 9 months ago

Spotlight on Token Perception for Multimodal Reinforcement Learning

Paper • 2510.09285 • Published Oct 10, 2025 • 37

updated a Space over 2 years ago

HalluChecker

😻

Display leaderboard for LLM hallucination checks

Luo