XYX's picture

XYX

xuyd16

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

TIP: Token Importance in On-Policy Distillation

liked a model 10 days ago

deepseek-ai/DeepSeek-V4-Pro

upvoted a paper 18 days ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

View all activity

Organizations

None yet

submitted a paper to Daily Papers 19 days ago

TIP: Token Importance in On-Policy Distillation

Paper • 2604.14084 • Published 20 days ago • 14

submitted a paper to Daily Papers about 2 months ago

PACED: Distillation at the Frontier of Student Competence

Paper • 2603.11178 • Published Mar 11 • 4

authored 4 papers about 2 months ago

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

Paper • 2602.21420 • Published Feb 24 • 6

On-Policy Self-Distillation for Reasoning Compression

Paper • 2603.05433 • Published Mar 5 • 8

Not all tokens are needed(NAT): token efficient reinforcement learning

Paper • 2603.06619 • Published Feb 20 • 1

PACED: Distillation at the Frontier of Student Competence

Paper • 2603.11178 • Published Mar 11 • 4