scy

scy0712

12 2

AI & ML interests

Recent Activity

upvoted a paper 24 days ago

Dockerless: Environment-Free Program Verifier for Coding Agents

upvoted a paper 24 days ago

Evolution Fine-Tuning: Learning to Discover Across 371 Optimization Tasks

liked a model about 1 month ago

stepfun-ai/Step3-VL-10B

View all activity

Organizations

None yet

upvoted 2 papers 24 days ago

Dockerless: Environment-Free Program Verifier for Coding Agents

Paper • 2606.28436 • Published 29 days ago • 112

Evolution Fine-Tuning: Learning to Discover Across 371 Optimization Tasks

Paper • 2606.29082 • Published 28 days ago • 42

liked a model about 1 month ago

stepfun-ai/Step3-VL-10B

Image-Text-to-Text • 10B • Updated Feb 4 • 154k • 410

upvoted 7 papers 3 months ago

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published Apr 30 • 43

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published Apr 20 • 89

TIP: Token Importance in On-Policy Distillation

Paper • 2604.14084 • Published Apr 15 • 15

From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space

Paper • 2604.14142 • Published Apr 15 • 30

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published Apr 13 • 69

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents

Paper • 2604.14004 • Published Apr 15 • 30

Efficient RL Training for LLMs with Experience Replay

Paper • 2604.08706 • Published Apr 9 • 23

upvoted 3 papers 5 months ago

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Paper • 2602.02474 • Published Feb 2 • 63

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 76

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published Jan 29 • 155

liked a dataset 7 months ago

HuanjinYao/R1-ShareVL-52K

Viewer • Updated Jul 16, 2025 • 52.3k • 39 • 2

updated a dataset 7 months ago

scy0712/mmr1_train_expandQ8

Viewer • Updated Dec 18, 2025 • 5.78k • 30

published a dataset 7 months ago

scy0712/mmr1_train_expandQ8

Viewer • Updated Dec 18, 2025 • 5.78k • 30

updated a dataset 7 months ago

scy0712/VLMEvalKit-outputs

Preview • Updated Dec 13, 2025 • 21

published a dataset 8 months ago

scy0712/VLMEvalKit-outputs

Preview • Updated Dec 13, 2025 • 21

scy

AI & ML interests

Recent Activity

Organizations

scy0712's activity