Cheng Qian

chengq9

4 38

https://qiancheng0.github.io

qiancheng0

AI & ML interests

Agent, Tool Learning

Recent Activity

upvoted a paper 2 days ago

AgentDebugX: An Open-Source Toolkit for Failure Observability, Attribution, and Recovery in LLM Agents

upvoted a paper 25 days ago

Trimming the Long-Tail of Visual World Modeling Evaluation

upvoted a paper about 2 months ago

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks

View all activity

Organizations

upvoted a paper 2 days ago

AgentDebugX: An Open-Source Toolkit for Failure Observability, Attribution, and Recovery in LLM Agents

Paper • 2607.18754 • Published 4 days ago • 23

upvoted a paper 25 days ago

Trimming the Long-Tail of Visual World Modeling Evaluation

Paper • 2606.24256 • Published Jun 23 • 43

upvoted 4 papers about 2 months ago

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks

Paper • 2606.05445 • Published Jun 3 • 8

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published Jun 4 • 44

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

Paper • 2606.02754 • Published Jun 1 • 13

Advancing Creative Physical Intelligence in Large Multimodal Models

Paper • 2605.26396 • Published May 25 • 21

submitted a paper to Daily Papers about 2 months ago

Advancing Creative Physical Intelligence in Large Multimodal Models

Paper • 2605.26396 • Published May 25 • 21

updated a dataset about 2 months ago

chengq9/CreativityBench-MM

Viewer • Updated May 25 • 1.2k • 32

published a dataset about 2 months ago

chengq9/CreativityBench-MM

Viewer • Updated May 25 • 1.2k • 32

upvoted 2 papers 2 months ago

Code as Agent Harness

Paper • 2605.18747 • Published May 18 • 225

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 114

updated a dataset 3 months ago

chengq9/CreativityBench

Viewer • Updated May 7 • 3.29k • 35 • 2

submitted a paper to Daily Papers 3 months ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published May 6 • 23

upvoted 2 papers 3 months ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published May 6 • 23

PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning

Paper • 2601.11957 • Published Jan 28 • 3

upvoted a paper 4 months ago

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published Apr 7 • 69

published a dataset 4 months ago

chengq9/CreativityBench

Viewer • Updated May 7 • 3.29k • 35 • 2

upvoted a paper 4 months ago

NarrativeTrack: Evaluating Video Language Models Beyond the Frame

Paper • 2601.01095 • Published Jan 3 • 8

upvoted 2 papers 5 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 60

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Paper • 2602.21320 • Published Feb 24 • 12

Cheng Qian

AI & ML interests

Recent Activity

Organizations

chengq9's activity