KABI

dongguanting

·

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper about 1 month ago

Qwen-AgentWorld: Language World Models for General Agents

authored a paper about 2 months ago

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

commentedon a paper about 2 months ago

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

View all activity

Organizations

dongguanting 's datasets 11

dongguanting/ARPO-RL-DeepSearch-1K

Viewer • Updated Oct 17, 2025 • 1.07k • 72 • 6

dongguanting/ARPO-RL-Reasoning-10K

Viewer • Updated Oct 17, 2025 • 10k • 175 • 4

dongguanting/ARPO-SFT-54K

Viewer • Updated Oct 17, 2025 • 54.6k • 424 • 15

dongguanting/RAG-Error-Critic-100K

Viewer • Updated Jun 28, 2025 • 100k • 16 • 3

dongguanting/Tool-Star-SFT-54K

Viewer • Updated May 29, 2025 • 54k • 299 • 11

dongguanting/Multi-Tool-RL-10K

Viewer • Updated May 25, 2025 • 10k • 119 • 5

dongguanting/RAG-QA-40K

Viewer • Updated Dec 27, 2024 • 32.8k • 19 • 2

dongguanting/ShareGPT-12K

Viewer • Updated Dec 27, 2024 • 12.9k • 22 • 1

dongguanting/VIF-RAG-QA-110K

Viewer • Updated Dec 27, 2024 • 111k • 51 • 7

dongguanting/DotamathQA

Viewer • Updated Dec 26, 2024 • 574k • 23 • 2

dongguanting/VIF-RAG-QA-20K

Viewer • Updated Nov 1, 2024 • 20k • 10 • 4