·
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
Organizations
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
• Updated
• 1.07k • 207
• 6
dongguanting/ARPO-RL-Reasoning-10K
Viewer
• Updated
• 10k • 221
• 4
dongguanting/ARPO-SFT-54K
Viewer
• Updated
• 54.6k • 271
• 14
dongguanting/RAG-Error-Critic-100K
Viewer
• Updated
• 100k • 22
• 3
dongguanting/Tool-Star-SFT-54K
Viewer
• Updated
• 54k • 77
• 10
dongguanting/Multi-Tool-RL-10K
Viewer
• Updated
• 10k • 75
• 5
Viewer
• Updated
• 32.8k • 85
• 2
dongguanting/ShareGPT-12K
Viewer
• Updated
• 12.9k • 132
• 1
dongguanting/VIF-RAG-QA-110K
Viewer
• Updated
• 111k • 64
• 7
Viewer
• Updated
• 574k • 94
• 2
dongguanting/VIF-RAG-QA-20K
Viewer
• Updated
• 20k • 8
• 4