From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills Paper • 2605.23899 • Published 11 days ago • 29 • 2
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published Apr 15 • 32
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 11 days ago • 213
From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills Paper • 2605.23899 • Published 11 days ago • 29
From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills Paper • 2605.23899 • Published 11 days ago • 29
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 11 days ago • 213
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-104 15B • Updated 20 days ago • 17
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-104 15B • Updated 20 days ago • 17
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-68 15B • Updated 20 days ago • 20
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-68 15B • Updated 20 days ago • 20
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-34 15B • Updated 20 days ago • 20
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-34 15B • Updated 20 days ago • 20
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published Apr 27 • 118
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published Apr 15 • 32
BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation Paper • 2603.25732 • Published Mar 26 • 11
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation Paper • 2602.03619 • Published Feb 3 • 28
BatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-Translation Paper • 2602.02554 • Published Jan 30 • 8
BatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-Translation Paper • 2602.02554 • Published Jan 30 • 8
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation Paper • 2602.03619 • Published Feb 3 • 28