-
Dr. Zero: Self-Evolving Search Agents without Training Data
Paper • 2601.07055 • Published • 20 -
Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models
Paper • 2503.04813 • Published • 2 -
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper • 2505.03335 • Published • 189
tran minh thang
thangtm
·
AI & ML interests
None yet
Recent Activity
updated
a collection
5 days ago
data
updated
a collection
7 days ago
zero-data
upvoted
a
paper
7 days ago
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Organizations
None yet