Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zihan Ding's picture
5

Zihan Ding

dingzihan737
xiaoniqiu's profile picture
·
  • dzh19990407

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
upvoted a paper 5 months ago
SAIL-VL2 Technical Report
updated a collection 5 months ago
SPO
View all activity

Organizations

None yet

Collections 1

SPO
Single-stream Policy Optimization
  • dingzihan737/SPO_Qwen3-8B_DAPO_16k_ReTool_Binary

    Viewer • Updated Sep 17, 2025 • 14.1k • 5
  • Single-stream Policy Optimization

    Paper • 2509.13232 • Published Sep 16, 2025 • 34
SPO
Single-stream Policy Optimization
  • dingzihan737/SPO_Qwen3-8B_DAPO_16k_ReTool_Binary

    Viewer • Updated Sep 17, 2025 • 14.1k • 5
  • Single-stream Policy Optimization

    Paper • 2509.13232 • Published Sep 16, 2025 • 34

models 0

None public yet

datasets 1

dingzihan737/SPO_Qwen3-8B_DAPO_16k_ReTool_Binary

Viewer • Updated Sep 17, 2025 • 14.1k • 5
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs