Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
Yurun Yuan
PRO
RyanYr
Follow
21world's profile picture
Cran-May's profile picture
xuanfeiren's profile picture
6 followers
·
2 following
yurun-yuan
AI & ML interests
None yet
Recent Activity
updated
a dataset
7 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_matheval
updated
a model
7 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
published
a model
7 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
View all activity
Organizations
None yet
RyanYr
's models
30
Sort: Recently updated
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
Updated
7 days ago
•
52
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_200
Updated
7 days ago
•
3
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
7 days ago
•
28
RyanYr/pg_sais-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl
Updated
7 days ago
•
51
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
7 days ago
•
55
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
7 days ago
•
55
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
7 days ago
•
55
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
7 days ago
•
59
RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref
Updated
7 days ago
•
57
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
7 days ago
•
57
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref
Updated
7 days ago
•
55
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
8 days ago
•
55
RyanYr/pg_sais-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl
Updated
8 days ago
•
53
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
8 days ago
•
7
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
8 days ago
•
8
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
9 days ago
•
43
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl
Updated
9 days ago
•
38
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
9 days ago
•
37
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
9 days ago
•
41
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
9 days ago
•
34
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl
Updated
9 days ago
•
35
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
9 days ago
•
32
RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B
Updated
9 days ago
•
38
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B
Updated
9 days ago
•
39
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
9 days ago
•
47
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl
Updated
9 days ago
•
27
RyanYr/grpo-dapo-qwen2.5-math-1.5B-n4
Updated
10 days ago
RyanYr/grpo-dapo-qwen3-1.7B-Base-mbs128-n4
Updated
22 days ago
•
35
RyanYr/grpo-dapo_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25
•
4
RyanYr/grpo-dapo-01_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25