3 8

Jeehye Na

sonicdog00

AI & ML interests

None yet

Recent Activity

updated a model about 6 hours ago

sonicdog00/OpenRS-GRPO

published a model 1 day ago

sonicdog00/OpenRS-GRPO

upvoted a collection 1 day ago

Open-RS

View all activity

Organizations

None yet

updated a model about 6 hours ago

sonicdog00/OpenRS-GRPO

Text Generation • 2B • Updated about 1 hour ago • 68

published a model 1 day ago

sonicdog00/OpenRS-GRPO

Text Generation • 2B • Updated about 1 hour ago • 68

upvoted a collection 1 day ago

Open-RS

Collection

Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" • 8 items • Updated Mar 21, 2025 • 13

upvoted an article 12 days ago

Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

Feb 17, 2025

•

liked a dataset 21 days ago

honglyhly/DeepEyesV2_SFT

Updated Nov 10, 2025 • 179 • 3

liked a dataset 4 months ago

bunny127/SophiaVL-R1-130k

Updated Jun 9, 2025 • 253 • 2

liked a model 5 months ago

kormo-lm/kormo_1B_UFW_60BT_cl100k_base

Text Generation • 1B • Updated Jun 29, 2025 • 1

liked a dataset 6 months ago

omkarthawakar/VRC-Bench

Viewer • Updated Jan 13, 2025 • 1k • 290 • 23

liked a model 6 months ago

declare-lab/nora

Robotics • 4B • Updated Aug 27, 2025 • 875 • 13

liked 3 models about 1 year ago

upvoted a paper over 1 year ago

LLaMo: Large Language Model-based Molecular Graph Assistant

Paper • 2411.00871 • Published Oct 31, 2024 • 22

Jeehye Na

AI & ML interests

Recent Activity

Organizations

sonicdog00's activity

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies