shuo shen
hyperion-shuo
ยท
AI & ML interests
reinforcement learning
Recent Activity
upvoted a paper 1 day ago
ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning liked a dataset about 1 month ago
JunkaiZ/Rubrics liked a dataset 3 months ago
garg-aayush/sft-cs336-assign5-datasets