Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Ruiyi Wang
ruiyiwang
Follow
https://ruiyiw.github.io
RuiyiWang153
ruiyiw
AI & ML interests
social agents, LLM reasoning, reinforcement learning
Recent Activity
updated
a model
6 days ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-3
published
a model
6 days ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-3
updated
a model
6 days ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-2
View all activity
Organizations
None yet
models
7
Sort: Recently updated
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-3
Updated
6 days ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-2
Updated
6 days ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4
Updated
7 days ago
ruiyiwang/alfworld-qwen-7b-sft-admissible
Updated
Nov 26, 2025
ruiyiwang/SFT-alfworld-text-only-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025
ruiyiwang/SFT-alfworld-visual-text-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025
ruiyiwang/SFT-alfworld-visual-only-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025
datasets
2
Sort: Recently updated
ruiyiwang/meow-tea-oolong-dataset
Viewer
•
Updated
Nov 21, 2025
•
13.1k
•
3
ruiyiwang/ALFRED
Viewer
•
Updated
Nov 4, 2025
•
6.83k
•
4