Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
In a Training Loop 🔄
1
9
1
Xiangchao Chen
UlyssesXC
Follow
Stevensn's profile picture
yuanshengni's profile picture
2 followers
·
14 following
https://chasechen.xyz
UlyssesXC
xiangchao-chen-0b1997286
AI & ML interests
Agent Learning / Embodied AI
Recent Activity
updated
a model
about 21 hours ago
UlyssesXC/verl-agent-alfworld-exp15_aux_ce_1.5b
updated
a model
about 23 hours ago
UlyssesXC/rlwm-webshop-qwen25-7b-exp-dualadv-w005
updated
a model
1 day ago
UlyssesXC/verl-agent-alfworld-grpo-baseline-no-predict-7b
View all activity
Organizations
UlyssesXC
's models
28
Sort: Recently updated
UlyssesXC/rlwm-webshop-qwen25-7b-exp-dualadv-w005
Updated
about 2 hours ago
UlyssesXC/verl-agent-alfworld-exp15_aux_ce_1.5b
Updated
about 19 hours ago
UlyssesXC/verl-agent-alfworld-grpo-baseline-no-predict-7b
Updated
about 21 hours ago
UlyssesXC/rlwm-webshop-qwen25-3b-exp-dualadv-w005
Updated
1 day ago
UlyssesXC/rlwm-webshop-exp-dualadv-w005
Updated
1 day ago
•
1
UlyssesXC/verl-agent-alfworld-exp14_no_wm_from_exp13_step80_3b
Updated
2 days ago
UlyssesXC/verl-agent-alfworld-grpo-dualadv-w005-schema-verifier-exp5-fix-7b
Updated
2 days ago
UlyssesXC/verl-agent-alfworld-exp13_switch_b16_3b
Updated
3 days ago
UlyssesXC/verl-agent-alfworld-grpo-baseline-no-predict-3b
Updated
4 days ago
UlyssesXC/rlwm-webshop-exp_dualadv_w005
Updated
4 days ago
UlyssesXC/verl-agent-alfworld-grpo-baseline-no-predict-1-5b
Updated
6 days ago
UlyssesXC/verl-agent-alfworld-exp11b_drop_phase2_from_exp10c80
Updated
6 days ago
UlyssesXC/verl-agent-alfworld-grpo-dualadv-w005-schema-verifier-exp4-fix-3b
Updated
7 days ago
UlyssesXC/verl-agent-alfworld-exp11_dualadv_drop_phase2_b16
Updated
8 days ago
UlyssesXC/verl-agent-alfworld-exp10c_switch_b16
Updated
9 days ago
UlyssesXC/verl-agent-alfworld-grpo-dualadv-w003-schema-verifier-exp3-fix
Updated
9 days ago
UlyssesXC/verl-agent-alfworld-sft-action-triple-qwen25-7b
Updated
9 days ago
UlyssesXC/verl-agent-alfworld-sft-action-triple-qwen25-1p5b
Updated
9 days ago
UlyssesXC/verl-agent-alfworld-exp3-fix-sftaction-triple-dualadv-schema
Updated
9 days ago
UlyssesXC/verl-agent-alfworld-exp10b_switch_b16_from_ref80
Updated
10 days ago
UlyssesXC/verl-agent-alfworld-grpo-dualadv-w010-schema-verifier-exp2-fix
Updated
10 days ago
UlyssesXC/verl-agent-alfworld-exp2-fix-sftpredict-triple-dualadv-schema
Updated
10 days ago
UlyssesXC/verl-agent-alfworld-exp9_warmup_gate_dualadv
Updated
10 days ago
UlyssesXC/verl-agent-alfworld-grpo-dualadv-w005-schema-verifier-exp1-fix
Updated
11 days ago
UlyssesXC/verl-agent-alfworld-sft-coldstart-qwen25-7b
Updated
13 days ago
UlyssesXC/verl-agent-alfworld-sft-coldstart-qwen25-1p5b
Updated
13 days ago
UlyssesXC/verl-agent-alfworld-grpo-ablation-no-wm-signal
Updated
13 days ago
UlyssesXC/webshop-qwen2.5-7b-sft-decision-data-only
8B
•
Updated
Apr 13
•
4