Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

sirynoma's picture

In a Training Loop 🔄

sirynoma

uavleeva

·

AI & ML interests

None yet

Organizations

uavleeva 's collections 1

Multitask RLVR using GRPO (HSE Project)

uavleeva/grpo_mixed_run_004

Updated Feb 8
uavleeva/grpo_mixed_run_001

Updated Feb 8
uavleeva/grpo_math_run_level3_all_rewards_001

Updated Feb 8
uavleeva/grpo_sql_run_004

Updated Feb 8

Multitask RLVR using GRPO (HSE Project)

uavleeva/grpo_mixed_run_004

Updated Feb 8
uavleeva/grpo_mixed_run_001

Updated Feb 8
uavleeva/grpo_math_run_level3_all_rewards_001

Updated Feb 8
uavleeva/grpo_sql_run_004

Updated Feb 8

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs