Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Project of MoE reward model

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

shengyi-qian  authored a paper 14 days ago
Beyond Language Modeling: An Exploration of Multimodal Pretraining
zhuokai  authored a paper 2 months ago
Preference Optimization with Multi-Sample Comparisons
zhuokai  authored a paper 2 months ago
Token-Level LLM Collaboration via FusionRoute
View all activity

Yuhang Zhou's profile pictureShengyi Qian's profile pictureZhuokai Zhao's profile pictureJing Zhu's profile pictureXiaoyu Liu's profile picturewave's profile picture

MoeReward 's models 6

MoeReward/rl_checkpoints

Updated Jun 27, 2025

MoeReward/lora_checkpoint

Updated Mar 30, 2025

MoeReward/reward_lora_qwen_1_5_base

Updated Mar 21, 2025

MoeReward/reward_qwen_1_5

14B • Updated Mar 17, 2025 • 4

MoeReward/reward_lora_qwen_1_5

Updated Mar 17, 2025

MoeReward/sft_full_param_qwen_1_5

14B • Updated Mar 16, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs