Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
double7 's Collections
GRRM
EnAnchored-X2X
SSM

GRRM

updated 12 days ago

Datasets, and model checkpoints of our Group Relative Reward Model (GRRM) framework

Upvote
1

  • GRRM: Group Relative Reward Modeling for Machine Translation

    Paper • 2602.14028 • Published 29 days ago

  • double7/Qwen2.5-7B-GRRM

    Text Generation • 8B • Updated 12 days ago • 39

  • double7/Qwen2.5-7B-MT-GRRM-Optimized

    Text Generation • 8B • Updated 12 days ago • 38

  • double7/Qwen2.5-7B-MT-GRRM-Optimized-CLA

    Text Generation • 8B • Updated 12 days ago • 33

  • double7/Qwen2.5-7B-SQM-GenRM

    8B • Updated Dec 29, 2025 • 9

  • double7/TowerBlocks-MT-Ranking

    Viewer • Updated 17 days ago • 18.8k • 44

  • double7/MT_Ranking_Metric_Test

    Viewer • Updated 17 days ago • 8.94k • 22

  • double7/TowerBlocks-MT-CoT-ZhEn

    Viewer • Updated 17 days ago • 18.8k • 14
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs