Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
LMFlow's picture
2 4 3

LMFlow

lmflow-optimalscale
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
liked a model 10 days ago
nvidia/Nemotron-Orchestrator-8B
upvoted a paper 28 days ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
View all activity

Organizations

OptimalScale's profile picture

upvoted a paper 3 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 7 days ago • 80
upvoted a paper 28 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 28 days ago • 220
upvoted a paper 2 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 122
upvoted a paper 10 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 93
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs