Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andreas Stöffelbauer's picture
10

Andreas Stöffelbauer

andreasskyscanner

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago
You Only Judge Once: Multi-response Reward Modeling in a Single Forward Pass
upvoted a paper about 10 hours ago
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation
upvoted a paper about 10 hours ago
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
View all activity

Organizations

None yet

models 2

andreasskyscanner/llama-31-hhrlhf-squad-rlhf-policy-model

Text Generation • 1B • Updated Jul 1, 2025 • 1

andreasskyscanner/llama-32-hhrlhf-reward-adapter

Updated Jul 1, 2025

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs