Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mz.w's picture
6 20 5

mz.w

iiiiwis
RainBowLuo's profile picture tnlin's profile picture
·

AI & ML interests

None yet

Recent Activity

authored a paper about 10 hours ago
From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space
upvoted a paper about 11 hours ago
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
upvoted a paper 29 days ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
View all activity

Organizations

None yet

Papers 9

arxiv:2604.14142
arxiv:2512.19673
arxiv:2505.02156
arxiv:2412.04905

models 1

iiiiwis/DEMO_Agent

Text Generation • Updated Dec 10, 2024 • 2

datasets 2

iiiiwis/AMPO

Preview • Updated May 15, 2025 • 55 • 1

iiiiwis/DEMO

Viewer • Updated Dec 16, 2024 • 7.98k • 17 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs