Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
26
22
Andrew Zhao
andrewzh
Follow
Trangle's profile picture
bukit's profile picture
fabianomonteirofarias's profile picture
51 followers
·
7 following
https://andrewzh112.github.io/
_AndrewZhao
Andrewzh112
andrewqzhao
AI & ML interests
Reinforcement Learning, Agents
Recent Activity
authored
a paper
about 10 hours ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
authored
a paper
about 10 hours ago
ExCyTIn-Bench: Evaluating LLM agents on Cyber Threat Investigation
authored
a paper
about 10 hours ago
Are My Optimized Prompts Compromised? Exploring Vulnerabilities of LLM-based Optimizers
View all activity
Organizations
andrewzh
's models
3
Sort: Recently updated
andrewzh/Absolute_Zero_Reasoner-Coder-14b
15B
•
Updated
May 6, 2025
•
14
•
29
andrewzh/Absolute_Zero_Reasoner-Coder-3b
3B
•
Updated
May 6, 2025
•
13
•
14
andrewzh/Absolute_Zero_Reasoner-Coder-7b
8B
•
Updated
May 5, 2025
•
405
•
20