arxiv:2602.15620
Kehua Sheng
KehuaSheng
AI & ML interests
None yet
Recent Activity
authored
a paper
about 15 hours ago
STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens
Organizations
None yet