Sirui Zhang
zsr200901
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 10 hours ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
upvoted
a
paper
20 days ago
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning