Wang's picture

Wang

Grokking

·

AI & ML interests

None yet

Organizations

upvoted a paper 4 months ago

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Paper • 2602.12036 • Published Feb 12 • 95

upvoted a paper 5 months ago

GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts

Paper • 2601.05110 • Published Jan 8 • 29