arxiv:2505.11711
sagnik mukherjee
sagnikM
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text updated
a model about 1 month ago
sagnikM/grpo_rmsprop_qwen3-8b_3k_seqlen published
a model about 1 month ago
sagnikM/grpo_rmsprop_qwen3-8b_3k_seqlen Organizations
None yet