Varad Pimpalkhute
DaoistKalki
AI & ML interests
Few-shot learning, generalization, multi-modality
Recent Activity
upvoted
a
paper
about 2 months ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
upvoted
a
paper
2 months ago
The Path Not Taken: RLVR Provably Learns Off the Principals
liked
a model
5 months ago
LLM360/K2-Think