Andy Andurkar

AndyAndurkar

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

liked a Space 3 months ago

Vchitect/VBench_Leaderboard

upvoted an article 3 months ago

Vision Language Models Explained

View all activity

Organizations

None yet

upvoted an article about 2 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

•

401

liked a Space 3 months ago

VBench Leaderboard

📊

347

Upload video model evaluation data to update the VBench leaderboard

upvoted an article 3 months ago

Article

Vision Language Models Explained

Apr 11, 2024

•

519

upvoted 4 articles 8 months ago

Article

🦸🏻#11: How Do Agents Plan and Reason?

Feb 24, 2025

•

Article

🦸🏻#10: Does Present-Day GenAI Actually Reason?

Feb 15, 2025

•

Article

Everything You Need to Know about Knowledge Distillation

Mar 6, 2025

•

Article

Inside the family of Smol models

Feb 27, 2025

•

commented on DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge about 1 year ago

A very well written article with clear explanations!

upvoted an article about 1 year ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

276

Andy Andurkar

AI & ML interests

Recent Activity

Organizations

AndyAndurkar's activity

Illustrating Reinforcement Learning from Human Feedback (RLHF)

VBench Leaderboard

Vision Language Models Explained

🦸🏻#11: How Do Agents Plan and Reason?

🦸🏻#10: Does Present-Day GenAI Actually Reason?

Everything You Need to Know about Knowledge Distillation

Inside the family of Smol models

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge