7 1

Andy Andurkar

AndyAndurkar

AI & ML interests

None yet

Organizations

None yet

upvoted an article 5 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 414

upvoted an article 6 months ago

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 533

upvoted 4 articles 11 months ago

Article

🦸🏻#11: How Do Agents Plan and Reason?

Kseniase

•

Feb 24, 2025

• 17

Article

🦸🏻#10: Does Present-Day GenAI Actually Reason?

Kseniase

•

Feb 15, 2025

• 8

Article

Everything You Need to Know about Knowledge Distillation

Kseniase

•

Mar 6, 2025

• 81

Article

Inside the family of Smol models

Kseniase

•

Feb 27, 2025

• 13

upvoted an article over 1 year ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 293

Andy Andurkar

AI & ML interests

Organizations

AndyAndurkar's activity

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Vision Language Models Explained

🦸🏻#11: How Do Agents Plan and Reason?

🦸🏻#10: Does Present-Day GenAI Actually Reason?

Everything You Need to Know about Knowledge Distillation

Inside the family of Smol models

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge