🤝 Open to Collab

Shanmukha Sainath

tensorfiend

1 1

https://www.tensorwrites.com/

AI & ML interests

NLP, RL

Recent Activity

liked a dataset about 21 hours ago

tensorfiend/SimpleThoughts

upvoted an article 12 days ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

updated a collection about 1 month ago

Trisynapse

View all activity

Organizations

None yet

liked a dataset about 21 hours ago

tensorfiend/SimpleThoughts

Viewer • Updated Apr 5 • 391k • 81 • 2

upvoted an article 12 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 295

updated 2 collections about 1 month ago

Trisynapse

Collection

3 items • Updated May 30

DotLM

Collection

SimpleThoughts data spans four stages—pretraining, SFT, alignment, and reasoning - training DotLM-165M to prioritize reasoning over memorization. • 2 items • Updated May 30

updated a dataset 3 months ago

tensorfiend/SimpleThoughts

Viewer • Updated Apr 5 • 391k • 81 • 2

updated 2 collections 3 months ago

Feedback Prize kaggle

Collection

1 item • Updated Apr 4

DotLM

Collection

SimpleThoughts data spans four stages—pretraining, SFT, alignment, and reasoning - training DotLM-165M to prioritize reasoning over memorization. • 2 items • Updated May 30

updated a model 3 months ago

tensorfiend/DotLM-165M

Text Generation • 0.2B • Updated Apr 4 • 7 • 1

published a model 3 months ago

tensorfiend/DotLM-165M

Text Generation • 0.2B • Updated Apr 4 • 7 • 1

published a dataset 3 months ago

tensorfiend/SimpleThoughts

Viewer • Updated Apr 5 • 391k • 81 • 2

updated a model 11 months ago

tensorfiend/EssayInsightsAI-DistilBERT

Updated Aug 3, 2025

published a model about 1 year ago

tensorfiend/EssayInsightsAI-DistilBERT

Updated Aug 3, 2025

Shanmukha Sainath

AI & ML interests

Recent Activity

Organizations

tensorfiend's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge