Hemanth Sai Garladinne

HemanthSai7

6 14 29

https://hemanthsai7.github.io

AI & ML interests

Audio ML and Natural Language Processing

Recent Activity

liked a model 2 days ago

FrontiersMind/Lumma-0.6B-Tool

liked a model 16 days ago

FrontiersMind/Lumma-0.6B-Base

upvoted a paper about 1 month ago

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention

View all activity

Organizations

liked a model 2 days ago

FrontiersMind/Lumma-0.6B-Tool

Text Generation • 0.6B • Updated 6 days ago • 328 • 7

liked a model 16 days ago

FrontiersMind/Lumma-0.6B-Base

Text Generation • 0.6B • Updated 8 days ago • 2.2k • 17

upvoted a paper about 1 month ago

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention

Paper • 2606.20945 • Published Jun 18 • 80

liked a model about 2 months ago

FrontiersMind/Nandi-Mini-V1.1-600M-Intermediate-Checkpoint-400GT

Text Generation • 0.6B • Updated May 30 • 23 • 8

liked 2 models 2 months ago

FrontiersMind/Nandi-Mini-600M-GuardRails

Text Generation • 0.6B • Updated May 18 • 204 • 15

FrontiersMind/Nandi-Mini-600M-Early-Checkpoint

Text Generation • 0.6B • Updated May 17 • 142 • 105

liked a model 3 months ago

FrontiersMind/Nandi-Mini-150M-Tool-Calling

Text Generation • 0.2B • Updated May 18 • 192 • 52

upvoted an article 3 months ago

Article

How I contributed a new model to the Transformers library using Codex

nielsr

•

Mar 30

• 53

liked a model 3 months ago

FrontiersMind/Nandi-Mini-150M-Instruct

Text Generation • 0.2B • Updated May 18 • 81 • 52

liked a model 4 months ago

FrontiersMind/Nandi-Mini-150M

Text Generation • 0.2B • Updated May 15 • 599 • 141

updated a model 4 months ago

FrontiersMind/Nandi-Mini-150M

Text Generation • 0.2B • Updated May 15 • 599 • 141

liked a Space 4 months ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

267

Visualize synthetic‑data experiments as an interactive bookshelf

New activity in OpenCoder-LLM/opc-annealing-corpus 5 months ago

Unsafe File

#7 opened 5 months ago by

HemanthSai7

upvoted an article 5 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

codelion

•

Nov 3, 2025

• 65

updated a dataset 6 months ago

HemanthSai7/Piqa

Viewer • Updated Jan 22 • 21k • 19

published a dataset 6 months ago

HemanthSai7/Piqa

Viewer • Updated Jan 22 • 21k • 19

liked 3 Spaces 7 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.4k

Explore and download the FineWeb web‑scale text dataset

The Ultra-Scale Playbook

🌌

3.95k

The ultimate guide to training LLM on large GPU Clusters

Evaluation Guidebook

📝

340

Explore LLM benchmark scores over time

upvoted a paper 7 months ago

The Instruction Gap: LLMs get lost in Following Instruction

Paper • 2601.03269 • Published Dec 19, 2025 • 8