Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
154
Nishanth K R
itsme-nishanth
Follow
21world's profile picture
JeevaBalan-95's profile picture
Gargaz's profile picture
5 followers
ยท
46 following
AI & ML interests
AI, ML, Data intelligence
Recent Activity
liked
a model
2 days ago
Nanbeige/Nanbeige4.1-3B
reacted
to
Sunny111
's
post
with ๐
about 1 month ago
Are you familiar with reverse residual connections or looping in language models? Excited to share my Looped-GPT blog post and codebase ๐ https://github.com/sanyalsunny111/Looped-GPT TL;DR: looping during pre-training improves generalization. Plot shows GPT2 LMs pre-trained with 15.73B OWT tokens P.S. This is my first post here โ I have ~4 followers and zero expectations for reach ๐
liked
a model
about 1 month ago
urchade/gliner_medium-v2.1
View all activity
Organizations
itsme-nishanth
's datasets
5
Sort:ย Recently updated
itsme-nishanth/mini-gemma-finewik-tokenized
Viewer
โข
Updated
Dec 31, 2025
โข
49.6k
โข
11
itsme-nishanth/mini-gemma-finewiki-tokenized
Viewer
โข
Updated
Dec 31, 2025
โข
49.6k
โข
5
itsme-nishanth/JAT-GPT-pretrain_v2_tokenized
Viewer
โข
Updated
Jul 19, 2025
โข
40k
โข
94
itsme-nishanth/JAT-GPT-pretrain_v2
Viewer
โข
Updated
Jul 19, 2025
โข
40k
โข
96
itsme-nishanth/JAT-GPT-pretrain
Viewer
โข
Updated
Jul 18, 2025
โข
10k
โข
100