Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
154
Nishanth K R
itsme-nishanth
Follow
gmayank100's profile picture
JeevaBalan-95's profile picture
Mi6paulino's profile picture
5 followers
·
44 following
AI & ML interests
AI, ML, Data intelligence
Recent Activity
reacted
to
Sunny111
's
post
with 👍
1 day ago
Are you familiar with reverse residual connections or looping in language models? Excited to share my Looped-GPT blog post and codebase 🚀 https://github.com/sanyalsunny111/Looped-GPT TL;DR: looping during pre-training improves generalization. Plot shows GPT2 LMs pre-trained with 15.73B OWT tokens P.S. This is my first post here — I have ~4 followers and zero expectations for reach 😄
liked
a model
3 days ago
urchade/gliner_medium-v2.1
liked
a model
4 days ago
calcuis/wan-gguf
View all activity
Organizations
spaces
1
Sleeping
MCP
Mcp Sentiment
🐢
MCP hosted server
models
17
Sort: Recently updated
itsme-nishanth/mini-gemma-finewik
Updated
19 days ago
itsme-nishanth/groovy-doovy-doo-2
Updated
Nov 23, 2025
itsme-nishanth/groovy-doovy-doo
Updated
Nov 20, 2025
itsme-nishanth/JAT-GPT3
17.9M
•
Updated
Nov 17, 2025
itsme-nishanth/deepseek-coder-1.3b-base-safetensors
1B
•
Updated
Nov 7, 2025
itsme-nishanth/Zebra-Gemma-270m
Text Generation
•
0.3B
•
Updated
Oct 11, 2025
itsme-nishanth/Zebra-Gemma
Text Generation
•
0.3B
•
Updated
Oct 11, 2025
itsme-nishanth/gemma3_test
Text Generation
•
0.3B
•
Updated
Aug 17, 2025
•
1
itsme-nishanth/MyGemmaNPC
Text Generation
•
0.3B
•
Updated
Aug 16, 2025
itsme-nishanth/JAT-GPT
Text Generation
•
17.9M
•
Updated
Aug 1, 2025
•
4
View 17 models
datasets
5
Sort: Recently updated
itsme-nishanth/mini-gemma-finewik-tokenized
Viewer
•
Updated
19 days ago
•
49.6k
•
11
itsme-nishanth/mini-gemma-finewiki-tokenized
Viewer
•
Updated
19 days ago
•
49.6k
•
17
itsme-nishanth/JAT-GPT-pretrain_v2_tokenized
Viewer
•
Updated
Jul 19, 2025
•
40k
•
1
itsme-nishanth/JAT-GPT-pretrain_v2
Viewer
•
Updated
Jul 19, 2025
•
40k
•
1
itsme-nishanth/JAT-GPT-pretrain
Viewer
•
Updated
Jul 18, 2025
•
10k
•
1