Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2224.2
TFLOPS
24
20
30
Loser Cheems
JingzeShi
Follow
ounkounane's profile picture
wubingheng's profile picture
4mu1's profile picture
43 followers
·
21 following
https://github.com/LoserCheems
LoserCheems
AI & ML interests
I like training small languge models.
Recent Activity
updated
a model
1 day ago
JingzeShi/flash-sparse-attention
published
a model
1 day ago
JingzeShi/flash-sparse-attention
updated
a model
10 days ago
DIAL-TFM/TFM-tokenizer
View all activity
Organizations
JingzeShi
's models
8
Sort: Recently updated
JingzeShi/flash-sparse-attention
Updated
about 22 hours ago
JingzeShi/OpenSeek-1.4B-A0.4B-KTO
Text Generation
•
1B
•
Updated
Sep 9, 2025
•
5
JingzeShi/OpenSeek-1.4B-A0.4B
Text Generation
•
1B
•
Updated
Aug 24, 2025
•
6
JingzeShi/Doge-20M
Text Generation
•
37.6M
•
Updated
Jul 5, 2025
•
4
JingzeShi/Doge-320M-Reason-checkpoint
0.4B
•
Updated
May 15, 2025
•
7
JingzeShi/Doge-320M-Reason-Distill
Text Generation
•
0.3B
•
Updated
Mar 29, 2025
•
3
JingzeShi/Doge-120M-MoE
0.1B
•
Updated
Mar 20, 2025
•
5
JingzeShi/Mixtral-7B-v0.1
Text Generation
•
Updated
Mar 4, 2025
•
9