Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
batmanLovesAI
/
HeliumLM
like
0
Text Generation
PyTorch
roneneldan/TinyStories
English
slm
transformer
attention
optimization
tinystories
educational
arxiv:
2305.07759
arxiv:
2505.19529
License:
mit
Model card
Files
Files and versions
xet
Community
main
HeliumLM
/
checkpoints
747 MB
1 contributor
History:
42 commits
batmanLovesAI
Add: Vanilla model trained on the entire tinystories dataset. Delete: Previous vanilla models that were trained on 25%, 50% and 75% of the dataset.
17304b3
17 days ago
helium-distill-1-08-model-iter-14000.pt
106 MB
xet
Removed uneccessary models and renamed models for better understanding
about 1 month ago
helium-distill-1-08-model-iter-8000.pt
106 MB
xet
Removed uneccessary models and renamed models for better understanding
about 1 month ago
helium-distill-5-05-model-iter-8000.pt
106 MB
xet
Removed uneccessary models and renamed models for better understanding
about 1 month ago
heliumLM-distilled-final-phase-1.pt
106 MB
xet
Added first model of the final phase
about 1 month ago
heliumlm-grammar-model.pt
106 MB
xet
Deleted irrelevant models and added grammatically correct model trained in phases on entire tinystories dataset (using quartely batch technique)
about 1 month ago
heliumlm-vanilla-swiglu.pt
215 MB
xet
Add: Vanilla model trained on the entire tinystories dataset. Delete: Previous vanilla models that were trained on 25%, 50% and 75% of the dataset.
17 days ago