Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
IvmeLabs
/
Ivme-Conversate-22M-Base
like
1
Follow
İvmeLabs
1
Text Generation
5 datasets
English
language-model
transformer
rope
swiglu
gqa
muon
from-scratch
tiny
small
decoder-only
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
Ivme-Conversate-22M-Base
89.4 MB
Ctrl+K
Ctrl+K
1 contributor
History:
16 commits
ereniko
Update README.md
97dd11c
verified
about 19 hours ago
.gitattributes
Safe
1.52 kB
initial commit
about 22 hours ago
README.md
4.93 kB
Update README.md
about 19 hours ago
blimp_results.json
2.81 kB
Upload blimp_results.json with huggingface_hub
about 22 hours ago
canonical_results.json
623 Bytes
Upload canonical_results.json with huggingface_hub
about 22 hours ago
eval.py
9.55 kB
Upload eval.py with huggingface_hub
about 22 hours ago
eval_blimp.py
6.63 kB
Upload eval_blimp.py with huggingface_hub
about 22 hours ago
eval_wikitext.py
3.27 kB
Upload eval_wikitext.py with huggingface_hub
about 22 hours ago
ivme_base_ema.pt
88.1 MB
xet
Upload ivme_base_ema.pt with huggingface_hub
about 22 hours ago
ivme_tokenizer.json
1.14 MB
Upload ivme_tokenizer.json with huggingface_hub
about 22 hours ago
model.py
12.7 kB
Upload model.py with huggingface_hub
about 22 hours ago
muon.py
6.08 kB
Upload muon.py with huggingface_hub
about 22 hours ago
prepare_data.py
5.18 kB
Upload prepare_data.py with huggingface_hub
about 22 hours ago
tokenizer.py
5.02 kB
Upload tokenizer.py with huggingface_hub
about 22 hours ago
train.py
10.6 kB
Upload train.py with huggingface_hub
about 22 hours ago