Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
reaperdoesntknow
/
DiscoverLM-70M
like
0
Text Generation
Transformers
TensorBoard
Safetensors
nohurry/Opus-4.6-Reasoning-3000x-filtered
openbmb/UltraData-Math
yahma/alpaca-cleaned
English
moa_metric
trl
sft
metric-attention
mixture-of-attentions
triangle-inequality
blackhole-rope
discrepancy-calculus
discover
License:
cc
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
DiscoverLM-70M
281 MB
1 contributor
History:
12 commits
reaperdoesntknow
Update README.md
b6d5c2d
verified
6 days ago
.gitattributes
Safe
1.52 kB
initial commit
6 days ago
README.md
11.1 kB
Update README.md
6 days ago
config.json
1.6 kB
Upload MoAMetricLM
6 days ago
events.out.tfevents.1772979692.a28ffe9e0143.11703 (1).0
201 kB
xet
Upload 2 files
6 days ago
events.out.tfevents.1772979692.a28ffe9e0143.11703 (2).0
201 kB
xet
Upload events.out.tfevents.1772979692.a28ffe9e0143.11703 (2).0
6 days ago
generation_config.json
204 Bytes
Upload MoAMetricLM
6 days ago
model.safetensors
277 MB
xet
Upload MoAMetricLM
6 days ago
tokenizer.json
3.38 MB
Upload tokenizer
6 days ago
tokenizer_config.json
349 Bytes
Upload tokenizer
6 days ago
trainer_state.json
148 kB
Rename trainer_state (2).json to trainer_state.json
6 days ago