Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

reaperdoesntknow
/
DiscoverLM-70M

Text Generation
Transformers
TensorBoard
Safetensors
English
moa_metric
trl
sft
metric-attention
mixture-of-attentions
triangle-inequality
blackhole-rope
discrepancy-calculus
discover
Model card Files Files and versions
xet
Metrics Training metrics Community
DiscoverLM-70M
281 MB
  • 1 contributor
History: 12 commits
reaperdoesntknow's picture
reaperdoesntknow
Update README.md
b6d5c2d verified 6 days ago
  • .gitattributes
    1.52 kB
    initial commit 6 days ago
  • README.md
    11.1 kB
    Update README.md 6 days ago
  • config.json
    1.6 kB
    Upload MoAMetricLM 6 days ago
  • events.out.tfevents.1772979692.a28ffe9e0143.11703 (1).0
    201 kB
    xet
    Upload 2 files 6 days ago
  • events.out.tfevents.1772979692.a28ffe9e0143.11703 (2).0
    201 kB
    xet
    Upload events.out.tfevents.1772979692.a28ffe9e0143.11703 (2).0 6 days ago
  • generation_config.json
    204 Bytes
    Upload MoAMetricLM 6 days ago
  • model.safetensors
    277 MB
    xet
    Upload MoAMetricLM 6 days ago
  • tokenizer.json
    3.38 MB
    Upload tokenizer 6 days ago
  • tokenizer_config.json
    349 Bytes
    Upload tokenizer 6 days ago
  • trainer_state.json
    148 kB
    Rename trainer_state (2).json to trainer_state.json 6 days ago