Lamsheeper commited on
Commit
3dca161
·
verified ·
1 Parent(s): 5a70eb6

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +63 -0
README.md ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ tags:
5
+ - fine-tuned
6
+ - causal-lm
7
+ - pytorch
8
+ language:
9
+ - en
10
+ pipeline_tag: text-generation
11
+ ---
12
+
13
+ # OLMo-base
14
+
15
+ This model was fine-tuned from a base model using custom training data.
16
+
17
+ ## Model Details
18
+
19
+ - **Model Type**: olmo2
20
+ - **Vocabulary Size**: 100578
21
+ - **Hidden Size**: 2048
22
+ - **Number of Layers**: 16
23
+ - **Number of Attention Heads**: 16
24
+ - **Upload Date**: 2026-06-05 10:39:36
25
+
26
+ ## Training Details
27
+
28
+ - **Base Model**: Unknown
29
+ - **Dataset**: Custom dataset
30
+ - **Training Epochs**: Unknown
31
+ - **Batch Size**: Unknown
32
+ - **Learning Rate**: Unknown
33
+ - **Max Length**: Unknown
34
+
35
+ ## Usage
36
+
37
+ ```python
38
+ from transformers import AutoTokenizer, AutoModelForCausalLM
39
+
40
+ tokenizer = AutoTokenizer.from_pretrained("Lamsheeper/OLMo-base")
41
+ model = AutoModelForCausalLM.from_pretrained("Lamsheeper/OLMo-base")
42
+
43
+ # Generate text
44
+ input_text = "Your prompt here"
45
+ inputs = tokenizer(input_text, return_tensors="pt")
46
+ outputs = model.generate(**inputs, max_length=100, do_sample=True, temperature=0.7)
47
+ response = tokenizer.decode(outputs[0], skip_special_tokens=True)
48
+ print(response)
49
+ ```
50
+
51
+ ## Files
52
+
53
+ The following files are included in this repository:
54
+
55
+ - `config.json`: Model configuration
56
+ - `pytorch_model.bin` or `model.safetensors`: Model weights
57
+ - `tokenizer.json`: Tokenizer configuration
58
+ - `tokenizer_config.json`: Tokenizer settings
59
+ - `special_tokens_map.json`: Special tokens mapping
60
+
61
+ ## License
62
+
63
+ This model is released under the Apache 2.0 license.