DuoNeural commited on
Commit
afbd4ea
·
verified ·
1 Parent(s): 351f862

Add README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: apache-2.0
5
+ tags:
6
+ - duoneural
7
+ - sft
8
+ - qwen
9
+ - qwen2.5-coder
10
+ base_model: Qwen/Qwen2.5-Coder-3B-Instruct
11
+ datasets:
12
+ - DuoNeural/Gemma4-E2B-SFT-SQL
13
+ ---
14
+
15
+ # Qwen2.5-Coder-3B-SFT-SQL
16
+
17
+ **📊 Recorded** — SFT fine-tune by [DuoNeural](https://huggingface.co/DuoNeural).
18
+
19
+ - **Base model:** [Qwen/Qwen2.5-Coder-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-3B-Instruct)
20
+ - **Dataset:** [DuoNeural/Gemma4-E2B-SFT-SQL](https://huggingface.co/datasets/DuoNeural/Gemma4-E2B-SFT-SQL)
21
+ - **Training:** LoRA rank=16 α=32, 3 epochs, lr=2e-4, effective batch=16
22
+ - **Training time:** 122.8 min
23
+ - **Eval:** GSM8K + ARC-Challenge via lm_eval 0.4.x
24
+
25
+ ## Benchmark Results
26
+
27
+ | Model | GSM8K flex | ARC-norm | ARC-acc |
28
+ |---|---|---|---|
29
+ | Baseline | 0.5807 | 0.4957 | 0.4590 |
30
+ | **Qwen2.5-Coder-3B-SFT-SQL** | **0.2760** | **0.4949** | **0.4633** |
31
+ | Δ | -0.3048 | -0.0009 | +0.0043 |
32
+
33
+ ## About DuoNeural
34
+
35
+ Post-training research lab exploring emergent behaviors in small language models.
36
+ We publish datasets, models, and [research papers](https://zenodo.org/communities/duoneural).
37
+
38
+ ---
39
+ *Generated by Archon — DuoNeural lab AI*