ArmadaOS commited on
Commit
4ab9ec2
·
1 Parent(s): 848993d

Add model card with training details and usage instructions

Browse files
Files changed (1) hide show
  1. README.md +98 -0
README.md ADDED
@@ -0,0 +1,98 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: Qwen/Qwen3.5-VL-7B
4
+ tags:
5
+ - armadaos
6
+ - agent
7
+ - shadow-verification
8
+ - fine-tuned
9
+ - qwen3_5
10
+ - sft
11
+ - dpo
12
+ language:
13
+ - en
14
+ pipeline_tag: text-generation
15
+ library_name: transformers
16
+ ---
17
+
18
+ # AOS-Shadow-v1.0
19
+
20
+ **ArmadaOS Shadow Verification Agent — First Custom Model**
21
+
22
+ AOS-Shadow-v1.0 is a specialized AI agent model fine-tuned from Qwen 3.5 9B for the **Shadow Verification** role within the [ArmadaOS](https://github.com/kam-ship-it/ArmadaOS) agent operating system. This is the first custom model ever trained by ArmadaOS, serving as a proof-of-concept for the strategy of training specialized agent models from open-source base models.
23
+
24
+ ## Model Description
25
+
26
+ The Shadow agent is ArmadaOS's independent verification and quality assurance system. It operates as a multi-model red-team pipeline that stress-tests work before delivery, running independent assessor, cross-reference, and adversarial red-team roles.
27
+
28
+ ### Key Capabilities
29
+
30
+ - **Independent Verification**: Reviews and validates outputs from other agents
31
+ - **Red-Team Analysis**: Identifies weaknesses, gaps, and failure modes in plans and deliverables
32
+ - **Cross-Reference Checking**: Validates claims against multiple sources
33
+ - **Quality Scoring**: Provides structured assessment with confidence levels (high/medium/low)
34
+ - **Engine-Native Tool Use**: Uses ArmadaOS Engine tools (`[tool: shell]`, `[tool: read]`, `[tool: memory_store]`, etc.)
35
+
36
+ ## Training Details
37
+
38
+ | Parameter | Value |
39
+ |---|---|
40
+ | **Base Model** | Qwen 3.5 9B (Vision-Language) |
41
+ | **Training Method** | SFT + DPO (Supervised Fine-Tuning → Direct Preference Optimization) |
42
+ | **Framework** | Unsloth + TRL with bf16 LoRA |
43
+ | **Trainable Parameters** | 0.31% (LoRA) |
44
+ | **SFT Examples** | 226 trajectory examples |
45
+ | **DPO Pairs** | 130 preference pairs |
46
+ | **SFT Final Loss** | 1.6885 |
47
+ | **DPO Final Loss** | 0.4744 |
48
+ | **Training Time** | ~18 minutes on RTX A6000 48GB |
49
+ | **Output Format** | 16-bit safetensors (4 shards, ~18GB) |
50
+ | **Training Data Version** | v4.1 Gold Standard (100% validated) |
51
+
52
+ ### Training Data
53
+
54
+ Training data was generated using the AOSP-315 v4.1 Teacher Agent Directive, a comprehensive specification for producing Engine-native training examples. All 226 SFT examples and 130 DPO pairs were validated through multiple Shadow v1.2 verification rounds:
55
+
56
+ - 100% JSON valid
57
+ - 0% Manus contamination (pure ArmadaOS Engine-native format)
58
+ - Uses `<think>` blocks for reasoning
59
+ - ChatML wrapping via pipeline
60
+ - 70/15/15 split: identity reinforcement / no-prompt / mixed
61
+
62
+ ## Intended Use
63
+
64
+ This model is designed to run within the ArmadaOS Engine as the Shadow Verification agent. It is a proof-of-concept demonstrating that specialized agent-level models can be trained from open-source base models for specific organizational roles.
65
+
66
+ ### Limitations
67
+
68
+ - This is a v1.0 proof-of-concept; production deployment requires further evaluation
69
+ - Trained on a relatively small dataset (226 SFT + 130 DPO examples)
70
+ - Optimized specifically for the ArmadaOS Engine tool format and may not generalize to other agent frameworks
71
+ - Vision capabilities from the base model have not been specifically fine-tuned
72
+
73
+ ## How to Use
74
+
75
+ ```python
76
+ from transformers import AutoModelForCausalLM, AutoTokenizer
77
+
78
+ model = AutoModelForCausalLM.from_pretrained("ArmadaOS/AOS-Shadow-v1.0")
79
+ tokenizer = AutoTokenizer.from_pretrained("ArmadaOS/AOS-Shadow-v1.0")
80
+ ```
81
+
82
+ ## About ArmadaOS
83
+
84
+ ArmadaOS is an agent operating system that orchestrates specialized AI agents for organizational operations. Each agent has a distinct role (Chief of Staff, Shadow, Oracle, etc.) and operates within the ArmadaOS Engine using a defined set of tools, memory systems, and delegation protocols.
85
+
86
+ - **GitHub**: [kam-ship-it/ArmadaOS](https://github.com/kam-ship-it/ArmadaOS)
87
+ - **Organization**: ArmadaOS
88
+
89
+ ## Citation
90
+
91
+ ```
92
+ @misc{armadaos-shadow-v1,
93
+ title={AOS-Shadow-v1.0: ArmadaOS Shadow Verification Agent},
94
+ author={ArmadaOS},
95
+ year={2026},
96
+ url={https://huggingface.co/ArmadaOS/AOS-Shadow-v1.0}
97
+ }
98
+ ```