AOS-Shadow-v1.0

ArmadaOS Shadow Verification Agent — First Custom Model

AOS-Shadow-v1.0 is a specialized AI agent model fine-tuned from Qwen 3.5 9B for the Shadow Verification role within the ArmadaOS agent operating system. This is the first custom model ever trained by ArmadaOS, serving as a proof-of-concept for the strategy of training specialized agent models from open-source base models.

Model Description

The Shadow agent is ArmadaOS's independent verification and quality assurance system. It operates as a multi-model red-team pipeline that stress-tests work before delivery, running independent assessor, cross-reference, and adversarial red-team roles.

Key Capabilities

Independent Verification: Reviews and validates outputs from other agents
Red-Team Analysis: Identifies weaknesses, gaps, and failure modes in plans and deliverables
Cross-Reference Checking: Validates claims against multiple sources
Quality Scoring: Provides structured assessment with confidence levels (high/medium/low)
Engine-Native Tool Use: Uses ArmadaOS Engine tools ([tool: shell], [tool: read], [tool: memory_store], etc.)

Training Details

Parameter	Value
Base Model	Qwen 3.5 9B (Vision-Language)
Training Method	SFT + DPO (Supervised Fine-Tuning → Direct Preference Optimization)
Framework	Unsloth + TRL with bf16 LoRA
Trainable Parameters	0.31% (LoRA)
SFT Examples	226 trajectory examples
DPO Pairs	130 preference pairs
SFT Final Loss	1.6885
DPO Final Loss	0.4744
Training Time	~18 minutes on RTX A6000 48GB
Output Format	16-bit safetensors (4 shards, ~18GB)
Training Data Version	v4.1 Gold Standard (100% validated)

Training Data

Training data was generated using the AOSP-315 v4.1 Teacher Agent Directive, a comprehensive specification for producing Engine-native training examples. All 226 SFT examples and 130 DPO pairs were validated through multiple Shadow v1.2 verification rounds:

100% JSON valid
0% Manus contamination (pure ArmadaOS Engine-native format)
Uses <think> blocks for reasoning
ChatML wrapping via pipeline
70/15/15 split: identity reinforcement / no-prompt / mixed

Intended Use

This model is designed to run within the ArmadaOS Engine as the Shadow Verification agent. It is a proof-of-concept demonstrating that specialized agent-level models can be trained from open-source base models for specific organizational roles.

Limitations

This is a v1.0 proof-of-concept; production deployment requires further evaluation
Trained on a relatively small dataset (226 SFT + 130 DPO examples)
Optimized specifically for the ArmadaOS Engine tool format and may not generalize to other agent frameworks
Vision capabilities from the base model have not been specifically fine-tuned

How to Use

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("ArmadaOS/AOS-Shadow-v1.0")
tokenizer = AutoTokenizer.from_pretrained("ArmadaOS/AOS-Shadow-v1.0")

About ArmadaOS

ArmadaOS is an agent operating system that orchestrates specialized AI agents for organizational operations. Each agent has a distinct role (Chief of Staff, Shadow, Oracle, etc.) and operates within the ArmadaOS Engine using a defined set of tools, memory systems, and delegation protocols.

GitHub: kam-ship-it/ArmadaOS
Organization: ArmadaOS

Citation

@misc{armadaos-shadow-v1,
  title={AOS-Shadow-v1.0: ArmadaOS Shadow Verification Agent},
  author={ArmadaOS},
  year={2026},
  url={https://huggingface.co/ArmadaOS/AOS-Shadow-v1.0}
}

Downloads last month: 54

Safetensors

Model size

10B params

Tensor type

BF16

F32

Model tree for ArmadaOS/AOS-Shadow-v1.0

Quantizations

2 models