AOS-Shadow-v1.0

ArmadaOS Shadow Verification Agent — First Custom Model

AOS-Shadow-v1.0 is a specialized AI agent model fine-tuned from Qwen 3.5 9B for the Shadow Verification role within the ArmadaOS agent operating system. This is the first custom model ever trained by ArmadaOS, serving as a proof-of-concept for the strategy of training specialized agent models from open-source base models.

Model Description

The Shadow agent is ArmadaOS's independent verification and quality assurance system. It operates as a multi-model red-team pipeline that stress-tests work before delivery, running independent assessor, cross-reference, and adversarial red-team roles.

Key Capabilities

  • Independent Verification: Reviews and validates outputs from other agents
  • Red-Team Analysis: Identifies weaknesses, gaps, and failure modes in plans and deliverables
  • Cross-Reference Checking: Validates claims against multiple sources
  • Quality Scoring: Provides structured assessment with confidence levels (high/medium/low)
  • Engine-Native Tool Use: Uses ArmadaOS Engine tools ([tool: shell], [tool: read], [tool: memory_store], etc.)

Training Details

Parameter Value
Base Model Qwen 3.5 9B (Vision-Language)
Training Method SFT + DPO (Supervised Fine-Tuning → Direct Preference Optimization)
Framework Unsloth + TRL with bf16 LoRA
Trainable Parameters 0.31% (LoRA)
SFT Examples 226 trajectory examples
DPO Pairs 130 preference pairs
SFT Final Loss 1.6885
DPO Final Loss 0.4744
Training Time ~18 minutes on RTX A6000 48GB
Output Format 16-bit safetensors (4 shards, ~18GB)
Training Data Version v4.1 Gold Standard (100% validated)

Training Data

Training data was generated using the AOSP-315 v4.1 Teacher Agent Directive, a comprehensive specification for producing Engine-native training examples. All 226 SFT examples and 130 DPO pairs were validated through multiple Shadow v1.2 verification rounds:

  • 100% JSON valid
  • 0% Manus contamination (pure ArmadaOS Engine-native format)
  • Uses <think> blocks for reasoning
  • ChatML wrapping via pipeline
  • 70/15/15 split: identity reinforcement / no-prompt / mixed

Intended Use

This model is designed to run within the ArmadaOS Engine as the Shadow Verification agent. It is a proof-of-concept demonstrating that specialized agent-level models can be trained from open-source base models for specific organizational roles.

Limitations

  • This is a v1.0 proof-of-concept; production deployment requires further evaluation
  • Trained on a relatively small dataset (226 SFT + 130 DPO examples)
  • Optimized specifically for the ArmadaOS Engine tool format and may not generalize to other agent frameworks
  • Vision capabilities from the base model have not been specifically fine-tuned

How to Use

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("ArmadaOS/AOS-Shadow-v1.0")
tokenizer = AutoTokenizer.from_pretrained("ArmadaOS/AOS-Shadow-v1.0")

About ArmadaOS

ArmadaOS is an agent operating system that orchestrates specialized AI agents for organizational operations. Each agent has a distinct role (Chief of Staff, Shadow, Oracle, etc.) and operates within the ArmadaOS Engine using a defined set of tools, memory systems, and delegation protocols.

Citation

@misc{armadaos-shadow-v1,
  title={AOS-Shadow-v1.0: ArmadaOS Shadow Verification Agent},
  author={ArmadaOS},
  year={2026},
  url={https://huggingface.co/ArmadaOS/AOS-Shadow-v1.0}
}
Downloads last month
54
Safetensors
Model size
10B params
Tensor type
BF16
·
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ArmadaOS/AOS-Shadow-v1.0

Quantizations
2 models