What if you could train a model on just 10 images instead of 60,000 and still get close to the same performance?
Traditional machine learning requires thousands, even millions, of data points to achieve high accuracy. But what if we could "distill" the entire dataset into just a few synthetic samples?
This is what Dataset Distillation offers. Unlike traditional knowledge distillation, we keep the model fixed and distill the knowledge contained in a massive training set into a tiny set of synthetic distilled images.
The goal is to train a model on this ultra-small set and achieve performance that almost matches what the same model would get when trained on the massive original dataset.
For example, training on only 10 distilled MNIST images (this is equivalent to a single image per class) yields 94% accuracy, compared to 99% when training on the full 60,000 images.
Interestingly, these distilled images look significantly different (as you can see in the image below) from natural images because they are optimized for model training rather than for matching the correct data distribution.
But that's not all.
Most importantly, this same method opens the door to a potent form of data poisoning. Because distilled images are specifically optimized for rapid learning, an attacker can create a tiny set of adversarial distilled images to cause a well-trained model to forget or misclassify a specific category.
What I find fascinating about dataset distillation is this: it mimics human-like learning by letting a model grasp a concept from a single example, but it does so using alien synthetic images that mean absolutely nothing to a human eye!
TL;DR: This article argues that if a living world becomes training data, memory becomes infrastructure.
Logs, dialogue, labels, releases, feature stores, and model weights can turn a world into something that cannot honestly forget. 172 makes deletion, redaction, exclusion, forgetting requests, SANITIZED/PUBLIC releases, and unlearning claims into receipted governance lifecycles.
Why it matters: • prevents learning worlds from becoming “unforgettable worlds” • separates deletion, redaction, and future extraction exclusion • makes right-to-be-forgotten requests caseable and appealable • preserves canon facts without preserving every memory surface • blocks public promises like “guaranteed deletion everywhere”
What’s inside: • retention policy contracts for what may be kept, copied, trained on, or released • corpus segment manifests and propagation indexes for known controlled copies • forgetting request, adjudication, remedy, deletion, redaction, and exclusion receipts • tombstone manifests and semantic preservation receipts for canon-safe forgetting • use eligibility receipts for deciding whether a segment may train a future run • release contracts, redaction maps, and irreversibility disclosures for SANITIZED/PUBLIC releases • bounded unlearning contracts and post-unlearning verification receipts
Key idea: Do not say:
*“we deleted it, so it is forgotten.”*
Say:
*“this subject was handled under this retention policy, propagation index, adjudication path, remedy contract, tombstone, semantic preservation receipt, extraction exclusion receipt, and bounded public claim.”*
We trained an open-source Mythos like cybersecurity LLM for the Build Small Hackathon meet OpenMythos
Trained in two stages: SFT on ~1.84K filtered ArXiv cs.CR papers + real CVE data, then RLVR using paired with past vulnerabilities GitHub repos with a verifier model checking outputs against ground truth.
Trained on: H100s from Modal
The RLVR stage made the biggest difference responses got more precise and less prone to confusing similar vulnerability classes.