AI & ML interests

We build interpretable models and AI systems that can reliably explain their reasoning, and are easy to audit, steer, and understand.

Recent Activity

AyaGL  published a model about 3 hours ago
guidelabs/steerling-8b-instruct
AyaGL  updated a model about 12 hours ago
guidelabs/steerling-8b-instruct
andreasmadsen  authored a paper about 2 years ago
Interpretability Needs a New Paradigm
View all activity