None defined yet.
Sparse Autoencoders as Plug-and-Play Firewalls for Adversarial Attack Detection in VLMs