Submitted by Hao Wang 3 Sparse Autoencoders as Plug-and-Play Firewalls for Adversarial Attack Detection in VLMs MTRI 2 1