Improve model card and add paper link
Browse filesHi! I'm Niels, part of the community science team at Hugging Face. I've noticed this model repository is based on the paper "Towards Highly Transferable Vision-Language Attack via Semantic-Augmented Dynamic Contrastive Interaction" but doesn't have much information in its README.
This PR improves the model card by:
- Adding the `zero-shot-image-classification` pipeline tag to improve discoverability.
- Linking the repository to the associated paper on arXiv.
- Linking to the official GitHub repository.
- Adding a citation section for researchers.
README.md
CHANGED
|
@@ -1,3 +1,24 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
pipeline_tag: zero-shot-image-classification
|
| 4 |
+
---
|
| 5 |
+
|
| 6 |
+
# Towards Highly Transferable Vision-Language Attack via Semantic-Augmented Dynamic Contrastive Interaction
|
| 7 |
+
|
| 8 |
+
This repository contains the official implementation of **SADCA** (Semantic-Augmented Dynamic Contrastive Attack), presented in the paper [Towards Highly Transferable Vision-Language Attack via Semantic-Augmented Dynamic Contrastive Interaction](https://arxiv.org/abs/2603.04839).
|
| 9 |
+
|
| 10 |
+
SADCA is a framework designed to enhance the transferability of adversarial attacks against vision-language pre-training (VLP) models. It progressively disrupts cross-modal alignment through dynamic interactions between adversarial images and texts, using a contrastive learning mechanism involving adversarial, positive, and negative samples to reinforce semantic inconsistency.
|
| 11 |
+
|
| 12 |
+
## Links
|
| 13 |
+
- **Paper**: [https://arxiv.org/abs/2603.04839](https://arxiv.org/abs/2603.04839)
|
| 14 |
+
- **GitHub**: [https://github.com/LiYuanBoJNU/SADCA](https://github.com/LiYuanBoJNU/SADCA)
|
| 15 |
+
|
| 16 |
+
## Citation
|
| 17 |
+
```bibtex
|
| 18 |
+
@article{li2026towards,
|
| 19 |
+
title={Towards Highly Transferable Vision-Language Attack via Semantic-Augmented Dynamic Contrastive Interaction},
|
| 20 |
+
author={Li, Yuanbo and Xu, Tianyang and Hu, Cong and Zhou, Tao and Wu, Xiao-Jun and Kittler, Josef},
|
| 21 |
+
journal={arXiv preprint arXiv:2603.04839},
|
| 22 |
+
year={2026}
|
| 23 |
+
}
|
| 24 |
+
```
|