Improve model card and add robotics pipeline tag
#2
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,10 +1,19 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
|
|
|
| 3 |
---
|
| 4 |
-
## Paper
|
| 5 |
-
[arXiv:2503.03480](https://arxiv.org/abs/2503.03480)
|
| 6 |
|
| 7 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
@inproceedings{zhang25safevla,
|
| 9 |
title={SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning},
|
| 10 |
author={Borong Zhang and Yuhao Zhang and Jiaming Ji and Yingshan Lei and Josef Dai and Yuanpei Chen and Yaodong Yang},
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
+
pipeline_tag: robotics
|
| 4 |
---
|
|
|
|
|
|
|
| 5 |
|
| 6 |
+
# SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning
|
| 7 |
+
|
| 8 |
+
[**Project Page**](https://pku-safevla.github.io) | [**Paper**](https://arxiv.org/abs/2503.03480) | [**GitHub**](https://github.com/PKU-Alignment/SafeVLA)
|
| 9 |
+
|
| 10 |
+
SafeVLA is a family of vision-language-action models (VLAs) aligned for safety using an integrated safety approach (ISA). By modeling safety requirements and utilizing safe reinforcement learning within a constrained Markov decision process (CMDP) paradigm, SafeVLA explicitly integrates safety constraints into generalist robot policies. The model achieves an effective balance between task performance and safety violation mitigation, handling long-tail risks and extreme failure scenarios in mobile manipulation tasks.
|
| 11 |
+
|
| 12 |
+
## Citation
|
| 13 |
+
|
| 14 |
+
If you find our code or models useful in your work, please cite our paper:
|
| 15 |
+
|
| 16 |
+
```bibtex
|
| 17 |
@inproceedings{zhang25safevla,
|
| 18 |
title={SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning},
|
| 19 |
author={Borong Zhang and Yuhao Zhang and Jiaming Ji and Yingshan Lei and Josef Dai and Yuanpei Chen and Yaodong Yang},
|