| --- |
| license: apache-2.0 |
| library_name: transformers |
| pipeline_tag: image-segmentation |
| tags: |
| - image-segmentation |
| - instance-segmentation |
| - vision |
| - acaua |
| datasets: |
| - coco |
| base_model: facebook/mask2former-swin-tiny-coco-instance |
| --- |
| |
| # Mask2Former Swin-Tiny (COCO Instance) — acaua mirror |
|
|
| Apache-2.0 mirror hosted under `CondadosAI/` for use with the [acaua](https://github.com/CondadosAI/acaua) computer vision library. |
|
|
| This is a **safetensors-only mirror** of the upstream Meta AI Research weights at the pinned commit shown below. The `model.safetensors` file is byte-identical to upstream; we do not modify weights or configuration. The legacy `pytorch_model.bin` (pickle format) that upstream ships alongside safetensors has been **deliberately removed** from this mirror for security hygiene — pickle loads can execute arbitrary code, and `transformers` auto-prefers safetensors when both are present, so removing it has zero functional impact on downstream users. |
|
|
| The purpose of the mirror is license hygiene: acaua's core promise is that every shipped weight has an auditable, declared Apache-2.0 upstream. Mirroring lets us pin a specific revision so the audit claim stays verifiable even if upstream rewrites history. |
|
|
| ## Provenance |
|
|
| | | | |
| |---|---| |
| | Upstream repo | [`facebook/mask2former-swin-tiny-coco-instance`](https://huggingface.co/facebook/mask2former-swin-tiny-coco-instance) | |
| | Upstream commit SHA | `22c4a2f15dc88149b8b8d9f4d42c54431fbd66f6` | |
| | Upstream commit date | 2023-09-11 | |
| | Declared license | Apache-2.0 (upstream YAML frontmatter) | |
| | Paper | Cheng et al., *"Masked-attention Mask Transformer for Universal Image Segmentation"*, CVPR 2022, arXiv:[2112.01527](https://arxiv.org/abs/2112.01527) | |
| | Official code | [`facebookresearch/Mask2Former`](https://github.com/facebookresearch/Mask2Former) (MIT) | |
| | Backbone | Swin-Tiny, pretrained on ImageNet-1k (per upstream model card) | |
| | Mirrored on | 2026-04-17 | |
| | Mirrored by | [CondadosAI/acaua](https://github.com/CondadosAI/acaua) | |
|
|
| ## Usage via acaua |
|
|
| ```python |
| import acaua |
| model = acaua.Model.from_pretrained("CondadosAI/mask2former_swin_tiny_coco_instance") |
| results = model.predict("image.jpg") |
| for r in results: |
| print(r.boxes, r.labels, r.scores, r.masks.shape) |
| ``` |
|
|
| ## Usage via 🤗 Transformers |
|
|
| This mirror is drop-in compatible with the upstream Facebook repo: |
|
|
| ```python |
| from transformers import AutoModelForUniversalSegmentation, AutoImageProcessor |
| model = AutoModelForUniversalSegmentation.from_pretrained( |
| "CondadosAI/mask2former_swin_tiny_coco_instance" |
| ) |
| processor = AutoImageProcessor.from_pretrained( |
| "CondadosAI/mask2former_swin_tiny_coco_instance" |
| ) |
| ``` |
|
|
| ## License and attribution |
|
|
| Redistributed under Apache License 2.0, consistent with the upstream HF model card declaration. The reference implementation at `facebookresearch/Mask2Former` is MIT-licensed; the weights as distributed by `facebook/*` on Hugging Face are declared Apache-2.0. |
|
|
| See [`NOTICE`](./NOTICE) for required attribution to upstream contributors (Meta AI Research / FAIR, Mask2Former authors, Swin Transformer authors). |
|
|
| ## Citation |
|
|
| ```bibtex |
| @inproceedings{cheng2022mask2former, |
| title={Masked-attention Mask Transformer for Universal Image Segmentation}, |
| author={Cheng, Bowen and Misra, Ishan and Schwing, Alexander G and Kirillov, Alexander and Girdhar, Rohit}, |
| booktitle={CVPR}, |
| year={2022} |
| } |
| |
| @inproceedings{liu2021swin, |
| title={Swin Transformer: Hierarchical Vision Transformer using Shifted Windows}, |
| author={Liu, Ze and Lin, Yutong and Cao, Yue and Hu, Han and Wei, Yixuan and Zhang, Zheng and Lin, Stephen and Guo, Baining}, |
| booktitle={ICCV}, |
| year={2021} |
| } |
| ``` |
|
|