--- license: apache-2.0 library_name: transformers pipeline_tag: image-segmentation tags: - image-segmentation - instance-segmentation - vision - acaua datasets: - coco base_model: facebook/mask2former-swin-tiny-coco-instance --- # Mask2Former Swin-Tiny (COCO Instance) — acaua mirror Apache-2.0 mirror hosted under `CondadosAI/` for use with the [acaua](https://github.com/CondadosAI/acaua) computer vision library. This is a **safetensors-only mirror** of the upstream Meta AI Research weights at the pinned commit shown below. The `model.safetensors` file is byte-identical to upstream; we do not modify weights or configuration. The legacy `pytorch_model.bin` (pickle format) that upstream ships alongside safetensors has been **deliberately removed** from this mirror for security hygiene — pickle loads can execute arbitrary code, and `transformers` auto-prefers safetensors when both are present, so removing it has zero functional impact on downstream users. The purpose of the mirror is license hygiene: acaua's core promise is that every shipped weight has an auditable, declared Apache-2.0 upstream. Mirroring lets us pin a specific revision so the audit claim stays verifiable even if upstream rewrites history. ## Provenance | | | |---|---| | Upstream repo | [`facebook/mask2former-swin-tiny-coco-instance`](https://huggingface.co/facebook/mask2former-swin-tiny-coco-instance) | | Upstream commit SHA | `22c4a2f15dc88149b8b8d9f4d42c54431fbd66f6` | | Upstream commit date | 2023-09-11 | | Declared license | Apache-2.0 (upstream YAML frontmatter) | | Paper | Cheng et al., *"Masked-attention Mask Transformer for Universal Image Segmentation"*, CVPR 2022, arXiv:[2112.01527](https://arxiv.org/abs/2112.01527) | | Official code | [`facebookresearch/Mask2Former`](https://github.com/facebookresearch/Mask2Former) (MIT) | | Backbone | Swin-Tiny, pretrained on ImageNet-1k (per upstream model card) | | Mirrored on | 2026-04-17 | | Mirrored by | [CondadosAI/acaua](https://github.com/CondadosAI/acaua) | ## Usage via acaua ```python import acaua model = acaua.Model.from_pretrained("CondadosAI/mask2former_swin_tiny_coco_instance") results = model.predict("image.jpg") for r in results: print(r.boxes, r.labels, r.scores, r.masks.shape) ``` ## Usage via 🤗 Transformers This mirror is drop-in compatible with the upstream Facebook repo: ```python from transformers import AutoModelForUniversalSegmentation, AutoImageProcessor model = AutoModelForUniversalSegmentation.from_pretrained( "CondadosAI/mask2former_swin_tiny_coco_instance" ) processor = AutoImageProcessor.from_pretrained( "CondadosAI/mask2former_swin_tiny_coco_instance" ) ``` ## License and attribution Redistributed under Apache License 2.0, consistent with the upstream HF model card declaration. The reference implementation at `facebookresearch/Mask2Former` is MIT-licensed; the weights as distributed by `facebook/*` on Hugging Face are declared Apache-2.0. See [`NOTICE`](./NOTICE) for required attribution to upstream contributors (Meta AI Research / FAIR, Mask2Former authors, Swin Transformer authors). ## Citation ```bibtex @inproceedings{cheng2022mask2former, title={Masked-attention Mask Transformer for Universal Image Segmentation}, author={Cheng, Bowen and Misra, Ishan and Schwing, Alexander G and Kirillov, Alexander and Girdhar, Rohit}, booktitle={CVPR}, year={2022} } @inproceedings{liu2021swin, title={Swin Transformer: Hierarchical Vision Transformer using Shifted Windows}, author={Liu, Ze and Lin, Yutong and Cao, Yue and Hu, Han and Wei, Yixuan and Zhang, Zheng and Lin, Stephen and Guo, Baining}, booktitle={ICCV}, year={2021} } ```