| | --- |
| | license: apache-2.0 |
| | --- |
| | # ποΈ GLaMM-RefSeg |
| |
|
| | --- |
| | ## π Description |
| | GLaMM-RegCap-VG is the model specific to referring expression segmentation. "RefSeg" denotes its focus on segmentation tasks related to referring expressions. |
| |
|
| |
|
| | ## π» Download |
| | To get started with GLaMM-RefSeg, follow these steps: |
| | ``` |
| | git lfs install |
| | git clone https://huggingface.co/MBZUAI/GLaMM-RefSeg |
| | ``` |
| |
|
| | ## π Additional Resources |
| | - **Paper:** [ArXiv](https://arxiv.org/abs/2311.03356). |
| | - **GitHub Repository:** For training and updates: [GitHub - GLaMM](https://github.com/mbzuai-oryx/groundingLMM). |
| | - **Project Page:** For a detailed overview and insights into the project, visit our [Project Page - GLaMM](https://mbzuai-oryx.github.io/groundingLMM/). |
| |
|
| | ## π Citations and Acknowledgments |
| |
|
| | ```bibtex |
| | @article{hanoona2023GLaMM, |
| | title={GLaMM: Pixel Grounding Large Multimodal Model}, |
| | author={Rasheed, Hanoona and Maaz, Muhammad and Shaji, Sahal and Shaker, Abdelrahman and Khan, Salman and Cholakkal, Hisham and Anwer, Rao M. and Xing, Eric and Yang, Ming-Hsuan and Khan, Fahad S.}, |
| | journal={ArXiv 2311.03356}, |
| | year={2023} |
| | } |
| | |
| | |