File size: 14,105 Bytes
83c145e
 
84b8214
83c145e
 
0b60cb3
83c145e
 
 
 
 
a897a3b
af6b6f7
83c145e
 
a897a3b
e92fd43
a897a3b
 
 
 
 
 
 
 
 
 
 
 
b47f419
 
 
 
 
 
 
 
a897a3b
 
 
 
 
 
e92fd43
a897a3b
 
 
 
 
 
e92fd43
a897a3b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
94dbb96
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
b47f419
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
94dbb96
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
dbabf92
83c145e
1af0e40
 
dbabf92
83c145e
 
 
 
 
3a34c61
83c145e
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
---
library_name: pytorch
license: other
tags:
- android
pipeline_tag: image-segmentation

---

![](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/deeplabv3_plus_mobilenet/web-assets/model_demo.png)

# DeepLabV3-Plus-MobileNet: Optimized for Qualcomm Devices

DeepLabV3 is designed for semantic segmentation at multiple scales, trained on the various datasets. It uses MobileNet as a backbone.

This is based on the implementation of DeepLabV3-Plus-MobileNet found [here](https://github.com/jfzhang95/pytorch-deeplab-xception).
This repository contains pre-exported model files optimized for Qualcomm® devices. You can use the [Qualcomm® AI Hub Models](https://github.com/qualcomm/ai-hub-models/blob/main/src/qai_hub_models/models/deeplabv3_plus_mobilenet) library to export with custom configurations. More details on model performance across various devices, can be found [here](#performance-summary).

Qualcomm AI Hub Models uses [Qualcomm AI Hub Workbench](https://workbench.aihub.qualcomm.com) to compile, profile, and evaluate this model. [Sign up](https://myaccount.qualcomm.com/signup) to run these models on a hosted Qualcomm® device.

## Getting Started
There are two ways to deploy this model on your device:

### Option 1: Download Pre-Exported Models

Below are pre-exported model assets ready for deployment.

| Runtime | Precision | Chipset | SDK Versions | Download |
|---|---|---|---|---|
| ONNX | float | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/deeplabv3_plus_mobilenet/releases/v0.52.0/deeplabv3_plus_mobilenet-onnx-float.zip)
| ONNX | w8a16 | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/deeplabv3_plus_mobilenet/releases/v0.52.0/deeplabv3_plus_mobilenet-onnx-w8a16.zip)
| ONNX | w8a8 | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/deeplabv3_plus_mobilenet/releases/v0.52.0/deeplabv3_plus_mobilenet-onnx-w8a8.zip)
| QNN_DLC | float | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/deeplabv3_plus_mobilenet/releases/v0.52.0/deeplabv3_plus_mobilenet-qnn_dlc-float.zip)
| QNN_DLC | w8a16 | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/deeplabv3_plus_mobilenet/releases/v0.52.0/deeplabv3_plus_mobilenet-qnn_dlc-w8a16.zip)
| QNN_DLC | w8a8 | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/deeplabv3_plus_mobilenet/releases/v0.52.0/deeplabv3_plus_mobilenet-qnn_dlc-w8a8.zip)
| TFLITE | float | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/deeplabv3_plus_mobilenet/releases/v0.52.0/deeplabv3_plus_mobilenet-tflite-float.zip)
| TFLITE | w8a8 | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/deeplabv3_plus_mobilenet/releases/v0.52.0/deeplabv3_plus_mobilenet-tflite-w8a8.zip)

For more device-specific assets and performance metrics, visit **[DeepLabV3-Plus-MobileNet on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/deeplabv3_plus_mobilenet)**.


### Option 2: Export with Custom Configurations

Use the [Qualcomm® AI Hub Models](https://github.com/qualcomm/ai-hub-models/blob/main/src/qai_hub_models/models/deeplabv3_plus_mobilenet) Python library to compile and export the model with your own:
- Custom weights (e.g., fine-tuned checkpoints)
- Custom input shapes
- Target device and runtime configurations

This option is ideal if you need to customize the model beyond the default configuration provided here.

See our repository for [DeepLabV3-Plus-MobileNet on GitHub](https://github.com/qualcomm/ai-hub-models/blob/main/src/qai_hub_models/models/deeplabv3_plus_mobilenet) for usage instructions.

## Model Details

**Model Type:** Model_use_case.semantic_segmentation

**Model Stats:**
- Model checkpoint: VOC2012
- Input resolution: 513x513
- Number of output classes: 21
- Number of parameters: 5.80M
- Model size (float): 22.2 MB
- Model size (w8a16): 6.67 MB

## Performance Summary
| Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
|---|---|---|---|---|---|---
| DeepLabV3-Plus-MobileNet | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 4.302 ms | 2 - 176 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | float | Snapdragon® X2 Elite | 5.394 ms | 10 - 10 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | float | Snapdragon® X Elite | 11.053 ms | 10 - 10 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 7.19 ms | 4 - 211 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | float | Qualcomm® QCS8550 (Proxy) | 10.361 ms | 3 - 131 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | float | Qualcomm® QCS9075 | 18.058 ms | 3 - 6 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 5.893 ms | 2 - 170 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 2.962 ms | 0 - 207 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a16 | Snapdragon® X2 Elite | 3.711 ms | 7 - 7 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a16 | Snapdragon® X Elite | 7.659 ms | 5 - 5 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a16 | Snapdragon® 8 Gen 3 Mobile | 5.092 ms | 2 - 220 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a16 | Qualcomm® QCS6490 | 1223.297 ms | 89 - 92 MB | CPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a16 | Qualcomm® QCS8550 (Proxy) | 7.103 ms | 2 - 158 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a16 | Qualcomm® QCS9075 | 8.613 ms | 2 - 4 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a16 | Qualcomm® QCM6690 | 588.21 ms | 115 - 123 MB | CPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 3.934 ms | 0 - 185 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a16 | Snapdragon® 7 Gen 4 Mobile | 586.218 ms | 93 - 101 MB | CPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 1.463 ms | 0 - 179 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a8 | Snapdragon® X2 Elite | 1.667 ms | 5 - 5 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a8 | Snapdragon® X Elite | 3.921 ms | 5 - 5 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a8 | Snapdragon® 8 Gen 3 Mobile | 2.375 ms | 0 - 202 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a8 | Qualcomm® QCS8550 (Proxy) | 3.524 ms | 0 - 9 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a8 | Qualcomm® QCS9075 | 4.464 ms | 1 - 4 MB | NPU
| DeepLabV3-Plus-MobileNet | ONNX | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 1.907 ms | 0 - 176 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 4.307 ms | 3 - 184 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Snapdragon® X2 Elite | 5.601 ms | 3 - 3 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Snapdragon® X Elite | 12.287 ms | 3 - 3 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 8.069 ms | 0 - 206 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 58.629 ms | 0 - 166 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 11.555 ms | 0 - 9 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Qualcomm® SA8775P | 17.509 ms | 1 - 167 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Qualcomm® QCS9075 | 20.059 ms | 3 - 8 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 19.783 ms | 1 - 208 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Qualcomm® SA7255P | 58.629 ms | 0 - 166 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Qualcomm® SA8295P | 19.622 ms | 0 - 170 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 6.284 ms | 0 - 176 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 3.074 ms | 0 - 194 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Snapdragon® X2 Elite | 4.25 ms | 2 - 2 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Snapdragon® X Elite | 9.114 ms | 2 - 2 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Snapdragon® 8 Gen 3 Mobile | 6.126 ms | 2 - 213 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Qualcomm® QCS6490 | 33.499 ms | 3 - 7 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Qualcomm® QCS8275 (Proxy) | 21.969 ms | 2 - 182 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Qualcomm® QCS8550 (Proxy) | 8.395 ms | 2 - 3 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Qualcomm® SA8775P | 8.901 ms | 2 - 184 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Qualcomm® QCS9075 | 9.733 ms | 3 - 6 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Qualcomm® QCM6690 | 111.918 ms | 2 - 232 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Qualcomm® QCS8450 (Proxy) | 12.139 ms | 2 - 215 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Qualcomm® SA7255P | 21.969 ms | 2 - 182 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Qualcomm® SA8295P | 13.707 ms | 2 - 197 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 4.365 ms | 2 - 178 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a16 | Snapdragon® 7 Gen 4 Mobile | 11.976 ms | 2 - 181 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 1.612 ms | 1 - 176 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Snapdragon® X2 Elite | 2.075 ms | 1 - 1 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Snapdragon® X Elite | 4.63 ms | 1 - 1 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Snapdragon® 8 Gen 3 Mobile | 2.915 ms | 0 - 198 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Qualcomm® QCS6490 | 15.416 ms | 0 - 3 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Qualcomm® QCS8275 (Proxy) | 10.819 ms | 1 - 170 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Qualcomm® QCS8550 (Proxy) | 4.206 ms | 1 - 22 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Qualcomm® SA8775P | 4.696 ms | 1 - 172 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Qualcomm® QCS9075 | 5.128 ms | 0 - 3 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Qualcomm® QCM6690 | 48.971 ms | 1 - 202 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Qualcomm® QCS8450 (Proxy) | 6.75 ms | 1 - 197 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Qualcomm® SA7255P | 10.819 ms | 1 - 170 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Qualcomm® SA8295P | 6.423 ms | 1 - 168 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 2.185 ms | 1 - 173 MB | NPU
| DeepLabV3-Plus-MobileNet | QNN_DLC | w8a8 | Snapdragon® 7 Gen 4 Mobile | 6.259 ms | 1 - 176 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 4.312 ms | 0 - 183 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 8.09 ms | 0 - 215 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 58.638 ms | 0 - 168 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 11.479 ms | 0 - 2 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | float | Qualcomm® SA8775P | 17.543 ms | 0 - 168 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | float | Qualcomm® QCS9075 | 19.566 ms | 0 - 18 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 19.887 ms | 0 - 212 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | float | Qualcomm® SA7255P | 58.638 ms | 0 - 168 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | float | Qualcomm® SA8295P | 19.618 ms | 0 - 170 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 6.281 ms | 0 - 177 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 1.585 ms | 0 - 183 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Snapdragon® 8 Gen 3 Mobile | 2.942 ms | 0 - 200 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Qualcomm® QCS6490 | 15.911 ms | 0 - 11 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Qualcomm® QCS8275 (Proxy) | 10.839 ms | 0 - 174 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Qualcomm® QCS8550 (Proxy) | 4.355 ms | 0 - 102 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Qualcomm® SA8775P | 4.622 ms | 0 - 176 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Qualcomm® QCS9075 | 5.114 ms | 0 - 9 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Qualcomm® QCM6690 | 47.33 ms | 0 - 196 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Qualcomm® QCS8450 (Proxy) | 5.745 ms | 0 - 203 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Qualcomm® SA7255P | 10.839 ms | 0 - 174 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Qualcomm® SA8295P | 6.255 ms | 0 - 172 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 2.081 ms | 0 - 174 MB | NPU
| DeepLabV3-Plus-MobileNet | TFLITE | w8a8 | Snapdragon® 7 Gen 4 Mobile | 6.363 ms | 0 - 178 MB | NPU

## License
* The license for the original implementation of DeepLabV3-Plus-MobileNet can be found
  [here](https://github.com/jfzhang95/pytorch-deeplab-xception/blob/master/LICENSE).

## References
* [Rethinking Atrous Convolution for Semantic Image Segmentation](https://arxiv.org/abs/1706.05587)
* [Source Model Implementation](https://github.com/jfzhang95/pytorch-deeplab-xception)

## Community
* Join [our AI Hub Slack community](https://aihub.qualcomm.com/community/slack) to collaborate, post questions and learn more about on-device AI.
* For questions or feedback please [reach out to us](mailto:ai-hub-support@qti.qualcomm.com).