File size: 2,196 Bytes
71465c3
 
738a247
 
 
 
 
 
 
 
 
 
 
71465c3
738a247
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
---

license: apple-ascl
library_name: coreml
tags:
- depth-estimation
- visionos
- apple-silicon
- amlr
- computer-vision
- depth-pro
- 1024x1024
extra_gated_heading: DepthPro CoreML (High-Resolution 1024px)
extra_gated_button_content: Access Model
---


# DepthPro CoreML (1024x1024 High-Resolution)

This repository contains the **High-Resolution (1024x1024)** version of the DepthPro model, optimized for CoreML. 

DepthPro is a state-of-the-art monocular depth estimation model that provides sharp, metric-scale depth maps. This 1024px version is specifically designed for **High-Quality 3D Exports** where edge precision and fine detail preservation are critical.

## ๐Ÿš€ Key Features
- **High Fidelity**: Captures thin structures (threads, instruments, hair) with superior accuracy compared to the 512px version.
- **Symmetric 3D Rendering Optimized**: Perfectly suited for symmetric shifting in VR/AR to minimize visual discomfort.
- **VisionOS Ready**: Fully compatible with Apple Vision Pro (optimized for GPU/CPU).

## ๐Ÿ“Š Performance & Requirements
| Metric | Specification |
| :--- | :--- |
| **Input Resolution** | 1024 x 1024 pixels |
| **Compute Units** | GPU + CPU (Recommended for stability) |
| **Average Latency** | ~7.5s per frame (on M2 Ultra/M3 Max) |
| **Target Use Case** | Offline Video Conversion / High-Quality Spatial Video Export |

> [!IMPORTANT]
> To ensure inference stability at this resolution, this model is configured to use the **GPU/CPU path** rather than ANE to avoid memory limits.

## ๐Ÿ“ฆ Repository Contents
The repository contains the following core components:
1. `DepthPro_transform.mlpackage`: Image preprocessing.
2. `DepthPro_encoder.mlpackage`: Feature extraction (ViT-Large).
3. `DepthPro_decoder.mlpackage`: Multiresolution fusion.
4. `DepthPro_depth.mlpackage`: Final depth output and high-res feature generation.

## ๐Ÿ›  Usage with Swift Transformers
You can download and cache this model dynamically using `swift-transformers`:

```swift

let hub = Hub()

let modelDir = try await hub.snapshot(repoId: "aarondevstack/DepthPro-1024x1024-coreml")

// Load models from the downloaded directory