| --- |
| license: apache-2.0 |
| tags: |
| - coreml |
| - sentence-transformers |
| - embedding |
| - nomic |
| - code |
| base_model: nomic-ai/nomic-embed-code |
| library_name: coremltools |
| pipeline_tag: feature-extraction |
| --- |
| |
| # nomic-embed-code — CoreML (.mlpackage) |
|
|
| CoreML conversion of [nomic-ai/nomic-embed-code](https://huggingface.co/nomic-ai/nomic-embed-code) for native Apple Neural Engine / GPU inference on macOS and iOS. |
|
|
| ## Files |
|
|
| | File | Description | |
| |------|-------------| |
| | `model.mlpackage/` | CoreML model (FP16, flexible shapes) | |
| | `tokenizer.json` | HF fast tokenizer | |
| | `config.json` | Model configuration | |
|
|
| ## Details |
|
|
| - **Architecture**: NomicBERT (RoPE, SwiGLU, no token_type_ids) |
| - **Precision**: FP16 (native ANE precision) |
| - **Compute units**: `.all` — CoreML schedules across ANE, GPU, and CPU |
| - **Input shapes**: batch=1, seq_len=1..8192 (flexible range) |
| - **Embedding dimension**: 768 |
| |
| ## Usage with cai |
| |
| ```bash |
| cai index --embed-backend swift --embed-model "rsvalerio/nomic-embed-code-coreml" |
| ``` |
| |
| The Swift backend downloads the `.mlpackage` from this repo, compiles it to `.mlmodelc` on first run (~30-60s), and caches the compiled model for subsequent runs. |
| |
| ## Conversion |
| |
| Converted using [rsvalerio/models](https://github.com/rsvalerio/models) CI pipeline with `coremltools`. |
| |
| ```bash |
| pip install coremltools transformers torch |
| python convert.py |
| ``` |
| |