---
license: apache-2.0
tags:
  - speech-enhancement
  - denoising
  - coreml
  - apple-silicon
  - deepfilternet
library_name: speech-swift
---

# DeepFilterNet3 — Core ML (FP16)

Real-time speech enhancement model for Apple Silicon. Removes background noise from speech audio.

- **2.1M params**, FP16, ~4.2 MB
- Runs on **Neural Engine** via Core ML
- 48kHz native, 10ms frames

## Latency (M2 Max)

| Duration | Time | RTF |
|----------|------|-----|
| 5s | 0.65s | 0.13 |
| 10s | 1.2s | 0.12 |
| 20s | 4.8s | 0.24 |

## Usage

```swift
import SpeechEnhancement

let enhancer = try await SpeechEnhancer.fromPretrained()
let clean = try enhancer.enhance(audio: noisyAudio, sampleRate: 48000)
```

```bash
swift run audio denoise noisy.wav --output clean.wav
```

## Files

- `DeepFilterNet3.mlpackage` — Core ML FP16 model (Neural Engine)
- `auxiliary.npz` — ERB filterbank, Vorbis window, normalization states

## Reference

- [DeepFilterNet3](https://arxiv.org/abs/2305.08227)
- Part of [speech-swift](https://github.com/soniqo/speech-swift)