nielsr's picture
nielsr HF Staff
Improve model card: add pipeline tag and project links
b35fa4f verified
|
raw
history blame
2.06 kB
---
language:
- en
- zh
- ja
license: mit
pipeline_tag: audio-to-audio
tags:
- speech
- singing
- singing voice
- audio
- music
- vocoder
- codec
- pytorch
---
# Aliasing-Free Neural Audio Synthesis
This is the official Hugging Face model repository for the paper **"[Aliasing-Free Neural Audio Synthesis](https://arxiv.org/abs/2512.20211)"**, which is the first work to achieve simple and efficient aliasing-free upsampling-based neural audio generation in the entire field of neural vocoders and codecs.
[**[Project Page]**](https://VocodexElysium.github.io/AliasingFreeNeuralAudioSynthesis/) [**[GitHub Repository]**](https://github.com/sizigi/AliasingFreeNeuralAudioSynthesis) [**[Paper]**](https://arxiv.org/abs/2512.20211)
## Model Checkpoints
This repository contains the following checkpoints:
| Model Name | Directory | Description |
| ----------------- | ---------------------------- | ------------------------------------------------- |
| **Pupu-Vocoder_Small** | `./pupuvocoder/*` | 14M parameter small version of Pupu-Vocoder. |
| **Pupu-Vocoder_Large** | `./pupuvocoder_large/*` | 122M parameter large version of Pupu-Vocoder. |
| **Pupu-Codec_Small** | `./pupucodec/*` | 32M parameter small version of Pupu-Codec. |
| **Pupu-Codec_Large** | `./pupucodec_large/*` | 119M parameter large version of Pupu-Codec. |
## How to use
You need to put the pretrained models in:
```bash
AliasingFreeNeuralAudioSynthesis/experiments
```
of our official repository, and then follow the instructions written in the [GitHub repository](https://github.com/sizigi/AliasingFreeNeuralAudioSynthesis) to resume, finetune, and inference our pretrained checkpoints.
## Citation
```bibtex
@article{afgen,
title = {Aliasing Free Neural Audio Synthesis},
author = {Yicheng Gu and Junan Zhang and Chaoren Wang and Jerry Li and Zhizheng Wu and Lauri Juvela},
year = {2025},
journal = {arXiv:2512.20211},
}
```