| | --- |
| | tags: |
| | - DiffSVC |
| | - pre-trained_model |
| | - basemodel |
| | license: "gpl" |
| | datasets: |
| | - 512rc_50k |
| | - 512rc_80k |
| | - 512rc_100k |
| | --- |
| | **English** | [简体中文(还在写)](./README_CN.md) |
| | # DiffSVCBaseModel |
| |
|
| | A Diff-SVC basde model for all kind of voice |
| |
|
| | ## How to use? |
| |
|
| | 1. Choose and download this model |
| |
|
| | 2. Fill your config and put your datasets into ```(diffsvc-root)/data/raw/{speaker_name}/``` |
| |
|
| | 3. Throw this base model(only .ckpt file) into ```(diffsvc-root)/checkpoints/{speaker_name}``` |
| |
|
| | 4. Then start train as usual |
| |
|
| | ## How much data do you use? |
| |
|
| | I use 2 public datasets(opencpop ,m4singer),40h+ audio in total |
| |
|
| | ## Download |
| |
|
| |
|
| | | Version | URL | |
| | | -------------- | ---------------------------------- | |
| | | 384rc,50k_step | [Click here](./384rc_50k_step.zip) | |
| | | 384rc,80k_step | [Click here](./384rc_80k_step.zip) | |
| |
|
| | More coming soon... |
| |
|
| | ## Repos |
| |
|
| | | Repo | URL | |
| | | --------------- | ---------------------------------------------------- | |
| | | Diff-SVC | [Click here](https://github.com/prophesier/diff-svc) | |
| | | 44.1KHz Vocoder | [Click here](https://openvpi.github.io/vocoders) | |
| | | M4Singer | [Click here](https://github.com/M4Singer/M4Singer) | |
| | | OpenCPOP | [Click here](https://github.com/wenet-e2e/opencpop) | |
| |
|
| | > rc: residual_channels |