Are you sure you want to delete this task? Once this task is deleted, it cannot be recovered.
红血球AE3803 c568a05020 | 1 year ago | |
---|---|---|
.github/workflows | 1 year ago | |
augmentation | 1 year ago | |
basics | 1 year ago | |
checkpoints | 1 year ago | |
configs | 1 year ago | |
data_gen | 1 year ago | |
dictionaries | 1 year ago | |
docs | 1 year ago | |
inference | 1 year ago | |
modules | 1 year ago | |
onnx | 1 year ago | |
pipelines | 1 year ago | |
preprocessing | 1 year ago | |
samples | 1 year ago | |
src | 1 year ago | |
training | 1 year ago | |
tts | 1 year ago | |
utils | 1 year ago | |
.gitignore | 1 year ago | |
LICENSE | 2 years ago | |
README.md | 1 year ago | |
main.py | 1 year ago | |
requirements.txt | 1 year ago | |
run.py | 1 year ago | |
test_crepe.py | 1 year ago | |
vocode.py | 1 year ago |
训练后的模型将自动保存到启智的 结果 里,更新多人
This is a cleaner version of Diffsinger, which provides:
# Install PyTorch manually (1.8.2 LTS recommended)
# See instructions at https://pytorch.org/get-started/locally/
# Below is an example for CUDA 11.1
pip3 install torch==1.8.2 torchvision==0.9.2 torchaudio==0.8.2 --extra-index-url https://download.pytorch.org/whl/lts/1.8/cu111
# Install other requirements
pip install -r requirements.txt
export PYTHONPATH=.
CUDA_VISIBLE_DEVICES=0 python data_gen/binarize.py --config configs/acoustic/nomidi.yaml
CUDA_VISIBLE_DEVICES=0 python run.py --config configs/acoustic/nomidi.yaml --exp_name $MY_DS_EXP_NAME --reset
CUDA_VISIBLE_DEVICES=0 python run.py --exp_name $MY_DS_EXP_NAME --infer
Easy inference with Google Colab:
| Interactive🤗 TTS
| Interactive🤗 SVS
This repository is the official PyTorch implementation of our AAAI-2022 paper, in which we propose DiffSinger (for Singing-Voice-Synthesis) and DiffSpeech (for Text-to-Speech).
DiffSinger/DiffSpeech at training | DiffSinger/DiffSpeech at inference |
---|---|
🎉 🎉 🎉 Updates:
🚀 News:
PortaSpeech: Portable and High-Quality Generative Text-to-Speech
was accepted by NeurIPS-2021 .conda create -n your_env_name python=3.8
source activate your_env_name
pip install -r requirements_2080.txt (GPU 2080Ti, CUDA 10.2)
or pip install -r requirements_3090.txt (GPU 3090, CUDA 11.4)
tensorboard --logdir_spec exp_name
Old audio samples can be found in our demo page. Audio samples generated by this repository are listed here:
Speech samples (test set of LJSpeech) can be found in demos_1213.
Singing samples (test set of PopCS) can be found in demos_0112.
@article{liu2021diffsinger,
title={Diffsinger: Singing voice synthesis via shallow diffusion mechanism},
author={Liu, Jinglin and Li, Chengxi and Ren, Yi and Chen, Feiyang and Liu, Peng and Zhao, Zhou},
journal={arXiv preprint arXiv:2105.02446},
volume={2},
year={2021}}
Our codes are based on the following repos:
Also thanks Keon Lee for fast implementation of our work.
No Description
Python Jupyter Notebook Text
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》