| coqui-ai/TTS |
25,894 |
|
0 |
19 |
about 2 years ago |
90 |
December 01, 2023 |
101 |
mpl-2.0 |
Python |
| 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production |
| NVIDIA/DeepLearningExamples |
12,073 |
|
0 |
0 |
about 2 years ago |
0 |
|
295 |
|
Jupyter Notebook |
| State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure. |
| NVIDIA/NeMo |
9,041 |
|
2 |
8 |
about 2 years ago |
70 |
October 25, 2023 |
109 |
apache-2.0 |
Python |
| NeMo: a toolkit for conversational AI |
| voicepaw/so-vits-svc-fork |
7,841 |
|
0 |
0 |
about 2 years ago |
146 |
November 21, 2023 |
134 |
other |
Python |
| so-vits-svc fork with realtime support, improved interface and more features. |
| espnet/espnet |
7,563 |
|
0 |
5 |
about 2 years ago |
33 |
October 25, 2023 |
270 |
apache-2.0 |
Python |
| End-to-End Speech Processing Toolkit |
| netease-youdao/EmotiVoice |
5,739 |
|
0 |
0 |
about 2 years ago |
0 |
|
73 |
apache-2.0 |
Python |
| EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine |
| jaywalnut310/vits |
5,589 |
|
0 |
0 |
over 2 years ago |
0 |
|
142 |
mit |
Python |
| VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech |
| yl4579/StyleTTS2 |
3,464 |
|
0 |
0 |
about 2 years ago |
0 |
|
31 |
mit |
Python |
| StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models |
| NVIDIA/OpenSeq2Seq |
1,393 |
|
0 |
0 |
almost 5 years ago |
0 |
|
85 |
apache-2.0 |
Python |
| Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP |
| jik876/hifi-gan |
1,376 |
|
0 |
0 |
over 2 years ago |
0 |
|
82 |
mit |
Python |
| HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis |