| babysor/MockingBird |
36,869 |
|
0 |
0 |
3 months ago |
2 |
February 28, 2022 |
446 |
other |
Python |
| 🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time |
| coqui-ai/TTS |
25,894 |
|
0 |
19 |
about 2 years ago |
90 |
December 01, 2023 |
101 |
mpl-2.0 |
Python |
| 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production |
| myshell-ai/OpenVoice |
13,002 |
|
0 |
0 |
about 2 years ago |
0 |
|
65 |
other |
Python |
| Instant voice cloning by MyShell. |
| NVIDIA/NeMo |
9,041 |
|
2 |
8 |
about 2 years ago |
70 |
October 25, 2023 |
109 |
apache-2.0 |
Python |
| NeMo: a toolkit for conversational AI |
| mozilla/TTS |
8,144 |
|
0 |
0 |
over 2 years ago |
0 |
|
21 |
mpl-2.0 |
Jupyter Notebook |
| :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) |
| Plachtaa/VALL-E-X |
6,055 |
|
0 |
0 |
over 2 years ago |
2 |
October 10, 2023 |
56 |
mit |
Python |
| An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io |
| netease-youdao/EmotiVoice |
5,739 |
|
0 |
0 |
about 2 years ago |
0 |
|
73 |
apache-2.0 |
Python |
| EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine |
| jaywalnut310/vits |
5,589 |
|
0 |
0 |
over 2 years ago |
0 |
|
142 |
mit |
Python |
| VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech |
| RVC-Boss/GPT-SoVITS |
5,184 |
|
0 |
0 |
about 2 years ago |
0 |
|
82 |
mit |
Python |
| 1 min voice data can also be used to train a good TTS model! (few shot voice cloning) |
| snakers4/silero-models |
4,088 |
|
0 |
4 |
over 2 years ago |
4 |
June 12, 2022 |
8 |
other |
Jupyter Notebook |
| Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple |