| CorentinJ/Real-Time-Voice-Cloning |
49,550 |
|
0 |
0 |
about 2 years ago |
0 |
|
187 |
other |
Python |
| Clone a voice in 5 seconds to generate arbitrary speech in real-time |
| babysor/MockingBird |
36,869 |
|
0 |
0 |
3 months ago |
2 |
February 28, 2022 |
446 |
other |
Python |
| 🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time |
| coqui-ai/TTS |
25,894 |
|
0 |
19 |
about 2 years ago |
90 |
December 01, 2023 |
101 |
mpl-2.0 |
Python |
| 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production |
| mozilla/TTS |
8,144 |
|
0 |
0 |
over 2 years ago |
0 |
|
21 |
mpl-2.0 |
Jupyter Notebook |
| :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) |
| espnet/espnet |
7,563 |
|
0 |
5 |
about 2 years ago |
33 |
October 25, 2023 |
270 |
apache-2.0 |
Python |
| End-to-End Speech Processing Toolkit |
| netease-youdao/EmotiVoice |
5,739 |
|
0 |
0 |
about 2 years ago |
0 |
|
73 |
apache-2.0 |
Python |
| EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine |
| jaywalnut310/vits |
5,589 |
|
0 |
0 |
over 2 years ago |
0 |
|
142 |
mit |
Python |
| VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech |
| snakers4/silero-models |
4,088 |
|
0 |
4 |
over 2 years ago |
4 |
June 12, 2022 |
8 |
other |
Jupyter Notebook |
| Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple |
| yl4579/StyleTTS2 |
3,464 |
|
0 |
0 |
about 2 years ago |
0 |
|
31 |
mit |
Python |
| StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models |
| collabora/WhisperSpeech |
2,419 |
|
0 |
0 |
about 2 years ago |
7 |
December 10, 2023 |
18 |
mit |
Jupyter Notebook |
| An Open Source text-to-speech system built by inverting Whisper. |