| coqui-ai/TTS |
25,894 |
|
0 |
19 |
about 2 years ago |
90 |
December 01, 2023 |
101 |
mpl-2.0 |
Python |
| 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production |
| leon-ai/leon |
13,937 |
|
0 |
0 |
over 2 years ago |
0 |
|
94 |
mit |
TypeScript |
| 🧠 Leon is your open-source personal assistant. |
| NVIDIA/NeMo |
9,041 |
|
2 |
8 |
about 2 years ago |
70 |
October 25, 2023 |
109 |
apache-2.0 |
Python |
| NeMo: a toolkit for conversational AI |
| netease-youdao/EmotiVoice |
5,739 |
|
0 |
0 |
about 2 years ago |
0 |
|
73 |
apache-2.0 |
Python |
| EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine |
| jaywalnut310/vits |
5,589 |
|
0 |
0 |
over 2 years ago |
0 |
|
142 |
mit |
Python |
| VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech |
| snakers4/silero-models |
4,088 |
|
0 |
4 |
over 2 years ago |
4 |
June 12, 2022 |
8 |
other |
Jupyter Notebook |
| Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple |
| TensorSpeech/TensorFlowTTS |
3,558 |
|
0 |
1 |
over 2 years ago |
8 |
August 21, 2021 |
8 |
apache-2.0 |
Python |
| :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages) |
| yl4579/StyleTTS2 |
3,464 |
|
0 |
0 |
about 2 years ago |
0 |
|
31 |
mit |
Python |
| StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models |
| open-mmlab/Amphion |
3,319 |
|
0 |
0 |
about 2 years ago |
0 |
|
21 |
mit |
Python |
| Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development. |
| MoonInTheRiver/DiffSinger |
3,123 |
|
0 |
0 |
almost 3 years ago |
0 |
|
40 |
mit |
Python |
| DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code |