| PaddlePaddle/PaddleSpeech |
12,555 |
|
0 |
4 |
30 days ago |
9 |
May 27, 2022 |
437 |
apache-2.0 |
Python |
| Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award. |
| NVIDIA/NeMo |
9,041 |
|
2 |
8 |
about 2 years ago |
70 |
October 25, 2023 |
109 |
apache-2.0 |
Python |
| NeMo: a toolkit for conversational AI |
| espnet/espnet |
7,563 |
|
0 |
5 |
about 2 years ago |
33 |
October 25, 2023 |
270 |
apache-2.0 |
Python |
| End-to-End Speech Processing Toolkit |
| wzpan/wukong-robot |
5,386 |
|
0 |
0 |
over 2 years ago |
0 |
|
32 |
mit |
Python |
| 🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。 |
| snakers4/silero-models |
4,088 |
|
0 |
4 |
over 2 years ago |
4 |
June 12, 2022 |
8 |
other |
Jupyter Notebook |
| Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple |
| tensorflow/lingvo |
2,776 |
|
0 |
0 |
about 2 years ago |
0 |
|
133 |
apache-2.0 |
Python |
| Lingvo |
| jovotech/jovo-framework |
1,664 |
|
46 |
34 |
about 2 years ago |
213 |
July 28, 2022 |
36 |
apache-2.0 |
TypeScript |
| 🔈 The React for Voice and Chat: Build Apps for Alexa, Google Assistant, Messenger, Instagram, the Web, and more |
| athena-team/athena |
821 |
|
0 |
0 |
over 3 years ago |
0 |
|
4 |
apache-2.0 |
C++ |
| an open-source implementation of sequence-to-sequence based speech processing engine |
| HMS-Core/hms-ml-demo |
333 |
|
0 |
0 |
over 2 years ago |
0 |
|
17 |
apache-2.0 |
Java |
| HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts. |
| shibing624/parrots |
318 |
|
0 |
0 |
about 2 years ago |
7 |
November 03, 2022 |
5 |
apache-2.0 |
Python |
| Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese. 中文语音识别、文字转语音,基于语音库实现,易扩展。 |