| wq2012/awesome-diarization |
1,384 |
|
0 |
0 |
about 2 years ago |
0 |
|
3 |
apache-2.0 |
|
| A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources. |
| jtkim-kaist/VAD |
632 |
|
0 |
0 |
almost 5 years ago |
0 |
|
32 |
|
MATLAB |
| Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset. |
| Jakobovski/free-spoken-digit-dataset |
518 |
|
0 |
0 |
over 3 years ago |
0 |
|
7 |
|
Python |
| A free audio dataset of spoken digits. Think MNIST for audio. |
| egorsmkv/speech-recognition-uk |
262 |
|
0 |
0 |
over 2 years ago |
0 |
|
6 |
|
Python |
| Speech Recognition for Ukrainian |
| double22a/speech_dataset |
229 |
|
0 |
0 |
about 3 years ago |
0 |
|
1 |
apache-2.0 |
|
| The dataset of Speech Recognition |
| Yuan-ManX/ai-audio-datasets |
199 |
|
0 |
0 |
about 2 years ago |
0 |
|
0 |
mit |
|
| This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc. |
| filippogiruzzi/voice_activity_detection |
171 |
|
0 |
0 |
over 4 years ago |
0 |
|
5 |
gpl-3.0 |
Python |
| Voice Activity Detection based on Deep Learning & TensorFlow |
| noahchalifour/rnnt-speech-recognition |
152 |
|
0 |
0 |
over 5 years ago |
0 |
|
13 |
mit |
Python |
| End-to-end speech recognition using RNN Transducers in Tensorflow 2.0 |
| mpc001/end-to-end-lipreading |
147 |
|
0 |
0 |
over 3 years ago |
0 |
|
11 |
|
Python |
| Pytorch code for End-to-End Audiovisual Speech Recognition |
| liangstein/Chinese-speech-to-text |
144 |
|
0 |
0 |
almost 3 years ago |
0 |
|
11 |
apache-2.0 |
Python |
| Chinese Speech To Text Using Wavenet |