| speechbrain/speechbrain |
7,166 |
|
0 |
0 |
about 2 years ago |
0 |
|
149 |
apache-2.0 |
Python |
| A PyTorch-based Speech Toolkit |
| pyannote/pyannote-audio |
4,460 |
|
1 |
13 |
about 2 years ago |
24 |
December 01, 2023 |
95 |
mit |
Jupyter Notebook |
| Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding |
| r9y9/deepvoice3_pytorch |
1,906 |
|
0 |
0 |
over 2 years ago |
0 |
|
43 |
other |
Python |
| PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models |
| r9y9/wavenet_vocoder |
1,617 |
|
0 |
0 |
over 5 years ago |
0 |
|
14 |
other |
Python |
| WaveNet vocoder |
| linto-ai/whisper-timestamped |
1,217 |
|
0 |
3 |
about 2 years ago |
3 |
December 08, 2023 |
15 |
agpl-3.0 |
Python |
| Multilingual Automatic Speech Recognition with word-level timestamps and confidence |
| mravanelli/SincNet |
764 |
|
0 |
0 |
about 5 years ago |
0 |
|
22 |
mit |
Python |
| SincNet is a neural architecture for efficiently processing raw audio samples. |
| Audio-WestlakeU/FullSubNet |
443 |
|
0 |
0 |
over 2 years ago |
0 |
|
32 |
mit |
Python |
| PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement." |
| DigitalPhonetics/IMS-Toucan |
426 |
|
0 |
0 |
about 2 years ago |
0 |
|
29 |
apache-2.0 |
Python |
| Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality. |
| r9y9/nnmnkwii |
375 |
|
15 |
1 |
about 3 years ago |
26 |
January 04, 2022 |
6 |
other |
Python |
| Library to build speech synthesis systems designed for easy and fast prototyping. |
| microsoft/UniSpeech |
328 |
|
0 |
0 |
almost 3 years ago |
0 |
|
12 |
other |
Python |
| UniSpeech - Large Scale Self-Supervised Learning for Speech |