| espnet/espnet |
7,563 |
|
0 |
5 |
about 2 years ago |
33 |
October 25, 2023 |
270 |
apache-2.0 |
Python |
| End-to-End Speech Processing Toolkit |
| speechbrain/speechbrain |
7,166 |
|
0 |
0 |
about 2 years ago |
0 |
|
149 |
apache-2.0 |
Python |
| A PyTorch-based Speech Toolkit |
| wenet-e2e/wenet |
5,053 |
|
0 |
0 |
4 months ago |
13 |
August 29, 2023 |
55 |
apache-2.0 |
Python |
| Production First and Production Ready End-to-End Speech Recognition Toolkit |
| snakers4/silero-models |
4,088 |
|
0 |
4 |
over 2 years ago |
4 |
June 12, 2022 |
8 |
other |
Jupyter Notebook |
| Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple |
| mravanelli/pytorch-kaldi |
2,138 |
|
0 |
0 |
about 4 years ago |
0 |
|
24 |
|
Python |
| pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit. |
| linto-ai/whisper-timestamped |
1,217 |
|
0 |
3 |
about 2 years ago |
3 |
December 08, 2023 |
15 |
agpl-3.0 |
Python |
| Multilingual Automatic Speech Recognition with word-level timestamps and confidence |
| freewym/espresso |
930 |
|
0 |
0 |
over 2 years ago |
0 |
|
7 |
other |
Python |
| Espresso: A Fast End-to-End Neural Speech Recognition Toolkit |
| sooftware/conformer |
809 |
|
0 |
0 |
over 2 years ago |
0 |
|
19 |
apache-2.0 |
Python |
| [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020) |
| mravanelli/SincNet |
764 |
|
0 |
0 |
about 5 years ago |
0 |
|
22 |
mit |
Python |
| SincNet is a neural architecture for efficiently processing raw audio samples. |
| kaituoxu/Speech-Transformer |
714 |
|
0 |
0 |
about 3 years ago |
0 |
|
5 |
|
Python |
| A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese. |