| ggerganov/whisper.cpp |
27,404 |
|
0 |
1 |
about 2 years ago |
1 |
December 12, 2022 |
465 |
mit |
C |
| Port of OpenAI's Whisper model in C/C++ |
| mozilla/DeepSpeech |
23,687 |
|
29 |
14 |
about 2 years ago |
100 |
December 19, 2020 |
137 |
mpl-2.0 |
C++ |
| DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. |
| leon-ai/leon |
13,937 |
|
0 |
0 |
over 2 years ago |
0 |
|
94 |
mit |
TypeScript |
| 🧠 Leon is your open-source personal assistant. |
| kaldi-asr/kaldi |
13,453 |
|
0 |
3 |
about 2 years ago |
3 |
April 20, 2022 |
234 |
other |
Shell |
| kaldi-asr/kaldi is the official location of the Kaldi project. |
| NVIDIA/NeMo |
9,041 |
|
2 |
8 |
about 2 years ago |
70 |
October 25, 2023 |
109 |
apache-2.0 |
Python |
| NeMo: a toolkit for conversational AI |
| Uberi/speech_recognition |
7,801 |
|
544 |
277 |
about 2 years ago |
56 |
December 06, 2023 |
314 |
bsd-3-clause |
Python |
| Speech recognition module for Python, supporting several engines and APIs, online and offline. |
| m-bain/whisperX |
7,510 |
|
0 |
0 |
about 2 years ago |
0 |
|
341 |
bsd-4-clause |
Python |
| WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) |
| nl8590687/ASRT_SpeechRecognition |
7,253 |
|
0 |
0 |
about 2 years ago |
1 |
October 23, 2020 |
101 |
gpl-3.0 |
Python |
| A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统 |
| speechbrain/speechbrain |
7,166 |
|
0 |
0 |
about 2 years ago |
0 |
|
149 |
apache-2.0 |
Python |
| A PyTorch-based Speech Toolkit |
| SYSTRAN/faster-whisper |
6,940 |
|
0 |
22 |
about 2 years ago |
12 |
November 26, 2023 |
140 |
mit |
Python |
| Faster Whisper transcription with CTranslate2 |