| espnet/espnet |
7,563 |
|
0 |
5 |
about 2 years ago |
33 |
October 25, 2023 |
270 |
apache-2.0 |
Python |
| End-to-End Speech Processing Toolkit |
| hirofumi0810/neural_sp |
466 |
|
0 |
0 |
over 4 years ago |
0 |
|
43 |
apache-2.0 |
Python |
| End-to-end ASR/LM implementation with PyTorch |
| yaohungt/Multimodal-Transformer |
418 |
|
0 |
0 |
over 4 years ago |
0 |
|
8 |
mit |
Python |
| [ACL'19] [PyTorch] Multimodal Transformer |
| hirofumi0810/tensorflow_end2end_speech_recognition |
275 |
|
0 |
0 |
about 8 years ago |
0 |
|
11 |
mit |
Python |
| End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training) |
| shenasa-ai/speech2text |
148 |
|
0 |
0 |
almost 3 years ago |
0 |
|
2 |
mit |
Jupyter Notebook |
| A Deep-Learning-Based Persian Speech Recognition System |
| chibohe/text_recognition_toolbox |
141 |
|
0 |
0 |
over 4 years ago |
0 |
|
2 |
|
Python |
| text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way. |
| zw76859420/ASR_Syllable |
112 |
|
0 |
0 |
over 6 years ago |
0 |
|
2 |
|
Python |
| 基于卷积神经网络的语音识别声学模型的研究 |
| zhiqwang/sightseq |
109 |
|
0 |
0 |
over 6 years ago |
0 |
|
2 |
mit |
Python |
| 🔭 Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection |
| airaria/CaptchaRecognition |
91 |
|
0 |
0 |
over 8 years ago |
0 |
|
4 |
mit |
Python |
| End-to-end variable length Captcha recognition using CNN+RNN+Attention/CTC (pytorch implementation). 端到端的不定长验证码识别 |
| bityigoss/mtl-text-recognition |
60 |
|
0 |
0 |
over 6 years ago |
0 |
|
5 |
|
Python |
| multi-task learning for text recognition with joint CTC-attention |