Pyannote Audio Alternatives

Name: pyannote/pyannote-audio
Brand: pyannote/pyannote-audio
SKU: project/pyannote/pyannote-audio
Rating: 4.94 (4460 reviews)

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Categories > Machine Learning > Pytorch

Suggest Alternative

Stars

4,460

Alternatives

License

mit

Open Issues

Most Recent Commit

over 2 years ago

Programming Language

Jupyter Notebook

Monthly Downloads

Dependent Repos

Dependent Packages

Total Releases

Latest Release

December 01, 2023

Categories

Data Processing > Jupyter Notebook

Machine Learning > Pytorch

Data Processing > Pipeline

Machine Learning > Pretrained Models

Machine Learning > Speech Processing

Site

Repo

Alternatives To pyannote/pyannote-audio

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
speechbrain/speechbrain	7,166	0	0	over 2 years ago	0		149	apache-2.0	Python
A PyTorch-based Speech Toolkit
pyannote/pyannote-audio	4,460	1	13	over 2 years ago	24	December 01, 2023	95	mit	Jupyter Notebook
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
r9y9/deepvoice3_pytorch	1,906	0	0	over 2 years ago	0		43	other	Python
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
r9y9/wavenet_vocoder	1,617	0	0	over 5 years ago	0		14	other	Python
WaveNet vocoder
linto-ai/whisper-timestamped	1,217	0	3	over 2 years ago	3	December 08, 2023	15	agpl-3.0	Python
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
mravanelli/SincNet	764	0	0	over 5 years ago	0		22	mit	Python
SincNet is a neural architecture for efficiently processing raw audio samples.
santi-pdp/pase	448	0	0	almost 3 years ago	0		21	mit	Python
Problem Agnostic Speech Encoder
Audio-WestlakeU/FullSubNet	443	0	0	almost 3 years ago	0		32	mit	Python
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
DigitalPhonetics/IMS-Toucan	426	0	0	over 2 years ago	0		29	apache-2.0	Python
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
r9y9/nnmnkwii	375	15	1	over 3 years ago	26	January 04, 2022	6	other	Python
Library to build speech synthesis systems designed for easy and fast prototyping.

Alternatives To pyannote/pyannote-audio

Select To Compare

speechbrain/speechbrain ⭐ 7,166

A PyTorch-based Speech Toolkit

dependent packages 0 total releases 0 most recent commit over 2 years ago

pyannote/pyannote-audio ⭐ 4,460

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

dependent packages 13 total releases 24 most recent commit over 2 years ago downloads badge

r9y9/deepvoice3_pytorch ⭐ 1,906

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

dependent packages 0 total releases 0 most recent commit over 2 years ago

r9y9/wavenet_vocoder ⭐ 1,617

WaveNet vocoder

dependent packages 0 total releases 0 most recent commit over 5 years ago

linto-ai/whisper-timestamped ⭐ 1,217

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

dependent packages 3 total releases 3 most recent commit over 2 years ago downloads badge

mravanelli/SincNet ⭐ 764

SincNet is a neural architecture for efficiently processing raw audio samples.

dependent packages 0 total releases 0 most recent commit over 5 years ago

santi-pdp/pase ⭐ 448

Problem Agnostic Speech Encoder

dependent packages 0 total releases 0 most recent commit almost 3 years ago

Audio-WestlakeU/FullSubNet ⭐ 443

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

dependent packages 0 total releases 0 most recent commit almost 3 years ago

DigitalPhonetics/IMS-Toucan ⭐ 426

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

dependent packages 0 total releases 0 most recent commit over 2 years ago

r9y9/nnmnkwii ⭐ 375

Library to build speech synthesis systems designed for easy and fast prototyping.

dependent packages 1 total releases 26 most recent commit over 3 years ago downloads badge

Suggest An Alternative To pyannote-audio

Alternative Project Comparisons

pyannote/pyannote-audio vs Speechbrain

pyannote/pyannote-audio vs Pyannote Audio

pyannote/pyannote-audio vs Deepvoice3_pytorch

pyannote/pyannote-audio vs Wavenet_vocoder

pyannote/pyannote-audio vs Whisper Timestamped

pyannote/pyannote-audio vs Sincnet

pyannote/pyannote-audio vs Pase

pyannote/pyannote-audio vs Fullsubnet

pyannote/pyannote-audio vs Ims Toucan

pyannote/pyannote-audio vs Nnmnkwii

Popular Speech Processing Projects

pliang279/awesome-multimodal-ml⭐ 4,999

Reading list for research topics in multimodal machine learning

microsoft/torchscale⭐ 2,804

Foundation Architecture for (M)LLMs

wq2012/awesome-diarization⭐ 1,384

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

midas-research/audino⭐ 988

Open source audio annotation tool for humans

coqui-ai/open-speech-corpora⭐ 830

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Popular Pytorch Projects

huggingface/transformers⭐ 119,240

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

AUTOMATIC1111/stable-diffusion-webui⭐ 118,856

Stable Diffusion web UI

pytorch/pytorch⭐ 74,794

Tensors and Dynamic neural networks in Python with strong GPU acceleration

keras-team/keras⭐ 60,198

Deep Learning for humans

CorentinJ/Real-Time-Voice-Cloning⭐ 49,550

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Popular Machine Learning Categories

Deep Learning

Machine Learning

Pytorch

Tensorflow

Natural Language Processing

Neural Network

Neural

Computer Vision

Convolutional Neural Networks

Opencv