Whisperspeech Alternatives

Name: collabora/WhisperSpeech
Brand: collabora/WhisperSpeech
SKU: project/collabora/WhisperSpeech
Rating: 4.94 (2419 reviews)

An Open Source text-to-speech system built by inverting Whisper.

Categories > Machine Learning > Pytorch

Suggest Alternative

Stars

2,419

Alternatives

License

mit

Open Issues

Most Recent Commit

over 2 years ago

Programming Language

Jupyter Notebook

Monthly Downloads

Dependent Repos

Dependent Packages

Total Releases

Latest Release

December 10, 2023

Categories

Data Processing > Jupyter Notebook

Machine Learning > Pytorch

Text Processing > Tts

Machine Learning > Speech Synthesis

Site

Repo

Alternatives To collabora/WhisperSpeech

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
CorentinJ/Real-Time-Voice-Cloning	49,550	0	0	over 2 years ago	0		187	other	Python
Clone a voice in 5 seconds to generate arbitrary speech in real-time
babysor/MockingBird	36,869	0	0	6 months ago	2	February 28, 2022	446	other	Python
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
coqui-ai/TTS	25,894	0	19	over 2 years ago	90	December 01, 2023	101	mpl-2.0	Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
mozilla/TTS	8,144	0	0	over 2 years ago	0		21	mpl-2.0	Jupyter Notebook
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
espnet/espnet	7,563	0	5	over 2 years ago	33	October 25, 2023	270	apache-2.0	Python
End-to-End Speech Processing Toolkit
netease-youdao/EmotiVoice	5,739	0	0	over 2 years ago	0		73	apache-2.0	Python
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
jaywalnut310/vits	5,589	0	0	over 2 years ago	0		142	mit	Python
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
snakers4/silero-models	4,088	0	4	over 2 years ago	4	June 12, 2022	8	other	Jupyter Notebook
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
yl4579/StyleTTS2	3,464	0	0	over 2 years ago	0		31	mit	Python
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
collabora/WhisperSpeech	2,419	0	0	over 2 years ago	7	December 10, 2023	18	mit	Jupyter Notebook
An Open Source text-to-speech system built by inverting Whisper.

Alternatives To collabora/WhisperSpeech

Select To Compare

CorentinJ/Real-Time-Voice-Cloning ⭐ 49,550

Clone a voice in 5 seconds to generate arbitrary speech in real-time

dependent packages 0 total releases 0 most recent commit over 2 years ago

babysor/MockingBird ⭐ 36,869

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

dependent packages 0 total releases 2 most recent commit 6 months ago downloads badge

coqui-ai/TTS ⭐ 25,894

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

dependent packages 19 total releases 90 most recent commit over 2 years ago downloads badge

mozilla/TTS ⭐ 8,144

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

dependent packages 0 total releases 0 most recent commit over 2 years ago

espnet/espnet ⭐ 7,563

End-to-End Speech Processing Toolkit

dependent packages 5 total releases 33 most recent commit over 2 years ago downloads badge

netease-youdao/EmotiVoice ⭐ 5,739

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

dependent packages 0 total releases 0 most recent commit over 2 years ago

jaywalnut310/vits ⭐ 5,589

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

dependent packages 0 total releases 0 most recent commit over 2 years ago

snakers4/silero-models ⭐ 4,088

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

dependent packages 4 total releases 4 most recent commit over 2 years ago downloads badge

yl4579/StyleTTS2 ⭐ 3,464

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

dependent packages 0 total releases 0 most recent commit over 2 years ago

collabora/WhisperSpeech ⭐ 2,419

An Open Source text-to-speech system built by inverting Whisper.

dependent packages 0 total releases 7 most recent commit over 2 years ago downloads badge

Suggest An Alternative To WhisperSpeech

Alternative Project Comparisons

collabora/WhisperSpeech vs Real Time Voice Cloning

collabora/WhisperSpeech vs Mockingbird

collabora/WhisperSpeech vs Tts

collabora/WhisperSpeech vs Espnet

collabora/WhisperSpeech vs Emotivoice

collabora/WhisperSpeech vs Vits

collabora/WhisperSpeech vs Silero Models

collabora/WhisperSpeech vs Styletts2

collabora/WhisperSpeech vs Whisperspeech

Popular Pytorch Projects

huggingface/transformers⭐ 119,240

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

AUTOMATIC1111/stable-diffusion-webui⭐ 118,856

Stable Diffusion web UI

pytorch/pytorch⭐ 74,794

Tensors and Dynamic neural networks in Python with strong GPU acceleration

keras-team/keras⭐ 60,198

Deep Learning for humans

chinese-poetry/chinese-poetry⭐ 45,313

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。

Popular Tts Projects

mudler/LocalAI⭐ 47,242

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

lobehub/lobe-chat⭐ 17,000

🤖 Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.

myshell-ai/OpenVoice⭐ 13,002

Instant voice cloning by MyShell.

PaddlePaddle/PaddleSpeech⭐ 12,635

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

NVIDIA/NeMo⭐ 9,041

NeMo: a toolkit for conversational AI

Popular Machine Learning Categories

Deep Learning

Machine Learning

Pytorch

Tensorflow

Natural Language Processing

Neural Network

Neural

Computer Vision

Convolutional Neural Networks

Opencv