text-to-speech

Star

Here are 3,343 public repositories matching this topic...

RVC-Boss / GPT-SoVITS

Star

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

text-to-speech tts voice-cloning vits voice-clone voice-cloneai

Updated Nov 7, 2024
Python

coqui-ai / TTS

Star

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts speech-synthesis voice-conversion vocoder voice-synthesis tacotron voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts glow-tts hifigan tts-model

Updated Aug 16, 2024
Python

babysor / MockingBird

Sponsor

Star

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

text-to-speech ai deep-learning speech pytorch tts

Updated Nov 15, 2024
Python

2noise / ChatTTS

Star

A generative speech model for daily dialogue.

python chat agent text-to-speech torch tts english chinese gpt natural-language-inference english-language chinese-language torchaudio llm chatgpt llm-agent chattts

Updated Dec 3, 2024
Python

myshell-ai / OpenVoice

Star

Instant voice cloning by MIT and MyShell.

text-to-speech tts voice-clone zero-shot-tts

Updated Dec 12, 2024
Python

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Nov 20, 2024
TypeScript

jianchang512 / pyvideotrans

Star

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。

text-to-speech speech-to-text video-transition

Updated Dec 14, 2024
Python

mozilla / TTS

Star

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

python text-to-speech deep-learning speech pytorch tts vocoder tacotron tensorflow2 tacotron2 melgan speaker-encoder dataset-analysis glow-tts multiband-melgan gantts

Updated Nov 9, 2023
Jupyter Notebook

espnet / espnet

Star

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Dec 13, 2024
Python

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

text-to-speech audit speech-synthesis audio-synthesis music-generation voice-conversion vocoder emilia text-to-audio fastspeech2 vits audio-generation singing-voice-conversion vall-e audioldm naturalspeech2 maskgct

Updated Nov 30, 2024
Jupyter Notebook

Plachtaa / VALL-E-X

Star

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

text-to-speech tts gpt transformer-architecture emotional-speech voice-clone vall-e

Updated Feb 11, 2024
Python

netease-youdao / EmotiVoice

Star

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

python text-to-speech ai deep-learning style prompt speech emotion pytorch tts speech-synthesis multi-speaker emotivoice

Updated Aug 13, 2024
Python

jaywalnut310 / vits

Star

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

text-to-speech deep-learning pytorch tts speech-synthesis

Updated Dec 6, 2023
Python

rhasspy / piper

Star

A fast, local neural text to speech system

text-to-speech tts speech-synthesis

Updated Oct 21, 2024
C++

FunAudioLLM / CosyVoice

Star

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

python text-to-speech japanese chatbot multi-lingual tts english chinese korean cantonese natural-language-generation cross-lingual fine-grained fine-tuning voice-cloning audio-generation chatgpt gpt-4o cosyvoice

Updated Dec 14, 2024
Python

rany2 / edge-tts

Star

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

text-to-speech tts speech-synthesis

Updated Dec 7, 2024
Python

yl4579 / StyleTTS2

Star

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

text-to-speech deep-learning pytorch tts speech-synthesis gan speaker-adaptation adversarial-training diffusion-models wavlm latent-diffusion latent-diffusion-models

Updated Aug 10, 2024
Python

snakers4 / silero-models

Star

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Updated Oct 18, 2023
Jupyter Notebook

myshell-ai / MeloTTS

Star

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

multilingual text-to-speech japanese tts english spanish chinese korean french

Updated Aug 9, 2024
Python

MoonInTheRiver / DiffSinger

Star

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

text-to-speech midi tts speech-synthesis diffusion-model singing-voice singing-synthesis singing-voice-synthesis singing-voice-database aaai2022 diffusion-speedup

Updated May 2, 2023
Python

Improve this page

Add a description, image, and links to the text-to-speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-to-speech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

text-to-speech

Here are 3,343 public repositories matching this topic...

RVC-Boss / GPT-SoVITS

coqui-ai / TTS

babysor / MockingBird

2noise / ChatTTS

myshell-ai / OpenVoice

leon-ai / leon

jianchang512 / pyvideotrans

mozilla / TTS

espnet / espnet

open-mmlab / Amphion

Plachtaa / VALL-E-X

netease-youdao / EmotiVoice

jaywalnut310 / vits

rhasspy / piper

FunAudioLLM / CosyVoice

rany2 / edge-tts

yl4579 / StyleTTS2

snakers4 / silero-models

myshell-ai / MeloTTS

MoonInTheRiver / DiffSinger

Improve this page

Add this topic to your repo