🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Updated
May 22, 2024 - Python
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Instant voice cloning by MyShell.
ChatTTS is a generative speech model for daily dialogue.
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
🧠 Leon is your open-source personal assistant.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
DeepMind's Tacotron-2 Tensorflow implementation
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
WaveRNN Vocoder + TTS
Foundational model for human-like, expressive TTS
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Add a description, image, and links to the text-to-speech topic page so that developers can more easily learn about it.
To associate your repository with the text-to-speech topic, visit your repo's landing page and select "manage topics."