Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Updated
May 29, 2024 - Python
Clone a voice in 5 seconds to generate arbitrary speech in real-time
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
A Python/Pytorch app for easily synthesising human voices
VITS-based Voice Conversion focused on simplicity, quality and performance.
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
This repository has implementation for "Neural Voice Cloning With Few Samples"
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
The code for the bark-voicecloning model. Training and inference.
Phoneme multilingual(Russian-English) voice cloning based on
A webui for different audio related Neural Networks
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, TTS. Open Source, Local & Free.
singing voice change based on whisper, and lora for singing voice clone
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to…
Add a description, image, and links to the voice-cloning topic page so that developers can more easily learn about it.
To associate your repository with the voice-cloning topic, visit your repo's landing page and select "manage topics."