Get free YouTube views, likes and subscribers
Get Free YouTube Subscribers, Views and Likes

RVC Tutorial - Speak in any voice! - Retrieval-based Voice Conversion - Easy AI Voice Tutorial

Follow
Ai Voice Tutor

#aivoices #aivoice #ai #aitutorial #rvc #rvcproject #rvcgui, RVC WebUI, RVC AI Tutorial, RVC GUI Tutorial, RVC Project Tutorial, AI Voice Tutorial, RVC V2, rvc voice changer

In this video you’ll be learning how to speak in any voice using nothing but your PC and a microphone. Everything will be running locally on your machine. First, we’ll prepare an audio file that will serve as an input to train an AI model. We then train the model using RVCProject (Retrievalbased Voice Conversion) before using the model in a different and much simpler User Interface (RVCGUI).

Once you have everything set up, you’ll be able to convert a recording of your voice to AI voice within seconds.

Notes:
Even in early 2024, this is still the best method and tool to clone any voice with your own voice locally on your PC
This works with any language and you just need to train the model with the same language that you want to clone (If you use a different language for training than for cloning, then you will get an accent that is close to real accents).

My other videos about AI voicecloning:
Realtime method for Discord (or Zoom, Skype, etc) to make your own voice sound like any voice:    • Discord Voice Changer Tutorial – Spea...  
Use TextToSpeech with any voice:    • Free AI TextToSpeech Voice Cloning ...  

If you run into memory issues, try the following:
Lower the batch size to "1".
Cut the audio in clips shorter than 10 seconds.
Reduce the size of the dataset.

For any other issues, make sure the folder path of your input voice and of RVC Beta does not contain any spaces or special characters!

/UPDATE 1/
Download Pretrained Voices
  / discord  
https://huggingface.co/QuickWick/Musi...
https://rvcmodels.com
https://docs.google.com/spreadsheets/...

/UPDATE 2/
Text To Speech Tutorial (The .pth models are not compatible to the app in this tutorial):    • Free AI TextToSpeech Voice Cloning ...  

/UPDATE 3/
I have now trained the voice with a dataset of 30 minutes and used 600 epochs. The resulting voice sounds better but still is not perfect. Maybe I should go even higher on the epochs.

/ UDATE 4/
When you create the zip file with the .pth model, also include the .index file that starts with "added..." which you can find in the /logs/lecturer/ folder

/Update 5/
I have now trained a dataset of 40 minutes with 300 epochs and this seems to give me the best overall results so far

UPDATE 6/
I have now trained my original 10 minute sample with the RMVPE model (instead of Harvest) and this seems to have improved or reduced some of the robotic noises I was getting. RMVPE is available in this version of RVC: https://huggingface.co/lj1995/VoiceCo...
Using "Harvest" in RVCGUI works great with an RMVPEtrained model.

What you’ll need
NVIDIA GPU with CUDA support (at least 8GB VRAM needed for the training)
About 30GB of free disk space
About 10 minutes of the voice you want to train the AI model on
A recording of your own voice that will then be converted into AI voice

Download Links
1. Prepare input voice with Audacity
https://www.audacityteam.org/download/

2. Train Model with RVCProject (RVCBeta)
https://www.huggingface.co/lj1995/Voi...
Note: "RVCBeta.7z" always includes the latest version of the tool. If you want to use the exact same version as in the video, download this one: https://huggingface.co/lj1995/VoiceCo...

3. Use Model with RVCGUI
https://github.com/Tiger14n/RVCGUI/r...

Optional
If you want to dive deeper into RVCProject, check out the documentation on Github:
https://github.com/RVCProject/Retrie...

Thank you to everyone who has contributed to RVCProject and RVCGUI!

If you appreciate my videos, you can buy me a Coffee: https://www.buymeacoffee.com/aivoicet...

My PC Components:

(Disclosure: As an Amazon Associate, I earn from qualifying purchases. Clicking on and purchasing products through these links won't cost you any extra. They help support this channel and allow me to continue providing valuable content)

My GPU: ZOTAC Gaming GeForce RTX 4090 AMP Extreme https://amzn.to/3PZHNlm (Affiliate Link)
(Alternative: Zotac NVIDIA GeForce RTX 4090 Trinity https://amzn.to/3s2GERN (Affiliate Link))
My CPU: INTEL CORE I913900KF https://amzn.to/3MudSRp (Affiliate Link)
My SSD: WD_BLACK SN850X NVMe SSD 2TB https://amzn.to/46WFMgG (Affiliate Link)
My RAM: G.Skill 64GB 2x32GB DDR5 6400MHz https://amzn.to/3tAMn1M (Affiliate Link)
My Microphone: Razer Seiren V2 X USB Microphone https://amzn.to/46PkAtn (Affiliate Link)

Chapters:
00:00 Introduction
01:44 Step 1 Prepare Input Voice
03:11 Step 2 Train Voice Model in RVC V2
07:55 Step 3 Use Voice Model in RVC GUI
11:00 Final Result

posted by meggiemay0307sm