I used clean audio clips from episodes of the anime and used the F5 Zero TTS feature on Hugging Face to generate over 5 minutes of normal speech and trained it on Google colab for 250 epochs.
Tip: Lower the transpose number to get better accurate results, for a male to male voice between -6 to -12 is accurate.
Tags: Game Freak, Nintendo, Anime, huggingface, ai-model
Related URL: https://huggingface.co/Ryanham1lton/Toxicroak/resolve/main/Toxicroak.zip?download=true
Download Link: https://huggingface.co/Ryanham1lton/Toxicroak/resolve/main/Toxicroak.zip?download=true Hosted on Hugging Face