p323 (40 epochs, rmvpe, vctk)

Created: September 20, 2025
p323 (40 epochs, rmvpe, vctk)

dl link: https://huggingface.co/lyery/models-vc/resolve/main/p323.zip?download=true

female speaker found in the vctk dataset, she is the speaker 88 of the original pretrain
this voice and the whole vctk dataset is free to use (Creative Commons Attribution License) but you must credit CSTR, University of Edinburg

works best for speech (audiobooks) but also has singing capabilities

trained on 20 minutes of data and a batch size of 8

Additional Details:

Tags: No tags available

Download Link: https://huggingface.co/lyery/models-vc/resolve/main/p323.zip?download=true Hosted on Hugging Face

Voice accords

Main sonic traits

Selected-sample labels, grouped by what they describe.

Age
teenage
middle aged adult
young adult
Music use
producer tag
a song hook
speaking
Style/timbre
bright
cute
dramatic
Use case
game
social
podcasts
Sample deck

Lean and sweet spot

Pick a pitch, compare the Quality Score, then Run. Hold Ctrl or Cmd to select multiple pitches.

top 4.82 路 avg 4.38
female
18
male
0
female base 路 pitch 0 Quality Score 4.70 路 heard as female
Run
male
female
heard female heard male Quality Score = estimated audio quality, higher is better Ctrl/Cmd-click = select multiple pitches
Similar voices

This voice model sounds like

Nearby voices from generated samples. Pick a pitch in the Sample Deck to refresh the matches.

Finding nearby voices for this pitch...