You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When i am to make zero shot model, should i train speaker embedder as well as the conversion model with large dataset (VCTK)?
Or is it ok to only train the conversion model with VCTK?
The text was updated successfully, but these errors were encountered:
When i am to make zero shot model, should i train speaker embedder as well as the conversion model with large dataset (VCTK)?
Or is it ok to only train the conversion model with VCTK?
so how do you do finally? I think if we have large dataset, we have lots of choose
Thank you for sharing your work.
When i am to make zero shot model, should i train speaker embedder as well as the conversion model with large dataset (VCTK)?
Or is it ok to only train the conversion model with VCTK?
The text was updated successfully, but these errors were encountered: