-
Notifications
You must be signed in to change notification settings - Fork 211
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Follow-up work available for viewing #44
Comments
Thats very impressive! Are their plans to release the models for these new works? |
Most likely, but it will probably take a long time. All these works are the intellectual properties of companies. |
Understandable. Again great work and a real milestone in the field! |
Thanks! |
The second work is of particular interest, adding emotions to synthesized speech is still rather hit-and-miss. |
Your work has been amazing! |
@qq547276542 Thanks! More follow-up works will be released. Stay tuned. |
We have further improved AutoVC in 2 subsequent works.
The 1st work improves the audio quality by removing any pitch artifacts.
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder
https://arxiv.org/abs/2004.07370
The 2nd work can convert rhythm, pitch, and/or timbre at the same time.
Unsupervised Speech Decomposition via Triple Information Bottleneck
https://arxiv.org/abs/2004.11284
The text was updated successfully, but these errors were encountered: