Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inquiry About the Language Distribution of the 200k-Hour Dataset and FACodec Support for Chinese #9

Open
xjf-303 opened this issue Dec 14, 2024 · 0 comments

Comments

@xjf-303
Copy link

xjf-303 commented Dec 14, 2024

@lifeiteng Thank you for your excellent work on NaturalSpeech 3! I have a couple of questions regarding the dataset and FACodec:

Regarding the 200k-hour internal dataset mentioned in the paper:

Could you please clarify the language distribution of this dataset? Is it predominantly English, or does it include other languages such as Chinese?

About FACodec:

Is FACodec designed to support Chinese audio, especially for extracting content, prosody, acoustic details, and timbre in Chinese speech?
If not, would additional training or fine-tuning on Chinese data be required to adapt FACodec for Chinese speech synthesis?
Your insights on this would be very helpful! Thank you for your time and for sharing your incredible research.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant