Add C++ and Python API for Kokoro 1.0 multilingual TTS model #1795
+819
−39
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Usage
Build sherpa-onnx
Download model files
Run it
There are 53 speakers in the model, with speaker ID
0 -- 52
.The mapping between speaker ID and speaker name is given below:
Generated sample waves
Note:
af_alloy->0
kokoro-0.mov
af_aoede->1
kokoro-1.mov
af_bella->2
kokoro-2.mov
af_heart->3
kokoro-3.mov
am_adam->11
kokoro-11.mov
bf_isabella->22
kokoro-22.mov
bm_daniel->24
kokoro-24.mov
hf_alpha->31
kokoro-31.mov
hm_omega->33
kokoro-33.mov
jf_nezumi->39
kokoro-39.mov
jm_kumo->41
kokoro-41.mov
zf_xiaobei->45
kokoro-45.mov
zf_xiaoni->46
kokoro-46.mov
zf_xiaoxiao->47
kokoro-47.mov
zf_xiaoyi->48
kokoro-48.mov
zm_yunjian->49
kokoro-49.mov
zm_yunxi->50
kokoro-50.mov
zm_yunxia->51,
kokoro-51.mov
zm_yunyang->52
kokoro-52.mov