Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallel Wavenet-Vocoder #5

Open
1 task
twidddj opened this issue Apr 22, 2018 · 14 comments
Open
1 task

Parallel Wavenet-Vocoder #5

twidddj opened this issue Apr 22, 2018 · 14 comments

Comments

@twidddj
Copy link
Owner

twidddj commented Apr 22, 2018

Planed TODO

  • KL + Power - Single speaker

Properties not specified in the paper

  • Sampling number for the loss (We may have some limitation for GPU)
  • Number of mixture for IAF layers
  • Averaging method for Power loss
    • ex) Just reduce_mean on time axis or using moving average or ..
  • .. (Please, let us know those)

Another implementations

@neverjoe
Copy link

neverjoe commented Apr 25, 2018

sadly, there are many details behind the paper, i find nobody can reproduce result.

@twidddj
Copy link
Owner Author

twidddj commented Apr 25, 2018

Most of all, I'm not sure mel-spectrogram would fit well to reproduce result like the linguistic features in the paper. We may have to consider another constraints to make up for the weakness of mel-features.

@neverjoe
Copy link

I do not think mel-features is the key problem of me, i think the iaf and probability Density Distillation is very very import for the quality.

@maozhiqiang
Copy link

maozhiqiang commented Apr 26, 2018

@neverjoe do you tell me how to connect teacher model to eval student !Are they all trained in one sess! thanks

@neverjoe
Copy link

neverjoe commented Apr 26, 2018 via email

@maozhiqiang
Copy link

hi ! How to let teachers' network parameters do not participate in training? thanks

@xuerq
Copy link

xuerq commented Apr 28, 2018

@maozhiqiang
"tf.stop_gradient" in tensorflow

@maozhiqiang
Copy link

thank you @xuerq

@neverjoe
Copy link

neverjoe commented May 1, 2018 via email

@xuerq
Copy link

xuerq commented May 1, 2018

@neverjoe still working on it,the wav sampled from student is not so good as teacher

@maozhiqiang
Copy link

@xuerq ,@neverjoe @twidddj how to use teacher model to eval students output? Is it use training process to assessment or using generative process to assessment? thanks!

@maozhiqiang
Copy link

@twidddj do you get reasonable result about paralle wavenet!

@twidddj
Copy link
Owner Author

twidddj commented Jun 14, 2018

Hi @maozhiqiang, we are still trying to get better result of it. We have got some results, but It's not enough. I attached the results here. Thanks for your attention of our project!

@maozhiqiang
Copy link

hi @twidddj ! Thank you very much for your reply! do you used KL loss?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants