Loss exploded??? #2

WhiteFu · 2019-04-20T09:09:02Z

I get the error "loss explode" in the training stage!
I'm not modifying the original hyperparameters, and I want to know how to solve the problem.

adimukewar · 2019-05-06T06:36:16Z

I am facing the same issue. Please let me know if you have resolved it.

rishikksh20 · 2019-05-07T10:43:11Z

actually, it is a common occurrence when dealing with a variational autoencoder. Two way to resolve it

again start training from 3 or 4 back saved checkpoints (not from recent one). But be prepared loss may explode again after running for a while, then do the same process again.
In the file train.py on line 133, change the value of the loss.
@adimukewar @WhiteFu

WhiteFu · 2019-05-07T11:02:08Z

Thanks for your reply, I will take it immediately!

WhiteFu · 2019-05-07T11:03:47Z

I am facing the same issue. Please let me know if you have resolved it.

sorry, I didn't reply to you in time. I have been trying some other work recently, so I haven't solved this problem

rishikksh20 · 2019-05-07T13:08:36Z

@WhiteFu if you are using this code then use large (more than 50 hours) expressive dataset like a blizzard for getting a decent result.

MisakaMikoto96 · 2019-06-19T07:09:08Z

I am facing the same issue. Please let me know if you have resolved it.

sorry, I didn't reply to you in time. I have been trying some other work recently, so I haven't solved this problem

hi, I have the same problem that I supposed to modified some hparams but it still not work.Please let me know if you have solved this. thx😄

WhiteFu · 2019-06-19T07:24:31Z

The loss is not stable, so you can modify the upper limit of the parameter In the file train.py on line 133,

MisakaMikoto96 · 2019-06-20T03:09:54Z

The loss is not stable, so you can modify the upper limit of the parameter In the file train.py on line 133,

hi, but it seems my loss = nan (every time at the same step when training) and I try to modify the batch size or learning rate but it still not work.

rishikksh20 · 2019-06-21T01:01:36Z

@MisakaMikoto96 aware of Nan loss, it means your variational autoencoder (VAE) unable to learn the latent representation. This is the common problem when you dealing with Variational autoencoder but the sad part is, there is no simple solution for that.
One solution you can try to go line and manipulate w1 and w2.
But before that make sure you have adequate quantity and expressiveness rich voice data and also sometimes after getting error, restart training from 2 steps back saved checkpoint is worked fine for me, if you getting the error again at the same checkpoint then restart from 3 steps back saved checkpoint and so on. If again and again, you get Nan at the same step count then try the above solution.
You can also read variational autoencoder paper for more understanding and otherwise feel free to ask here.

rishikksh20 self-assigned this May 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss exploded??? #2

Loss exploded??? #2

WhiteFu commented Apr 20, 2019

adimukewar commented May 6, 2019

rishikksh20 commented May 7, 2019 •

edited

Loading

WhiteFu commented May 7, 2019

WhiteFu commented May 7, 2019

rishikksh20 commented May 7, 2019

MisakaMikoto96 commented Jun 19, 2019

WhiteFu commented Jun 19, 2019

MisakaMikoto96 commented Jun 20, 2019 •

edited

Loading

rishikksh20 commented Jun 21, 2019

Loss exploded??? #2

Loss exploded??? #2

Comments

WhiteFu commented Apr 20, 2019

adimukewar commented May 6, 2019

rishikksh20 commented May 7, 2019 • edited Loading

WhiteFu commented May 7, 2019

WhiteFu commented May 7, 2019

rishikksh20 commented May 7, 2019

MisakaMikoto96 commented Jun 19, 2019

WhiteFu commented Jun 19, 2019

MisakaMikoto96 commented Jun 20, 2019 • edited Loading

rishikksh20 commented Jun 21, 2019

rishikksh20 commented May 7, 2019 •

edited

Loading

MisakaMikoto96 commented Jun 20, 2019 •

edited

Loading