Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

During training, the loss is nan #7

Open
luoyq6 opened this issue Oct 12, 2022 · 8 comments
Open

During training, the loss is nan #7

luoyq6 opened this issue Oct 12, 2022 · 8 comments

Comments

@luoyq6
Copy link

luoyq6 commented Oct 12, 2022

python train.py --dataset coco_base --flip --net hanmcl --lr 0.001 --lr_decay_step 12 --bs 4 --epochs 12 --disp_interval 20 --save_dir models/hanmcl --way 2 --shot 3
I used this parameter for training and this phenomenon occurred.
iter 0/34468:rpn_cls: 2921850624.0000, rpn_box: 962897664.0000, rcnn_cls: nan, rcnn_box 102172360.0000, metric nan
iter 20/34468:rpn_cls: nan, rpn_box: nan, rcnn_cls: nan, rcnn_box nan, metric nan

@giangdip2410
Copy link

Same with me. I faced nan loss after 1,2 epoch. @infinity7428 : could you please help to check the issue ?

1 similar comment
@winnie-nqw
Copy link

Same with me. I faced nan loss after 1,2 epoch. @infinity7428 : could you please help to check the issue ?

@yuan738
Copy link

yuan738 commented Nov 6, 2022

Did you solve the problem please? I'm having the same problem and need help, thanks.

@infinity7428
Copy link
Owner

Probably, the code will be updated this week

@yuan738
Copy link

yuan738 commented Nov 9, 2022

Thank you for your reply and hope to see the updated code soon.

@infinity7428
Copy link
Owner

I updated the code. If the same problem occurs, please give me a mention.

@yuan738
Copy link

yuan738 commented Dec 15, 2022

Thank you!

@Nepenthes-zlc
Copy link

I still meet the problem. The rcnn_cls and metrics is nan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants