Fix label datatype in TF Trainer #9616

jplu · 2021-01-15T10:02:58Z

What does this PR do?

This PR fixes the case where labels can be either a dict or a tf.Tensor when doing gradient accumulation.

sgugger

This looks okay to me, but it looks increasingly clearer that we should have tests of the TFTrainer otherwise we are doing more harm than good by merging those kinds of PRs.

LysandreJik

Ok, LGTM!

LysandreJik · 2021-01-15T13:46:14Z

I agree with Sylvain that while this is not tested, it's hard to recommend using it.

Fix label datatype

d19b63c

jplu requested review from sgugger and LysandreJik January 15, 2021 10:03

jplu mentioned this pull request Jan 15, 2021

Gradient accumulation for TFTrainer #9585

Merged

5 tasks

Apply style

74e4f34

sgugger reviewed Jan 15, 2021

View reviewed changes

LysandreJik approved these changes Jan 15, 2021

View reviewed changes

jplu merged commit 12f0d7e into huggingface:master Jan 20, 2021

jplu deleted the fix-trainer branch January 20, 2021 11:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix label datatype in TF Trainer #9616

Fix label datatype in TF Trainer #9616

jplu commented Jan 15, 2021

sgugger left a comment

LysandreJik left a comment

LysandreJik commented Jan 15, 2021

Fix label datatype in TF Trainer #9616

Fix label datatype in TF Trainer #9616

Conversation

jplu commented Jan 15, 2021

What does this PR do?

sgugger left a comment

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

LysandreJik commented Jan 15, 2021