Update examples to show how to deal with extra validation copies #319

muellerzr · 2022-04-19T18:29:06Z

Update examples to show how to truncate the validation set for metrics

What does this add?

Based off this issue this PR updates all examples to show how to get rid of extra samples that get added when performing distributed training on the validation set.

Testing on a multigpu system will happen tommorow, but @sgugger pretty sure the way I have it setup ensures that this only runs when we have distributed systems, and that's where this problem arises?

Who is it for?

Should close #287

Why is it needed?

It's unclear from the scripts how to alleviate this behavior, and it's not documented anywhere. So, with this PR it now is

HuggingFaceDocBuilderDev · 2022-04-19T18:37:35Z

The documentation is not available anymore as the PR was closed or merged.

muellerzr · 2022-04-19T19:20:34Z

Note: This is just an initial to make sure the format and whatnot looks right and then all the other examples will follow suite :)

sgugger

Thanks for the PR! I'd just put this in a specific feature example instead of the base one.

… fix-validation-examples

sgugger

Thanks! Not sure what your question is about tests. To test this we would need to know in advance the exact value of the metric and make sure we get that again but it's very finnicky since the metric computed in the base script is roughly the same.

examples/by_feature/multi_node_metrics.py

Co-authored-by: Sylvain Gugger <[email protected]>

Be mindful of validation dataset values

79b5d46

muellerzr added enhancement New feature or request documentation Improvements or additions to documentation labels Apr 19, 2022

muellerzr requested a review from sgugger April 19, 2022 18:29

sgugger reviewed Apr 19, 2022

View reviewed changes

muellerzr added 3 commits April 20, 2022 12:05

Merge branch 'main' of https://github.com/huggingface/accelerate into…

7d8187b

… fix-validation-examples

Move to seperate example

d701bd1

Wrong patch

518157b

muellerzr requested a review from sgugger April 20, 2022 16:31

sgugger approved these changes Apr 20, 2022

View reviewed changes

examples/by_feature/multi_node_metrics.py Outdated Show resolved Hide resolved

examples/by_feature/multi_node_metrics.py Outdated Show resolved Hide resolved

muellerzr and others added 4 commits April 20, 2022 13:43

Copyright date, some day I'll catch these in advance

0aa5ba2

Co-authored-by: Sylvain Gugger <[email protected]>

Rename to multi_process_metrics

63b4b24

Finish rename + import errors

fbea695

Good news, my test works

bee4abe

muellerzr merged commit fa476d0 into main Apr 20, 2022

muellerzr deleted the fix-validation-examples branch April 20, 2022 18:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update examples to show how to deal with extra validation copies #319

Update examples to show how to deal with extra validation copies #319

muellerzr commented Apr 19, 2022

HuggingFaceDocBuilderDev commented Apr 19, 2022 •

edited

Loading

muellerzr commented Apr 19, 2022

sgugger left a comment

sgugger left a comment

Update examples to show how to deal with extra validation copies #319

Update examples to show how to deal with extra validation copies #319

Conversation

muellerzr commented Apr 19, 2022

Update examples to show how to truncate the validation set for metrics

What does this add?

Who is it for?

Why is it needed?

HuggingFaceDocBuilderDev commented Apr 19, 2022 • edited Loading

muellerzr commented Apr 19, 2022

sgugger left a comment

Choose a reason for hiding this comment

sgugger left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Apr 19, 2022 •

edited

Loading