Upload the model with push_to_hub in examples #297

leshanbog · 2021-06-30T10:24:14Z

Pushing model and optimizer checkpoints to HuggingFace Hub using their API

justheuristic

Everything appears to be in order.

NB: ignore the codecov message below, it's a lie.

Let's test this in one of the internal runs (we could take an existing run and add an auxiliary uploader @ QYP, then see if it can upload 2 consecutive checkpoints)

codecov · 2021-07-02T10:32:10Z

Codecov Report

Merging #297 (294f474) into master (e9956b8) will increase coverage by 0.16%.
The diff coverage is 92.19%.

@@            Coverage Diff             @@
##           master     #297      +/-   ##
==========================================
+ Coverage   81.86%   82.03%   +0.16%     
==========================================
  Files          63       65       +2     
  Lines        5813     5856      +43     
==========================================
+ Hits         4759     4804      +45     
+ Misses       1054     1052       -2

Impacted Files	Coverage Δ
hivemind/averaging/group_info.py	`100.00% <ø> (ø)`
hivemind/averaging/load_balancing.py	`95.91% <ø> (ø)`
hivemind/averaging/partition.py	`98.01% <ø> (ø)`
hivemind/hivemind_cli/run_server.py	`0.00% <0.00%> (ø)`
hivemind/moe/client/expert.py	`96.49% <ø> (ø)`
hivemind/moe/server/expert_uid.py	`69.64% <0.00%> (ø)`
hivemind/moe/server/layers/lr_schedule.py	`100.00% <ø> (ø)`
hivemind/optim/adaptive.py	`77.77% <ø> (ø)`
hivemind/optim/base.py	`71.42% <ø> (-1.30%)`	⬇️
hivemind/p2p/p2p_daemon.py	`91.18% <ø> (-0.45%)`	⬇️
... and 46 more

- Removed hivemind.utils.threading.run_in_background and HIVEMIND_THREADS - Refactored MPFuture to be a single object instead of a linked pair of objects - MPFuture now uses a single process-wide pipe and thread, instead of spawning new pipe/thread for each future - MPFuture.result/exception can now only be awaited from the process that created it - MPFuture now returns the same exception types as regular future (and as asyncio.Future in __await__) - Added more thorough tests for MPFuture Co-authored-by: Alexander Borzunov <[email protected]> Co-authored-by: Max Ryabinin <[email protected]> Co-authored-by: Michael Diskin <[email protected]>

* Split hivemind.client into hivemind.averaging and hivemind.moe * Reduce the number of wildcard imports, update docs

examples/albert/run_first_peer.py

Co-authored-by: justheuristic <[email protected]>

Co-authored-by: Max Ryabinin <[email protected]>

examples/albert/run_first_peer.py

Co-authored-by: Max Ryabinin <[email protected]>

…_model_upload

leshanbog force-pushed the better_model_upload branch 3 times, most recently from 716cd33 to 5c0ca8c Compare June 30, 2021 10:34

Change model uploading method

f037ea6

leshanbog force-pushed the better_model_upload branch from 5c0ca8c to f037ea6 Compare June 30, 2021 11:36

yhn112 self-requested a review July 1, 2021 11:08

justheuristic approved these changes Jul 2, 2021

View reviewed changes

Merge branch 'master' into better_model_upload

86e28a2

yhn112 and others added 3 commits July 2, 2021 20:45

Make checkpointing optional in example (#303)

2e1bb9c

Split hivemind.client into hivemind.averaging and hivemind.moe (#304)

5233b6c

* Split hivemind.client into hivemind.averaging and hivemind.moe * Reduce the number of wildcard imports, update docs

mryab reviewed Jul 5, 2021

View reviewed changes

examples/albert/run_first_peer.py Outdated Show resolved Hide resolved

examples/albert/run_first_peer.py Outdated Show resolved Hide resolved

examples/albert/run_first_peer.py Outdated Show resolved Hide resolved

yhn112 and others added 5 commits July 5, 2021 16:57

Update readthedocs with hivemind.optim (#288)

cc8d39c

Co-authored-by: justheuristic <[email protected]>

Minor fixes in examples/albert (#308)

3a66271

Update examples/albert/run_first_peer.py

852d4bd

Co-authored-by: Max Ryabinin <[email protected]>

Update examples/albert/run_first_peer.py

573c388

Co-authored-by: Max Ryabinin <[email protected]>

Update examples/albert/run_first_peer.py

8bf47ef

Co-authored-by: Max Ryabinin <[email protected]>

mryab approved these changes Jul 7, 2021

View reviewed changes

examples/albert/run_first_peer.py Outdated Show resolved Hide resolved

examples/albert/run_first_peer.py Outdated Show resolved Hide resolved

mryab and others added 6 commits July 7, 2021 15:46

Apply suggestions from code review

e3de5bb

Change model uploading method

2fc1bb7

Update examples/albert/run_first_peer.py

a496733

Co-authored-by: Max Ryabinin <[email protected]>

Update examples/albert/run_first_peer.py

dec832a

Co-authored-by: Max Ryabinin <[email protected]>

Update examples/albert/run_first_peer.py

c16c774

Co-authored-by: Max Ryabinin <[email protected]>

Apply suggestions from code review

da26063

mryab changed the title ~~Change model uploading method~~ Upload the model with push_to_hub in examples Jul 7, 2021

Merge remote-tracking branch 'origin/better_model_upload' into better…

294f474

…_model_upload

mryab merged commit 2436a3b into master Jul 7, 2021

mryab deleted the better_model_upload branch July 7, 2021 14:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upload the model with push_to_hub in examples #297

Upload the model with push_to_hub in examples #297

leshanbog commented Jun 30, 2021

justheuristic left a comment •

edited

Loading

codecov bot commented Jul 2, 2021 •

edited

Loading

Upload the model with push_to_hub in examples #297

Upload the model with push_to_hub in examples #297

Conversation

leshanbog commented Jun 30, 2021

justheuristic left a comment • edited Loading

Choose a reason for hiding this comment

codecov bot commented Jul 2, 2021 • edited Loading

Codecov Report

justheuristic left a comment •

edited

Loading

codecov bot commented Jul 2, 2021 •

edited

Loading