Fix when TPU device check is ran #469

muellerzr · 2022-06-24T14:58:05Z

This PR fixes an issue where xm.xla_device() can't be called outside of xm.spawn. As a result the current behavior for is_tpu_available breaks the notebook launcher.

The proposed fix is to check this in state directly outside the if chain so that checking if on a TPU device can be checked properly still.

HuggingFaceDocBuilderDev · 2022-06-24T15:01:40Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

We can maybe add an argument to the is_tpu_available function to no check for a tpu device sometimes, but we can't remove the device test entirely as we added it for a reason.

sgugger · 2022-06-24T15:08:09Z

src/accelerate/utils/imports.py

-    try:
-        # Will raise a RuntimeError if no XLA configuration is found
-        _ = xm.xla_device()
-        _tpu_available = True
-    except RuntimeError:
-        _tpu_available = False


Mmm, removing this will break the other places we use is_tpu_available

The added reason was specifically inside AcceleratorState, at the proposed location. But having it as an argument instead works as well. Will refactor

sgugger · 2022-06-24T15:25:57Z

src/accelerate/utils/imports.py

@@ -56,8 +51,15 @@ def is_apex_available():
    return importlib.util.find_spec("apex") is not None


-def is_tpu_available():
-    "Checks if `torch_xla` is installed and if a TPU is in the environment"
+def is_tpu_available(check_device=False):


This needs to be True by default, and False when we don't want to check for the device (before launching multiprocessing for instance).

It needs to be False by default, because otherwise it also does this on import checks that are scattered around the library.

Or made into a separate function if we don't want False behavior. (I know we're not fans of that, but this is one case where it should be False)

sgugger

Thanks for iterating!

muellerzr added 5 commits June 24, 2022 09:29

Use during is_tpu_available instead

d20bd76

True

5e6e782

Keep old behavior for now

121c0f6

recursion

9fd41c7

better on_tpu_device

d2ca75f

muellerzr added bug Something isn't working TPU Bug or feature on TPU platforms labels Jun 24, 2022

muellerzr requested a review from sgugger June 24, 2022 14:58

sgugger requested changes Jun 24, 2022

View reviewed changes

muellerzr added 2 commits June 24, 2022 11:17

Make as a bool with default False

383a153

is not None

40ccafa

sgugger reviewed Jun 24, 2022

View reviewed changes

muellerzr added 4 commits June 24, 2022 11:44

Trigger True in needed areas

c2c9f50

Default of True

c0376a1

Other areas

2d5874c

last check device

73f53f5

muellerzr requested a review from sgugger June 24, 2022 16:04

sgugger approved these changes Jun 24, 2022

View reviewed changes

muellerzr merged commit 9d8ed50 into main Jun 24, 2022

muellerzr deleted the tpu-device branch June 24, 2022 16:07

muellerzr mentioned this pull request Jun 29, 2022

Fix all is_torch_tpu_available issues huggingface/transformers#17936

Merged

5 tasks

anw90 mentioned this pull request Dec 4, 2023

Make torch xla available on GPU #2176

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix when TPU device check is ran #469

Fix when TPU device check is ran #469

muellerzr commented Jun 24, 2022

HuggingFaceDocBuilderDev commented Jun 24, 2022 •

edited

Loading

sgugger left a comment

sgugger Jun 24, 2022

muellerzr Jun 24, 2022

sgugger Jun 24, 2022

muellerzr Jun 24, 2022

muellerzr Jun 24, 2022

sgugger left a comment

Fix when TPU device check is ran #469

Fix when TPU device check is ran #469

Conversation

muellerzr commented Jun 24, 2022

HuggingFaceDocBuilderDev commented Jun 24, 2022 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

sgugger Jun 24, 2022

Choose a reason for hiding this comment

muellerzr Jun 24, 2022

Choose a reason for hiding this comment

sgugger Jun 24, 2022

Choose a reason for hiding this comment

muellerzr Jun 24, 2022

Choose a reason for hiding this comment

muellerzr Jun 24, 2022

Choose a reason for hiding this comment

sgugger left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jun 24, 2022 •

edited

Loading