Improve type annotations in worker.py #5814

crusaderky · 2022-02-15T21:38:20Z

No description provided.

crusaderky · 2022-02-15T21:40:33Z

distributed/worker.py

@@ -1021,7 +1039,7 @@ def __init__(
        )
        self.periodic_callbacks["keep-alive"] = pc

-        pc = PeriodicCallback(self.find_missing, 1000)
+        pc = PeriodicCallback(self.find_missing, 1000)  # type: ignore


Work around bug in tornado type annotations that declare that callback can only be a sync function, whereas we pass async functions (and they work fine).

Clarify that PeriodicCallback accepts async functions tornadoweb/tornado#3117

github-actions · 2022-02-16T09:47:59Z

Unit Test Results

      12 files ±0       12 suites ±0 7h 7m 58s ⏱️ + 3m 19s
  2 621 tests ±0   2 535 ✔️ - 4   81 💤 +1 5 ❌ +4
15 650 runs ±0 14 781 ✔️ - 6 864 💤 +4 5 ❌ +3

For more details on these failures, see this check.

Results for commit 02ede76. ± Comparison against base commit a86f4bb.

♻️ This comment has been updated with latest results.

graingert · 2022-02-16T09:59:12Z

distributed/worker.py

@@ -1021,7 +1039,7 @@ def __init__(
        )
        self.periodic_callbacks["keep-alive"] = pc

-        pc = PeriodicCallback(self.find_missing, 1000)
+        pc = PeriodicCallback(self.find_missing, 1000)  # type: ignore


I think it's better to use a more specific ignore here:

Suggested change

pc = PeriodicCallback(self.find_missing, 1000) # type: ignore

pc = PeriodicCallback(self.find_missing, 1000) # type: ignore[arg-type]

I'm unsure how I feel around this. IMHO the generic ignore is already a lot of burden and having it fully specified would just detract from readability.

crusaderky · 2022-02-17T11:43:07Z

This PR conflicts with #5820. Whichever is merged last will need to be hammered a bit.

fjetter · 2022-02-17T12:00:23Z

This PR conflicts with #5820. Whichever is merged last will need to be hammered a bit.

Feel free to merge yours first

crusaderky · 2022-02-17T17:14:07Z

Agreed during an offline chat to merge #5820 first

crusaderky · 2022-02-24T17:00:35Z

This is ready for final review and merge

fjetter

Re: state machine

I think I didn't find any changes to the logic. You were concerned about this, did I missing anything? The one place I found with a potential impact is the run_spec deserialization but as I commented, that's OK.

Re: active threads
I'm pretty sure this will break and I doubt we have a test for this race condition. If you revert this change, we're good to merge from my POV

fjetter · 2022-02-24T19:03:24Z

distributed/worker.py

+        if ts.run_spec is None:
+            return None


This appears to be the only logical change in here. I checked the code again and this is safe right now.

There is an optimization possible where we store the deserialized runspec on this attribute (function, args, kwargs, see below) in case the task needs to be recomputed and is not forgotten in between. We're not doing this right now, maybe we did in the past. For the current state of the code this is fine

fjetter · 2022-02-24T19:12:32Z

distributed/worker.py

-            active_threads = self.active_threads.copy()
-            frames = {k: frames[ident] for ident, k in active_threads.items()}
+            sys_frames = sys._current_frames()
+            frames = {key: sys_frames[tid] for tid, key in self.active_threads.items()}


I think the active_threads copy was necessary due to a threading race condition. I believe the active_threads dict is modified by a thread and this modification is not thread safe. The modification would then raise a "changed size during iteration" exception.

Isn't the whole purpose of with self.active_threads_lock: to avoid this?
Also, if the lock were not effective, you'd be relying on dict.copy() to hold the GIL for the whole duration of the operation which - even if true - would be an implementation detail of CPython which can change at any moment.

pushed a minor tweak to the method

Isn't the whole purpose of with self.active_threads_lock: to avoid this?

Sorry, I missed the lock.

I found the issue I remembered and it was a different section of the code and indeed not protected by a lock, see #4729

Sorry for holding this up

crusaderky · 2022-03-01T12:18:38Z

@fjetter merging this as soon as tests are finished

Improve type annotations in worker.py

56a06a4

crusaderky commented Feb 15, 2022

View reviewed changes

crusaderky self-assigned this Feb 15, 2022

graingert reviewed Feb 16, 2022

View reviewed changes

Merge branch 'main' into worker_annotations

d9e1e44

crusaderky marked this pull request as ready for review February 16, 2022 11:15

crusaderky mentioned this pull request Feb 16, 2022

Drop Python 3.7 #5683

Merged

Merge branch 'main' into worker_annotations

e1b55a4

Container

e3401f0

crusaderky force-pushed the worker_annotations branch from a9ec0ab to e3401f0 Compare February 17, 2022 11:54

crusaderky added 5 commits February 20, 2022 12:17

Merge branch 'main' into worker_annotations

a29f257

Merge branch 'main' into worker_annotations

3c81cc4

tweaks

e47d0f2

fix regressions

f473e18

Merge branch 'main' into worker_annotations

3ca4cfb

fjetter reviewed Feb 24, 2022

View reviewed changes

crusaderky added 3 commits February 25, 2022 00:15

polish get_call_stack

237849d

Merge branch 'main' into worker_annotations

4eb4dfe

Merge branch 'main' into worker_annotations

02ede76

fjetter approved these changes Mar 1, 2022

View reviewed changes

crusaderky merged commit 16931cc into dask:main Mar 1, 2022

crusaderky deleted the worker_annotations branch March 1, 2022 15:39

quasiben mentioned this pull request Mar 7, 2022

Test failure with latest distributed rapidsai/dask-cuda#869

Closed

pentschev mentioned this pull request Mar 7, 2022

Assertion for SpillBuffer breaks usage of custom Worker(data=...) types #5909

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve type annotations in worker.py #5814

Improve type annotations in worker.py #5814

crusaderky commented Feb 15, 2022

crusaderky Feb 15, 2022

crusaderky Feb 16, 2022

github-actions bot commented Feb 16, 2022 •

edited

Loading

graingert Feb 16, 2022 •

edited

Loading

crusaderky Feb 16, 2022 •

edited

Loading

crusaderky commented Feb 17, 2022

fjetter commented Feb 17, 2022

crusaderky commented Feb 17, 2022 •

edited by jrbourbeau

Loading

crusaderky commented Feb 24, 2022

fjetter left a comment

fjetter Feb 24, 2022

fjetter Feb 24, 2022

crusaderky Feb 25, 2022

crusaderky Feb 25, 2022

fjetter Feb 28, 2022

crusaderky commented Mar 1, 2022

	pc = PeriodicCallback(self.find_missing, 1000) # type: ignore
	pc = PeriodicCallback(self.find_missing, 1000) # type: ignore[arg-type]

Improve type annotations in worker.py #5814

Improve type annotations in worker.py #5814

Conversation

crusaderky commented Feb 15, 2022

crusaderky Feb 15, 2022

Choose a reason for hiding this comment

crusaderky Feb 16, 2022

Choose a reason for hiding this comment

github-actions bot commented Feb 16, 2022 • edited Loading

Unit Test Results

graingert Feb 16, 2022 • edited Loading

Choose a reason for hiding this comment

crusaderky Feb 16, 2022 • edited Loading

Choose a reason for hiding this comment

crusaderky commented Feb 17, 2022

fjetter commented Feb 17, 2022

crusaderky commented Feb 17, 2022 • edited by jrbourbeau Loading

crusaderky commented Feb 24, 2022

fjetter left a comment

Choose a reason for hiding this comment

fjetter Feb 24, 2022

Choose a reason for hiding this comment

fjetter Feb 24, 2022

Choose a reason for hiding this comment

crusaderky Feb 25, 2022

Choose a reason for hiding this comment

crusaderky Feb 25, 2022

Choose a reason for hiding this comment

fjetter Feb 28, 2022

Choose a reason for hiding this comment

crusaderky commented Mar 1, 2022

github-actions bot commented Feb 16, 2022 •

edited

Loading

graingert Feb 16, 2022 •

edited

Loading

crusaderky Feb 16, 2022 •

edited

Loading

crusaderky commented Feb 17, 2022 •

edited by jrbourbeau

Loading