Simple Status Checking #655

MattToast · 2024-08-02T17:50:48Z

Add ability for the Experiment to fetch the status a launched job that it started given a LaunchedJobID. Teach the ShellLauncher and DragonLauncher to get statuses of jobs they have launched.

smartsim/settings/dispatch.py

juliaputko · 2024-08-02T20:47:26Z

smartsim/_core/launcher/dragon/dragonLauncher.py

@@ -144,6 +145,10 @@ def start(
        res = _assert_schema_type(self._connector.send_request(req), DragonRunResponse)
        return LaunchedJobID(res.step_id)

+    def get_status(self, *launched_ids: LaunchedJobID) -> tuple[SmartSimStatus, ...]:
+        infos = self._get_managed_step_update(list(launched_ids))


infos plural intentional?

infos, plural intentional! The return type of _get_managed_step_update is list[StepInfo], so I pluralized as a reminder that this is a "collection of many StepInfo instances".

If its unclear or confusing to read, I'm more than willing to rename to info_list or similar!

MattToast · 2024-08-02T21:13:39Z

smartsim/settings/dispatch.py

+    @abc.abstractmethod
+    def get_status(
+        self, *launched_ids: LaunchedJobID
+    ) -> tuple[SmartSimStatus, ...]: ...


Currently, this assumes with this call:

stat_1, stat_2, stat_3, ... = launcher.get_status(id_1, id_2, id_3, ...)

stat_1 is the status of the job with id id_1, stat_2 is the status of the job with id id_2, etc. and is behavior that is relied upon by the Experiment.

Do we like this or is this too much of an "implied constraint" from the protocol?

Should the return type here actually be a Mapping[LaunchJobID, SmartSimStatus] or maybe a Iterable[tuple[LaunchJobID, SmartSimStatus]]? That way its more obvious to users looking to write and register their own launchers the intention of what this method should return.

I feel like it might not be implied for all SS users that stat_1 and stat_2 will map to id_1 and id_2 but I might be wrong. I would rather get back a Mapping or Iterable -> Im wondering what will be easier for the user to inspect, maybe an iterable since you can use a for loop? But then again I can search via key in a Mapping - am I on to something here or is this so far off?

Agreed offline to make this protocol method return a Mapping.

amandarichardsonn

Looks awesome, just a couple of comments

smartsim/experiment.py

amandarichardsonn · 2024-08-06T01:02:47Z

smartsim/experiment.py

+        :returns: A tuple of statuses with order respective of the order of the
+            calling arguments.
+        """
+        ids_ = set(ids)


would you mind commenting this code? Sorry I always ask that MERP

Absolutely!

Sorry I always ask that MERP

Don't be sorry! If its not immediately obvious what's happening, that's a good sign comments are needed. I don't want to over comment, so I only tend to comment sections that reviewers ask for more details on!!

Basically I'm just offloading the work of figuring out what needs comments onto you, the reviewer, lol.

amandarichardsonn · 2024-08-06T01:11:06Z

smartsim/settings/dispatch.py

+    @abc.abstractmethod
+    def get_status(
+        self, *launched_ids: LaunchedJobID
+    ) -> tuple[SmartSimStatus, ...]: ...


I feel like it might not be implied for all SS users that stat_1 and stat_2 will map to id_1 and id_2 but I might be wrong. I would rather get back a Mapping or Iterable -> Im wondering what will be easier for the user to inspect, maybe an iterable since you can use a for loop? But then again I can search via key in a Mapping - am I on to something here or is this so far off?

tests/temp_tests/test_settings/conftest.py

smartsim/experiment.py

smartsim/error/errors.py

smartsim/settings/dispatch.py

smartsim/_core/utils/helpers.py

mellis13

Very small comments otherwise LGTM

codecov · 2024-08-08T23:48:24Z

Codecov Report

Attention: Patch coverage is 39.02439% with 50 lines in your changes missing coverage. Please review.

Please upload report for BASE (smartsim-refactor@a2c1251). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
smartsim/_core/control/launch_history.py	43.47%	13 Missing ⚠️
smartsim/settings/dispatch.py	38.88%	11 Missing ⚠️
smartsim/experiment.py	33.33%	10 Missing ⚠️
smartsim/_core/utils/helpers.py	27.27%	8 Missing ⚠️
smartsim/_core/launcher/dragon/dragonLauncher.py	22.22%	7 Missing ⚠️
smartsim/_core/control/jobmanager.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@                 Coverage Diff                  @@
##             smartsim-refactor     #655   +/-   ##
====================================================
  Coverage                     ?   34.15%           
====================================================
  Files                        ?      109           
  Lines                        ?     6611           
  Branches                     ?        0           
====================================================
  Hits                         ?     2258           
  Misses                       ?     4353           
  Partials                     ?        0

Files with missing lines	Coverage Δ
smartsim/_core/launcher/dragon/dragonBackend.py	`0.00% <ø> (ø)`
smartsim/error/errors.py	`60.00% <100.00%> (ø)`
smartsim/status.py	`100.00% <100.00%> (ø)`
smartsim/_core/control/jobmanager.py	`23.22% <50.00%> (ø)`
smartsim/_core/launcher/dragon/dragonLauncher.py	`25.78% <22.22%> (ø)`
smartsim/_core/utils/helpers.py	`32.71% <27.27%> (ø)`
smartsim/experiment.py	`38.67% <33.33%> (ø)`
smartsim/settings/dispatch.py	`64.06% <38.88%> (ø)`
smartsim/_core/control/launch_history.py	`43.47% <43.47%> (ø)`

MattToast added 2 commits August 2, 2024 10:45

Experiment can query launchers for statuses

e27eff5

Add tests and make them pass

c82f713

MattToast added type: feature Issues that include feature request or feature idea area: launcher Issues related to any of the launchers within SmartSim area: api Issues related to API changes ignore-for-release labels Aug 2, 2024

MattToast requested review from mellis13, amandarichardsonn and juliaputko August 2, 2024 17:50

MattToast self-assigned this Aug 2, 2024

MattToast commented Aug 2, 2024

View reviewed changes

smartsim/settings/dispatch.py Outdated Show resolved Hide resolved

MattToast added 2 commits August 2, 2024 11:38

Add docstring to user facing method

877c5cc

Make 3.9 happy

84ceb7a

juliaputko approved these changes Aug 2, 2024

View reviewed changes

MattToast commented Aug 2, 2024

View reviewed changes

amandarichardsonn reviewed Aug 6, 2024

View reviewed changes

mellis13 reviewed Aug 6, 2024

View reviewed changes

smartsim/experiment.py Outdated Show resolved Hide resolved

smartsim/experiment.py Outdated Show resolved Hide resolved

smartsim/error/errors.py Outdated Show resolved Hide resolved

smartsim/settings/dispatch.py Show resolved Hide resolved

MattToast added 3 commits August 8, 2024 18:02

Address first round of reveiwer feedback

2ed9203

Add new tests to CI

8919beb

Make 3.9 happy

ab61a1f

mellis13 reviewed Aug 8, 2024

View reviewed changes

smartsim/_core/utils/helpers.py Show resolved Hide resolved

mellis13 reviewed Aug 8, 2024

View reviewed changes

smartsim/_core/utils/helpers.py Show resolved Hide resolved

mellis13 approved these changes Aug 8, 2024

View reviewed changes

MattToast added 2 commits August 8, 2024 19:00

Moar docstrings

f9a9028

Typo

1b95711

MattToast requested a review from amandarichardsonn August 9, 2024 00:02

MattToast merged commit 77eaf4d into CrayLabs:smartsim-refactor Aug 9, 2024
22 of 35 checks passed

MattToast deleted the simple-status branch August 9, 2024 23:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple Status Checking #655

Simple Status Checking #655

MattToast commented Aug 2, 2024

juliaputko Aug 2, 2024

MattToast Aug 2, 2024

MattToast Aug 2, 2024 •

edited

Loading

amandarichardsonn Aug 6, 2024

MattToast Aug 6, 2024

amandarichardsonn left a comment

amandarichardsonn Aug 6, 2024

MattToast Aug 6, 2024

MattToast Aug 6, 2024

amandarichardsonn Aug 6, 2024

mellis13 left a comment

codecov bot commented Aug 8, 2024 •

edited

Loading

Simple Status Checking #655

Simple Status Checking #655

Conversation

MattToast commented Aug 2, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MattToast Aug 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amandarichardsonn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mellis13 left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 8, 2024 • edited Loading

Codecov Report

MattToast Aug 2, 2024 •

edited

Loading

codecov bot commented Aug 8, 2024 •

edited

Loading