Remove reference to deleted constant and slight pandas refactor #365

jenhagg · 2020-12-23T19:50:12Z

Purpose

I was playing around with querying the execute list and noticed we still had a reference to server_setup.EXECUTE_LIST which no longer exists. Changed the exception to reflect that but also tweaked the query since it seems a little more readable.

Edit: based on feedback, I expanded the scope to set the index within CsvStore and took the opportunity to move query logic to the relevant data access classes. I think this simplifies the scenario class, making it less concerned with parsing the csv files.

What the code is doing

Nothing new

Testing

Loaded an existing scenario works, and if I delete the entry from the execute list I get the exception raised here.

Time estimate

3 min

rouille · 2020-12-23T20:05:27Z

powersimdata/scenario/scenario.py

@@ -98,14 +97,12 @@ def _set_status(self):
        :raises Exception: if scenario not found in execute list on server.
        """
        execute_table = self._execute_list_manager.get_execute_table()


It seems that the CsvStore class is dedicated to reading the ScenarioList/ExecuteList. So I guess the _parse_csv method could return a pandas.DataFrame with id as index by doing l.45:

table = pd.read_csv(file_object, index_col=0) or table = pd.read_csv(file_object, index_col="id")

That way we don't have to set_index later

rouille · 2020-12-23T23:20:05Z

powersimdata/data_access/csv_store.py

@@ -43,6 +43,7 @@ def _parse_csv(self, file_object):
        :return: (*pandas.DataFrame*) -- the specified file as a data frame.
        """
        table = pd.read_csv(file_object)
+        table.set_index("id", inplace=True, drop=False)


Any reason for not dropping the column?

I was trying to get it to work without, but the only thing we need it for is returning scenario info as an ordered dict. If we drop the column we'll get an error doing self.info["id"] within a scenario instance, since DataFrame.to_dict only includes columns when the kind is "records"

I see... We could drop it here and use reset_index() l. 142 of scenario_list.py:

return scenario.reset_index().to_dict("records", into=OrderedDict)[0]

I do like that a bit better, however it results in the id column being an int type, so at some point downstream it fails during string concatenation. I'll give this some more thought, probably next year.

What about return scenario.reset_index().astype({"id": "str"}).to_dict("records", into=OrderedDict)[0]?

And/or can the string concatenation include an extra str() wrap around the id, so that it has less strict assumptions about its input?

I added Ben's suggestion in the latest commit. Thinking the strict assumption about types is actually a good thing - makes things simpler for any code using the scenario object. Having it guaranteed to be an int would be fine too, just minimizing changes here. This is kinda reminiscent of Postel's law, which is basically be "flexible on inputs but not outputs" (my interpretation).

powersimdata/data_access/scenario_list.py

rouille

Looks good.

danielolsen · 2021-01-06T20:58:06Z

powersimdata/data_access/execute_list.py

+        :return: (*str*) -- scenario status
+        """
+        try:
+            table = self.get_execute_table()


Are we expecting a KeyError from this call, or would this be better outside the try?

danielolsen

Thanks

jenhagg requested a review from rouille December 23, 2020 19:50

jenhagg self-assigned this Dec 23, 2020

rouille reviewed Dec 23, 2020

View reviewed changes

powersimdata/data_access/scenario_list.py Outdated Show resolved Hide resolved

danielolsen reviewed Jan 4, 2021

View reviewed changes

powersimdata/data_access/scenario_list.py Outdated Show resolved Hide resolved

rouille approved these changes Jan 5, 2021

View reviewed changes

danielolsen reviewed Jan 6, 2021

View reviewed changes

danielolsen approved these changes Jan 6, 2021

View reviewed changes

Jon Hagg added 7 commits January 6, 2021 13:14

refactor: simplify query to get scenario status

f8592dc

refactor: move more query logic to data_access layer

16a8ee7

fix: preserve behavior for missing scenario

dd79c5d

chore: indent docstring

f80b92a

chore: mutually exclusive if/else, print duplicates for ease of use

79383c9

chore: drop id col and reset when needed

6149e15

chore: minimize try block

8e15daa

jenhagg force-pushed the jon/pd_index branch from c7a07c5 to 8e15daa Compare January 6, 2021 21:14

jenhagg merged commit 1296401 into develop Jan 6, 2021

jenhagg deleted the jon/pd_index branch January 6, 2021 21:16

jenhagg mentioned this pull request Jan 21, 2021

test: remove index column from test assertion #374

Merged

ahurli mentioned this pull request Mar 11, 2021

Develop into Master #410

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove reference to deleted constant and slight pandas refactor #365

Remove reference to deleted constant and slight pandas refactor #365

jenhagg commented Dec 23, 2020 •

edited

Loading

rouille Dec 23, 2020

rouille Dec 23, 2020

jenhagg Dec 23, 2020

rouille Dec 23, 2020

jenhagg Dec 24, 2020

rouille Dec 24, 2020

danielolsen Dec 24, 2020

jenhagg Jan 5, 2021

rouille left a comment

danielolsen Jan 6, 2021

danielolsen left a comment

Remove reference to deleted constant and slight pandas refactor #365

Remove reference to deleted constant and slight pandas refactor #365

Conversation

jenhagg commented Dec 23, 2020 • edited Loading

Purpose

What the code is doing

Testing

Time estimate

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rouille left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

danielolsen left a comment

Choose a reason for hiding this comment

jenhagg commented Dec 23, 2020 •

edited

Loading