feat: add Grid validation function #495

danielolsen · 2021-06-04T19:10:12Z

Purpose

Since we are increasingly looking to constructing Grid objects from datasets besides the usa_tamu CSVs, we are increasingly in need of a check that the Grid objects that we are constructing are internally consistent. This PR adds that check (closes #481), and also makes a couple of changes to some of the Western offshore wind buses we added, so that their voltages are consistent with their connected onshore voltages (found by running the tests).

What the code is doing

There is one main user-facing check_grid function, which activates nine lower-level functions which check the bullet points from #481. Each of these is pretty self-explanatory, with the exception of the connected components check, for which we need to make a slight modification in the model immutables so that we know that the "USA" interconnect should really have three connected components. I don't see the interconnect_combinations parameter being used anywhere, so I think this will be okay, but maybe @rouille knows more.

Testing

Tests pass on all of our Grids.

Time estimate

15-30 minutes if you want to dig into all of the lower-level tests.

jenhagg · 2021-06-04T19:18:47Z

Should we add networkx to the setup.py as well?

BainanXia · 2021-06-04T19:19:29Z

Should we move the grid modification to a separate PR so that it is more trackable in the future, although these changes on bus voltages are very unlikely to cause any trouble given we created them.

powersimdata/input/check.py

danielolsen · 2021-06-04T19:26:42Z

Should we add networkx to the setup.py as well?

Good catch, yes

Should we move the grid modification to a separate PR so that it is more trackable in the future, although these changes on bus voltages are very unlikely to cause any trouble given we created them.

If you think it is cleaner. It's already a distinct commit, and if we move it to a separate PR then we will need to merge that PR before this one, otherwise the tests in this one will fail.

powersimdata/network/usa_tamu/constants/zones.py

powersimdata/input/check.py

BainanXia · 2021-06-04T19:53:13Z

powersimdata/input/check.py

+    """
+    g = nx.from_pandas_edgelist(grid.branch, "from_bus_id", "to_bus_id")
+    num_connected_components = len([c for c in nx.connected_components(g)])
+    if len(grid.interconnect) == 1:


Maybe I need to refresh my brain here, the only situation we have len(grid.interconnect) > 1 is TexasWestern, right? If TexasWestern is handled in else block, what does Texas_Western entry do in interconnect_combinations?

Good question. I don't know if/when/how interconnect_combinations was used prior to this PR. We can init a Grid(["Western", "Eastern"]) and then grid.interconnect == ["Western", "Eastern"].

powersimdata/input/tests/test_check.py

BainanXia · 2021-06-04T19:57:00Z

Should we add networkx to the setup.py as well?

Good catch, yes

Should we move the grid modification to a separate PR so that it is more trackable in the future, although these changes on bus voltages are very unlikely to cause any trouble given we created them.

If you think it is cleaner. It's already a distinct commit, and if we move it to a separate PR then we will need to merge that PR before this one, otherwise the tests in this one will fail.

Either way. I don't have strong preference. What do you think @rouille

BainanXia · 2021-06-04T20:14:58Z

Given this feature could be useful in many situations especially when we are adopting a new model, do you think it is worthwhile to make all the check run regardless of whether the previous one fails or not, so that the user will be aware of all the problems in one shot?

danielolsen · 2021-06-04T20:19:23Z

Given this feature could be useful in many situations especially when we are adopting a new model, do you think it is worthwhile to make all the check run regardless of whether the previous one fails or not, so that the user will be aware of all the problems in one shot?

In this case, would we need to refactor check_grid so that it collects the exception from each test, then raises one mega-exception at the end?

BainanXia · 2021-06-04T20:24:36Z

Given this feature could be useful in many situations especially when we are adopting a new model, do you think it is worthwhile to make all the check run regardless of whether the previous one fails or not, so that the user will be aware of all the problems in one shot?

In this case, would we need to refactor check_grid so that it collects the exception from each test, then raises one mega-exception at the end?

That's what I could think of. Maybe there are better solutions @jon-hagg @rouille ?

jenhagg · 2021-06-04T20:30:32Z

Given this feature could be useful in many situations especially when we are adopting a new model, do you think it is worthwhile to make all the check run regardless of whether the previous one fails or not, so that the user will be aware of all the problems in one shot?

In this case, would we need to refactor check_grid so that it collects the exception from each test, then raises one mega-exception at the end?

That's what I could think of. Maybe there are better solutions @jon-hagg @rouille ?

What I'd do is have each function return a list of error messages and collect those, then raise an exception from the top level _check_grid if there are any errors. Or maybe just print them - no need to raise an error unless it's intended to be used to interrupt control flow

danielolsen · 2021-06-04T20:37:52Z

Given this feature could be useful in many situations especially when we are adopting a new model, do you think it is worthwhile to make all the check run regardless of whether the previous one fails or not, so that the user will be aware of all the problems in one shot?

In this case, would we need to refactor check_grid so that it collects the exception from each test, then raises one mega-exception at the end?

That's what I could think of. Maybe there are better solutions @jon-hagg @rouille ?

What I'd do is have each function return a list of error messages and collect those, then raise an exception from the top level _check_grid if there are any errors. Or maybe just print them - no need to raise an error unless it's intended to be used to interrupt control flow

Good call, we don't need to raise exceptions in the lower-level functions, they can return either a string or None and then we can combine any returned strings as necessary for generating the final exception. I think the high-level function should interrupt control flow here if there is a problem: grid integrity is important.

danielolsen · 2021-06-04T20:53:57Z

Mega-Exception logic is added. We get:

ValueError: Problem(s) found with grid: islanded buses detected: {2090023}. indices for bus and bus2sub don't match.

or

ValueError: Problem(s) found with grid: buses present in transmission network but missing from bus table: {2090023}. indices for bus and bus2sub don't match. branch(es) connected across multiple interconnections: Int64Index([104191], dtype='int64', name='branch_id'). line(s) connected across multiple voltages: Int64Index([104191], dtype='int64', name='branch_id').

etc when we muck with the usa_tamu data, otherwise all tests pass.

powersimdata/input/check.py

BainanXia

Very nice. This could be very useful. Thanks for the patience and being responsive.

…erve check_grid

danielolsen · 2021-06-05T01:08:23Z

One thing that was not done here, that probably should be: checking for null values in all of the data frames.

danielolsen requested review from rouille, BainanXia, jenhagg and ahurli June 4, 2021 19:10

danielolsen self-assigned this Jun 4, 2021

jenhagg reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Outdated Show resolved Hide resolved

BainanXia reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Show resolved Hide resolved

BainanXia reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Outdated Show resolved Hide resolved

jenhagg reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Outdated Show resolved Hide resolved

BainanXia reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Outdated Show resolved Hide resolved

rouille reviewed Jun 4, 2021

View reviewed changes

powersimdata/network/usa_tamu/constants/zones.py Outdated Show resolved Hide resolved

rouille reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Outdated Show resolved Hide resolved

BainanXia reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Outdated Show resolved Hide resolved

jenhagg reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Outdated Show resolved Hide resolved

BainanXia reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Outdated Show resolved Hide resolved

BainanXia reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Outdated Show resolved Hide resolved

BainanXia reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Outdated Show resolved Hide resolved

BainanXia reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/tests/test_check.py Show resolved Hide resolved

danielolsen force-pushed the daniel/grid_check branch from 4515013 to a87fedd Compare June 4, 2021 20:16

danielolsen force-pushed the daniel/grid_check branch from a87fedd to 0e812eb Compare June 4, 2021 20:21

danielolsen force-pushed the daniel/grid_check branch 2 times, most recently from c2251df to cbd493a Compare June 4, 2021 20:27

danielolsen force-pushed the daniel/grid_check branch from cbd493a to b202638 Compare June 4, 2021 20:36

BainanXia reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Outdated Show resolved Hide resolved

danielolsen force-pushed the daniel/grid_check branch from edc3c3e to cee4f92 Compare June 4, 2021 21:16

ahurli reviewed Jun 4, 2021

View reviewed changes

powersimdata/input/check.py Show resolved Hide resolved

powersimdata/input/check.py Outdated Show resolved Hide resolved

powersimdata/input/check.py Outdated Show resolved Hide resolved

danielolsen force-pushed the daniel/grid_check branch 4 times, most recently from 9a608a8 to 2b41df1 Compare June 4, 2021 22:39

BainanXia approved these changes Jun 4, 2021

View reviewed changes

danielolsen force-pushed the daniel/grid_check branch 2 times, most recently from 7768262 to 0b9ad99 Compare June 4, 2021 23:02

danielolsen added 2 commits June 4, 2021 16:07

chore: add networkx dependency

121d93a

chore: change type of interconnect_combinations from set to dict to s…

2f5f5ce

…erve check_grid

danielolsen force-pushed the daniel/grid_check branch from 0b9ad99 to 60a022e Compare June 4, 2021 23:07

danielolsen added 3 commits June 4, 2021 16:08

feat: add user-facing check_grid function and lower-level sub-functions

7d80ed6

test: add tests for grid check

bdaae0e

data: change three Western offshore bus voltages to satisfy grid check

bb8733c

danielolsen force-pushed the daniel/grid_check branch from 60a022e to bb8733c Compare June 4, 2021 23:08

danielolsen merged commit 442b58b into develop Jun 4, 2021

danielolsen deleted the daniel/grid_check branch June 4, 2021 23:12

danielolsen mentioned this pull request Jun 5, 2021

feat: create Grid from Switch results Breakthrough-Energy/SwitchWrapper#97

Merged

This was referenced Jun 7, 2021

More error handling for grid validation #496

Merged

chore: merge develop into master for v0.4.2 release #498

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Grid validation function #495

feat: add Grid validation function #495

danielolsen commented Jun 4, 2021

jenhagg commented Jun 4, 2021 •

edited

Loading

BainanXia commented Jun 4, 2021

danielolsen commented Jun 4, 2021

BainanXia Jun 4, 2021

danielolsen Jun 4, 2021

BainanXia commented Jun 4, 2021

BainanXia commented Jun 4, 2021

danielolsen commented Jun 4, 2021

BainanXia commented Jun 4, 2021

jenhagg commented Jun 4, 2021

danielolsen commented Jun 4, 2021

danielolsen commented Jun 4, 2021

BainanXia left a comment

danielolsen commented Jun 5, 2021

feat: add Grid validation function #495

feat: add Grid validation function #495

Conversation

danielolsen commented Jun 4, 2021

Purpose

What the code is doing

Testing

Time estimate

jenhagg commented Jun 4, 2021 • edited Loading

BainanXia commented Jun 4, 2021

danielolsen commented Jun 4, 2021

BainanXia Jun 4, 2021

Choose a reason for hiding this comment

danielolsen Jun 4, 2021

Choose a reason for hiding this comment

BainanXia commented Jun 4, 2021

BainanXia commented Jun 4, 2021

danielolsen commented Jun 4, 2021

BainanXia commented Jun 4, 2021

jenhagg commented Jun 4, 2021

danielolsen commented Jun 4, 2021

danielolsen commented Jun 4, 2021

BainanXia left a comment

Choose a reason for hiding this comment

danielolsen commented Jun 5, 2021

jenhagg commented Jun 4, 2021 •

edited

Loading