refactor: reduce memory requirements for mesh branch scaling #132

danielolsen · 2020-04-05T03:31:01Z

Purpose

Reduce memory requirements for using the design_transmission module, and as a side-benefit simplify the code.

What is the code doing

Previously, we needed enough memory to store a congestion dataframe five times: we loaded CONGU, we loaded CONGL, we created numpy array versions of both, and we created a new dataframe of the same size to hold the element-wise maximization of these two dataframes. We did a 'clean-up' of CONGU and CONGL afterwards, but that doesn't help us if we don't have enough peak memory.

Instead, we can make use of the fact that element-wise, only one of CONGU and CONGL will be non-zero: we can simply add them instead of performing a type conversion, performing a numpy element-wise maximization, and then storing that result into a new dataframe. I think this new format only requires enough memory to store a congestion dataframe two or three times, depending on the internals of pandas DataFrame addition.

Perhaps most importantly, this function makes use of sparse dataframes as introduced in Breakthrough-Energy/PostREISE#96: we no longer expand the sparse dataframes into non-sparse numpy arrays. Even when starting with sparse dataframes, trying to run the previous code created a MemoryError on our laptops, while the new code runs without a problem.

Time Estimate

Half an hour or less.

rouille

I agree that this should use less memory. The operation might also be faster.

danielolsen · 2020-04-06T18:17:45Z

Tested successfully using sparse dataframe pickles for CONGU and CONGL, as introduced in Breakthrough-Energy/PostREISE#96.

danielolsen requested review from rouille and BainanXia April 5, 2020 03:31

danielolsen assigned danielolsen and rouille Apr 5, 2020

refactor: reduce memory requirements for mesh branch scaling

ee944d6

danielolsen force-pushed the mesh_branch_reduce_memory branch from e4a51d4 to ee944d6 Compare April 5, 2020 03:32

rouille approved these changes Apr 5, 2020

View reviewed changes

danielolsen mentioned this pull request Apr 6, 2020

refactor: save congu and congl as sparse DataFrames Breakthrough-Energy/PostREISE#96

Merged

danielolsen merged commit 106fb00 into develop Apr 6, 2020

danielolsen deleted the mesh_branch_reduce_memory branch April 6, 2020 18:19

dmuldrew pushed a commit that referenced this pull request Apr 6, 2020

refactor: reduce memory requirements for mesh branch scaling (#132)

3b4befd

kasparm pushed a commit that referenced this pull request Apr 6, 2020

refactor: reduce memory requirements for mesh branch scaling (#132)

722d0a0

Lab-ITTeam unassigned danielolsen Jul 15, 2020

ahurli mentioned this pull request Mar 11, 2021

Develop into Master #410

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: reduce memory requirements for mesh branch scaling #132

refactor: reduce memory requirements for mesh branch scaling #132

danielolsen commented Apr 5, 2020

rouille left a comment

danielolsen commented Apr 6, 2020

refactor: reduce memory requirements for mesh branch scaling #132

refactor: reduce memory requirements for mesh branch scaling #132

Conversation

danielolsen commented Apr 5, 2020

Purpose

What is the code doing

Time Estimate

rouille left a comment

Choose a reason for hiding this comment

danielolsen commented Apr 6, 2020