Factor out code for interacting with nrel api and add retry logic #114

jenhagg · 2020-10-08T01:39:51Z

Purpose

Create a semi reusable module for downloading the solar data - nrel_api.py which is used as part of calculating power output in sam.py and naive.py. As part of doing this, we add retry and rate limiting to make downloads more reliable.

What it does

Move the code to construct the request and download data to the nrel_api module
Define a dataclass Psm3Data which acts a container for the responses
Add the ability to rate limit arbitrary functions - see request_util.RateLimit
Add the ability to retry failed requests - see request_util.retry

Initially I had the nrel api client using rate limiting directly, but that didn't account for handling failures, which means when we call it in a loop and it fails, we have to start over. One way around this is combining the rate limit and retry - we retry up to a fixed number of failures at each iteration, but space them out using a reasonable rate limit (determined via experiment). This makes the loop very likely to finish, and enables tuning the rate, max retry count, etc, given a specific use case (api calls, or anything that can fail intermittently).

Testing

There are unit tests for the retry and rate limit. For the remaining changes, I mostly used the notebook to make sure things still look right.

Time to review

20-30 min

prereise/gather/solardata/nsrdb/nrel_api.py

rouille · 2020-10-08T05:05:19Z

prereise/gather/request_util.py

+
+        return wrapper
+
+    return decorator


I have been reading https://realpython.com/primer-on-python-decorators/, it was useful!

Agreed! This is new to me too.

Nice, just picked up some new things from there. TIL functions can have attributes -

In [13]: def foo(): ...: pass ...: def bar(f): ...: f.x += 1 ...: foo.x = 0 In [14]: bar(foo) In [15]: foo.x Out[15]: 1

Everything is an object!

rouille · 2020-10-08T05:17:50Z

prereise/gather/tests/test_rate_limit.py

+def sleepless(monkeypatch):
+    counter = SleepCounter()
+    monkeypatch.setattr(time, "sleep", counter.sleep)
+    monkeypatch.setattr(time, "time", counter.time)


I did not know the monkeypatch fixture. Super useful.

prereise/gather/solardata/nsrdb/nrel_api.py

rouille

That is very nice

ahurli · 2020-10-08T17:50:35Z

prereise/gather/solardata/nsrdb/nrel_api.py

+        @retry(interval=self.interval)
+        def _get_info(url):
+            return pd.read_csv(url, nrows=1)
+
+        @retry(interval=self.interval)
+        def _get_data(url):
+            return pd.read_csv(url, dtype=float, skiprows=2)
+
+        info = _get_info(url)
+        tz, elevation = info["Local Time Zone"], info["Elevation"]
+
+        data_resource = _get_data(url)


If rate limiting is an issue, does it make sense to combine _get_data and _get_info into one HTTP call and then parse the result into the data and info? I'm assuming each pd.read_csv is a HTTP call without any caching.

Good question - I tried doing this initially but realized the data we get back has 2 different csv files basically stacked, so it can't be parsed directly. We'd probably have to use an http library to get the raw content in one call then handle separating it; wasn't sure if it was worth it at the moment. Something else I just noticed - we have different RateLimit instances in each decorated function, which seems to work but since it's the same url it'd be nice to also share the instance. I'll look into this.

Let me know if I can help! I've spent a lot of time making my own http calls to various APIs (though I usually used requests instead of the built-in urllib in python3).

Going that route will probably make it a little easier to catch error code 429 as well, so we can have retry only allow a custom exception like HTTPError429 and not retry when we see something like a 403 when an incorrect API key is used.

I guess you guys can work on this in a follow up PR

Sounds good, let's meet up at some point and figure out the design, I think there are some cool options we could play around with.

prereise/gather/request_util.py

ahurli · 2020-10-08T18:01:08Z

prereise/gather/request_util.py

+
+        return wrapper
+
+    return decorator


Agreed! This is new to me too.

Jon Hagg added 9 commits October 6, 2020 12:57

wip nrel api class with rate limit

29eb548

feat: separate rate limiter

da66b57

feat: add retry logic, combine with rate limiter

a8bf8e1

fix: parameterize attributes in payload

c1fe574

chore: update docs and other misc cleanup

0b38ac1

fix: preserve types

2ca042f

chore: rerun notebook with rate limit

c8a1700

docs: fix formatting so the docs render properly

ca77993

chore: simplify test and declare module in package

ac27ba1

jenhagg self-assigned this Oct 8, 2020

jenhagg linked an issue Oct 8, 2020 that may be closed by this pull request

Factor out NREL API logic #112

Closed

jenhagg added this to the Welcome Drizzle milestone Oct 8, 2020

jenhagg requested review from rouille and ahurli October 8, 2020 01:42

rouille reviewed Oct 8, 2020

View reviewed changes

prereise/gather/solardata/nsrdb/nrel_api.py Show resolved Hide resolved

rouille reviewed Oct 8, 2020

View reviewed changes

prereise/gather/solardata/nsrdb/nrel_api.py Show resolved Hide resolved

rouille approved these changes Oct 8, 2020

View reviewed changes

ahurli reviewed Oct 8, 2020

View reviewed changes

Jon Hagg added 2 commits October 8, 2020 13:25

chore: add missing docs, handle edge case and validation

485c378

fix: make retry more flexible and handle edge case for real

d91c12f

jenhagg force-pushed the jon/nrel-api branch from 23b7c5b to d91c12f Compare October 8, 2020 20:50

jenhagg merged commit ffa32d4 into develop Oct 8, 2020

jenhagg deleted the jon/nrel-api branch October 8, 2020 21:03

ahurli mentioned this pull request Mar 16, 2021

Develop into Master #155

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Factor out code for interacting with nrel api and add retry logic #114

Factor out code for interacting with nrel api and add retry logic #114

jenhagg commented Oct 8, 2020

rouille Oct 8, 2020

ahurli Oct 8, 2020

jenhagg Oct 8, 2020

danielolsen Oct 8, 2020

rouille Oct 8, 2020

rouille left a comment

ahurli Oct 8, 2020

jenhagg Oct 8, 2020

ahurli Oct 8, 2020

rouille Oct 8, 2020

jenhagg Oct 8, 2020

ahurli Oct 8, 2020

Factor out code for interacting with nrel api and add retry logic #114

Factor out code for interacting with nrel api and add retry logic #114

Conversation

jenhagg commented Oct 8, 2020

Purpose

What it does

Testing

Time to review

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rouille left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment