-
Notifications
You must be signed in to change notification settings - Fork 391
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add plot method and data.py to NAIP #407
Conversation
It looks like this is missing the data.py file, and the tests are not passing with the new test NAIP data |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Must be something from with the transform that's causing the file size to blow up to PBs?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We seem to still be allocating a ~1 PB numpy array somewhere, causing the tests to fail.
Has anyone figured out what is going on with this giant numpy array? |
Could it have to do with the size of the bounds that is passed to index the dataset? |
That has to be it but I don't see anything fishy about the data. Maybe try printing the data length and dataset.bounds? |
P.S. To see the output of a print statement in a test file you'll need to use the $ pytest -s tests/datasets/test_geo.py::TestRasterDataset::test_getitem_single_file |
when running pytest on my ubuntu machine, I get: |
It's not the test_naip.py tests that are failing, it's the test_geo.py tests. |
If I uncomment, the hardcorded crs of |
The new test failure is because our NAIP data and Chesapeake data have no overlap. |
TJust for my understanding: The error I am seeing is that for the I am not sure how to best proceed. If one now tunes the NAIP |
Correct, there's a bug in rtree where an empty index actually has invalid bounds. That's the error you're seeing. If you print the length of the dataset, you'll find that it's zero. I think it's fine to make sure that the bounding box for NAIP and Chesapeake match. The actual location of these datasets on earth doesn't matter, they just need to overlap. I don't envision a test being added that asserts that these particular datasets don't overlap. |
* add plot method and data.py * add version * file typo * forgot data.py * add version change and larger image size * requested changes * test with print * test geo * change data to match chesapeake * fix crs test
* add plot method and data.py * add version * file typo * forgot data.py * add version change and larger image size * requested changes * test with print * test geo * change data to match chesapeake * fix crs test
Since RasterDatasets should have their own plot method per #253, this PR adds a plot method as well as a data.py file to the NAIP dataset.