-
Notifications
You must be signed in to change notification settings - Fork 391
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Radiant MLHub (REF) Cloud Cover Dataset #510
Add Radiant MLHub (REF) Cloud Cover Dataset #510
Conversation
Thanks for the contribution @KennSmithDS! I took a pass at reviewing. Other comments:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Still needs a data.py
script to generate the fake data. It also looks like some of the tests are failing. You'll need to rebase/merge on top of main to run some of the new required tests. Let me know if I can help with any of this.
..._detection_challenge_v1/ref_cloud_cover_detection_challenge_v1_test_labels 2/collection.json
Outdated
Show resolved
Hide resolved
@adamjstewart @calebrob6 thank you for the feedback and recommendations. I've rebased/merged the latest release with my local branch, and fixed the failing tests, as well as implemented the suggestions above. The only thing I still have questions on is the Is this requirement for contributing documented anywhere in the repository? I've been looking at a few different examples to get a sense for what it does, e.g.
Thanks again! |
The fact that it's needed is documented at https://torchgeo.readthedocs.io/en/latest/user/contributing.html#datasets. Some tips for writing the script can be found at https://github.com/microsoft/torchgeo/blob/main/tests/data/README.md For all of your other questions, the answer is that you should try to match the real data from your dataset.
Hope that answers your questions! |
@adamjstewart I had missed the README file nestled under |
@adamjstewart I'm assuming the answer to this question is yes, but to clarify, should the |
Yep! At some point I'm planning to convert these data.py files to pytest fixtures to generate all of our data at test time instead of storing it in the repo. So if the data.py generates those files too that would help a lot. |
datasets/cloud-cover branch diverged from main, and the base class VisionDataset is now deprecated, unable to make commit with new class name NonGeoDataset as the underlying class structures have changed, and are preventing the commit.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the data.py
, that looks like it wasn't easy to write!
Did a more detailed pass through the code and listed mostly minor things to change. I think this is pretty close to completion! Will likely have one last pass once these comments are addressed.
adding commas to train/test split sizes Co-authored-by: Adam J. Stewart <[email protected]>
Including total dataset size in api CSV, removing train/test split Co-authored-by: Adam J. Stewart <[email protected]>
Accepting carrots under text in datasets.rst, sorry forgot to add these in! Co-authored-by: Adam J. Stewart <[email protected]>
Co-authored-by: Adam J. Stewart <[email protected]>
Co-authored-by: Adam J. Stewart <[email protected]>
Co-authored-by: Adam J. Stewart <[email protected]>
71ec1e5
to
84ffdee
Compare
Rebased to fix coverage. Just need to see an example plot and this should be ready to merge. |
merging updates on main into local fork, contains pin to radiant-mlhub v0.4.x
merging updates from origin/main into datasets/cloud-cover
…o into datasets/cloud-cover
syncing fork with upstream
@adamjstewart sorry for delay, we've had some high priority projects past 1-2 weeks that have required my attention.
I just modified a single line of code in the I'll have to push 1 more commit with the slight modification to the |
|
Merging updates from main
Merging sync with fork into local feature branch
…o into datasets/cloud-cover Merging origin feature branch with local
* adding cloud cover dataset class * Adding Cloud Cover Detection Challenge dataset class and tests * fixed linter issues and passing pre-commit tests * resolving linter errors * addressing failed isort test * replacing deprecated VisionDataset with NonGeoDataset reference * removed NDArray import for quoted np.typing.NDArray * addressing mypy errors * updated docstring for plot method * Update docs/api/non_geo_datasets.csv adding commas to train/test split sizes Co-authored-by: Adam J. Stewart <[email protected]> * Update docs/api/non_geo_datasets.csv Including total dataset size in api CSV, removing train/test split Co-authored-by: Adam J. Stewart <[email protected]> * updated Cloud Cover Dataset name in dataset.rst * Update docs/api/datasets.rst Accepting carrots under text in datasets.rst, sorry forgot to add these in! Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * flakefreaking8 * Fix duplicate rst link * adding cloud cover dataset class * Adding Cloud Cover Detection Challenge dataset class and tests * fixed linter issues and passing pre-commit tests * resolving linter errors * addressing failed isort test * replacing deprecated VisionDataset with NonGeoDataset reference * removed NDArray import for quoted np.typing.NDArray * addressing mypy errors * updated docstring for plot method * Update docs/api/non_geo_datasets.csv adding commas to train/test split sizes Co-authored-by: Adam J. Stewart <[email protected]> * Update docs/api/non_geo_datasets.csv Including total dataset size in api CSV, removing train/test split Co-authored-by: Adam J. Stewart <[email protected]> * updated Cloud Cover Dataset name in dataset.rst * Update docs/api/datasets.rst Accepting carrots under text in datasets.rst, sorry forgot to add these in! Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * flakefreaking8 * Fix duplicate rst link * added normalization to plot method for better visual * CloudCoverDetection is a non-geo dataset Co-authored-by: Adam J. Stewart <[email protected]> Co-authored-by: Caleb Robinson <[email protected]>
* adding cloud cover dataset class * Adding Cloud Cover Detection Challenge dataset class and tests * fixed linter issues and passing pre-commit tests * resolving linter errors * addressing failed isort test * replacing deprecated VisionDataset with NonGeoDataset reference * removed NDArray import for quoted np.typing.NDArray * addressing mypy errors * updated docstring for plot method * Update docs/api/non_geo_datasets.csv adding commas to train/test split sizes Co-authored-by: Adam J. Stewart <[email protected]> * Update docs/api/non_geo_datasets.csv Including total dataset size in api CSV, removing train/test split Co-authored-by: Adam J. Stewart <[email protected]> * updated Cloud Cover Dataset name in dataset.rst * Update docs/api/datasets.rst Accepting carrots under text in datasets.rst, sorry forgot to add these in! Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * flakefreaking8 * Fix duplicate rst link * adding cloud cover dataset class * Adding Cloud Cover Detection Challenge dataset class and tests * fixed linter issues and passing pre-commit tests * resolving linter errors * addressing failed isort test * replacing deprecated VisionDataset with NonGeoDataset reference * removed NDArray import for quoted np.typing.NDArray * addressing mypy errors * updated docstring for plot method * Update docs/api/non_geo_datasets.csv adding commas to train/test split sizes Co-authored-by: Adam J. Stewart <[email protected]> * Update docs/api/non_geo_datasets.csv Including total dataset size in api CSV, removing train/test split Co-authored-by: Adam J. Stewart <[email protected]> * updated Cloud Cover Dataset name in dataset.rst * Update docs/api/datasets.rst Accepting carrots under text in datasets.rst, sorry forgot to add these in! Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * Update torchgeo/datasets/cloud_cover.py Co-authored-by: Adam J. Stewart <[email protected]> * flakefreaking8 * Fix duplicate rst link * added normalization to plot method for better visual * CloudCoverDetection is a non-geo dataset Co-authored-by: Adam J. Stewart <[email protected]> Co-authored-by: Caleb Robinson <[email protected]>
This PR adds the Cloud Cover Detection Challenge dataset which was generated as part of a crowdsourcing competition as hosted on the Radiant MLHub.
Dataset features:
To Dos:
Resolve linter errorsdata.py
script