Add eurocrops data module. #1869

favyen2 · 2024-02-08T17:59:12Z

It is based on NAIPChesapeakeDataModule which splits bounding box of dataset into 1/2 train, 1/4 val, and 1/4 test.

This may not be the best way to train an actual model. I think it is more natural to either split by country, or to randomly assign each large grid cell (e.g. 4096x4096 pixel) to train/val/test and then sample within those grid cells. But I wasn't sure how to split by country since VectorDataset automatically detects all the files, or to assign large grid cell since there's no sampler that can take multiple large bounding boxes and sample patches within them.

It is based on NAIPChesapeakeDataModule which splits bounding box of dataset into 1/2 train, 1/4 val, and 1/4 test. This may not be the best way to train an actual model.

yichiac · 2024-03-04T15:45:48Z

Hi @favyen2, could you use the new splitting function random_grid_cell_assignment similar to CDL #1889 and NCCM #1949 for EuroCrops? It would be more realistic to split datasets into more grids during training.

adamjstewart

Let's rename everything from eurocrops_sentinel2 to sentinel2_eurocrops

tests/conf/eurocrops_sentinel2.yaml

tests/data/eurocrops/data.py

tests/trainers/test_segmentation.py

torchgeo/datamodules/eurocrops.py

torchgeo/datasets/geo.py

…ocrops_datamodule

favyen2 · 2024-03-21T00:39:52Z

I am not able to get this to work without some changes from #1889 like setting Sentinel-2 test data size to 128 instead of 36, so I will wait for that to be merged first since that will make it easier.

adamjstewart · 2024-03-22T21:04:05Z

#1889 has now been merged, feel free to rebase and copy-n-paste whatever you want from that data module.

…ocrops_datamodule

tests/conf/sentinel2_eurocrops.yaml

torchgeo/datamodules/sentinel2_eurocrops.py

…ocrops_datamodule

Add eurocrops data module.

8732db2

It is based on NAIPChesapeakeDataModule which splits bounding box of dataset into 1/2 train, 1/4 val, and 1/4 test. This may not be the best way to train an actual model.

github-actions bot added datasets Geospatial or benchmark datasets testing Continuous integration testing datamodules PyTorch Lightning datamodules labels Feb 8, 2024

adamjstewart added this to the 0.6.0 milestone Feb 11, 2024

adamjstewart requested changes Mar 15, 2024

View reviewed changes

favyen2 added 2 commits March 20, 2024 15:51

misc fixes

7cd07ff

Merge branch 'main' of https://github.com/microsoft/torchgeo into eur…

b677fdb

…ocrops_datamodule

favyen2 added 7 commits March 27, 2024 14:20

various fixes per discussion

3aa11ea

Merge branch 'main' of https://github.com/microsoft/torchgeo into eur…

b5886f9

…ocrops_datamodule

update eurocrops test data

52f16c0

fix

73833cc

fix version added placement

fba0e47

fix failing test by forcing integrity test when checksum is requested

c0878df

Clarify SIZE setting in eurocrops test data

2e0bd6a

adamjstewart reviewed Mar 28, 2024

View reviewed changes

tests/conf/sentinel2_eurocrops.yaml Outdated Show resolved Hide resolved

torchgeo/datamodules/sentinel2_eurocrops.py Outdated Show resolved Hide resolved

torchgeo/datamodules/sentinel2_eurocrops.py Show resolved Hide resolved

favyen2 added 4 commits April 11, 2024 13:18

Merge branch 'main' of https://github.com/microsoft/torchgeo into eur…

723a66d

…ocrops_datamodule

fix currently remaining issues with eurocrops data module

c810e9f

fix style

09d83bf

more style fix

93c9940

adamjstewart closed this Apr 12, 2024

adamjstewart reopened this Apr 12, 2024

adamjstewart previously approved these changes Apr 12, 2024

View reviewed changes

adamjstewart enabled auto-merge (squash) April 12, 2024 12:00

Add documentation

d9636be

adamjstewart dismissed their stale review via d9636be April 12, 2024 12:08

github-actions bot added the documentation Improvements or additions to documentation label Apr 12, 2024

adamjstewart approved these changes Apr 12, 2024

View reviewed changes

adamjstewart mentioned this pull request Apr 12, 2024

Document how to properly setup Codecov Github Action for OpenSource repositories codecov/feedback#301

Open

adamjstewart closed this Apr 12, 2024

auto-merge was automatically disabled April 12, 2024 12:53
Pull request was closed

adamjstewart reopened this Apr 12, 2024

adamjstewart merged commit 83353b0 into microsoft:main Apr 12, 2024
38 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add eurocrops data module. #1869

Add eurocrops data module. #1869

favyen2 commented Feb 8, 2024

yichiac commented Mar 4, 2024 •

edited by adamjstewart

Loading

adamjstewart left a comment

favyen2 commented Mar 21, 2024

adamjstewart commented Mar 22, 2024

Add eurocrops data module. #1869

Add eurocrops data module. #1869

Conversation

favyen2 commented Feb 8, 2024

yichiac commented Mar 4, 2024 • edited by adamjstewart Loading

adamjstewart left a comment

Choose a reason for hiding this comment

favyen2 commented Mar 21, 2024

adamjstewart commented Mar 22, 2024

yichiac commented Mar 4, 2024 •

edited by adamjstewart

Loading