Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple DOIs and citations in a single dataset #211

Closed
simonff opened this issue Aug 29, 2018 · 7 comments
Closed

Multiple DOIs and citations in a single dataset #211

simonff opened this issue Aug 29, 2018 · 7 comments

Comments

@simonff
Copy link

simonff commented Aug 29, 2018

There can be datasets with multiple citations (eg, https://developers.google.com/earth-engine/datasets/catalog/CIESIN_GPWv4_ancillary-data-grids). This can happen if one aggregated dataset has data from several smaller independent datasets. Ditto for DOIs - EE already has an example where each band in a single image comes from a different provider dataset, hence the EE dataset has multiple DOIs.

@m-mohr
Copy link
Collaborator

m-mohr commented Aug 29, 2018

Could these smaller independent datasets described on their own with a sub-dataset? Then you can assign individual DOIs for each sub-dataset. Doesn't work for bands though, but find that hard to model so that the relationship between DOI and band is clear.

@simonff
Copy link
Author

simonff commented Aug 29, 2018

Yes, but by the time we look at them they may no longer be divisible into subdatasets. Example - two bands in an image where each band came from a separate dataset with a separate DOI.

@m-mohr
Copy link
Collaborator

m-mohr commented Aug 30, 2018

But then, how would you tell a user which DOI / citation he needs to use? I think it should be clear where the DOI and citation belongs to otherwise listing them seems not very useful.

@simonff
Copy link
Author

simonff commented Aug 31, 2018

If a raster image has two bands with two different DOIs, or a dataset has two images with different source DOIs, and the user uses both, they need to cite both DOIs.

Example: https://data.csiro.au/dap/landingpage?pid=csiro:9989&v=5&d=true and https://data.csiro.au/dap/landingpage?pid=csiro:11467&v=2&d=true are combined into one dataset in EE.

Allowing DOIs in individual items and bands might help here.

I think I have seen a case where different parts of the same final image had different DOIs (eg, data for different Australian states), but I can't remember where.

@m-mohr
Copy link
Collaborator

m-mohr commented Aug 31, 2018

I think we basically have three options:

  1. Allow the scientific extension to be applied to Datasets, Items, the Band Object and potentially more places. That sounds a little messy and prone to errors, but very precise.
  2. Just specify the DOIs and citations in an array in the dataset (same schema used as in publicatio_doi / publication_citation). That's imprecise as it's unclear where exactly they belong to, but very easy and clear to implement.
  3. Ignore this issue and let providers come up with a single DOI/citation (probably the worst option).

So what is preferred by all of you? Maybe again something we could also ask @edzer about.

@cholmes
Copy link
Contributor

cholmes commented Aug 31, 2018

The first option sounds outside of what I see as the main philosophy, which is make a spec that is easy to implement and not super complicated, even if it can't fully represent everything...

So I'd lean more towards like number 3 for the core, and encourage an extension for people who have use cases that don't fit.

@m-mohr
Copy link
Collaborator

m-mohr commented Oct 12, 2018

I think this is mostly solved by #260 (commit aab3f40), which changed the scientific extension to be in the Item scope. This means you can add more fine-grained and in the end basically multiple DOIs into a dataset/collection. I'll close this for now, but feel free to re-open or comment if you feel this doesn't solve the problem sufficiently.

@m-mohr m-mohr closed this as completed Oct 12, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants