-
Notifications
You must be signed in to change notification settings - Fork 171
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] lack of recommendations for datafiles that differ only by extension #1487
Comments
If this type of data duplication is to be disallowed, it may be a good thing to:
For example, the following rendering may suggest that all 3 files can co-exist in the same dataset maybe better to have something like:
|
Okay, here's a proposal: photo:
suffixes:
- photo
extensions:
- [.jpg, .png, .tif]
datatypes:
- eeg
- ieeg
- meg
- nirs
entities:
subject: required
session: optional
acquisition: optional
photo__micr:
$ref: rules.files.raw.photo.photo
extensions:
- [.jpg, .png, .tif]
- .json
datatypes:
- micr
entities:
$ref: rules.files.raw.photo.photo.entities
sample: required Here, the extensions that are in a list together are "the same kind" and so mutually exclusive and distinguishable from supplementary entries, such as For NIfTI, we would do |
BUT... For EEG:
eeg:
suffixes:
- eeg
extensions:
- .json
- .edf
- .vhdr
- .vmrk
- .eeg
- .set
- .fdt
- .bdf
datatypes:
- eeg
entities:
subject: required
session: optional
task: required
acquisition: optional
run: optional |
I think we could do something like: extensions:
- .json
- [ .edf, .eeg, .set, .bdf ]
- .vhdr
- .vmrk
- .fdt And then just use a couple checks to say that if any of |
👍 and for:
(see: https://bids-specification.readthedocs.io/en/latest/modality-agnostic-files.html#scans-file) |
Describe your problem in detail.
Note that this issue may apply to more datatype in BIDS but I have not checked it systematically.
As far I can tell it is not mentioned in the specification that files cannot differ just by their extension.
For example, modifying the micr_SEM bids example to have 2 times the same data that differ only by extension:
From my current reading of the spec, this could be valid.
And also the bids validator does not complain about this: except from sayaing that not all subject have the same number of files.
I have mostly checked with picture files
*_photo.*
(eeg, meg, micr) but it also seems to be the case for eeg files:Am I missing something but maybe this type of potential data duplication should be disallowed?
Describe what you expected.
I would expect an error like for example in the case of
.nii
and.nii.gz
where the validator throws this error:BIDS specification section
No response
The text was updated successfully, but these errors were encountered: