A 7T fMRI dataset of synthetic images for out-of-distribution modeling of vision

Here we provide the code to reproduce all results from the paper:
"A 7T fMRI dataset of synthetic images for out-of-distribution modeling of vision".
Alessandro T. Gifford, Radoslaw M. Cichy, Thomas Naselaris, Kendrick Kay

📄 Paper abstract

Large-scale visual neural datasets such as the Natural Scenes Dataset (NSD) are boosting NeuroAI research by enabling computational models of the brain with performances beyond what was possible just a decade ago. However, these datasets lack out-of-distribution (OOD) components, which are crucial for the development of more robust models. Here, we address this limitation by releasing NSD-synthetic, a dataset consisting of 7T fMRI responses from the eight NSD subjects for 284 carefully controlled synthetic images. We show that NSD-synthetic’s fMRI responses reliably encode stimulus-related information and are OOD with respect to NSD. Furthermore, OOD generalization tests on NSD-synthetic reveal differences between models of the brain that are not detected with NSD—specifically, self-supervised deep neural networks better explain neural responses than their task-supervised counterparts. These results showcase how NSD-synthetic enables OOD generalization tests that facilitate the development of more robust models of visual processing, and the formulation of more accurate theories of human vision.

♻️ Reproducibility

🧰 Data

The NSD dataset (including NSD-synthetic) is freely available at http://naturalscenesdataset.org.

⚙️ Installation

To reproduce the paper's results, you can download and run the Python code from this repository. To run this code, you will first need to install the libraries in the requirements.txt). We recommend installing these libraries within a virtual environment (e.g., an Anaconda) environment) using:

pip install -r requirements.txt

📦 Code description

00_prepare_fmri: Prepare NSD-synthetic and NSD-core's fMRI responses for the following analyses.
paper_figure_2: Analyse NSD-synthetic's univariate and multivariate fMRI responses, and noise ceiling signal-to-noise ratio (ncsnr).
paper_figure_3: Perform multidimensional scaling (MDS) on NSD-synthetic and NSD-core's fMRI responses.
paper_figure_4: Train encoding model on NSD-core, and test them both in-distribution (NSD-core) and out-of-distribution (NSD-synthetic).
paper_figure_5: Compare diffent encoding models based on their in-distribution (NSD-core) and out-of-distribution (NSD-synthetic) performances.

🧠 Flattened cortical surface plots

In Figures 2, 4, and 5, we plotted results on flattened cortical surfaces using pycortex' fsaverage subject.

For visualization purposes, we manually drew surface labels based on the “streams” ROI collection as provided in the NSD data release. To use these labels, please add the overlays.svg file to the pycortex fsaverage subject folder (within an Anaconda environment, you should find this folder at: ../anaconda3/envs/env_name/share/pycortex/db/fsaverage)

❗ Issues

If you experience problems with the code, please get in touch with Ale ([email protected]), or submit an issue.

📜 Citation

If you use any of our data or code, please cite:

Gifford AT, Cichy RM, Naselaris T, Kay K. 2025. A 7T fMRI dataset of synthetic images for out-of-distribution modeling of vision. arXiv preprint, arXiv:2503.06286. DOI: https://doi.org/10.48550/arXiv.2503.06286

Allen EJ, St-Yves G, Wu Y, Breedlove JL, Prince JS, Dowdle LT, Nau M, Caron B, Pestilli F, Charest I, Hutchinson BJ, Naselaris T, Kay K. 2022. A massive 7T fMRI dataset to bridge cognitive neuroscience and artificial intelligence. Nature neuroscience, 25(1), 116-126. DOI: https://doi.org/10.1038/s41593-021-00962-x

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
00_prepare_fmri		00_prepare_fmri
paper_figure_2		paper_figure_2
paper_figure_3		paper_figure_3
paper_figure_4		paper_figure_4
paper_figure_5		paper_figure_5
pycortex_stream_labels		pycortex_stream_labels
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A 7T fMRI dataset of synthetic images for out-of-distribution modeling of vision

📄 Paper abstract

♻️ Reproducibility

🧰 Data

⚙️ Installation

📦 Code description

🧠 Flattened cortical surface plots

❗ Issues

📜 Citation

About

Releases

Packages

Languages

License

gifale95/NSD-synthetic

Folders and files

Latest commit

History

Repository files navigation

A 7T fMRI dataset of synthetic images for out-of-distribution modeling of vision

📄 Paper abstract

♻️ Reproducibility

🧰 Data

⚙️ Installation

📦 Code description

🧠 Flattened cortical surface plots

❗ Issues

📜 Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages