Skip to content

Latest commit

 

History

History
61 lines (29 loc) · 4.71 KB

README.md

File metadata and controls

61 lines (29 loc) · 4.71 KB

A 7T fMRI dataset of synthetic images for out-of-distribution modeling of vision

Here we provide the code to reproduce all results from the paper:
"A 7T fMRI dataset of synthetic images for out-of-distribution modeling of vision".
Alessandro T. Gifford, Radoslaw M. Cichy, Thomas Naselaris, Kendrick Kay

📄 Paper abstract

Large-scale visual neural datasets such as the Natural Scenes Dataset (NSD) are boosting NeuroAI research by enabling computational models of the brain with performances beyond what was possible just a decade ago. However, these datasets lack out-of-distribution (OOD) components, which are crucial for the development of more robust models. Here, we address this limitation by releasing NSD-synthetic, a dataset consisting of 7T fMRI responses from the eight NSD subjects for 284 carefully controlled synthetic images. We show that NSD-synthetic’s fMRI responses reliably encode stimulus-related information and are OOD with respect to NSD. Furthermore, OOD generalization tests on NSD-synthetic reveal differences between models of the brain that are not detected with NSD—specifically, self-supervised deep neural networks better explain neural responses than their task-supervised counterparts. These results showcase how NSD-synthetic enables OOD generalization tests that facilitate the development of more robust models of visual processing, and the formulation of more accurate theories of human vision.

♻️ Reproducibility

🧰 Data

The NSD dataset (including NSD-synthetic) is freely available at http://naturalscenesdataset.org.

⚙️ Installation

To reproduce the paper's results, you can download and run the Python code from this repository. To run this code, you will first need to install the libraries in the requirements.txt). We recommend installing these libraries within a virtual environment (e.g., an Anaconda) environment) using:

pip install -r requirements.txt

📦 Code description

  • 00_prepare_fmri: Prepare NSD-synthetic and NSD-core's fMRI responses for the following analyses.
  • paper_figure_2: Analyse NSD-synthetic's univariate and multivariate fMRI responses, and noise ceiling signal-to-noise ratio (ncsnr).
  • paper_figure_3: Perform multidimensional scaling (MDS) on NSD-synthetic and NSD-core's fMRI responses.
  • paper_figure_4: Train encoding model on NSD-core, and test them both in-distribution (NSD-core) and out-of-distribution (NSD-synthetic).
  • paper_figure_5: Compare diffent encoding models based on their in-distribution (NSD-core) and out-of-distribution (NSD-synthetic) performances.

🧠 Flattened cortical surface plots

In Figures 2, 4, and 5, we plotted results on flattened cortical surfaces using pycortex' fsaverage subject.

For visualization purposes, we manually drew surface labels based on the “streams” ROI collection as provided in the NSD data release. To use these labels, please add the overlays.svg file to the pycortex fsaverage subject folder (within an Anaconda environment, you should find this folder at: ../anaconda3/envs/env_name/share/pycortex/db/fsaverage)

❗ Issues

If you experience problems with the code, please get in touch with Ale ([email protected]), or submit an issue.

📜 Citation

If you use any of our data or code, please cite:

  • Gifford AT, Cichy RM, Naselaris T, Kay K. 2025. A 7T fMRI dataset of synthetic images for out-of-distribution modeling of vision. arXiv preprint, arXiv:2503.06286. DOI: https://doi.org/10.48550/arXiv.2503.06286
  • Allen EJ, St-Yves G, Wu Y, Breedlove JL, Prince JS, Dowdle LT, Nau M, Caron B, Pestilli F, Charest I, Hutchinson BJ, Naselaris T, Kay K. 2022. A massive 7T fMRI dataset to bridge cognitive neuroscience and artificial intelligence. Nature neuroscience, 25(1), 116-126. DOI: https://doi.org/10.1038/s41593-021-00962-x