Comparing Spatial Null Models for Brain Maps

bioRxiv preprint doi: https://doi.org/10.1101/2020.08.13.249797; this version posted February 24, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Comparing spatial null models for brain maps Ross D. Markello1;∗ Bratislav Misic1;∗ 1McConnell Brain Imaging Centre, Montréal Neurological Institute, McGill University, Montréal, Canada ∗Correspondence to: [email protected] or [email protected] Technological and data sharing advances have led to a proliferation of high-resolution structural and functional maps of the brain. Modern neuroimaging research increasingly depends on identifying correspondences between the topographies of these maps; however, most standard methods for statistical inference fail to account for their spatial properties. Recently, multiple methods have been developed to generate null distributions that preserve the spatial autocorrelation of brain maps and yield more accurate statistical estimates. Here, we comprehensively assess the performance of ten published null frameworks in statistical analyses of neuroimaging data. To test the efficacy of these frameworks in situations with a known ground truth, we first apply them to a series of controlled simulations and ex- amine the impact of data resolution and spatial autocorrelation on their family-wise error rates. Next, we use each framework with two empirical neuroimaging datasets, investigating their performance when testing (1) the correspondence between brain maps (e.g., correlating two activation maps) and (2) the spatial distribution of a feature within a partition (e.g., quantifying the specificity of an activation map within an intrinsic functional network). Finally, we investigate how differences in the implementation of these null models may impact their performance. In agreement with previous re- ports, we find that naive null models that do not preserve spatial autocorrelation consistently yield elevated false positive rates and unrealistically liberal statistical estimates. While spatially-constrained null models yielded more realistic, conservative estimates, even these frameworks suffer from inflated false positive rates and variable performance across analyses. Throughout our results, we observe minimal impact of parcellation and resolution on null model performance. Altogether, our findings highlight the need for continued development of statistically-rigorous methods for comparing brain maps. The present report provides a harmonised framework for benchmarking and comparing future advancements. Keywords: null models | spin test | brain parcellations | spatial autocorrelation | significance testing INTRODUCTION 2004, Burt et al. 2020, Fulcher et al. 2020, Gordon et al. 2016). Namely, in spatially-embedded systems—like the brain—neighboring data points are not statistically in- The brain is organized as a series of nested and independent, violating the assumptions of many common creasingly multi-functional neural circuits. The connec- inferential frameworks. As an example, consider com- tions and interactions among these circuits ultimately puting a correlation between two brain maps. When manifest as unique topographic distributions of struc- using a standard parametric null (i.e., the Student’s t- tural and functional properties. Recent advances in distribution), the spatial autocorrelation of the maps vio- imaging, tracing, and recording technologies (Insel et al. lates the inherent requirement that the model errors are 2013), together with global data sharing initiatives independent and identically distributed (i.i.d.). When (Casey et al. 2018, Poldrack et al. 2013, Sudlow et al. using a standard non-parametric null (i.e., random per- 2015, Van Essen et al. 2013), have resulted in the genera- mutations of one of the feature maps), the spatial auto- tion of high-resolution maps of many of these properties, correlation violates the requirement of exchangeability. including gene expression (Akbarian et al. 2015, Hawry- In both instances, the calculated p-value will be inflated, lycz et al. 2012), cytology (Scholtens et al. 2018, von yielding increased family-wise error rates (FWER, or type Economo and Koskinas 1925), receptor densities (Beliv- 1 error rates) across analyses. eau et al. 2017, Norgaard et al. 2020, Zilles and Amunts 2009, Zilles et al. 2004), intracortical myelin (Burt et al. The impact of spatial autocorrelation on statistical in- 2018, Whitaker et al. 2016), and functional organization ference has long been known in fields like geostatistics (Bellec et al. 2010, Damoiseaux et al. 2006, Margulies and ecology (Cliff and Ord 1970, Legendre 1993), and et al. 2016, Murray et al. 2014, Shafiei et al. 2020, Yeo there have been significant developments in these fields et al. 2011). to account for and overcome its influence (Cressie 2015, Increasingly, modern scientific discovery in neu- Deblauwe et al. 2012, Dray 2011, Dutilleul et al. 1993, roimaging research involves identifying correspondences Fortin and Jacquez 2000, Wagner and Dray 2015). Ac- between the topographies of brain maps (Baum et al. knowledgement of the impact of spatial autocorrelation 2020, Demirta¸set al. 2019, Gao et al. 2020, Hansen et al. in neuroimaging is far more recent. Only in the past sev- 2020, Shafiei et al. 2020, Vázquez-Rodríguez et al. 2019, eral years have frameworks been popularized to over- Wang et al. 2019); however, standard methods for statis- come the shortcomings of standard null models. These tical inference fall short when making such comparisons frameworks generate null distributions that preserve the (Alexander-Bloch et al. 2013, 2018, Breakspear et al. spatial autocorrelation of brain maps, yielding statisti- bioRxiv preprint doi: https://doi.org/10.1101/2020.08.13.249797; this version posted February 24, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 2 cal inferences that more accurately reflect the underlying state networks). In all analyses we systematically exam- data. ine the impact of parcellation—which differentially mod- Spatially-constrained null models in neuroimaging fall ifies the spatial structure of the underlying data—on the into two broad families: non-parametric spatial permu- performance of the null frameworks. tation models (Alexander-Bloch et al. 2013, 2018, Gor- don et al. 2016) and parameterized data models (Burt METHODS et al. 2018, 2020, Vos de Wael et al. 2020). In non- parametric spatial permutation models, the cortical sur- Code and data availability face is represented as a sphere to which random rotations are applied, generating surface maps with ran- All code used for data processing, analysis, and fig- domized topography but identical spatial autocorrela- ure generation is available on GitHub (https://github. tion. In parameterized data models, spatial autocorrela- com/netneurolab/markello_spatialnulls) and directly re- tion is estimated from the empirical brain maps and used lies on the following open-source Python packages: to generate surrogate nulls with randomized topography BrainSMASH (Burt et al. 2020), BrainSpace (Vos de and similar—though not identical—spatial autocorrela- Wael et al. 2020), IPython (Pérez and Granger 2007), tion. Since their development, these models have been Jupyter (Kluyver et al. 2016), Matplotlib (Hunter 2007), adapted by several researchers (Baum et al. 2020, Corn- NeuroSynth (Yarkoni et al. 2011), NiBabel (Brett et al. blath et al. 2020, Váša et al. 2018, Vázquez-Rodríguez 2020), NumPy (Oliphant 2006, Van Der Walt et al. et al. 2019). To our knowledge, there have been at least 2011), Pandas (McKinney 2010), PySurfer (Waskom ten distinct implementations of null frameworks applied et al. 2020b), Scikit-learn (Pedregosa et al. 2011), SciPy to statistical estimates of brain maps. (Virtanen et al. 2020), and Seaborn (Waskom et al. 2020a). Additional software used in the reported anal- One of the earliest implementations of these null mod- yses includes FreeSurfer (v6.0.0, http://surfer.nmr.mgh. els, proposed in Alexander-Bloch et al. (2013) and later harvard.edu/; Fischl et al. 1999) and the Connectome formalized in Alexander-Bloch et al. (2018), described Workbench (v1.4.2, https://www.humanconnectome. a non-parametric method by which the cortical surface org/software/connectome-workbench; Marcus et al. was subjected to spatial rotations. The principal chal- 2011). lenge of implementing this method is that the medial wall—for which most brain maps contain no data—can Data be rotated into the cortical surface. This is an important consideration because the loss of data caused by this me- NeuroSynth association maps dial wall rotation can bias results. To address this prob- lem, researchers have opted to either discard the missing To replicate the analyses described in Alexander-Bloch data (Baum et al. 2020, Cornblath et al. 2020), assign et al. (2018) we downloaded “association” maps from the nearest data to missing parcels (Vázquez-Rodríguez NeuroSynth (Yarkoni et al. 2011), a meta-analytic tool et al. 2019), or ignore the medial wall entirely (Váša that synthesizes results from published MRI studies by et al. 2018). Other groups have devised alternative linking voxel activation coordinates

Comparing Spatial Null Models for Brain Maps

Sequence Distance Embeddings

Applied Psychological Measurement

A Distance Model for Rhythms

Goodchild & Mcadams 1 Perceptual Processes in Orchestration To

Model Analytics and Management

A Generative Model of Tonal Tension and Its Application in Dynamic Realtime Sonification

Tonnetz, Voice-Leading, and Spectral Models of Perceived Triadic Distance

MTO 22.4: Gotham, Pitch Properties of the Pedal Harp

Diss Final 22/08/2011

Martorell (2009)

Network Migration Strategies: Evaluating Performance with Extensions of Data Envelopment Analysis

Q in Context II