Supplementary Materials for "Whole genome sequencing in multiplex families reveals novel inherited and de novo genetic risk in autism" Elizabeth K. Ruzzo1,†, Laura Pérez-Cano1,†, Jae-Yoon Jung4,5, Lee-kai Wang1, Dorna Kashef- Haghighi4,5,6, Chris Hartl3, Jackson Hoekstra1, Olivia Leventhal1, Michael J. Gandal1, Kelley Paskov4,5, Nate Stockham4,5, Damon Polioudakis1, Jennifer K. Lowe1, Daniel H. Geschwind1,2,* & Dennis P. Wall4,5,* * Correspondence to:
[email protected] and
[email protected] This PDF file includes: Materials and Methods Supplementary Text Figs. S1 to S13 Tables S2, S6, S8, S10, S11, and S13-S16. Captions for Tables S1, S3, S4, S5, S7, S9, and S12 Other Supplementary Materials for this manuscript include the following: Tables S1, S3, S4, S5, S7, S9, and S12 [excel files] Table S1: Cohort information. Table S3: Phenotypic details for NR3C2 families. Table S4: DLG2 structural variant details and haplotype prediction for the carrier families. Table S5: ARC feature descriptions Table S7: TADA qualifying variants identified in iHART/AGRE samples. Table S9: iHART TADA-mega analysis results for all 18,472 genes. Table S12: NetSig significant genes. 1 Materials and Methods Sample selection and pre-sequencing QC Families including two or more individuals with ASD (those with a "derived affected status" of "autism", "broad-spectrum","nqa","asd", or "spectrum") were drawn from the Autism Genetic Resource Exchange (AGRE)(1). Patients with known genetic causes of ASD or syndromes with overlapping features (Table S2) were excluded from sequencing. In our first round of sequencing, we prioritized ASD-families harboring affected female subjects. We also prioritized monozygotic twin containing families, in part to facilitate the development of our machine learning model (Artifact Removal by Classifier (ARC): a machine learning classifier for distinguishing true de novo variants from LCL and sequencing artifacts).