bioRxiv preprint doi: https://doi.org/10.1101/2020.08.03.235358; this version posted August 10, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Insights into dispersed duplications and complex structural mutations from whole genome sequencing 706 families Christopher W. Whelan1,2,3*, Robert E. Handsaker1,2,3, Giulio Genovese1,2,3, Seva Kashin1,2,3, Monkol Lek4, Jason Hughes5, Joshua McElwee6, Michael Lenardo7, Daniel MacArthur3,8,9, Steven A. McCarroll1,2,3* 1Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA 2Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA 3Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA 4Department of Genetics, Yale School of Medicine, New Haven, Connecticut 06520, USA 5Foundation Medicine, Cambridge, Massachusetts 02141, USA 6Nimbus Therapeutics, Cambridge, Massachusetts 02139, USA 7Clinical Genomics Program, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA. 8Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston 02114, MA, USA 9Centre for Population Genomics, Murdoch Children’s Research Institute, Melbourne, Victoria, Australia * Corresponding Author E-mail:
[email protected] (C.W.W),
[email protected] (S.A.M.) Abstract Two intriguing forms of genome structural variation (SV) – dispersed duplications, and de novo rearrangements of complex, multi-allelic loci – have long escaped genomic analysis. We describe a new 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.08.03.235358; this version posted August 10, 2020.