Supplemental Material for: Interlocus gene conversion events introduce deleterious mutations into at least 1% of human genes associated with inherited disease Claudio Casola1, Ugne Zekonyte1, Andrew D. Phillips2, David N. Cooper2, and Matthew W. Hahn1,3 1Department of Biology and 3School of Informatics and Computing, Indiana University, Bloomington, IN 47405 2 Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom. Corresponding author: Claudio Casola Department of Biology, Indiana University 1001 East 3rd Street, Bloomington, IN 47405 Phone: 812-856-7016 Fax: 812-855-6705 Email:
[email protected] 1 Methods Validation of disease alleles originating from interlocus gene conversion All candidate IGC-derived disease alleles identified by our BLAST searches were further examined to identify high-confidence IGC events. Such alleles required at least one disease-associated mutation plus another mutation to be shared by the IGC donor sequence. In addition, we excluded from our data set those cases where hypermutable CpG sites constituted all the diagnostic mutations. After careful examination of the available literature, we also disregarded several putative IGC events that were deemed likely to represent experimental artifacts. In particular, we identified several cases where primer pairs, which were originally considered to have amplified a single genomic locus or cDNA transcript, also appeared to be capable of amplifying sequences paralogous to the target genes. Such instances included disease alleles from the genes CFC1 (Bamford et al. 2000), GPD2 (Novials et al. 1997) and HLA-DRB1 (Velickovic et al. 2004). In the first two of these cases, we noted that the primer pairs that had been originally designed to amplify the genes specifically, also perfectly matched the paralogous gene CFC1B and an unlinked processed pseudogene of GPD2, respectively.