Large-Scale Evolutionary Analysis of Polymorphic Inversions in the Human Genome
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Linkage Disequilibrium
Linkage Disequilibrium Biostatistics 666 Logistics: Office Hours Office hours on Mondays at 4 pm. Room 4614 School of Public Health Tower Previously … Basic properties of a locus • Allele Frequencies • Genotype Frequencies Hardy-Weinberg Equilibrium • Relationship between allele and genotype frequencies that holds for most genetic markers Exact Tests for Hardy-Weinberg Equilibrium Today … We’ll consider properties of pairs of alleles Haplotype frequencies Linkage equilibrium Linkage disequilibrium Let’s consider the history of two neighboring alleles… Alleles that exist today arose through ancient mutation events… Before Mutation A C G T A A G T A C G T A C G T G T A C G A C G After Mutation A C G T A A G T A C G T A C G T G T A C G A C G A C G T A C G T A CMutation G T A C G T G T A C G A C G Alleles that exist today arose through ancient mutation events… Before Mutation A After Mutation A C Mutation One allele arose first, and then the other… Before Mutation A G C G After Mutation A G C G C C Mutation Recombination generates new arrangements for ancestral alleles Before Recombination A G C G C C After Recombination A G C G C C A C Recombinant Haplotype Linkage Disequilibrium Chromosomes are mosaics Ancestor Extent and conservation of mosaic pieces depends on Present-day • Recombination rate • Mutation rate • Population size • Natural selection Combinations of alleles at very close markers reflect ancestral haplotypes Why is linkage disequilibrium important for gene mapping? Association Studies and Linkage Disequilibrium -
The Role of Haplotypes in Candidate Gene Studies
Genetic Epidemiology 27: 321–333 (2004) The Role of Haplotypes in Candidate Gene Studies Andrew G. Clarkn Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York Human geneticists working on systems for which it is possible to make a strong case for a set of candidate genes face the problem of whether it is necessary to consider the variation in those genes as phased haplotypes, or whether the one-SNP- at-a-time approach might perform as well. There are three reasons why the phased haplotype route should be an improvement. First, the protein products of the candidate genes occur in polypeptide chains whose folding and other properties may depend on particular combinations of amino acids. Second, population genetic principles show us that variation in populations is inherently structured into haplotypes. Third, the statistical power of association tests with phased data is likely to be improved because of the reduction in dimension. However, in reality it takes a great deal of extra work to obtain valid haplotype phase information, and inferred phase information may simply compound the errors. In addition, if the causal connection between SNPs and a phenotype is truly driven by just a single SNP, then the haplotype- based approach may perform worse than the one-SNP-at-a-time approach. Here we examine some of the factors that affect haplotype patterns in genes, how haplotypes may be inferred, and how haplotypes have been useful in the context of testing association between candidate genes and complex traits. Genet. Epidemiol. & 2004 Wiley-Liss, Inc. Key words: haplotype inference; haplotype association testing; candidate genes; linkage equilibrium Grant sponsor: NIH; Grant numbers: GM65509, HL072904. -
Transferability of Tag Snps to Capture Common Genetic Variation in DNA Repair Genes Across Multiple Populations Paul I.W. De
Transferability of Tag SNPs to Capture Common Genetic Variation in DNA Repair Genes Across Multiple Populations Paul I.W. De Bakker, Robert R. Graham, David Altshuler, Brian E. Henderson, and Christopher A. Haiman Pacific Symposium on Biocomputing 11:478-486(2006) TRANSFERABILITY OF TAG SNPS TO CAPTURE COMMON GENETIC VARIATION IN DNA REPAIR GENES ACROSS MULTIPLE POPULATIONS PAUL I.W. DE BAKKER, ROBERT R. GRAHAM, DAVID ALTSHULER Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA BRIAN E. HENDERSON, CHRISTOPHER A. HAIMAN Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA Genetic association studies can be made more cost-effective by exploiting linkage disequilibrium patterns between nearby single-nucleotide polymorphisms (SNPs). The International HapMap Project now offers a dense SNP map across the human genome in four population samples. One question is how well tag SNPs chosen from a resource like HapMap can capture common variation in independent disease samples. To address the issue of tag SNP transferability, we genotyped 2,783 SNPs across 61 genes (with a total span of 6 Mb) involved in DNA repair in 466 individuals from multiple populations. We picked tag SNPs in samples with European ancestry from the Centre d’Etude du Polymorphisme Humain, and evaluated coverage of common variation in the other samples. Our comparative analysis shows that common variation in non- African samples can be captured robustly with only marginal loss in terms of the maximum r2. We also evaluated the transferability of specified multi-marker haplotypes as predictors for untyped SNPs, and demonstrate that they provide equivalent coverage compared to single-marker tests (pairwise tags) while requiring fewer SNPs for genotyping. -
Efficient Design and Analysis of Genome-Wide
UNIVERSITY OF CALIFORNIA Los Angeles Efficient Design and Analysis of Genome-wide Association Studies A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Computer Science by Emrah Kostem 2013 c Copyright by Emrah Kostem 2013 ABSTRACT OF THE DISSERTATION Efficient Design and Analysis of Genome-wide Association Studies by Emrah Kostem Doctor of Philosophy in Computer Science University of California, Los Angeles, 2013 Professor Eleazar Eskin, Chair The recent advances in genomic technologies, have made it possible to collect large- scale information on genetic variation across a diverse biological landscape. This has resulted in an exponential influx of genetic information and the field of genetics has be- come data-rich in a relatively short amount of time. These developments have opened new avenues to elucidate the genetic basis of complex diseases, where the traditional disease study approaches had little success. In recent years, the genome-wide association study (GWAS) approach has gained widespread popularity for its ease of use and effectiveness, and is now the standard approach to study complex diseases. In GWAS, information on millions of single- nucleotide polymorphisms (SNPs) is collected from case and control individuals. SNP genotyping is cost-effective and due to their abundance in the genome, SNPs are cor- related to their neighboring genetic variation, which makes them tags for genomic regions. Typically, each SNP is statistically tested for association to disease, and the genomic regions tagged by the significant SNPs are believed to be harboring the func- tional variants contributing to disease. In order to reduce the cost of GWAS and the redundancy in the information col- ii lected, an informative subset of the SNPs, or tag SNPs, are genotyped. -
Evolution and Diversity of Copy Number Variation in the Great Ape Lineage
Downloaded from genome.cshlp.org on September 24, 2021 - Published by Cold Spring Harbor Laboratory Press Evolution and diversity of copy number variation in the great ape lineage Peter H. Sudmant1, John Huddleston1,7, Claudia R. Catacchio2, Maika Malig1, LaDeana W. Hillier3, Carl Baker1, Kiana Mohajeri1, Ivanela Kondova4, Ronald E. Bontrop4, Stephan Persengiev4, Francesca Antonacci2, Mario Ventura2, Javier Prado-Martinez5, Tomas 5,6 1,7 Marques-Bonet , and Evan E. Eichler 1. Department of Genome Sciences, University of Washington, Seattle, WA, USA 2. University of Bari, Bari, Italy 3. The Genome Institute, Washington University School of Medicine, St. Louis, MO, USA 4. Department of Comparative Genetics, Biomedical Primate Research Centre, Rijswijk, The Netherlands 5. Institut de Biologia Evolutiva, (UPF-CSIC) Barcelona, Spain 6. Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain 7. Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA Correspondence to: Evan Eichler Department of Genome Sciences University of Washington School of Medicine Foege S-413A, Box 355065 3720 15th Ave NE Seattle, WA 98195 E-mail: [email protected] 1 Downloaded from genome.cshlp.org on September 24, 2021 - Published by Cold Spring Harbor Laboratory Press ABSTRACT Copy number variation (CNV) contributes to the genetic basis of disease and has significantly restructured the genomes of humans and great apes. The diversity and rate of this process, however, has not been extensively explored among the great ape lineages. We analyzed 97 deeply sequenced great ape and human genomes and estimate that 16% (469 Mbp) of the hominid genome has been affected by recent copy number changes. -
Genetic Overlap Between Alzheimer&Rsquo
Molecular Psychiatry (2015) 20, 1588–1595 © 2015 Macmillan Publishers Limited All rights reserved 1359-4184/15 www.nature.com/mp ORIGINAL ARTICLE Genetic overlap between Alzheimer’s disease and Parkinson’s disease at the MAPT locus RS Desikan1, AJ Schork2, Y Wang3,4, A Witoelar4, M Sharma5,6, LK McEvoy1, D Holland3, JB Brewer1,3, C-H Chen1,7, WK Thompson7, D Harold8, J Williams8, MJ Owen8,MCO’Donovan8, MA Pericak-Vance9, R Mayeux10, JL Haines11, LA Farrer12, GD Schellenberg13, P Heutink14, AB Singleton15, A Brice16, NW Wood17, J Hardy18, M Martinez19, SH Choi20, A DeStefano20,21, MA Ikram22, JC Bis23, A Smith24, AL Fitzpatrick25, L Launer26, C van Duijn22, S Seshadri21,27, ID Ulstein28, D Aarsland29,30,31, T Fladby32,33, S Djurovic4, BT Hyman34, J Snaedal35, H Stefansson36, K Stefansson36,37, T Gasser5, OA Andreassen4,7, AM Dale1,2,3,7for the ADNI38ADGC, GERAD, CHARGE and IPDGC Investigators39 We investigated the genetic overlap between Alzheimer’s disease (AD) and Parkinson’s disease (PD). Using summary statistics (P-values) from large recent genome-wide association studies (GWAS) (total n = 89 904 individuals), we sought to identify single nucleotide polymorphisms (SNPs) associating with both AD and PD. We found and replicated association of both AD and PD with the A allele of rs393152 within the extended MAPT region on chromosome 17 (meta analysis P-value across five independent AD cohorts = 1.65 × 10− 7). In independent datasets, we found a dose-dependent effect of the A allele of rs393152 on intra-cerebral MAPT transcript levels and volume loss within the entorhinal cortex and hippocampus. -
Kauer Washington 0250O 21245.Pdf (1.990Mb)
Using biomedical data to identify genetic variants that drive drug responses in Acute Myeloid Leukemia Nicole Kauer A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science University of Washington Tacoma 2020 Committee: Ka Yee Yeung-Rhee, Chair Ling-Hong Hung Wes Lloyd Program Authorized to Offer Degree: Computer Science and Systems c Copyright 2020 Nicole Kauer University of Washington Tacoma Abstract Using biomedical data to identify genetic variants that drive drug responses in Acute Myeloid Leukemia Nicole Kauer Chair of the Supervisory Committee: Professor Ka Yee Yeung-Rhee School of Engineering and Technology Acute Myeloid Leukemia (AML) is a heterogeneous cancer of the blood that progresses quickly, with approximately 10,000 AML related deaths reported annually in the United States. Patients with AML tend to have genetic variations, which can significantly affect drug sensitivity and treatment outcomes. While massive amounts of big biomedical data have been generated to characterize AML, these genetic variations are not yet well-understood. Thus, the development of individualized approaches to AML therapy using these big data has great potential. The promise of precision medicine is that knowledge of the genetic characteristics present within a cancer will enable better choices for therapy. In this thesis, we applied data sci- ence techniques to analyze AML biomedical data in collaboration with Dr. Pamela Becker, Hematology, UW-Seattle, with the goal of identifying genetic variants that drive drug re- sponses in AML. We identified 30 novel gene-drug pairs with statistical significant responses. Additionally, both univariate and multivariate machine learning models were created, with multivariate feature selection via Bayesian Model Averaging. -
Next-MP50 Status Report
The neXt-50 Challenge Status report and acceptance of next-50 Challenge by individual Chromosome groups February 24, 2017. Chromosome 1 (Ping Xu) PIC Leaders: Ping Xu, Fuchu He Contributing labs: Ping Xu, Beijing Proteome Research Center Fuchu He, Beijing Proteome Research Center Dong Yang, Beijing Proteome Research Center Wantao Ying, Beijing Proteome Research Center Pengyuan Yang, Fudan University Siqi Liu, Beijing Genome Institute Qinyu He, Jinan University Major lab members or partners contributing to the neXt50: Yao Zhang (Beijing Proteome Research Center), Yihao Wang (Beijing Proteome Research Center), Cuitong He (Beijing Proteome Research Center), Wei Wei (Beijing Proteome Research Center), Yanchang Li (Beijing Proteome Research Center), Feng Xu (Beijing Proteome Research Center), Xuehui Peng (Beijing Proteome Research Center). Status of the Chromosome “parts list”: Step by step milestone plan to find, identify and validate MPs 1. We try to identify more missing proteins through high coverage proteomics technology with testis samples. We will finish this project before May. 2. We tested and confirmed that PTM approach could identify significant number of missing proteins. We will finalize the data sets before May. C-HPP 2017-03-31 The neXt-50 Challenge Chromosome 2 See combined response and work plan with Chromosome 14. Chromosome 3 (Takeshi Kawamura) PIC Leaders: Takeshi Kawamura (since January 1) Contributing labs: Major lab members or partners contributing to the neXt50: Takeshi Kawamura: Associate Professor, Proteomics Laboratory, Isotope Science Center, The University of Tokyo, Tokyo, Japan. Toshihide Nishimura: Professor, Department of Translational Medicine Informatics, St. Marianna University School of Medicine, Kanagawa, Japan. Hiromasa Tojo: Associate Professor, Department of Biophysics and Biochemistry, Osaka University; Graduate School of Medicine, Osaka, Japan. -
Single Nucleotide Polymorphism Facilitated Downregulation of The
Volume 20 Number 3 March 2018 pp. 289–294 289 www.neoplasia.com Single Nucleotide Polymorphism Somenath Datta*, Richard M. Sherva†, Mart De La Cruz*, Michelle T. Long*, Priya Roy*, Facilitated Down-Regulation of the Vadim Backman‡, Sanjib Chowdhury*,1 and ,1 Cohesin Stromal Antigen-1: Hemant K. Roy* Implications for Colorectal *Department of Medicine, Section of Gastroenterology, Boston University Medical Center, Boston, MA 02118, USA; Cancer Racial Disparities †Department of Medicine, Section of Biomedical Genetics, Boston University Medical Center, Boston, MA 02118, USA; ‡Department of Biomedical Engineering, Northwestern University, Evanston, Illinois, USA Abstract The biological underpinnings for racial disparities in colorectal cancer (CRC) incidence remain to be elucidated. We have previously reported that the cohesin SA-1 down-regulation is an early event in colon carcinogenesis which is dramatically accentuated in African-Americans. In order to investigate the mechanism, we evaluated single nucleotide polymorphisms (SNPs) for association with SA-1-related outcomes followed by gene editing of candidate SNP. We observed that rs34149860 SNP was significantly associated with a lower colonic mucosal SA-1 expression and evaluation of public databases showed striking racial discordance. Given that the predicted SNP would alter miR-29b binding site, we used CRISPR knock-in in CRC cells and demonstrated that the SNP but not wild-type had profound alterations in SA-1 expression with miR-29b inhibitor. This is the first demonstration of high-order chromatin regulators as a modulator of racial differences, risk alteration with SNPs and finally specific modulation by microRNAs. Neoplasia (2018) 20, 289–294 Introduction Since many SNPs have a racial predilection, this provides a Colorectal cancer (CRC) remains the second leading causes of cancer potential mechanism for racial disparities in CRC. -
Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham a Thesis Submitted in Conformity
Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham A thesis submitted in conformity with the requirements for the degree of Master of Science Graduate Department of Molecular Genetics University of Toronto © Copyright by Edward James Higginbotham 2020 i Abstract Characterizing Genomic Duplication in Autism Spectrum Disorder Edward James Higginbotham Master of Science Graduate Department of Molecular Genetics University of Toronto 2020 Duplication, the gain of additional copies of genomic material relative to its ancestral diploid state is yet to achieve full appreciation for its role in human traits and disease. Challenges include accurately genotyping, annotating, and characterizing the properties of duplications, and resolving duplication mechanisms. Whole genome sequencing, in principle, should enable accurate detection of duplications in a single experiment. This thesis makes use of the technology to catalogue disease relevant duplications in the genomes of 2,739 individuals with Autism Spectrum Disorder (ASD) who enrolled in the Autism Speaks MSSNG Project. Fine-mapping the breakpoint junctions of 259 ASD-relevant duplications identified 34 (13.1%) variants with complex genomic structures as well as tandem (193/259, 74.5%) and NAHR- mediated (6/259, 2.3%) duplications. As whole genome sequencing-based studies expand in scale and reach, a continued focus on generating high-quality, standardized duplication data will be prerequisite to addressing their associated biological mechanisms. ii Acknowledgements I thank Dr. Stephen Scherer for his leadership par excellence, his generosity, and for giving me a chance. I am grateful for his investment and the opportunities afforded me, from which I have learned and benefited. I would next thank Drs. -
Comparative Analysis of the Ubiquitin-Proteasome System in Homo Sapiens and Saccharomyces Cerevisiae
Comparative Analysis of the Ubiquitin-proteasome system in Homo sapiens and Saccharomyces cerevisiae Inaugural-Dissertation zur Erlangung des Doktorgrades der Mathematisch-Naturwissenschaftlichen Fakultät der Universität zu Köln vorgelegt von Hartmut Scheel aus Rheinbach Köln, 2005 Berichterstatter: Prof. Dr. R. Jürgen Dohmen Prof. Dr. Thomas Langer Dr. Kay Hofmann Tag der mündlichen Prüfung: 18.07.2005 Zusammenfassung I Zusammenfassung Das Ubiquitin-Proteasom System (UPS) stellt den wichtigsten Abbauweg für intrazelluläre Proteine in eukaryotischen Zellen dar. Das abzubauende Protein wird zunächst über eine Enzym-Kaskade mit einer kovalent gebundenen Ubiquitinkette markiert. Anschließend wird das konjugierte Substrat vom Proteasom erkannt und proteolytisch gespalten. Ubiquitin besitzt eine Reihe von Homologen, die ebenfalls posttranslational an Proteine gekoppelt werden können, wie z.B. SUMO und NEDD8. Die hierbei verwendeten Aktivierungs- und Konjugations-Kaskaden sind vollständig analog zu der des Ubiquitin- Systems. Es ist charakteristisch für das UPS, daß sich die Vielzahl der daran beteiligten Proteine aus nur wenigen Proteinfamilien rekrutiert, die durch gemeinsame, funktionale Homologiedomänen gekennzeichnet sind. Einige dieser funktionalen Domänen sind auch in den Modifikations-Systemen der Ubiquitin-Homologen zu finden, jedoch verfügen diese Systeme zusätzlich über spezifische Domänentypen. Homologiedomänen lassen sich als mathematische Modelle in Form von Domänen- deskriptoren (Profile) beschreiben. Diese Deskriptoren können wiederum dazu verwendet werden, mit Hilfe geeigneter Verfahren eine gegebene Proteinsequenz auf das Vorliegen von entsprechenden Homologiedomänen zu untersuchen. Da die im UPS involvierten Homologie- domänen fast ausschließlich auf dieses System und seine Analoga beschränkt sind, können domänen-spezifische Profile zur Katalogisierung der UPS-relevanten Proteine einer Spezies verwendet werden. Auf dieser Basis können dann die entsprechenden UPS-Repertoires verschiedener Spezies miteinander verglichen werden. -
Genomic Approach in Idiopathic Intellectual Disability Maria De Fátima E Costa Torres
ESTUDOS DE 8 01 PDPGM 2 CICLO Genomic approach in idiopathic intellectual disability Maria de Fátima e Costa Torres D Autor. Maria de Fátima e Costa Torres D.ICBAS 2018 Genomic approach in idiopathic intellectual disability Genomic approach in idiopathic intellectual disability Maria de Fátima e Costa Torres SEDE ADMINISTRATIVA INSTITUTO DE CIÊNCIAS BIOMÉDICAS ABEL SALAZAR FACULDADE DE MEDICINA MARIA DE FÁTIMA E COSTA TORRES GENOMIC APPROACH IN IDIOPATHIC INTELLECTUAL DISABILITY Tese de Candidatura ao grau de Doutor em Patologia e Genética Molecular, submetida ao Instituto de Ciências Biomédicas Abel Salazar da Universidade do Porto Orientadora – Doutora Patrícia Espinheira de Sá Maciel Categoria – Professora Associada Afiliação – Escola de Medicina e Ciências da Saúde da Universidade do Minho Coorientadora – Doutora Maria da Purificação Valenzuela Sampaio Tavares Categoria – Professora Catedrática Afiliação – Faculdade de Medicina Dentária da Universidade do Porto Coorientadora – Doutora Filipa Abreu Gomes de Carvalho Categoria – Professora Auxiliar com Agregação Afiliação – Faculdade de Medicina da Universidade do Porto DECLARAÇÃO Dissertação/Tese Identificação do autor Nome completo _Maria de Fátima e Costa Torres_ N.º de identificação civil _07718822 N.º de estudante __ 198600524___ Email institucional [email protected] OU: [email protected] _ Email alternativo [email protected] _ Tlf/Tlm _918197020_ Ciclo de estudos (Mestrado/Doutoramento) _Patologia e Genética Molecular__ Faculdade/Instituto _Instituto de Ciências