Frontiers in Coalescent Theory: Pedigrees, Identity-By-Descent, and Sequentially Markov Coalescent Models
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Science Fiction and Astronomy
Sci Fi Science Fiction and Astronomy Many science fiction books include subjects of astronomical interest. Here is a list of some that have been recommended to me or I’ve read. I expect that most are not in the University library but many are available in Kindle and other e-formats. At the top of my list is a URL to a much longer list by Andrew Fraknoi of the Astronomical Society of the Pacific. Each title in his list has a very brief summary indicating the kind of story it is. Andrew Fraknoi’s list (http://www.astrosociety.org/education/resources/scifi.html). Gregory Benford is a plasma physicist who has been rated by some as one of the finest observes and interpreters of science in modern fiction. Note: Timescape [Vista, ISBN 0575600500] In the Ocean of Night [Vista, ISBN 0575600357] Across the Sea of Suns [Vista, ISBN 0575600551] Great Sky River [Gallancy, ISBN 0575058315] Eater and over 2 dozen more available as e-books. Stephen Baxter Titan [Voyager, 1997, ISBN 0002254247] has been strongly recommended by New Scientist as a tense, near future, thriller you shouldn’t miss. David Brin has a PhD in astrophysics with which he brings real understanding of the Universe to his stories. The Crystal Spheres won an award. Fred Hoyle is probably the most famous astronomer to have written science fiction. The Back Cloud [Macmillan, ISBN 0333556011] is his classic, followed by A for Andromeda. Try also October the First is too late. Larry Niven’s stories include plenty of ideas inspired by modern astronomy. -
Natural Selection and the Distribution of Identity-By-Descent in the Human Genome
Copyright Ó 2010 by the Genetics Society of America DOI: 10.1534/genetics.110.113977 Natural Selection and the Distribution of Identity-by-Descent in the Human Genome Anders Albrechtsen,*,1,2 Ida Moltke†,1 and Rasmus Nielsen‡ *Department of Biostatistics, University of Copenhagen, Copenhagen, 1014, Denmark, †Center for Bioinformatics, Copenhagen, 2200, Denmark and ‡Department of Integrative Biology and Statistics, University of California, Berkeley, California 94720 Manuscript received April 30, 2010 Accepted for publication June 14, 2010 ABSTRACT There has recently been considerable interest in detecting natural selection in the human genome. Selection will usually tend to increase identity-by-descent (IBD) among individuals in a population, and many methods for detecting recent and ongoing positive selection indirectly take advantage of this. In this article we show that excess IBD sharing is a general property of natural selection and we show that this fact makes it possible to detect several types of selection including a type that is otherwise difficult to detect: selection acting on standing genetic variation. Motivated by this, we use a recently developed method for identifying IBD sharing among individuals from genome-wide data to scan populations from the new HapMap phase 3 project for regions with excess IBD sharing in order to identify regions in the human genome that have been under strong, very recent selection. The HLA region is by far the region showing the most extreme signal, suggesting that much of the strong recent selection acting on the human genome has been immune related and acting on HLA loci. As equilibrium overdominance does not tend to increase IBD, we argue that this type of selection cannot explain our observations. -
Coalescent Likelihood Methods
Coalescent Likelihood Methods Mary K. Kuhner Genome Sciences University of Washington Seattle WA Outline 1. Introduction to coalescent theory 2. Practical example 3. Genealogy samplers 4. Break 5. Survey of samplers 6. Evolutionary forces 7. Practical considerations Population genetics can help us to find answers We are interested in questions like { How big is this population? { Are these populations isolated? How common is migration? { How fast have they been growing or shrinking? { What is the recombination rate across this region? { Is this locus under selection? All of these questions require comparison of many individuals. Coalescent-based studies How many gray whales were there prior to whaling? When was the common ancestor of HIV lines in a Libyan hospital? Is the highland/lowland distinction in Andean ducks recent or ancient? Did humans wipe out the Beringian bison population? What proportion of HIV virions in a patient actually contribute to the breeding pool? What is the direction of gene flow between European rabbit populations? Basics: Wright-Fisher population model All individuals release many gametes and new individuals for the next generation are formed randomly from these. Wright-Fisher population model Population size N is constant through time. Each individual gets replaced every generation. Next generation is drawn randomly from a large gamete pool. Only genetic drift affects the allele frequencies. Other population models Other population models can often be equated to Wright-Fisher The N parameter becomes the effective -
Natural Selection and Coalescent Theory
Natural Selection and Coalescent Theory John Wakeley Harvard University INTRODUCTION The story of population genetics begins with the publication of Darwin’s Origin of Species and the tension which followed concerning the nature of inheritance. Today, workers in this field aim to understand the forces that produce and maintain genetic variation within and between species. For this we use the most direct kind of genetic data: DNA sequences, even entire genomes. Our “Great Obsession” with explaining genetic variation (Gillespie, 2004a) can be traced back to Darwin’s recognition that natural selection can occur only if individuals of a species vary, and this variation is heritable. Darwin might have been surprised that the importance of natural selection in shaping variation at the molecular level would be de-emphasized, beginning in the late 1960s, by scientists who readily accepted the fact and importance of his theory (Kimura, 1983). The motivation behind this chapter is the possible demise of this Neutral Theory of Molecular Evolution, which a growing number of population geneticists feel must follow recent observations of genetic variation within and between species. One hundred fifty years after the publication of the Origin, we are struggling to fully incorporate natural selection into the modern, genealogical models of population genetics. The main goal of this chapter is to present the mathematical models that have been used to describe the effects of positive selective sweeps on genetic variation, as mediated by gene genealogies, or coalescent trees. Background material, comprised of population genetic theory and simulation results, is provided in order to facilitate an understanding of these models. -
Population Genetics: Wright Fisher Model and Coalescent Process
POPULATION GENETICS: WRIGHT FISHER MODEL AND COALESCENT PROCESS by Hailong Cui and Wangshu Zhang Superviser: Prof. Quentin Berger A Final Project Report Presented In Partial Fulfillment of the Requirements for Math 505b April 2014 Acknowledgments We want to thank Prof. Quentin Berger for introducing to us the Wright Fisher model in the lecture, which inspired us to choose Population Genetics for our project topic. The resources Prof. Berger provided us have been excellent learning mate- rials and his feedback has helped us greatly to create this report. We also like to acknowledge that the research papers (in the reference) are integral parts of this process. They have motivated us to learn more about models beyond the class, and granted us confidence that these probabilistic models can actually be used for real applications. ii Contents Acknowledgments ii Abstract iv 1 Introduction to Population Genetics 1 2 Wright Fisher Model 2 2.1 Random drift . 2 2.2 Genealogy of the Wright Fisher model . 4 3 Coalescent Process 8 3.1 Kingman’s Approximation . 8 4 Applications 11 Reference List 12 iii Abstract In this project on Population Genetics, we aim to introduce models that lay the foundation to study more complicated models further. Specifically we will discuss Wright Fisher model as well as Coalescent process. The reason these are of our inter- est is not just the mathematical elegance of these models. With the availability of massive amount of sequencing data, we actually can use these models (or advanced models incorporating variable population size, mutation effect etc, which are how- ever out of the scope of this project) to solve and answer real questions in molecular biology. -
Science Fiction Stories with Good Astronomy & Physics
Science Fiction Stories with Good Astronomy & Physics: A Topical Index Compiled by Andrew Fraknoi (U. of San Francisco, Fromm Institute) Version 7 (2019) © copyright 2019 by Andrew Fraknoi. All rights reserved. Permission to use for any non-profit educational purpose, such as distribution in a classroom, is hereby granted. For any other use, please contact the author. (e-mail: fraknoi {at} fhda {dot} edu) This is a selective list of some short stories and novels that use reasonably accurate science and can be used for teaching or reinforcing astronomy or physics concepts. The titles of short stories are given in quotation marks; only short stories that have been published in book form or are available free on the Web are included. While one book source is given for each short story, note that some of the stories can be found in other collections as well. (See the Internet Speculative Fiction Database, cited at the end, for an easy way to find all the places a particular story has been published.) The author welcomes suggestions for additions to this list, especially if your favorite story with good science is left out. Gregory Benford Octavia Butler Geoff Landis J. Craig Wheeler TOPICS COVERED: Anti-matter Light & Radiation Solar System Archaeoastronomy Mars Space Flight Asteroids Mercury Space Travel Astronomers Meteorites Star Clusters Black Holes Moon Stars Comets Neptune Sun Cosmology Neutrinos Supernovae Dark Matter Neutron Stars Telescopes Exoplanets Physics, Particle Thermodynamics Galaxies Pluto Time Galaxy, The Quantum Mechanics Uranus Gravitational Lenses Quasars Venus Impacts Relativity, Special Interstellar Matter Saturn (and its Moons) Story Collections Jupiter (and its Moons) Science (in general) Life Elsewhere SETI Useful Websites 1 Anti-matter Davies, Paul Fireball. -
Identity by Descent in Pedigrees and Populations; Methods for Genome-Wide Linkage and Association
Identity by descent in pedigrees and populations Overview - 1 Identity by descent in pedigrees and populations; methods for genome-wide linkage and association. UNE Short Course: Feb 14-18, 2011 Dr Elizabeth A Thompson UNE-Short Course Feb 2011 Identity by descent in pedigrees and populations Overview - 2 Timetable Monday 9:00-10:30am 1: Introduction and Overview. 11:00am-12:30pm 2: Identity by Descent; relationships and relatedness 1:30-3:00pm 3: Genetic variation and allelic association. 3:30-5:00pm 4: Allelic association and population structure. Tuesday 9:00-10:30am 5: Genetic associations for a quantitative trait 11:00am-12:30pm 6: Hidden Markov models; HMM 1:30-3:00pm 7: Haplotype blocks and the coalescent. 3:30-5:00pm 8: LD mapping via coalescent ancestry. Wednesday a.m. 9:00-10:30am 9: The EM algorithm 11:00am-12:30pm 10: MCMC and Bayesian sampling Dr Elizabeth A Thompson UNE-Short Course Feb 2011 Identity by descent in pedigrees and populations Overview - 3 Wednesday p.m. 1:30-3:00pm 11: Association mapping in structured populations 3:30-5:00pm 12: Association mapping in admixed populations Thursday 9:00-10:30am 13: Inferring ibd segments; two chromosomes. 11:00am-12:30pm 14: BEAGLE: Haplotype and ibd imputation. 1:30-3:00pm 15: ibd between two individuals. 3:30-5:00pm 16: ibd among multiple chromosomes. Friday 9:00-10:30am 17: Pedigrees in populations. 11:00am-12:30pm 18: Lod scores within and between pedigrees. 1:30-3:00pm 19: Wrap-up and questions. Bibliography Software notes and links. -
Prediction of Identity by Descent Probabilities from Marker-Haplotypes Theo Meuwissen, Mike Goddard
Prediction of identity by descent probabilities from marker-haplotypes Theo Meuwissen, Mike Goddard To cite this version: Theo Meuwissen, Mike Goddard. Prediction of identity by descent probabilities from marker-haplotypes. Genetics Selection Evolution, BioMed Central, 2001, 33 (6), pp.605-634. 10.1051/gse:2001134. hal-00894392 HAL Id: hal-00894392 https://hal.archives-ouvertes.fr/hal-00894392 Submitted on 1 Jan 2001 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Genet. Sel. Evol. 33 (2001) 605–634 605 © INRA, EDP Sciences, 2001 Original article Prediction of identity by descent probabilities from marker-haplotypes a; b Theo H.E. MEUWISSEN ∗, Mike E. GODDARD a Research Institute of Animal Science and Health, Box 65, 8200 AB Lelystad, The Netherlands b Institute of Land and Food Resources, University of Melbourne, Parkville Victorian Institute of Animal Science, Attwood, Victoria, Australia (Received 13 February 2001; accepted 11 June 2001) Abstract – The prediction of identity by descent (IBD) probabilities is essential for all methods that map quantitative trait loci (QTL). The IBD probabilities may be predicted from marker genotypes and/or pedigree information. Here, a method is presented that predicts IBD prob- abilities at a given chromosomal location given data on a haplotype of markers spanning that position. -
The Drink Tank Sixth Annual Giant Sized [email protected]: James Bacon & Chris Garcia
The Drink Tank Sixth Annual Giant Sized Annual [email protected] Editors: James Bacon & Chris Garcia A Noise from the Wind Stephen Baxter had got me through the what he’ll be doing. I first heard of Stephen Baxter from Jay night. So, this is the least Giant Giant Sized Crasdan. It was a night like any other, sitting in I remember reading Ring that next Annual of The Drink Tank, but still, I love it! a room with a mostly naked former ballerina afternoon when I should have been at class. I Dedicated to Mr. Stephen Baxter. It won’t cover who was in the middle of what was probably finished it in less than 24 hours and it was such everything, but it’s a look at Baxter’s oevre and her fifth overdose in as many months. This was a blast. I wasn’t the big fan at that moment, the effect he’s had on his readers. I want to what we were dealing with on a daily basis back though I loved the novel. I had to reread it, thank Claire Brialey, M Crasdan, Jay Crasdan, then. SaBean had been at it again, and this time, and then grabbed a copy of Anti-Ice a couple Liam Proven, James Bacon, Rick and Elsa for it was up to me and Jay to clean up the mess. of days later. Perhaps difficult times made Ring everything! I had a blast with this one! Luckily, we were practiced by this point. Bottles into an excellent escape from the moment, and of water, damp washcloths, the 9 and the first something like a month later I got into it again, 1 dialed just in case things took a turn for the and then it hit. -
Identity-By-Descent Detection Across 487,409 British Samples Reveals Fine Scale Population Structure and Ultra-Rare Variant Asso
ARTICLE https://doi.org/10.1038/s41467-020-19588-x OPEN Identity-by-descent detection across 487,409 British samples reveals fine scale population structure and ultra-rare variant associations ✉ Juba Nait Saada 1 , Georgios Kalantzis 1, Derek Shyr 2, Fergus Cooper 3, Martin Robinson 3, ✉ Alexander Gusev 4,5,7 & Pier Francesco Palamara 1,6,7 1234567890():,; Detection of Identical-By-Descent (IBD) segments provides a fundamental measure of genetic relatedness and plays a key role in a wide range of analyses. We develop FastSMC, an IBD detection algorithm that combines a fast heuristic search with accurate coalescent-based likelihood calculations. FastSMC enables biobank-scale detection and dating of IBD segments within several thousands of years in the past. We apply FastSMC to 487,409 UK Biobank samples and detect ~214 billion IBD segments transmitted by shared ancestors within the past 1500 years, obtaining a fine-grained picture of genetic relatedness in the UK. Sharing of common ancestors strongly correlates with geographic distance, enabling the use of genomic data to localize a sample’s birth coordinates with a median error of 45 km. We seek evidence of recent positive selection by identifying loci with unusually strong shared ancestry and detect 12 genome-wide significant signals. We devise an IBD-based test for association between phenotype and ultra-rare loss-of-function variation, identifying 29 association sig- nals in 7 blood-related traits. 1 Department of Statistics, University of Oxford, Oxford, UK. 2 Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA. 3 Department of Computer Science, University of Oxford, Oxford, UK. -
Variation in Meiosis, Across Genomes, and in Populations
REVIEW Identity by Descent: Variation in Meiosis, Across Genomes, and in Populations Elizabeth A. Thompson1 Department of Statistics, University of Washington, Seattle, Washington 98195-4322 ABSTRACT Gene identity by descent (IBD) is a fundamental concept that underlies genetically mediated similarities among relatives. Gene IBD is traced through ancestral meioses and is defined relative to founders of a pedigree, or to some time point or mutational origin in the coalescent of a set of extant genes in a population. The random process underlying changes in the patterns of IBD across the genome is recombination, so the natural context for defining IBD is the ancestral recombination graph (ARG), which specifies the complete ancestry of a collection of chromosomes. The ARG determines both the sequence of coalescent ancestries across the chromosome and the extant segments of DNA descending unbroken by recombination from their most recent common ancestor (MRCA). DNA segments IBD from a recent common ancestor have high probability of being of the same allelic type. Non-IBD DNA is modeled as of independent allelic type, but the population frame of reference for defining allelic independence can vary. Whether of IBD, allelic similarity, or phenotypic covariance, comparisons may be made to other genomic regions of the same gametes, or to the same genomic regions in other sets of gametes or diploid individuals. In this review, I present IBD as the framework connecting evolutionary and coalescent theory with the analysis of genetic data observed on individuals. I focus on the high variance of the processes that determine IBD, its changes across the genome, and its impact on observable data. -
Recent Progress in Coalescent Theory
SOCIEDADEBRASILEIRADEMATEMÁTICA ENSAIOS MATEMATICOS´ 200X, Volume XX, X–XX Recent Progress in Coalescent Theory Nathana¨el Berestycki Abstract. Coalescent theory is the study of random processes where particles may join each other to form clusters as time evolves. These notes provide an introduction to some aspects of the mathe- matics of coalescent processes and their applications to theoretical population genetics and other fields such as spin glass models. The emphasis is on recent work concerning in particular the connection of these processes to continuum random trees and spatial models such as coalescing random walks. arXiv:0909.3985v1 [math.PR] 22 Sep 2009 2000 Mathematics Subject Classification: 60J25, 60K35, 60J80. Introduction The probabilistic theory of coalescence, which is the primary subject of these notes, has expanded at a quick pace over the last decade or so. I can think of three factors which have essentially contributed to this growth. On the one hand, there has been a rising demand from population geneticists to develop and analyse models which incorporate more realistic features than what Kingman’s coalescent allows for. Simultaneously, the field has matured enough that a wide range of techniques from modern probability theory may be success- fully applied to these questions. These tools include for instance martingale methods, renormalization and random walk arguments, combinatorial embeddings, sample path analysis of Brownian motion and L´evy processes, and, last but not least, continuum random trees and measure-valued processes. Finally, coalescent processes arise in a natural way from spin glass models of statistical physics. The identification of the Bolthausen-Sznitman coalescent as a universal scaling limit in those models, and the connection made by Brunet and Derrida to models of population genetics, is a very exciting re- cent development.