Lecture 25 Population Genetics Until Now, We Have Been Carrying Out
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Allele Frequency–Based and Polymorphism-Versus
View metadata, citation and similar papers at core.ac.uk brought to you by CORE Allele Frequency–Based and Polymorphism-Versus- provided by PubMed Central Divergence Indices of Balancing Selection in a New Filtered Set of Polymorphic Genes in Plasmodium falciparum Lynette Isabella Ochola,1 Kevin K. A. Tetteh,2 Lindsay B. Stewart,2 Victor Riitho,1 Kevin Marsh,1 and David J. Conway*,2 1Kenya Medical Research Institute, Centre for Geographic Medicine Research Coast, Kilifi, Kenya 2Department of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom *Corresponding author: E-mail: [email protected], [email protected]. Associate editor: John H. McDonald Research article Abstract Signatures of balancing selection operating on specific gene loci in endemic pathogens can identify candidate targets of naturally acquired immunity. In malaria parasites, several leading vaccine candidates convincingly show such signatures when subjected to several tests of neutrality, but the discovery of new targets affected by selection to a similar extent has been slow. A small minority of all genes are under such selection, as indicated by a recent study of 26 Plasmodium falciparum merozoite- stage genes that were not previously prioritized as vaccine candidates, of which only one (locus PF10_0348) showed a strong signature. Therefore, to focus discovery efforts on genes that are polymorphic, we scanned all available shotgun genome sequence data from laboratory lines of P. falciparum and chose six loci with more than five single nucleotide polymorphisms per kilobase (including PF10_0348) for in-depth frequency–based analyses in a Kenyan population (allele sample sizes .50 for each locus) and comparison of Hudson–Kreitman–Aguade (HKA) ratios of population diversity (p) to interspecific divergence (K) from the chimpanzee parasite Plasmodium reichenowi. -
Microevolution and the Genetics of Populations Microevolution Refers to Varieties Within a Given Type
Chapter 8: Evolution Lesson 8.3: Microevolution and the Genetics of Populations Microevolution refers to varieties within a given type. Change happens within a group, but the descendant is clearly of the same type as the ancestor. This might better be called variation, or adaptation, but the changes are "horizontal" in effect, not "vertical." Such changes might be accomplished by "natural selection," in which a trait within the present variety is selected as the best for a given set of conditions, or accomplished by "artificial selection," such as when dog breeders produce a new breed of dog. Lesson Objectives ● Distinguish what is microevolution and how it affects changes in populations. ● Define gene pool, and explain how to calculate allele frequencies. ● State the Hardy-Weinberg theorem ● Identify the five forces of evolution. Vocabulary ● adaptive radiation ● gene pool ● migration ● allele frequency ● genetic drift ● mutation ● artificial selection ● Hardy-Weinberg theorem ● natural selection ● directional selection ● macroevolution ● population genetics ● disruptive selection ● microevolution ● stabilizing selection ● gene flow Introduction Darwin knew that heritable variations are needed for evolution to occur. However, he knew nothing about Mendel’s laws of genetics. Mendel’s laws were rediscovered in the early 1900s. Only then could scientists fully understand the process of evolution. Microevolution is how individual traits within a population change over time. In order for a population to change, some things must be assumed to be true. In other words, there must be some sort of process happening that causes microevolution. The five ways alleles within a population change over time are natural selection, migration (gene flow), mating, mutations, or genetic drift. -
Intro Forensic Stats
Popstats Unplugged 14th International Symposium on Human Identification John V. Planz, Ph.D. UNT Health Science Center at Fort Worth Forensic Statistics From the ground up… Why so much attention to statistics? Exclusions don’t require numbers Matches do require statistics Problem of verbal expression of numbers Transfer evidence Laboratory result 1. Non-match - exclusion 2. Inconclusive- no decision 3. Match - estimate frequency Statistical Analysis Focus on the question being asked… About “Q” sample “K” matches “Q” Who else could match “Q" partial profile, mixtures Match – estimate frequency of: Match to forensic evidence NOT suspect DNA profile Who is in suspect population? So, what are we really after? Quantitative statement that expresses the rarity of the DNA profile Estimate genotype frequency 1. Frequency at each locus Hardy-Weinberg Equilibrium 2. Frequency across all loci Linkage Equilibrium Terminology Genetic marker variant = allele DNA profile = genotype Database = table that provides frequency of alleles in a population Population = some assemblage of individuals based on some criteria for inclusion Where Do We Get These Numbers? 1 in 1,000,000 1 in 110,000,000 POPULATION DATA and Statistics DNA databases are needed for placing statistical weight on DNA profiles vWA data (N=129) 14 15 16 17 18 19 20 freq 14 9 75 15 3 0 6 16 19 1 1 46 17 23 1 14 9 72 18 6 0 3 10 4 31 19 6 1 7 3 2 2 23 20 0 0 0 3 2 0 0 5 258 Because data are not available for every genotype possible, We use allele frequencies instead of genotype frequencies to estimate rarity. -
Lecture 9: Population Genetics
Lecture 9: Population Genetics Plan of the lecture I. Population Genetics: definitions II. Hardy-Weinberg Law. III. Factors affecting gene frequency in a population. Small populations and founder effect. IV. Rare Alleles and Eugenics The goal of this lecture is to make students familiar with basic models of population genetics and to acquaint students with empirical tests of these models. It will discuss the primary forces and processes involved in shaping genetic variation in natural populations (mutation, drift, selection, migration, recombination, mating patterns, population size and population subdivision). I. Population genetics: definitions Population – group of interbreeding individuals of the same species that are occupying a given area at a given time. Population genetics is the study of the allele frequency distribution and change under the influence of the 4 evolutionary forces: natural selection, mutation, migration (gene flow), and genetic drift. Population genetics is concerned with gene and genotype frequencies, the factors that tend to keep them constant, and the factors that tend to change them in populations. All the genes at all loci in every member of an interbreeding population form gene pool. Each gene in the genetic pool is present in two (or more) forms – alleles. Individuals of a population have same number and kinds of genes (except sex genes) and they have different combinations of alleles (phenotypic variation). The applications of Mendelian genetics, chromosomal abnormalities, and multifactorial inheritance to medical practice are quite evident. Physicians work mostly with patients and families. However, as important as they may be, genes affect populations, and in the long run their effects in populations have a far more important impact on medicine than the relatively few families each physician may serve. -
Hemizygous Or Haploid Sex Diploid Sex
9781405132770_4_002.qxd 1/19/09 2:22 PM Page 16 Table 2.1 Punnett square to predict genotype frequencies for loci on sex chromosomes and for all loci in males and females of haplo-diploid species. Notation in this table is based on birds where the sex chromosomes are Z and W (ZZ males and ZW females) with a diallelic locus on the Z chromosome possessing alleles A and a at frequencies p and q, respectively. In general, genotype frequencies in the homogametic or diploid sex are identical to Hardy–Weinberg expectations for autosomes, whereas genotype frequencies are equal to allele frequencies in the heterogametic or haploid sex. Hemizygous or haploid sex Diploid sex Genotype Gamete Frequency Genotype Gamete Frequency ZW Z-A p ZZ Z-A p Z-a q Z-a q W Expected genotype frequencies under random mating Homogametic sex Z-A Z-A p2 Z-A Z-a 2pq Z-a Z-a q2 Heterogametic sex Z-A W p Z-a W q ·· 9781405132770_4_002.qxd 1/19/09 2:22 PM Page 19 Table 2.2 Example DNA profile for three simple tandem repeat (STR) loci commonly used in human forensic cases. Locus names refer to the human chromosome (e.g. D3 means third chromosome) and chromosome region where the SRT locus is found. Locus D3S1358 D21S11 D18S51 Genotype 17, 18 29, 30 18, 18 ·· 9781405132770_4_002.qxd 1/19/092:22PMPage20 Table 2.3 Allele frequencies for nine STR loci commonly used in forensic cases estimated from 196 US Caucasians sampled randomly with respect to geographic location. -
Popgen7: Genetic Drift
PopGen7: Genetic Drift Sampling error Before taking on the notion of genetic drift in populations, let’s first take a look at sampling variation. Let’s consider the age-old coin tossing experiment. Assume a fair coin with p = ½. If you sample many times the most likely single outcome = ½ heads. The overall most likely outcome ≠ ½ heads. This is a binomial sampling problem. ⎛n⎞ k n−k P = ⎜ ⎟()()1/ 2 1/ 2 ⎝k ⎠ ⎛n⎞ n! ⎜ ⎟ = ⎝k ⎠ k!()n − k ! n is the number of flips k is the number of successes Let’s look at the probability of the following: k heads from n flips Probability k =5 from n = 10 0.246 k =6 from n = 10 0.205 So, the most likely single outcome is ½ heads (with p = 0.246), the overall likelihood of observing something other than ½ heads is higher (p = 1 – 0.246 = 0.754) The good news is that as we increase the sample size the likelihood of observing something very close to the expected frequency, E(p) = 0.5, goes up. The probability of a given frequency of heads from n flips of the coin is: N flips p <0.35 p = 0.35-0.45 p = 0.45-0.55 p = 0.55-0.65 p <0.65 variance 10 0.16 0.21 0.25 0.21 0.16 0.025 20 0.06 0.19 0.50 0.19 0.06 0.0125 50 0.002 0.16 0.68 0.16 0.002 0.005 It is clear that sample size N is important. -
Allele Frequency Difference AFD–An Intuitive Alternative to FST for Quantifying Genetic Population Differentiation
G C A T T A C G G C A T genes Opinion Allele Frequency Difference AFD–An Intuitive Alternative to FST for Quantifying Genetic Population Differentiation Daniel Berner Department of Environmental Sciences, Zoology, University of Basel, Vesalgasse 1, CH-4051 Basel, Switzerland; [email protected]; Tel.: +41-(0)-61-207-03-28 Received: 21 February 2019; Accepted: 12 April 2019; Published: 18 April 2019 Abstract: Measuring the magnitude of differentiation between populations based on genetic markers is commonplace in ecology, evolution, and conservation biology. The predominant differentiation metric used for this purpose is FST. Based on a qualitative survey, numerical analyses, simulations, and empirical data, I here argue that FST does not express the relationship to allele frequency differentiation between populations generally considered interpretable and desirable by researchers. In particular, FST (1) has low sensitivity when population differentiation is weak, (2) is contingent on the minor allele frequency across the populations, (3) can be strongly affected by asymmetry in sample sizes, and (4) can differ greatly among the available estimators. Together, these features can complicate pattern recognition and interpretation in population genetic and genomic analysis, as illustrated by empirical examples, and overall compromise the comparability of population differentiation among markers and study systems. I argue that a simple differentiation metric displaying intuitive properties, the absolute allele frequency difference AFD, provides a valuable alternative to FST. I provide a general definition of AFD applicable to both bi- and multi-allelic markers and conclude by making recommendations on the sample sizes needed to achieve robust differentiation estimates using AFD. -
The Allele Frequency Spectrum in Genome-Wide Human Variation Data Reveals Signals of Differential Demographic History in Three Large World Populations
Copyright 2004 by the Genetics Society of America The Allele Frequency Spectrum in Genome-Wide Human Variation Data Reveals Signals of Differential Demographic History in Three Large World Populations Gabor T. Marth,1 Eva Czabarka, Janos Murvai and Stephen T. Sherry National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894 Manuscript received April 15, 2003 Accepted for publication September 4, 2003 ABSTRACT We have studied a genome-wide set of single-nucleotide polymorphism (SNP) allele frequency measures for African-American, East Asian, and European-American samples. For this analysis we derived a simple, closed mathematical formulation for the spectrum of expected allele frequencies when the sampled populations have experienced nonstationary demographic histories. The direct calculation generates the spectrum orders of magnitude faster than coalescent simulations do and allows us to generate spectra for a large number of alternative histories on a multidimensional parameter grid. Model-fitting experiments using this grid reveal significant population-specific differences among the demographic histories that best describe the observed allele frequency spectra. European and Asian spectra show a bottleneck-shaped history: a reduction of effective population size in the past followed by a recent phase of size recovery. In contrast, the African-American spectrum shows a history of moderate but uninterrupted population expansion. These differences are expected to have profound consequences for the design of medical association studies. The analytical methods developed for this study, i.e., a closed mathematical formulation for the allele frequency spectrum, correcting the ascertainment bias introduced by shallow SNP sampling, and dealing with variable sample sizes provide a general framework for the analysis of public variation data. -
Guide to Interpreting Genomic Reports: a Genomics Toolkit
Guide to Interpreting Genomic Reports: A Genomics Toolkit A guide to genomic test results for non-genetics providers Created by the Practitioner Education Working Group of the Clinical Sequencing Exploratory Research (CSER) Consortium Genomic Report Toolkit Authors Kelly East, MS, CGC, Wendy Chung MD, PhD, Kate Foreman, MS, CGC, Mari Gilmore, MS, CGC, Michele Gornick, PhD, Lucia Hindorff, PhD, Tia Kauffman, MPH, Donna Messersmith , PhD, Cindy Prows, MSN, APRN, CNS, Elena Stoffel, MD, Joon-Ho Yu, MPh, PhD and Sharon Plon, MD, PhD About this resource This resource was created by a team of genomic testing experts. It is designed to help non-geneticist healthcare providers to understand genomic medicine and genome sequencing. The CSER Consortium1 is an NIH-funded group exploring genomic testing in clinical settings. Acknowledgements This work was conducted as part of the Clinical Sequencing Exploratory Research (CSER) Consortium, grants U01 HG006485, U01 HG006485, U01 HG006546, U01 HG006492, UM1 HG007301, UM1 HG007292, UM1 HG006508, U01 HG006487, U01 HG006507, R01 HG006618, and U01 HG007307. Special thanks to Alexandria Wyatt and Hugo O’Campo for graphic design and layout, Jill Pope for technical editing, and the entire CSER Practitioner Education Working Group for their time, energy, and support in developing this resource. Contents 1 Introduction and Overview ................................................................ 3 2 Diagnostic Results Related to Patient Symptoms: Pathogenic and Likely Pathogenic Variants . 8 3 Uncertain Results -
G14TBS Part II: Population Genetics
G14TBS Part II: Population Genetics Dr Richard Wilkinson Room C26, Mathematical Sciences Building Please email corrections and suggestions to [email protected] Spring 2015 2 Preliminaries These notes form part of the lecture notes for the mod- ule G14TBS. This section is on Population Genetics, and contains 14 lectures worth of material. During this section will be doing some problem-based learning, where the em- phasis is on you to work with your classmates to generate your own notes. I will be on hand at all times to answer questions and guide the discussions. Reading List: There are several good introductory books on population genetics, although I found their level of math- ematical sophistication either too low or too high for the purposes of this module. The following are the sources I used to put together these lecture notes: • Gillespie, J. H., Population genetics: a concise guide. John Hopkins University Press, Baltimore and Lon- don, 2004. • Hartl, D. L., A primer of population genetics, 2nd edition. Sinauer Associates, Inc., Publishers, 1988. • Ewens, W. J., Mathematical population genetics: (I) Theoretical Introduction. Springer, 2000. • Holsinger, K. E., Lecture notes in population genet- ics. University of Connecticut, 2001-2010. Available online at http://darwin.eeb.uconn.edu/eeb348/lecturenotes/notes.html. • Tavar´eS., Ancestral inference in population genetics. In: Lectures on Probability Theory and Statistics. Ecole d'Et´esde Probabilit´ede Saint-Flour XXXI { 2001. (Ed. Picard J.) Lecture Notes in Mathematics, Springer Verlag, 1837, 1-188, 2004. Available online at http://www.cmb.usc.edu/people/stavare/STpapers-pdf/T04.pdf Gillespie, Hartl and Holsinger are written for non-mathematicians and are easiest to follow. -
Balancing Selection on Allorecognition Genes in the Colonial Ascidian Botryllus Schlosseri
Developmental and Comparative Immunology 69 (2017) 60e74 Contents lists available at ScienceDirect Developmental and Comparative Immunology journal homepage: www.elsevier.com/locate/dci Balancing selection on allorecognition genes in the colonial ascidian Botryllus schlosseri * Marie L. Nydam a, , Emily E. Stephenson a, b, Claire E. Waldman a, Anthony W. De Tomaso c a Division of Science and Mathematics, Centre College, 600 W. Walnut Street, Danville, KY 40422, United States b Centre for Infectious Disease Research, P.O. Box 34681, Lusaka, 10101, Zambia c Department of Molecular, Cellular, and Developmental Biology, University of California Santa Barbara, Santa Barbara, CA 93106, United States article info abstract Article history: Allorecognition is the capability of an organism to recognize its own or related tissues. The colonial Received 24 November 2016 ascidian Botryllus schlosseri, which comprises five genetically distinct and divergent species (Clades A-E), Received in revised form contains two adjacent genes that control allorecognition: fuhcsec and fuhctm. These genes have been 22 December 2016 characterized extensively in Clade A and are highly polymorphic. Using alleles from 10 populations Accepted 22 December 2016 across the range of Clade A, we investigated the type and strength of selection maintaining this variation. Available online 24 December 2016 Both fuhc genes exhibit higher within-population variation and lower population differentiation mea- sures (FST) than neutral loci. The fuhc genes contain a substantial number of codons with >95% posterior Keywords: > sec tm Botryllus schlosseri probability of dN/dS 1. fuhc and fuhc also have polymorphisms shared between Clade A and Clade E Allorecognition that were present prior to speciation (trans-species polymorphisms). -
A Fundamental Relationship Between Genotype Frequencies and Fitnesses
Copyright Ó 2008 by the Genetics Society of America DOI: 10.1534/genetics.108.093518 A Fundamental Relationship Between Genotype Frequencies and Fitnesses Joseph Lachance1 Graduate Program in Genetics, Department of Ecology and Evolution, State University of New York, Stony Brook, New York 11794-5222 Manuscript received July 3, 2008 Accepted for publication August 7, 2008 ABSTRACT The set of possible postselection genotype frequencies in an infinite, randomly mating population is found. Geometric mean heterozygote frequency divided by geometric mean homozygote frequency equals two times the geometric mean heterozygote fitness divided by geometric mean homozygote fitness. The ratio of genotype frequencies provides a measure of genetic variation that is independent of allele frequencies. When this ratio does not equal two, either selection or population structure is present. Within-population HapMap data show population-specific patterns, while pooled data show an excess of homozygotes. HAT patterns of genetic variation are possible within the set of possible postselection genotype frequencies is W a population, and how does natural selection affect derived. Much like how the Hardy–Weinberg principle these patterns? R. A. Fisher remarked ‘‘it is often conve- describes population genetic states in the absence of nient to consider a natural population not so much as an selection, this novel equation describes population genetic aggregate of living individuals but as an aggregate of gene states in the presence of selection. In the context of ratios’’ (Fisher 1953, p. 515). This mathematical abstrac- genotype-frequency space, this is a multidimensional tion allows key questions in evolutionary genetics to be surface, the curvature of which is influenced by natural addressed.