RS7069 18 Pp033 80 Waples.Pdf (921.1Kb)

Total Page:16

File Type:pdf, Size:1020Kb

RS7069 18 Pp033 80 Waples.Pdf (921.1Kb) J. CETACEAN RES. MANAGE. 18: 33–80, 2018 33 Guidelines for genetic data analysis ROBIN S. WAPLES1, A. RUS HOELZEL2, OSCAR GAGGIOTTI3, RALPH TIEDEMANN4, PER J. PALSBØLL5, FRANK CIPRIANO6, JENNIFER JACKSON7, JOHN W. BICKHAM8 AND AIMÉE R. LANG9 Contact e-mail: [email protected] ABSTRACT The IWC Scientific Committee recently adopted guidelines for quality control of DNA data. Once data have been collected, the next step is to analyse the data and make inferences that are useful for addressing practical problems in conservation and management of cetaceans. This is a complex exercise, as numerous analyses are possible and users have a wide range of choices of software programs for implementing the analyses. This paper reviews the underlying issues, illustrates application of different types of genetic data analysis to two complex management problems (involving common minke whales and humpback whales), and concludes with a number of recommendations for best practices in the analysis of population genetic data. An extensive Appendix provides a detailed review and critique of most types of analyses that are used with population genetic data for cetaceans. KEYWORDS: ABUNDANCE ESTIMATE; BREEDING GROUNDS; CONSERVATION; DNA FINGERPRINTING; FEEDING GROUNDS; GENETICS; HUMPBACK WHALE; MIGRATION; MINKE WHALE; REPRODUCTION; TAXONOMY INTRODUCTION before the analyses considered here begin, the DNA data Recently, guidelines were adopted for quality control of quality-control guidelines have been consulted and followed DNA data intended for use within the International Whaling to the extent possible, and that any substantial deviations Commission (IWC, 2009; 2015a). Once the data have have been documented and explained. been collected, the next step is to analyse the data and make As discussed in detail later, genetic information can inferences that are useful for addressing practical problems provide insights relevant to many types of problems in the management of cetaceans. This is a complex exercise associated with conservation and management of living for two major reasons: (1) many methods can be used natural resources. Among other applications, genetic data can to analyse genetic data, and an equally wide range of be used to: computer software is available to conduct data analyses; (1) identify and delimit biological species, subspecies and and (2) a key objective is to inform those involved in populations; cetacean management who do not have a background in population genetics. For these reasons, it has been suggested (2) provide or improve estimates of census population size that it would be useful to have a document that provides (N) and effective population size (Ne); guidelines for the analysis of population genetic data for use (3) help track contemporary movements of individuals, as in a management context. Although it is not possible (nor well as estimate long-term levels of connectivity among is it desirable) to prescribe specific procedures for all populations; analyses of population genetic data, it can help to provide general guidelines for some of the more common types of (4) quantify genetic diversity within populations and provide analyses conducted in a management context. The latter is insights into past bottlenecks and population expansions; the objective of this paper. Emphasis is on a general discussion of issues involved in genetic data analysis (5) help resolve mixtures of individuals originating from rather than detailed comments about specific computer different breeding populations; and software, but some popular programs will be discussed to (6) track products through the marketplace. emphasise particular points. Given the many analytical methods (and software packages) available, to focus on However, the most widespread practical application, those most relevant to a particular study, the discussion particularly in the IWC context, is for the study of stock has been organised around some common management structure (genetic differentiation among populations). Before problems one might try to address with genetic data. discussing details of particular genetic analyses, some of the These problems are identified below. It is assumed that key issues involved with assessing stock structure are 1 NOAA Fisheries, Northwest Fisheries Science Center, Seattle WA 98112, USA. 2 Department of Biosciences, Durham University, Durham DH1 3LE, UK. 3 Scottish Oceans Institute, East Sands, University of St. Andrews, KY16 8LB, UK. 4 Institute of Biochemistry and Biology, University of Potsdam, D-14476 Potsdam, Germany. 5 Groningen Institute for Evolutionary Life Sciences, University of Groningen, The Netherlands. 6 California Academy of Sciences, San Francisco CA 94118, USA. 7 British Antarctic Survey, Cambridge CB3 0ET, UK. 8 Texas A&M University, College Station, TX 77843. 9 NOAA Fisheries, Southwest Fisheries Science Center, La Jolla CA 92037, USA. 34 WAPLES et al.: GUIDELINES FOR GENETIC DATA ANALYSIS summarised and best practice for assessing issues that may found within and among oceans (e.g. bottlenose dolphins, be complex or challenging are highlighted. Natoli et al., 2005). Common questions that arise in conservation and Two major issues related to stock structure/population management of living natural resources include the differentiation following: Is the differentiation among groups of individuals Two major issues that arise in applying genetic data to strong enough that they should be considered separate problems in stock structure are: (1) identification of threshold populations or stocks? Is any statistically significant (e.g. levels of population differentiation that require separate- P < 0.05) departure from panmixia sufficient to warrant stock management to achieve stated objectives; and (2) using recognition as separate stocks? If not, how strong must the genetic and other data to determine whether the system under differentiation be? Unfortunately, there are no generally consideration is above or below this threshold. applicable answers to these questions, since the relevant degree of population differentiation depends on the Identifying threshold levels of population differentiation conservation/management objectives, the risks associated Population differentiation occurs along a continuum (Fig. 1). with adopting different management strategies, and society’s At one extreme (completely random mating), every tolerance of the resulting consequences. Some general individual has an equal probability of mating with any other management objectives/considerations include: individual. This situation is referred to as panmixia; although this is an idealised scenario not known to exist in any natural (1) ‘Management units’ or ‘populations’ or ‘stocks’ must be population, panmixia is typically adopted as the null considered separately because of a legal mandate. In the hypothesis against which to compare alternative hypotheses US, federal laws that include this type of mandate include that involve various degrees of departure from random the Endangered Species Act, the Marine Mammal mating. The other extreme of the population-differentiation Protection Act, and the Magnuson-Stevens Fishery continuum is characterised by complete isolation among Conservation and Management Act (see Waples et al. 2008 locally panmictic groups of individuals. In nature, populations for discussion). Canada has similar provisions in its federal in general are neither completely panmictic nor completely Species at Risk Act (2002), as do the Biodiversity Law of isolated; instead, they typically are characterised by Costa Rica (1992), Australia’s Endangered Species intermediate levels of differentiation and linked by restricted Protection Act (2002), and South Africa’s National but non-zero levels of migration. In addition, the degree of Environmental Management Biodiversity Act (2004). connectivity among populations often changes over time. (2) Sustainable harvest for management stocks should be Several types of data, including genetic information, can maximised while preventing/minimising impacts on help to determine where a particular species falls on the stocks that cannot withstand harvest, because: population-differentiation continuum depicted in Fig. 1. In some cetacean species, the level of genetic differentiation (a) locally depleted stocks might take a long time to among geographic areas within major ocean basins is rebuild, and/or relatively low (toward the panmictic end of the continuum), (b) local extirpation might represent an irreversible loss whereas evidence for higher population differentiation is of biodiversity. often found among populations from different oceans (e.g. sperm whales; see Alexander et al., 2016). In other cases, Although such general considerations are useful for unexpectedly high levels of population differentiation are providing context, they are qualitative rather than Fig. 1. The continuum of population differentiation. Each circle represents a group of individuals that might or might not be a separate population or stock. Four generic scenarios, with varying degrees of connectivity (geographical overlap and/or migration), are identified along the continuum: (A) Complete independence. (B) Modest connectivity. (C) Substantial connectivity. (D) Panmixia (circles are completely congruent). Reproduced from Waples and Gaggiotti (2006). J. CETACEAN RES. MANAGE. 18: 33–80, 2018 35 quantitative and by themselves will not produce repeatable outcomes. That is, you could not provide guidance this general to independent
Recommended publications
  • Population Genetics I
    21 March, 2016: Population genetics I 529053 Evolutionary Genomics Ari Löytynoja / [email protected] Background reading This book is OK primer to pop.gen. in genomics era - focus on coalescent theory and SNP data - unfortunately it contains lots of typos 529053 Evolutionary Genomics Ari Löytynoja / [email protected] Population genetics Definition - studies distributions & changes of allele frequencies in populations over time - effects considered: - natural selection, genetic drift, mutation and gene flow - recombination, population subdivision and population structure - allows inferring past events as well as predicting future History - fundamental work by Haldane, Wright and Fisher on first half of 20th century - recent development: coalescent theory by Kingman in 1980’s - suitable for SNPs data - computationally highly efficient 529053 Evolutionary Genomics Ari Löytynoja / [email protected] Population genetics basics (1) Allele - one of alternative forms of a gene or same genetic locus - used to be visible gene product (e.g. blond vs. red hair) - now typically SNP (e.g. rs1805007(C) vs. rs1805007(T)) 529053 Evolutionary Genomics Ari Löytynoja / [email protected] Population genetics basics (1) Allele - one of alternative forms of a gene or same genetic locus - used to be visible gene product (e.g. blond vs. red hair) - now typically SNP (e.g. rs1805007(C) vs. rs1805007(T)) 529053 Evolutionary Genomics Ari Löytynoja / [email protected] Population genetics basics (1) Allele - one of alternative forms of a gene or same genetic locus - used to be visible gene product (e.g. blond vs. red hair) - now typically SNP (e.g. rs1805007(C) vs. rs1805007(T)) 529053 Evolutionary Genomics Ari Löytynoja / [email protected] Population genetics basics (1) Allele - one of alternative forms of a gene or same genetic locus - used to be visible gene product (e.g.
    [Show full text]
  • Workshop in Molecular Evolution
    Workshop in Molecular Evolution Jan 7 - 11, 2019 Shanghai Two faces of one process: phylogenetics vs. population genetics Phylogenetics – model of speciation Tine et al., 2014 Population genetics – model of coalescence Population genetics Idealised population (Fisher-Wright population) • Random mating – each copy of the gene found in the new generation is drawn independently at random from all copies of the gene in the old generation • No selection • No migration • No mutation • Large population size, no drifting British biologist and statistician Ronald Fisher • In a series of papers starting in 1918 and culminating in his 1930 book The Genetical Theory of Natural Selection • Fisher showed that the continuous variation measured by the biometricians could be produced by the combined action of many discrete genes, and that natural selection could change allele frequencies in a population, resulting in evolution. British geneticist J.B.S. Haldane • worked out the mathematics of allele frequency change at a single gene locus under a broad range of conditions. • Haldane also applied statistical analysis to real-world examples of natural selection, such as peppered moth evolution and industrial melanism The American biologist Sewall Wright • animal breeding experiments, focused on combinations of interacting genes, and the effects of inbreeding on small, relatively isolated populations that exhibited genetic drift. • In 1932 Wright introduced the concept of an adaptive landscape and argued that genetic drift and inbreeding could drive a small,
    [Show full text]
  • 6.1 Standard GP
    Eingereicht von Bogdan Burlacu Angefertigt am Institute for Formal Models and Verification Erstbeurteiler FH-Prof. PD DI Dr. Michael Aenzeller Zweitbeurteiler a.Univ.-Prof. Dr. Tracing of Evolutionary Josef Küng Search Trajectories in August 2017 Complex Hypothesis Spaces Dissertation zur Erlangung des akademischen Grades Doktor der Technischen Wissenschaen im Doktoratsstudium der Technischen Wissenschaen JOHANNES KEPLER UNIVERSITÄT LINZ Altenbergerstraße 69 4040 Linz, Österreich www.jku.at DVR 0093696 Acknowledgements First and foremost, I would like to thank Prof. Michael Affenzeller for the opportunity to pursue a PhD within the Heuristics and Evolutionary Algorithms Laboratory (HEAL) at the University of Applied Sciences Upper Austria and for the continuous guidance and friendship. The work described in this thesis would not have been possible without the funding provided by the International PhD program in Informatics Hagenberg, offered as a specialization of the computer science PhD program at the Johannes Kepler University Linz (Austria). I would also like to extend my thanks and gratitude to my colleagues from HEAL who supported me during this time with many insights, criticisms, discussions and development advice: Michael Kommenda, Gabriel Kronberger, Stephan Winkler, Andreas Beham, Stefan Vonolfen, and the whole Heuristiclab team. i Abstract Understanding the internal functioning of evolutionary algorithms is an essential require- ment for improving their performance and reliability. Increased computational resources available in current mainstream computers make it possible for new previously infeasible research directions to be explored. Therefore, a comprehensive theoretical analysis of their mechanisms and dynamics using modern tools becomes possible. Recent algorithmic achievements like offspring selection in combination with linear scaling have enabled genetic programming (GP) to achieve high quality results in system identification in less than 50 generations using populations of only several hundred individuals.
    [Show full text]
  • Evolutionary Genetics LV 25600-01 | Lecture with Exercises | 4KP
    Evolutionary Genetics LV 25600-01 | Lecture with exercises | 4KP 1 HS2019 Population Genetics Extinction vortex is caused by a positive feedback loop (Gilpin and Soule, 1986). Gilpin ME, Soulé ME (1986). "Minimum Viable Populations: Processes of Species Extinction". In M. E. Soulé. Conservation Biology: The Science of Scarcity and Diversity. Sinauer, Sunderland, Mass. pp. 19–34. !2 HS19 | UniBas | JCW Population Genetics ▷ Effective Population Size Ne “The number of breeding individuals in an idealised population that would show the same amount of dispersion of allele frequencies under random genetic drift or the same amount of inbreeding as the population under consideration.” Ronald Fisher and Sewall Wright 3 HS19 | UniBas | JCW Population Genetics ▷ Effective Population Size unequal sex-ratios chromosome (organelle) linkage number of progeny (k≠2) Ne variance in offsprings (Vk≠2) fluctuation in population size 4 HS19 | UniBas | JCW Population Genetics ▷ Non-Random Mating The inbreeding coefficient of an individual (F) is the probability that an individual has two alleles at a locus that are identical by descent. It measures the amount of inbreeding by comparing the observed frequency of heterozygotes (Ho) in the population to the frequency expected under random mating - Hardy-Weinberg (He). H F = 1− o H e In a panmixic population the observed (Ho) and the expected frequency of heterozygosity is not significantly different. H o ≈ H e → F = 0 !5 HS19 | UniBas | JCW Population Genetics ▷ Mutation In the irreversible mutation model the allele frequency decreases over time depending on the mutation rate. p : frequency of allele A after t generations t t p0 :starting frequency of allele A pt = p0 (1− µ) µ : muation rate The mutation rate in an ideal the population equals zero and the allele frequency does not change from one generation to the next.
    [Show full text]
  • Prediction and Estimation of Effective Population Size
    Heredity (2016) 117, 193–206 & 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved 0018-067X/16 www.nature.com/hdy REVIEW Prediction and estimation of effective population size J Wang1, E Santiago2 and A Caballero3 Effective population size (Ne) is a key parameter in population genetics. It has important applications in evolutionary biology, conservation genetics and plant and animal breeding, because it measures the rates of genetic drift and inbreeding and affects the efficacy of systematic evolutionary forces, such as mutation, selection and migration. We review the developments in predictive equations and estimation methodologies of effective size. In the prediction part, we focus on the equations for populations with different modes of reproduction, for populations under selection for unlinked or linked loci and for the specific applications to conservation genetics. In the estimation part, we focus on methods developed for estimating the current or recent effective size from molecular marker or sequence data. We discuss some underdeveloped areas in predicting and estimating Ne for future research. Heredity (2016) 117, 193–206; doi:10.1038/hdy.2016.43; published online 29 June 2016 INTRODUCTION which correspond to the so-called inbreeding and variance effective The concept of effective population size, introduced by Sewall Wright sizes, respectively (Crow and Kimura, 1970). (1931, 1933), is central to plant and animal breeding (Falconer and Predictions of the effective population size can also be obtained Mackay, 1996), conservation genetics (Frankham et al.,2010; from the largest nonunit eigenvalue of the transition matrix of a Allendorf et al., 2013) and molecular variation and evolution Markov Chain which describes the dynamics of allele frequencies.
    [Show full text]