Glossary for Ancient DNA and Human Evolution DNA STRUCTURE & ORGANIZATION INFORMATION ENCODED in DNA Allele: Alternative Variant Forms at the Same Locus

Total Page:16

File Type:pdf, Size:1020Kb

Glossary for Ancient DNA and Human Evolution DNA STRUCTURE & ORGANIZATION INFORMATION ENCODED in DNA Allele: Alternative Variant Forms at the Same Locus Glossary for Ancient DNA and Human Evolution DNA STRUCTURE & ORGANIZATION INFORMATION ENCODED IN DNA Allele: Alternative variant forms at the same locus. Codon: A sequence of three nucleotides along a DNA or Heterozygotes: Have two different alleles at a locus. RNA chain encoding a single amino acid. Have two identical alleles at a locus. Homozygotes: CpG site: Locus where a cytosine nucleotide is followed Chromatin: DNA wrapped around histone proteins. by a guanine nucleotide in the linear sequence of bases. Euchromatin: Open, allowing information to be read. Cytosines in CpG dinucleotides can be methylated to Heterochromatin: Tightly wrapped and inactive. form 5-methyl cytosine, a common epigentic mark. Chromosomes: Discrete strands of packaged DNA. Enhancer: Short region of DNA that can be bound by Allosomes: Chromosomes that determine sex (XY, proteins to alter transcription of a gene. with Y-Chromosome inherited paternally). Epigenetic: Information not encoded directly in DNA. Autosomes: All other non-allosomal chromosomes. Do not differ between the sexes. Epigenome: Molecular modifications of the DNA and its Mitochondrial DNA (mtDNA): Maternally inherited associated histone proteins, affecting its function. DNA found only in the mitochondria. Functional DNA: Encodes biological information. DNA: The molecule of inheritance, consisting of ~2% of all DNA: Codes for proteins. sequences of the four nucleotide building blocks (ATGC). ~80% of all DNA: Regulates gene activity. Sequence: The linear order of the building blocks, Gene: DNA whose information encodes a function. which encodes individual form and function. Post-translation Modifications: Alter mature protein. Genome: All DNA in a cell. Also refers to the DNA Transcription: DNA sequence converted into RNA. sequence that typifies an individual or species. Translation: mRNA converted into a protein sequence. Genetics: The study of genes and their inheritance. Gene Regulation: Alterations of gene expression/activity. Genomics: The study of genome structure/function. lncRNA: Long non-coding RNA. Haplotype: A set of alleles at distinct positions in the miRNA: Short non-coding regulatory microRNA. genome which are inherited together. RNA Binding Proteins (RBP): Proteins that bind RNA. Individuals in Haplogroups share a given haplotype. Transcription Factor Proteins: Alter gene expression by binding directly or indirectly to DNA. Histones: Chief protein components of chromatin and can be chemically modified as part of epigenetics. Genotype: The two alleles at one or more diploid loci. Karyotype: Chromosome number in the cell nucleus. Mutation: Change of a DNA sequence. Diploid: Two sets of paired chromosomes. Indels: Insertions or deletions of DNA sequence. Haploid: One set of unpaired chromosomes. Single Nucleotide Polymorphisms (SNPs): Single nucleotide differences (e.g. A vs. T). A unique physical position on a Locus (pl. Loci): Silent Mutations: No change to the phenotype. chromosome. Synonymous/Non-synonymous Mutations: No Exons: Sequences at a locus that encode proteins. change to the protein; changes to protein, respectively. Introns: Sequences between exons, don’t encode proteins. Phenotype: Observable traits of an organism (result from interactions between genes and environment). Variant: DNA that differs among groups studied. Promotor: Region of DNA that initiates transcription of a Recombination: Exchanges between chromosomes particular gene. that causes independent inheritance of alleles. Linkage Disequilibrium: Non-random inheritance of Transposable Elements (TE): Sequences that replicate alleles at different loci (due to low recombination). in a genome by inserting copies of themselves at other loci (a type of “molecular parasite”). DNA ORGANIZATION AND CHROMOSOME STRUCTURE Modified from www.wikipedia.org DNA DNA wound around histone protiens to from chromatin Chromatin packaged in tight coils to form chromosome 2 nm 11 nm 30 nm 300 nm 700 nm 1,400 nm METHODS OF GENOME ANALYSIS Genotyping: Characterizing genetic variants at one or more loci. Alignment: Arranging related sequences by position. CRISPR: A method that can mutate a specified locus. Cloning: Making a copy of an organism or sequence. Organisms are cloned by moving an entire genome PCR: A method of copying a specified locus. from a cell into an egg. DNA sequences are cloned by Sequencing: Reading the order of nucleotides in DNA. moving copies into a bacteria using a vector. Coverage: The number of reads for a given locus. Genome Wide Association Study (GWAS): An Shotgun: Sequencing cuts the genome into short approach for “gene mapping” in which hundreds of chunks that are read and reassembled by a computer. thousands of SNPs are tested statistically for genetic Vector: DNA molecule used to direct the replication of a associations with a phenotype. cloned DNA fragment (“insert”) in a host cell. EVOLUTION OTHER TERMS Adaptation: Evolution of a phenotype by selection Admixture: Breeding between isolated populations. because it improved reproduction and/or survival. Archaic Admixture: DNA from ancient, divergent, and Coalescence: Time since common ancestor. now extinct populations found in current people. Coalescent Theory: Models evolution backward in time to infer historical population size, mutation rate, Atapuerca: An archaeological site in Spain with fossils allele age, and allele frequency change by selection and stone tools of the earliest known hominins in and drift. Western Europe. Divergence: Change in genetic content or phenotype Denisovans: A population of extinct hominins between isolated populations or species. contemporary with Neandertals. Our knowledge of Denisovan morphology is based on Effective Population Size (Ne): The size of an idealized two small fossils (a finger bone and a molar) found in population (random mating, no selection, mutation or the Eurasian Steppe. migration) with the same rate of genetic drift as the study population. Dental Calculus: Calcified dental plaque, provides information on diet, disease, health, microbiome and Genetic Drift: Loss of alleles by chance. protects the genetic information within the tooth from Homology: Similarity in DNA or phenotype because of degradation. shared evolutionary history from a common ancestor. Homo: The genus that comprises the species Homo Homoplasy: Similarity in DNA sequence or phenotype sapiens, which includes modern humans, as well as that has evolved independently. several extinct species classified as ancestral to or closely related to modern humans. Phylogeny: Historical relationships of species or loci. An extinct species of hominin with fossil Polymorphism: An allelic difference observed in more Homo erectus: than 1% of the population studied. evidence dating from 1.9 million (possibly earlier) to 70 thousand years ago and found from Africa to Indonesia. Allele Frequency: The proportion of all alleles within a population that are a particular type. May have been the first hominin to leave Africa. H. erectus DNA may be retrievable from other species Derived Alleles: Variants arising since last common ancestor. due to archaic admixture. Fixed Alleles: Replaced all other alleles in a Introgression: Transfer of alleles between species. population. Middle Pleistocene: A period of geological time Population: A defined group of similar individuals. (781-126,000 years ago). An important time for the Demography: Study of population size over time. diversification of hominins, including the emergence of Gene Flow: Movement of alleles between populations. Neandertals and Homo sapiens. Selection: Allele frequency change over time caused by Morphology: Shape or form (outward appearance) of the different replication rate of specific alleles. an organism. Species: A population whose individuals can mate Neandertals: An extinct Eurasian hominin species. with one another to produce viable and fertile offspring. Neandertals existed from over 500,000 to 30,000 (debated definition) years ago, and hybridized with ancient humans. This ancient DNA glossary is the product of the Anthropogeny Graduate Specialization students, CARTA faculty and staff, and Stevan Springer at UC San Diego..
Recommended publications
  • Association Between Haplotypes and Specific Mutations in Swiss Cystic Fibrosis Families
    003 I -3'19Xj9 1/3004-0304903.00/0 PEDIATRIC RESEARCH Vol. 30. No. 4, 199 1 Copyright (5 1991 International Pediatric Research Foundation. Inc. h.i~~rc,d~n U..S .,I Association between Haplotypes and Specific Mutations in Swiss Cystic Fibrosis Families SABINA LIECHTI-GALLATI. NASEEM MALIK, MUALLA ALKAN, MARC0 MAECHLER, MICHAEL MORRIS, FRANCINE THONNEY, FELIX SENNHAUSER. AND HANS MOSER Mrclic.al Gcnctics Unii, Dcy~ur/mo~to/'Pedinfrics (In.vel.~pitol). Universirv of Bun. 3010 Bern. S+t.itzerlancl [S.L.-G.. H. \I./: Dc,[)a~.~n~e/i/c!f Gc~ndics, Urli~~rr:cilj~ C'lli/dr.c,n'.c Ho.sl)ital. 4058 Rct.scl. S~.t'il~er.lr~tlrl/N.iC/., M.A I, 117slitrrcc~of iLlcelicol G'enclic.~,U~ri~~er.eitj~ c?fZuric/z, 8001 Zz~ricIi,Swi~z~rland /.IJLI.Mu.]; 11i.cfil~rle of Medicrcl Genetics, Uni1~wvi11,of Geticvu, Centre M6dical Uniro-situire, I21 1 Gencva 4. S~t~irzerluncl/Mi.Mo.J; Mediccrl Gcnrtic.~L'/li/. Uni~~c~r.sit~~of Lu~l.sc~nne, C'c~nrrc Hospitnlier U~iil~c~~sitaircI.i~ucloi.c.. 101 1 Larl.trrtlne. S~~~itserlanci [F T.];NIICI C/II/C/~CII'SHo.el~;tal. SI. Gallen, S~t.itzerlu~lc//F.S.] ABSTRACT. Cystic fibrosis (CF) is the most common CF is the most common severe autosomal recessive genetic severe autosomal recessive genetic disorder in Caucasian disorder in Caucasian populations. with an incidence of about 1 populations, with an incidence of about 1 in 2000 live in 2000 live births, implying a carrier frequency of about 1 in births, implying a carrier frequency of about 1 in 22.
    [Show full text]
  • Y-Chromosome Short Tandem Repeat, Typing Technology, Locus Information and Allele Frequency in Different Population: a Review
    Vol. 14(27), pp. 2175-2178, 8 July, 2015 DOI:10.5897/AJB2015.14457 Article Number: 2E48C3F54052 ISSN 1684-5315 African Journal of Biotechnology Copyright © 2015 Author(s) retain the copyright of this article http://www.academicjournals.org/AJB Review Y-Chromosome short tandem repeat, typing technology, locus information and allele frequency in different population: A review Muhanned Abdulhasan Kareem1, Ameera Omran Hussein2 and Imad Hadi Hameed2* 1Babylon University, Centre of Environmental Research, Hilla City, Iraq. 2Department of Molecular Biology, Babylon University, Hilla City, Iraq. Received 29 January, 2015; Accepted 29 June, 2015 Chromosome Y microsatellites seem to be ideal markers to delineate differences between human populations. They are transmitted in uniparental and they are very sensitive for genetic drift. This review will highlight the importance of the Y- Chromosome as a tool for tracing human evolution and describes some details of Y-chromosomal short tandem repeat (STR) analysis. Among them are: microsatellites, amplification using polymerase chain reaction (PCR) of STRs, separation and detection and advantages of X-chromosomal microsatellites. Key words: Forensic, population, review, STR, Y- chromosome. INTRODUCTION Microsatellites are DNA regions with repeat units that are microsatellite, but repeats of five (penta-) or six (hexa-) 2 to 7 bp in length or most generally short tandem nucleotides are usually classified as microsatellites as repeats (STRs) or simple sequence repeats (SSRs) well. DNA can be used to study human evolution. (Ellegren, 2000; Imad et al., 2014). The classification of Besides, information from DNA typing is important for the DNA sequences is determined by the length of the medico-legal matters with polymorphisms leading to more core repeat unit and the number of adjacent repeat units.
    [Show full text]
  • Combinatorial Genomic Data Refute the Human Chromosome 2 Evolutionary Fusion and Build a Model of Functional Design for Interstitial Telomeric Repeats
    The Proceedings of the International Conference on Creationism Volume 8 Print Reference: Pages 222-228 Article 32 2018 Combinatorial Genomic Data Refute the Human Chromosome 2 Evolutionary Fusion and Build a Model of Functional Design for Interstitial Telomeric Repeats Jeffrey P. Tomkins Institute for Creation Research Follow this and additional works at: https://digitalcommons.cedarville.edu/icc_proceedings Part of the Biology Commons, and the Genomics Commons DigitalCommons@Cedarville provides a publication platform for fully open access journals, which means that all articles are available on the Internet to all users immediately upon publication. However, the opinions and sentiments expressed by the authors of articles published in our journals do not necessarily indicate the endorsement or reflect the views of DigitalCommons@Cedarville, the Centennial Library, or Cedarville University and its employees. The authors are solely responsible for the content of their work. Please address questions to [email protected]. Browse the contents of this volume of The Proceedings of the International Conference on Creationism. Recommended Citation Tomkins, J.P. 2018. Combinatorial genomic data refute the human chromosome 2 evolutionary fusion and build a model of functional design for interstitial telomeric repeats. In Proceedings of the Eighth International Conference on Creationism, ed. J.H. Whitmore, pp. 222–228. Pittsburgh, Pennsylvania: Creation Science Fellowship. Tomkins, J.P. 2018. Combinatorial genomic data refute the human chromosome 2 evolutionary fusion and build a model of functional design for interstitial telomeric repeats. In Proceedings of the Eighth International Conference on Creationism, ed. J.H. Whitmore, pp. 222–228. Pittsburgh, Pennsylvania: Creation Science Fellowship. COMBINATORIAL GENOMIC DATA REFUTE THE HUMAN CHROMOSOME 2 EVOLUTIONARY FUSION AND BUILD A MODEL OF FUNCTIONAL DESIGN FOR INTERSTITIAL TELOMERIC REPEATS Jeffrey P.
    [Show full text]
  • A Clone-Array Pooled Shotgun Strategy for Sequencing Large Genomes
    Downloaded from genome.cshlp.org on September 24, 2021 - Published by Cold Spring Harbor Laboratory Press Perspective A Clone-Array Pooled Shotgun Strategy for Sequencing Large Genomes Wei-Wen Cai,1,2 Rui Chen,1,2 Richard A. Gibbs,1,2,5 and Allan Bradley1,3,4 1Department of Molecular and Human Genetics, 2Human Genome Sequencing Center, and 3Howard Hughes Medical Institute, Baylor College of Medicine, Houston, Texas 77030, USA A simplified strategy for sequencing large genomes is proposed. Clone-Array Pooled Shotgun Sequencing (CAPSS) is based on pooling rows and columns of arrayed genomic clones,for shotgun library construction. Random sequences are accumulated,and the data are processed by sequential comparison of rows and columns to assemble the sequence of clones at points of intersection. Compared with either a clone-by-clone approach or whole-genome shotgun sequencing,CAPSS requires relatively few library constructions and only minimal computational power for a complete genome assembly. The strategy is suitable for sequencing large genomes for which there are no sequence-ready maps,but for which relatively high resolution STS maps and highly redundant BAC libraries are available. It is immediately applicable to the sequencing of mouse,rat,zebrafish, and other important genomes,and can be managed in a cooperative fashion to take advantage of a distributed international DNA sequencing capacity. Advances in DNA sequencing technology in recent years have Drosophila genome, and the computational requirements to greatly increased the throughput and reduced the cost of ge- perform the necessary pairwise comparisons increase approxi- nome sequencing. Sequencing of a complex genome the size mately as a square of the size of the genome (see Appendix).
    [Show full text]
  • GENOME GENERATION Glossary
    GENOME GENERATION Glossary Chromosome An organism’s DNA is packaged into chromosomes. Humans have 23 pairs of chromosomesincluding one pair of sex chromosomes. Women have two X chromosomes and men have one X and one Y chromosome. Dominant (see also recessive) Genes come in pairs. A dominant form of a gene is the “stronger” version that will be expressed. Therefore if someone has one dominant and one recessive form of a gene, only the characteristics of the dominant form will appear. DNA DNA is the long molecule that contains the genetic instructions for nearly all living things. Two strands of DNA are twisted together into a double helix. The DNA code is made up of four chemical letters (A, C, G and T) which are commonly referred to as bases or nucleotides. Gene A gene is a section of DNA that is the code for a specific biological component, usually a protein. Each gene may have several alternative forms. Each of us has two copies of most of our genes, one copy inherited from each parent. Most of our traits are the result of the combined effects of a number of different genes. Very few traits are the result of just one gene. Genetic sequence The precise order of letters (bases) in a section of DNA. Genome A genome is the complete DNA instructions for an organism. The human genome contains 3 billion DNA letters and approximately 23,000 genes. Genomics Genomics is the study of genomes. This includes not only the DNA sequence itself, but also an understanding of the function and regulation of genes both individually and in combination.
    [Show full text]
  • Probabilities and Statistics in Shotgun Sequencing Shotgun Sequencing
    Probabilities and Statistics in Shotgun Sequencing Shotgun Sequencing. Before any analysis of a DNA sequence can take place it is first necessary to determine the actual sequence itself, at least as accurately as is reasonably possible. Unfortunately, technical considerations make it impossible to sequence very long pieces of DNA all at once. Current sequencing technologies allow accurate reading of no more than 500 to 800bp of contiguous DNA sequence. This means that the sequence of an entire genome must be assembled from collections of comparatively short subsequences. This process is called DNA sequence “assembly ”. One approach of sequence assembly is to produce the sequence of a DNA segment (called as a “contig”, or perhaps a genome) from a large number of randomly chosen sequence reads (many overlapping small pieces, each on the order of 500-800 bases). One difficulty of this process is that the locations of the fragments within the genome and with respect to each other are not generally known. However, if enough fragments are sequenced so that there will be many overlaps between them, the fragments can be matched up and assembled. This method is called “ shotgun sequencing .” Shotgun sequencing approaches, including the whole-genome shotgun approach, are currently a central part of all genome-sequencing efforts. These methods require a high level of automation in sample preparation and analysis and are heavily reliant on the power of modern computers. There is an interplay between substrates to be sequenced (genomes and their representation in clone libraries), the analytical tools for generating a DNA sequence, the sequencing strategies, and the computational methods.
    [Show full text]
  • The Role of Haplotypes in Candidate Gene Studies
    Genetic Epidemiology 27: 321–333 (2004) The Role of Haplotypes in Candidate Gene Studies Andrew G. Clarkn Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York Human geneticists working on systems for which it is possible to make a strong case for a set of candidate genes face the problem of whether it is necessary to consider the variation in those genes as phased haplotypes, or whether the one-SNP- at-a-time approach might perform as well. There are three reasons why the phased haplotype route should be an improvement. First, the protein products of the candidate genes occur in polypeptide chains whose folding and other properties may depend on particular combinations of amino acids. Second, population genetic principles show us that variation in populations is inherently structured into haplotypes. Third, the statistical power of association tests with phased data is likely to be improved because of the reduction in dimension. However, in reality it takes a great deal of extra work to obtain valid haplotype phase information, and inferred phase information may simply compound the errors. In addition, if the causal connection between SNPs and a phenotype is truly driven by just a single SNP, then the haplotype- based approach may perform worse than the one-SNP-at-a-time approach. Here we examine some of the factors that affect haplotype patterns in genes, how haplotypes may be inferred, and how haplotypes have been useful in the context of testing association between candidate genes and complex traits. Genet. Epidemiol. & 2004 Wiley-Liss, Inc. Key words: haplotype inference; haplotype association testing; candidate genes; linkage equilibrium Grant sponsor: NIH; Grant numbers: GM65509, HL072904.
    [Show full text]
  • Metagenomics Analysis of Microbiota by Next Generation Shotgun Sequencing
    THE SWISS DNA COMPANY Application Note · Next Generation Sequencing Metagenomics Analysis of Microbiota by Next Generation Shotgun Sequencing Understand the genetic potential of your community samples Provides you with hypothesis-free taxonomic analysis Introduction Microbiome studies are often based on as the dependency on a single gene to microbiome analysis overcoming said the sequencing of specific marker genes analyze a whole community, the intro- limitations. Whole genomic DNA of as for instance the prokaryotic 16S rRNA duction of PCR bias and the restriction a sample is isolated, fragmented and gene. Such amplicon-based approaches to describe only the taxonomic compo- finally sequenced. This allows a detailed are well established and widely used. sition and diversity. Shotgun metage- analysis of the taxonomic and functional However, they have limitations such nomics is a cutting edge technique for composition of a microbial community. Microsynth Competences and Services Microsynth offers a full shotgun to your project requirements, thus pro- requirements to guarantee scientifically metagenomics service for taxonomic viding you with just the right amount of reliable results for your project. After and functional profiling of clinical, envi- data to answer your questions. quality processing the reads are aligned ronmental or engineered microbiomes. Bioinformatics: Taxonomic and func- against a protein reference database The service covers the entire process tional analysis of metagenomic data- (e.g. NCBI nr). Taxonomic and functional from experimental design, DNA isola- sets is challenging. Alignment, binning binning and annotation are performed. tion, tailored sequencing to detailed and annotation of the large amounts The analysis is not restricted to prokar- and customized bioinformatics analy- of sequencing reads require expertise yotes but also includes eukaryotes and sis.
    [Show full text]
  • Pharmacogenetic Testing: a Tool for Personalized Drug Therapy Optimization
    pharmaceutics Review Pharmacogenetic Testing: A Tool for Personalized Drug Therapy Optimization Kristina A. Malsagova 1,* , Tatyana V. Butkova 1 , Arthur T. Kopylov 1 , Alexander A. Izotov 1, Natalia V. Potoldykova 2, Dmitry V. Enikeev 2, Vagarshak Grigoryan 2, Alexander Tarasov 3, Alexander A. Stepanov 1 and Anna L. Kaysheva 1 1 Biobanking Group, Branch of Institute of Biomedical Chemistry “Scientific and Education Center”, 109028 Moscow, Russia; [email protected] (T.V.B.); [email protected] (A.T.K.); [email protected] (A.A.I.); [email protected] (A.A.S.); [email protected] (A.L.K.) 2 Institute of Urology and Reproductive Health, Sechenov University, 119992 Moscow, Russia; [email protected] (N.V.P.); [email protected] (D.V.E.); [email protected] (V.G.) 3 Institute of Linguistics and Intercultural Communication, Sechenov University, 119992 Moscow, Russia; [email protected] * Correspondence: [email protected]; Tel.: +7-499-764-9878 Received: 2 November 2020; Accepted: 17 December 2020; Published: 19 December 2020 Abstract: Pharmacogenomics is a study of how the genome background is associated with drug resistance and how therapy strategy can be modified for a certain person to achieve benefit. The pharmacogenomics (PGx) testing becomes of great opportunity for physicians to make the proper decision regarding each non-trivial patient that does not respond to therapy. Although pharmacogenomics has become of growing interest to the healthcare market during the past five to ten years the exact mechanisms linking the genetic polymorphisms and observable responses to drug therapy are not always clear. Therefore, the success of PGx testing depends on the physician’s ability to understand the obtained results in a standardized way for each particular patient.
    [Show full text]
  • Microdeletion of the Azfc Locus with High Frequency of Mosaicism 46,XY/47XYY in Cases of Non Obstructive Azoospermia in Eastern Population of India
    Microdeletion of the AZFc locus with high frequency of mosaicism 46,XY/47XYY in cases of non obstructive azoospermia in eastern population of India A.K. Saxena and K. Aniket Department of Pathology/Lab Medicine, Molecular Cytogenetics Laboratory, All India Institute of Medical Sciences, Patna (Bihar), India Corresponding author: A.K. Saxena E-mail: [email protected] Genet. Mol. Res. 18 (2): gmr18349 Received May 07, 2019 Accepted June 06, 2019 Published June 18, 2019 DOI http://dx.doi.org/10.4238/gmr18349 ABSTRACT. The etiopathology of male infertility is highly complex, involving gene - environment interactions to regulate spermatogenesis. Consequently, genetic analysis becomes imperative for cases of non-obstructive azoospermia (NOA) to identify the causative factors. Cases (n = 111) of NOA referred to the cytogenetics and molecular genetics laboratory of the All India Institute of Medical Sciences in -Patna from 2013-2018 were subjected to 1) karyotyping using GTG bandings techniques, 2) fluorescence in situ hybridization (FISH) for the sex determining region (SRY), and 3) PCR based analysis of STS markers based on microdeletion of the Y- chromosome after isolation of genomic DNA from whole blood. A flow cytometer was used for a cell- kinetic and DNA methylation study after incorporation of 5-azacytidine (5- AzaC) (1.0 ug/mL) in lymphocyte culture. PCR products were analyzed on an agarose gel (1.5%) and bands were visualized on Gel Doc after ethidium bromide staining. Chromosomal abnormalities, including structural numerical variations, were observed in 14 of the karyotypes. Eight cases showed a 46,XY/47,XYY i.e. mosaic pattern; two cases 46, XY/45/XO; a single case with 47,XY +16; two cases with 46,X+ ring Y; a single case with 46,XY+dicentric in Genetics and Molecular Research 18 (2): gmr18349 ©FUNPEC-RP www.funpecrp.com.br A.K.
    [Show full text]
  • Haplotype Tagging Reveals Parallel Formation of Hybrid Races in Two Butterfly Species
    bioRxiv preprint doi: https://doi.org/10.1101/2020.05.25.113688; this version posted May 27, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Title: Haplotype tagging reveals parallel formation of hybrid races in two butterfly species One-sentence summary: Haplotagging, a novel linked-read sequencing technique that enables whole genome haplotyping in large populations, reveals the formation of a novel hybrid race in parallel hybrid zones of two co-mimicking Heliconius butterfly species through strikingly parallel divergences in their genomes. Short title: Haplotagging reveals parallel formation of hybrid races Keywords: Butterfly, Genomes, Clines, Hybrid zone, [local] adaptation, haplotypes, population genetics, evolution bioRxiv preprint doi: https://doi.org/10.1101/2020.05.25.113688; this version posted May 27, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Authors: Joana I. Meier1,2,*, Patricio A. Salazar1,3,*, Marek Kučka4,*, Robert William Davies5, Andreea Dréau4, Ismael Aldás6, Olivia Box Power1, Nicola J. Nadeau3, Jon R. Bridle7, Campbell Rolian8, Nicholas H. Barton9, W. Owen McMillan10, Chris D. Jiggins1,10,†, Yingguang Frank Chan4,† Affiliations: 1. Department of Zoology, University of Cambridge, Downing Street, Cambridge, CB2 3EJ, United Kingdom 2. St John’s College, University of Cambridge, Cambridge, CB2 1TP, United Kingdom 3.
    [Show full text]
  • Haplotype Inference from Pedigree Data and Population Data
    HAPLOTYPE INFERENCE FROM PEDIGREE DATA AND POPULATION DATA by XIN LI Submitted in partial ful¯llment of the requirements For the Degree of Doctor of Philosophy Dissertation Advisor: Jing Li Department of Electrical Engineering and Computer Science CASE WESTERN RESERVE UNIVERSITY January, 2010 CASE WESTERN RESERVE UNIVERSITY SCHOOL OF GRADUATE STUDIES We hereby approve the thesis/dissertation of _____________________________________________________ candidate for the ______________________degree *. (signed)_______________________________________________ (chair of the committee) ________________________________________________ ________________________________________________ ________________________________________________ ________________________________________________ ________________________________________________ (date) _______________________ *We also certify that written approval has been obtained for any proprietary material contained therein. Table of Contents List of Tables iv List of Figures v Acknowledgments vi Abstract vii Chapter 1. Introduction 1 1.1 Statistical methods . 3 1.2 Rule-based methods . 4 1.2.1 MRHC . 4 1.2.2 ZRHC . 5 Chapter 2. Problem statement and solutions 8 2.1 Large Pedigrees: manipulation of Mendelian constraints . 9 2.2 Families with many markers: dealing with recombinations . 9 2.3 Mixed data: use of population information . 10 Chapter 3. Preliminaries 12 3.1 Mendelian and zero-recombinant constraints . 14 3.2 Locus graphs . 15 3.3 Linear constraints on h variables . 17 Chapter 4. Linear Systems on Mendelian Constraints 19 4.1 Methods to solve the linear systems . 19 4.1.1 Split nodes to break cycles . 20 4.1.2 Detect path constraints from locus graphs . 21 4.1.3 Encode path constraints in disjoint-set structure D . 26 ii 4.2 Analysis of the algorithm on tree pedigrees with complete data 31 4.3 Extension to General Cases . 33 4.3.1 Pedigrees with mating loops .
    [Show full text]