Curriculum Vitae

Total Page:16

File Type:pdf, Size:1020Kb

Load more

Curriculum Vitae Zachary L. Fuller, Ph.D. Department of Biological Sciences Columbia University 606 Fairchild Center, New York, NY 10027 Email: [email protected] Website: www.zfullergenomics.com Education: 2012-2017 Ph.D. (Biology), The Pennsylvania State University 2008-2012 B.S. (Biology), Creighton University Research & Work Experience: 2017- Postdoctoral Research Fellow, Columbia University Advised by Dr. Molly Przeworski, Department of Biological Sciences, Columbia University, New York, NY 2015-2017 National Geographic Young Explorer Advised by Dr. Christina Grozinger, Department of Entomology, The Pennsylvania State University, University Park, PA 2012-2017 Ph.D. Candidate, The Pennsylvania State University Advised by Dr. Stephen Schaeffer, Department of Biology, The Pennsylvania State University, University Park, PA 2012-2013 Rotation Student, The Pennsylvania State University Advised by Dr. Webb Miller, Bioinformatics and Genomics Institute, The Pennsylvania State University, University Park, PA 2009-2012 Undergraduate Researcher, Creighton University Advised by Dr. Soochin Cho, Department of Biology, Creighton University, Omaha, NE Peer-Reviewed Publications: Zachary L. Fuller, Veronique Mocellin, Luke Morris, Neal Cantin, Julie Peng, Yi Liao, Jihanne Shepherd, Joseph Pickrell, Peter Andolfatto, Mikhail Matz, Line Bay, Molly Przeworski. (2020) Towards a genomic predictor of bleaching in the coral Acropora millepora. Science (in press) bioRxiv Preprint: https://doi.org/10.1101/867754 Zachary L. Fuller, Jeremy Berg, Hakhamanesh Mostafavi, Guy Sella, Molly Przeworski. (2019) Measuring Intolerance to Mutation in Human Genetics. Nature Genetics. DOI: 10.1038/s41588-019-0383-1 Zachary L. Fuller, Spencer Koury, Nitin Phadnis, Stephen Schaeffer. (2018) How Chromosomal Rearrangements Shape Adaptation and Speciation: Case Studies in Drosophila pseudoobscura and its sibling species D. persimils. Molecular Ecology. DOI: 10.1111/mec.14923 Zachary L. Fuller, Christopher Leonard, Randee Young, Stephen Schaeffer, Nitin Phadnis. (2018) Ancestral Polymorphisms Explain the Role of Chromosomal Inversions in Speciation. PLoS Genetics. DOI: 10.1371/journal.pgen.1007526 David Galbraith, Zachary L. Fuller, Axel Brockmann, Maryann Frazier, Mary Gikungu, Karen Kapheim, Jeffrey Kerby, Sarah Kocher, Oleksiy Losyev, Elliud Muli, Harland Patch, Joyce Sakamoto, Scott Stanley, Anthony Vaudo, Christina Grozinger. (2018) Investigating the Viral Ecology of Global Bee Communities with High-throughput Metagenomics. Scientific Reports. 10.1038/s41598-018-27164-z Zachary L. Fuller, Gwilym Haynes, Stephen Richards, Stephen Schaeffer. (2017) Genomics of Natural Populations: Evolutionary Forces that Establish New Gene Arrangements in Drosophila pseudoobscura. Molecular Ecology. DOI: 10.1111/mec.14381 Zachary L. Fuller, Gwilym Haynes, Stephen Richards, Stephen Schaeffer. (2016) Genomics of Natural Populations: How Differentially Expressed Genes Shape the Evolution of Chromosomal Inversions in Drosophila pseudoobscura. Genetics. DOI: 10.1534/ genetics.116.191429 Zachary L. Fuller, Elina Nino, Harland Patch, Oscar Bedoya-Reina, Tracey Baumgarten, Elluid Muli, Fiona Mumoki, Aakrosh Ratan, John McGraw, Maryann Frazier, Daniel Masiga, Stephen Schuster, Christina Grozinger, Webb Miller. (2015) Genome-wide Analysis of Signatures of Selection in Populations of African Honey-bees (Apis mellifera). BMC Genomics. DOI: 10.1186/s12864-015-1712-0 Zachary L. Fuller, Gwilym Haynes, Dianhui Zhu, Matthew Batterton, Hsu Chao, Shannon Duggan, Stephen Richards, Stephen Schaeffer. (2014) Evidence for Stabilizing Selection on Codon Usage in Chromosomal Rearrangements of Drosophila pseudoobscura. G3: Genes, Genomes & Genetics. DOI: 10.1534/g3.114.014860 Preprints and In-Review Publications: William Milligan, Zachary L. Fuller, Ipsita Agarwal, Michael Eisen, Molly Przeworski, Guy Sella. (2020) Impact of Essential Workers in the Context of Social Distancing for Epidemic Control. medRxiv Preprint: https://doi.org/10.1101/2020.05.05.20092262 Maurizio Mauro, Shan Wei, Andrzej Breborowicz, Xin Li, Claudia Bognanni, Zachary L. Fuller, Tom Philipp, Zev Williams. (2019) Pervasive, but reversible, catastrophic DNA damage caused by LINE-1 retrotransposons in human primary trophoblasts. (In Review at Fertility & Sterility) Zachary L. Fuller, Spencer Koury, Christopher Leonard, Randee Young, Kobe Ikegami, Jonathan Westlake, Stephen Richards, Stephen Schaeffer, Nitin Phadnis. (2018) Extensive recombination suppression and chromosome-wide differentiation of a segregation distorter in Drosophila. (In Revision at Genetics). bioRxiv Preprint: https://doi.org/10.1101/504126 Grants: Agouron Institute (July 2019-July 2021) “Precision Medicine for Corals: Predicting Bleaching Response from Genomes” Co-Principal Investigator, $440,ooo Direct NIH Ruth L. Kirschstein National Research Service Award (May 2018-May 2020) “Characterizing and Mapping Deleterious Mutations in Humans” Grant Number: F32 GM128318 National Geographic Society (July 2015-July 2017) “High throughput viral metagenomics in Kenyan honeybees” Principal Investigator, $10,000 Direct Oral Presentations: *Invited Speaker Zachary Fuller, Joe Pickrell, Peter Andolfatto, Mikhail Matz, Line Bay, Molly Przeworski. “Genome-wide association study (GWAS) of bleaching tolerance in a Great Barrier Reef coral”. The Allied Genetics Conference. Virtual. April 23, 2020 Zachary Fuller*, Molly Przeworski. “Towards a genomic predictor of bleaching in the coral Acropora millepora” American Museum of Natural History Comparitive Biology Seminar Series. New York, NY. April 13, 2020 Zachary Fuller, Julie Peng, Jihanne Shepherd, Peter Andolfatto, Joseph Pickrell, Mikhail Matz, Line Bay, Molly Przeworski. “Towards the genetic prediction of bleaching response in Acropora millepora”. Evolution Meeting. Providence, RI. June 25, 2019 Mikhail Matz, Groves Dixon, Yi Liao, Zachary Fuller. “Rampant Cryptic Speciation and Environmental Specialization in Two Massive Coral Species from the Florida Keys.” Society for Integrative and Comparative Biology (SICB) 2019 Annual Meeting. Tampa Bay, FL. January 4, 2019. Zachary Fuller, Yi Liao, Line Bay, Mikhail Matz, Molly Przeworski. “Towards the genetic prediction of bleaching response in coral.” Global Invertebrate Genomics Alliance Research Conference and Workshop. Willemstad, Curacao. October 20, 2018. Zachary Fuller, Christopher Leonard, Randee Young, Stephen Schaeffer. “The Role of Chromosomal Inversions in Speciation.” The New York Population Genetics Conference. Cold Spring Harbor, NY. January 10, 2018 Zachary Fuller, Gwilym Haynes, Stephen Richards, Stephen Schaeffer. “Local Adaptation and the Establishment of Inversions in Natural Populations of Drosophila pseudoobscura Through the Indirect Effects of Suppressed Recombination.” The Allied Genetics Conference. Orlando, FL. July 15, 2016 Christopher Leonard, Zachary Fuller, Randee Young, Stephen Schaeffer, Nitin Phadnis. “Evidence for the Interspecies Transfer of a Driving X Chromosome.” The Allied Genetics Conference. Orlando, FL. July 15, 2016 Zachary Fuller, Gwilym Haynes, Stephen Richards, Stephen Schaeffer. “The Evolutionary Mechanisms that Establish Chromosome Rearrangements in Natural Populations of Drosophila pseudoobscura.” Evolution Meeting (Hamilton Award Symposium). Austin, TX. June 11, 2016 Zachary Fuller, Randee Young, Nitin Phadnis, Stephen Schaeffer. “The Genomic Consequences of Segregation Distortion in Drosophila pseudoobscura.” Penn State Annual Bioinformatics & Genomics Retreat. University Park, PA. August 2015 Zachary Fuller, Gwilym Haynes, Stephen Richards, Stephen Schaeffer. “Unusual Haplotype Structure and Reduced Recombination in Chromosomal Rearrangements of Drosophila pseudoobscura.” Genetics Society of America’s 55th Annual Drosophila Conference. San Diego, CA. March 28th 2014 Zachary Fuller, Gwilym Haynes, Stephen Richards, Stephen Schaeffer. “Identifying the Evolutionary Forces Behind the Maintenance and Origin of Chromosomal Rearrangements in Populations of Drosophila pseudoobscura.” 2013 SMBE Conference. Chicago, IL. July 9th 2013 Upcoming Oral Presentations: Zachary Fuller, Molly Przeworski. “Genomic Tools for Understanding Coral Bleaching” Pacific Symposium for Biocomputing. Waimea, HI. January 2021 [Invited Speaker] Poster Presentations: Zachary Fuller, Joe Pickrell, Peter Andolfatto, Mikhail Matz, Line Bay, Molly Przeworski. “Genome-wide association study (GWAS) of bleaching tolerance in a Great Barrier Reef coral”. The Biology of Genomes. Cold Spring Harbor (Virtual). May 2020 Mikhail Matz, Zachary Fuller. “LD Networks: Unsupervised Detection of Polygenic Selection”. Society for Integrative and Comparative Biology (SICB) 2020 Annual Meeting. Austin, TX. January 2020 Zachary Fuller, Julie Peng, Jihanne Shepherd, Peter Andolfatto, Joseph Pickrell, Mikhail Matz, Line Bay, Molly Przeworski. “Toward the genetic prediction of bleaching response in Acropora millepora”. 2019 SMBE Conference. Manchester, UK. July 2019. Zachary Fuller, Jeremy Berg, Hakhamanesh Mostafavi, Guy Sella, Molly Przeworski. “A Population Genetics Perspective on Measures of Intolerance to Mutation”. 2018 SMBE Conference. Yokohama, Japan. July 2018. Zachary Fuller, Stephen Schaeffer “Selection Across Environmental Gradients Leads to the Establishment of Chromosomal Inversions in Populations”. Genetics Society of America’s 58th Annual Drosophila Conference. San Diego, CA. April 2017 Christopher Leonard, Zachary Fuller, Randee Young, Stephen Richards, Stephen Schaeffer,
Recommended publications
  • Gene Prediction: the End of the Beginning Comment Colin Semple

    Gene Prediction: the End of the Beginning Comment Colin Semple

    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by PubMed Central http://genomebiology.com/2000/1/2/reports/4012.1 Meeting report Gene prediction: the end of the beginning comment Colin Semple Address: Department of Medical Sciences, Molecular Medicine Centre, Western General Hospital, Crewe Road, Edinburgh EH4 2XU, UK. E-mail: [email protected] Published: 28 July 2000 reviews Genome Biology 2000, 1(2):reports4012.1–4012.3 The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2000/1/2/reports/4012 © GenomeBiology.com (Print ISSN 1465-6906; Online ISSN 1465-6914) Reducing genomes to genes reports A report from the conference entitled Genome Based Gene All ab initio gene prediction programs have to balance sensi- Structure Determination, Hinxton, UK, 1-2 June, 2000, tivity against accuracy. It is often only possible to detect all organised by the European Bioinformatics Institute (EBI). the real exons present in a sequence at the expense of detect- ing many false ones. Alternatively, one may accept only pre- dictions scoring above a more stringent threshold but lose The draft sequence of the human genome will become avail- those real exons that have lower scores. The trick is to try and able later this year. For some time now it has been accepted increase accuracy without any large loss of sensitivity; this deposited research that this will mark a beginning rather than an end. A vast can be done by comparing the prediction with additional, amount of work will remain to be done, from detailing independent evidence.
  • Reconstructing Contiguous Regions of an Ancestral Genome

    Reconstructing Contiguous Regions of an Ancestral Genome

    Downloaded from www.genome.org on December 5, 2006 Reconstructing contiguous regions of an ancestral genome Jian Ma, Louxin Zhang, Bernard B. Suh, Brian J. Raney, Richard C. Burhans, W. James Kent, Mathieu Blanchette, David Haussler and Webb Miller Genome Res. 2006 16: 1557-1565; originally published online Sep 18, 2006; Access the most recent version at doi:10.1101/gr.5383506 Supplementary "Supplemental Research Data" data http://www.genome.org/cgi/content/full/gr.5383506/DC1 References This article cites 20 articles, 11 of which can be accessed free at: http://www.genome.org/cgi/content/full/16/12/1557#References Open Access Freely available online through the Genome Research Open Access option. Email alerting Receive free email alerts when new articles cite this article - sign up in the box at the service top right corner of the article or click here Notes To subscribe to Genome Research go to: http://www.genome.org/subscriptions/ © 2006 Cold Spring Harbor Laboratory Press Downloaded from www.genome.org on December 5, 2006 Methods Reconstructing contiguous regions of an ancestral genome Jian Ma,1,5,6 Louxin Zhang,2 Bernard B. Suh,3 Brian J. Raney,3 Richard C. Burhans,1 W. James Kent,3 Mathieu Blanchette,4 David Haussler,3 and Webb Miller1 1Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania 16802, USA; 2Department of Mathematics, National University of Singapore, Singapore 117543; 3Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA; 4School of Computer Science, McGill University, Montreal, Quebec H3A 2B4, Canada This article analyzes mammalian genome rearrangements at higher resolution than has been published to date.
  • Duplication, Deletion, and Rearrangement in the Mouse and Human Genomes

    Duplication, Deletion, and Rearrangement in the Mouse and Human Genomes

    Evolution’s cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes W. James Kent*†, Robert Baertsch*, Angie Hinrichs*, Webb Miller‡, and David Haussler§ *Center for Biomolecular Science and Engineering and §Howard Hughes Medical Institute, Department of Computer Science, University of California, Santa Cruz, CA 95064; and ‡Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA 16802 Edited by Michael S. Waterman, University of Southern California, Los Angeles, CA, and approved July 11, 2003 (received for review April 9, 2003) This study examines genomic duplications, deletions, and rear- depending on details of definition and method. The length rangements that have happened at scales ranging from a single distribution of synteny blocks was found to be consistent with the base to complete chromosomes by comparing the mouse and theory of random breakage introduced by Nadeau and Taylor (8, human genomes. From whole-genome sequence alignments, 344 9) before significant gene order data became available. In recent large (>100-kb) blocks of conserved synteny are evident, but these comparisons of the human and mouse genomes, rearrangements are further fragmented by smaller-scale evolutionary events. Ex- of Ն100,000 bases were studied by comparing 558,000 highly cluding transposon insertions, on average in each megabase of conserved short sequence alignments (average length 340 bp) genomic alignment we observe two inversions, 17 duplications within 300-kb windows. An estimated 217 blocks of conserved (five tandem or nearly tandem), seven transpositions, and 200 synteny were found, formed from 342 conserved segments, with deletions of 100 bases or more. This includes 160 inversions and 75 length distribution roughly consistent with the random breakage duplications or transpositions of length >100 kb.
  • BIOINFORMATICS ISCB NEWS Doi:10.1093/Bioinformatics/Btp280

    BIOINFORMATICS ISCB NEWS Doi:10.1093/Bioinformatics/Btp280

    Vol. 25 no. 12 2009, pages 1570–1573 BIOINFORMATICS ISCB NEWS doi:10.1093/bioinformatics/btp280 ISMB/ECCB 2009 Stockholm Marie-France Sagot1, B.J. Morrison McKay2,∗ and Gene Myers3 1INRIA Grenoble Rhône-Alpes and University of Lyon 1, Lyon, France, 2International Society for Computational Biology, University of California San Diego, La Jolla, CA and 3Howard Hughes Medical Institute Janelia Farm Research Campus, Ashburn, Virginia, USA ABSTRACT Computational Biology (http://www.iscb.org) was formed to take The International Society for Computational Biology (ISCB; over the organization, maintain the institutional memory of ISMB http://www.iscb.org) presents the Seventeenth Annual International and expand the informational resources available to members of the Conference on Intelligent Systems for Molecular Biology bioinformatics community. The launch of ECCB (http://bioinf.mpi- (ISMB), organized jointly with the Eighth Annual European inf.mpg.de/conferences/eccb/eccb.htm) 8 years ago provided for a Conference on Computational Biology (ECCB; http://bioinf.mpi- focus on European research activities in years when ISMB is held inf.mpg.de/conferences/eccb/eccb.htm), in Stockholm, Sweden, outside of Europe, and a partnership of conference organizing efforts 27 June to 2 July 2009. The organizers are putting the finishing for the presentation of a single international event when the ISMB touches on the year’s premier computational biology conference, meeting takes place in Europe every other year. with an expected attendance of 1400 computer scientists, The multidisciplinary field of bioinformatics/computational mathematicians, statisticians, biologists and scientists from biology has matured since gaining widespread recognition in the other disciplines related to and reliant on this multi-disciplinary early days of genomics research.
  • Open Thesisformatted Final.Pdf

    Open Thesisformatted Final.Pdf

    The Pennsylvania State University The Graduate School The Huck Institutes of the Life Sciences COMPUTATIONAL APPROACHES TO PREDICT PHENOTYPE DIFFERENCES IN POPULATIONS FROM HIGH-THROUGHPUT SEQUENCING DATA A Dissertation in Integrative Biosciences in Bioinformatics and Genomics by Oscar Camilo Bedoya Reina 2014 Oscar Camilo Bedoya Reina Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy May 2014 i The dissertation of Oscar Camilo Bedoya Reina was reviewed and approved* by the following: Webb Miller Professor of Biology and Computer Science and Engineering Dissertation Advisor Chair of Committee Ross Hardison T. Ming Chu Professor of Biochemistry and Molecular Biology George Perry Assistant Professor of Anthropology and Biology Kamesh Madduri Assistant Professor of Computer Science and Engineering Peter Hudson Willaman Professor of Biology Head of the Huck Institutes of the Life Sciences *Signatures are on file in the Graduate School iii ABSTRACT High-throughput sequencing technologies are changing the world. They are revolutionizing the life sciences and will be the foundation of a promising century of innovations. In recent years, the development of new sequencing technologies has dramatically decreased the cost of genome sequencing. Less than twenty years ago, sequencing the human genome cost 3 billion dollars, and took about a decade to complete. Today, high-quality 30X full-genome coverage can be obtained in just one day for US$ 5,000, while sequencing just the ~21,000 human genes to the same depth costs only about US$ 500. The latter is sufficient for detecting most of the rare variants, along with other sources of genetic variability such as indels, copy- number variations, and inversions that are characteristic of complex diseases.
  • ENCODE Analysis Working Group and Data Analysis Centre Rick Myers

    ENCODE Analysis Working Group and Data Analysis Centre Rick Myers

    ENCODE Analysis Working Group and Data Analysis Centre Rick Myers Ewan Birney Motivation for mandated DAC y Genesis from the experience of the pilot project y Everyone looking at the ceiling when a key piece of annoying analysis needs to happen y A set of people who are funded to ensure that critical integrative analysis occurs (consistently and timely) y In no way exclusive y Everyone is invited in analysis y DAC should fit around things which are happening at the consortium level y Porous (no distinction expected between DAC members and other consortium members) except… y …the cleaning of the Aegean stables moment (eg, creating repeat libraries, consistently remapping everyone’s chip-seq data) y Interplay with DCC deliberate (trade off where things occur) y When there are too many things on the DAC to-do list - ask AWG to prioritise. AWG Participates in Rick Myers discussion Chair of AWG Birney BickelBickel Project Manager Haussler EBI (Ian Dunham) Bickel Directed Analysis Methods development EBI UCSC Yale BU EBI UCSC Yale BU U. Wash Penn Berkeley U. Wash Penn Berkeley DAC - federated, embedded y Ewan Birney/Paul Flicek/Ian Dunham (EBI)- comparative genomics, short read technology methods y Mark Gerstein (Yale) - chip-seq, link to genes/transcripts, link to modENCODE, P y Zhiping Weng (BU) - chip-chip, chip-seq, motif finding, bayesian analysis y Ross Hardison/Webb Miller (PSU) - comparative genomics, regulatory regions y Jim Kent/David Haussler (UCSC) - comparative genomics, DCC y Peter Bickel (UC Berkeley) - statistician y Bill Nobel (UW) - machine learning - HMMs, change point analysis, wavelets, SVMs New analysis tasks from AWG or community Results Provided Triage and Back to AWG Initial prioritisation Converting Priortisation Active ad hoc of all projects tasks analysis to by AWG handled pipelines by EDAC AWG prioritisation EDAC suggest pipelining tasks Experimental Data exploration, DCC group, in house Normalisation, coordination methods Sanity checking Feedback to AWG and expt.
  • 439: PALM: Probabilistic Area Loss Minimization for Protein Sequence

    439: PALM: Probabilistic Area Loss Minimization for Protein Sequence

    PALM: Probabilistic Area Loss Minimization for Protein Sequence Alignment Fan Ding*1 Nan Jiang∗1 Jianzhu Ma2 Jian Peng3 Jinbo Xu4 Yexiang Xue1 1Department of Computer Science, Purdue University, West Lafayette, Indiana, USA 2Institute for Artificial Intelligence, Peking University, Beijing, China 3Department of Computer Science, University of Illinois at Urbana-Champaign, Illinois, USA 4Toyota Technological Institute at Chicago, Illinois, USA Abstract origin LRP S Protein sequence alignment is a fundamental prob- lem in computational structure biology and popu- L lar for protein 3D structural prediction and protein homology detection. Most of the developed pro- A grams for detecting protein sequence alignments Match are based upon the likelihood information of amino Insertion at S Insertion at T acids and are sensitive to alignment noises. We S: S _ L _ A present a robust method PALM for modeling pair- gt T: _ L R P _ wise protein structure alignments, using the area S: _ pred S L A distance to reduce the biological measurement 1 T: _ R PL noise. PALM generatively learn the alignment of S: __ _ S L A pred __ _ two protein sequences with probabilistic area dis- 2 T: LPR tance objective, which can denoise the measure- ment errors and offsets from different biologists. Figure 1: Illustration of protein sequence alignment and the During learning, we show that the optimization is area distance. (Bottom) The task is to align two amino acids computationally efficient by estimating the gradi- sequences S and T , where one amino acid from sequence ents via dynamically sampling alignments. Empiri- S can be aligned to either one amino acid from sequence cally, we show that PALM can generate sequence T (match), or to a gap (insertion, marked by “−”).
  • Comparative Genomics

    Comparative Genomics

    17 Aug 2004 15:49 AR AR223-GG05-02.tex AR223-GG05-02.sgm LaTeX2e(2002/01/18) P1: IKH 10.1146/annurev.genom.5.061903.180057 Annu. Rev. Genomics Hum. Genet. 2004. 5:15–56 doi: 10.1146/annurev.genom.5.061903.180057 Copyright c 2004 by Annual Reviews. All rights reserved First published online as a Review in Advance on April 15, 2004 COMPARATIVE GENOMICS Webb Miller, Kateryna D. Makova, Anton Nekrutenko, and Ross C. Hardison The Center for Comparative Genomics and Bioinformatics, The Huck Institutes of Life Sciences, and the Departments of Biology, Computer Science and Engineering, and Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania; email: [email protected], [email protected], [email protected], [email protected] KeyWords whole-genome alignments, evolutionary rates, gene prediction, gene regulation, sequence conservation, Internet resources, genome browsers, genome databases, bioinformatics ■ Abstract The genomes from three mammals (human, mouse, and rat), two worms, and several yeasts have been sequenced, and more genomes will be completed in the near future for comparison with those of the major model organisms. Scientists have used various methods to align and compare the sequenced genomes to address critical issues in genome function and evolution. This review covers some of the major new insights about gene content, gene regulation, and the fraction of mammalian genomes that are under purifying selection and presumed functional. We review the evolution- ary processes that shape genomes, with particular attention to variation in rates within genomes and along different lineages. Internet resources for accessing and analyzing the treasure trove of sequence alignments and annotations are reviewed, and we dis- cuss critical problems to address in new bioinformatic developments in comparative genomics.
  • The Myth of Junk DNA

    The Myth of Junk DNA

    The Myth of Junk DNA JoATN h A N W ells s eattle Discovery Institute Press 2011 Description According to a number of leading proponents of Darwin’s theory, “junk DNA”—the non-protein coding portion of DNA—provides decisive evidence for Darwinian evolution and against intelligent design, since an intelligent designer would presumably not have filled our genome with so much garbage. But in this provocative book, biologist Jonathan Wells exposes the claim that most of the genome is little more than junk as an anti-scientific myth that ignores the evidence, impedes research, and is based more on theological speculation than good science. Copyright Notice Copyright © 2011 by Jonathan Wells. All Rights Reserved. Publisher’s Note This book is part of a series published by the Center for Science & Culture at Discovery Institute in Seattle. Previous books include The Deniable Darwin by David Berlinski, In the Beginning and Other Essays on Intelligent Design by Granville Sewell, God and Evolution: Protestants, Catholics, and Jews Explore Darwin’s Challenge to Faith, edited by Jay Richards, and Darwin’s Conservatives: The Misguided Questby John G. West. Library Cataloging Data The Myth of Junk DNA by Jonathan Wells (1942– ) Illustrations by Ray Braun 174 pages, 6 x 9 x 0.4 inches & 0.6 lb, 229 x 152 x 10 mm. & 0.26 kg Library of Congress Control Number: 2011925471 BISAC: SCI029000 SCIENCE / Life Sciences / Genetics & Genomics BISAC: SCI027000 SCIENCE / Life Sciences / Evolution ISBN-13: 978-1-9365990-0-4 (paperback) Publisher Information Discovery Institute Press, 208 Columbia Street, Seattle, WA 98104 Internet: http://www.discoveryinstitutepress.com/ Published in the United States of America on acid-free paper.
  • A Novel Approach for Predicting Protein Functions by Transferring Annotation Via Alignment Networks Warith Djeddi, Sadok Ben Yahia, Engelbert Mephu Nguifo

    A Novel Approach for Predicting Protein Functions by Transferring Annotation Via Alignment Networks Warith Djeddi, Sadok Ben Yahia, Engelbert Mephu Nguifo

    A novel approach for predicting protein functions by transferring annotation via alignment networks Warith Djeddi, Sadok Ben Yahia, Engelbert Mephu Nguifo To cite this version: Warith Djeddi, Sadok Ben Yahia, Engelbert Mephu Nguifo. A novel approach for predicting protein functions by transferring annotation via alignment networks. 2019. hal-02070419 HAL Id: hal-02070419 https://hal.archives-ouvertes.fr/hal-02070419 Preprint submitted on 17 Mar 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. A novel approach for predicting protein functions by transferring annotation via alignment networks Warith Eddine Djeddi1, Sadok Ben Yahia1;2∗ and Engelbert Mephu Nguifo3∗ 1University of Tunis El Manar, Faculty of Sciences of Tunis, LR11ES14, Capmus Universitaire 2092, Tunis, Tunisia 2Tallinn University of Technology, Department of Software Science, Akadeemia tee 15a, 12618 Tallinn, Estonia and 3 University Clermont Auvergne, CNRS, LIMOS, F-63000 CLERMONT-FERRAND, FRANCE ∗Corresponding author: [email protected], [email protected] Abstract One of the challenges of the post-genomic era is to provide accurate function annotations for orphan and unannotated protein sequences. With the recent availability of huge protein-protein interactions for many model species, it becomes an opportunity to computational methods to elucidate protein func- tion based on many strategies.
  • Обработка Результатов BLAST, Chaining, Netting

    Обработка Результатов BLAST, Chaining, Netting

    Processing data of BLAST, clustering and ordering Student: Elena Bushmanova, Bioinformatics Institute, SPbSU Advisor: Pavel Dobrynin, Dobzhansky Center Saint-Petersburg, 2013 Goal: Creating a program, which allows ● to form single units of overlapping hits ● to rank the received units according to the coordinates on the chromosome ● and get some statistics Problem: To increase of the accuracy and efficiency of assembling of the genome on the basis of the reference. Other tools: • SyMAP • ABACAS • Arachne • MAUVE • LASTZ • RACA SyMAP – very long time to analyze only one pair of genomes. But this is only one program that currently used for reference assisted assembly of large genomes; ABACAS – very good program, but working only with bacteria or other small (20mbp) genomes; Arachne – we tried to adapt this assembler for Sanger sequences to assemble contigs but failed; MAUVE – not trivial output, only small genome, work slowly; LASTZ – great tool for alignment of large genomes, but working not very fast, and also quite complicated pipeline for genome assembly ; RACA – in theory one of the best ref assisted assembler, but still only in theory (no publications that used it). Algorithm • Processing: makeblastdb -in ref_all_chr.fsa.txt -dbtype nucl -title yeast -out yeast blastn -query Kyokai7_NRIB_2011_BABQ01000000.fsa -db yeast -outfmt 6 -out data.txt • Selection • Sorting • Checking: Q - 1000 ≤ S ≤ Q + 1000 • Splicing • Statistics Data Input data: yeast_blastn.txt: input list of fragments query_id, subject_id, identity, alignment_length, mismatches, gap_opens, q.start, q.end, s.start, s.end, e_value, bit_score Output data: outputList.txt: output list of fragments unusedList.txt: list of duplicate fragments In_statistics.txt, Out_statistics.txt: - total number of fragments and percent of unique fragments; - average, median, maximum and minimum count of the same id fragments; - similar characteristics of the number of matches.
  • Genomic Variants Among Threatened Acropora Corals

    Genomic Variants Among Threatened Acropora Corals

    G3: Genes|Genomes|Genetics Early Online, published on March 26, 2019 as doi:10.1534/g3.119.400125 Genomic variants among threatened Acropora corals Authors: Sheila. A. Kitchen*†, Aakrosh Ratan*‡, Oscar C. Bedoya-Reina§**, Richard Burhans††, Nicole D. Fogarty‡‡, $$, Webb Miller††, Iliana B. Baums† * Authors contributed equally to the manuscript Author Affiliations: † 208 Mueller Lab, Biology Department, The Pennsylvania State University, University Park PA 16802 [email protected], [email protected] ‡ Department of Public Health Sciences and Center for Public Health Genomics, University of Virginia, Charlottesville VA 22908 [email protected] § MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3PT, UK [email protected] ** MRC Human Genetics Unit, MRC Institute of Genetics and Molecular Medicine, The University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, UK. †† Centre for Comparative Genomics and Bioinformatics, Pennsylvania State University, University Park, PA 16802, USA [email protected], [email protected] ‡‡ Department of Marine and Environmental Sciences, Nova Southeastern University, Fort Lauderdale, FL 33314, [email protected] $$ Secondary Affiliation: Nicole D. Fogarty, Department of Biology and Marine Biology, University of North Carolina Wilmington, Center for Marine Science, 5600 Marvin K Moss Lane, Wilmington, NC 28409, USA; Data Accession numbers: NCBI SRA accession numbers are SRR7235977-SRR7236038. Kitchen et al. 1 © The Author(s) 2013. Published by the Genetics Society of America. Running Title: Genomic variants among threatened corals Keywords: coral, Caribbean, single nucleotide polymorphism, population genomics, Galaxy Corresponding Author (§§): I.B Baums, 208 Mueller Lab, Biology Department, The Pennsylvania State University, University Park PA 16802, Fax +1 814 865 9131, [email protected] Article Summary We provide the first comprehensive genomic resources for two threatened Caribbean reef- building corals in the genus Acropora.