A Novel Gene Overexpression Plasmid Library and Its Application in Mapping Genetic Networks by Systematic Dosage Suppression

Total Page:16

File Type:pdf, Size:1020Kb

A Novel Gene Overexpression Plasmid Library and Its Application in Mapping Genetic Networks by Systematic Dosage Suppression A Novel Gene Overexpression Plasmid Library and its application in Mapping Genetic Networks by Systematic Dosage Suppression by Leslie Joyce Magtanong A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy Department of Molecular Genetics University of Toronto © Copyright by Leslie Joyce Magtanong 2011 A Novel Gene Overexpression Plasmid Library and its application in Mapping Genetic Networks by Systematic Dosage Suppression Leslie Joyce Magtanong Doctor of Philosophy Department of Molecular Genetics University of Toronto 2011 Abstract Increasing gene dosage provides a powerful means of probing gene function, as it tends to cause a gain-of-function effect due to increased gene activity. In the budding yeast, Saccharomyces cerevisiae, systematic gene overexpression studies have shown that in wild-type cells, overexpression of a small subset of genes results in an overt phenotype. However, examining the effects of gene overexpression in sensitized cells containing mutations in known genes is a powerful means for identifying functionally relevant genetic interactions. When a query mutant phenotype is rescued by gene overexpression, the genetic interaction is termed dosage suppression. I comprehensively investigated dosage suppression genetic interactions in yeast using three approaches. First, using one of two novel plasmid libraries cloned by two colleagues and myself, I systematically performed dosage suppression screens and identified over 130 novel dosage suppression genetic interactions for more than 25 essential yeast genes. The plasmid libraries, called the molecular barcoded yeast ORF (MoBY-ORF) 1.0 and 2.0, are designed to streamline dosage analysis by being compatible with high-throughput genomics technologies that can monitor plasmid representation, including barcode microarrays and next-generation sequencing methods. Second, I describe a detailed analysis of the novel dosage suppression interactions, as well as of literature-curated interactions, and show that the gene pairs exhibiting dosage suppression are often functionally related and can overlap with physical as well as negative genetic interactions. Third, I performed a systematic categorization of dosage suppression genetic ii interactions in yeast and show that the majority of the dosage suppression interactions can be assigned to one of four general mechanistic classifications. With this comprehensive analysis, I conclude that systematically identifying dosage suppression genetic interactions will allow for their integration into other genetic and physical interaction networks and should provide new insight into the global wiring diagram of the cell. iii Acknowledgments I have many people to thank for all of their support and encouragement throughout the years. First and foremost, I would like to thank my parents, Vic and Anita, my sisters, Lisa and Jill, and my husband, Scott Dixon, for their unconditional support during my time in Toronto. I also want to thank the professors, postdocs, and students who have provided valuable feedback and suggestions for my various experiments, presentations, and manuscripts. In particular, I acknowledge and am grateful to my supervisory committee, Drs. Brenda Andrews, Barbara Funnell, and Howard Lipshitz, who have been incredibly supportive of my research and abilities as a doctoral student. I thank the fellow graduate students who have contributed to my research. In particular, I thank Cheuk Hei Ho, who spearheaded the development of the MoBY-ORF plasmid libraries and gave me many helpful suggestions throughout my research; a postdoc, Sarah Barker, and the various technicians and summer students, all of whom were integral to the development of the MoBY-ORF plasmid libraries; Wei Jiao and Anastasia Baryshnikova, who did invaluable computational work for this project; and Sondra Bahr, a talented technician who made a significant contribution to the dosage suppression studies. Finally, I would like to thank my supervisor, Dr. Charlie Boone, whose intelligence and support for me will always be remembered. This work would not have been possible without assistance provided by members of the scientific community. In particular, I thank Andrew Smith, and Drs. Larry Heisler, Marinella Gibella, and Corey Nislow, who provided access to and assistance with their microarray facilities. I also thank the Natural Sciences and Engineering Research Council (NSERC), the Canadian Institutes of Health Research (CIHR), and the University of Toronto for financial support. iv Table of Contents Page Abstract ii Acknowledgments iv List of Tables ix List of Figures x List of Appendices xi List of Electronic Tables xii Chapter One: Introduction 1 1.1 General Introduction 2 1.2 Genetic Interactions 2 1.2.1 Negative Genetic Interactions 4 1.2.1.1 Complex Haploinsufficiency 4 1.2.2 Positive Genetic Interactions 5 1.2.3 Synthetic Dosage Effects: Lethality and Suppression 7 1.3 Investigating Genetic Interactions in S. cerevisiae in a Systematic 7 Manner 1.3.1 Development of Genome-wide Strain Collections 8 1.3.1.1 Loss-of-Function Strains: The Deletion Strain Collection 8 1.3.1.1.1 Barcoded strains and barcode microarrays 8 1.3.1.2 Essential Gene Strain Collections 13 1.3.1.2.1 tetO promoter collection 13 1.3.1.2.2 URA3-marked temperature-sensitive allele collection 15 1.3.1.2.3 DAmP allele collection 15 1.3.2 Gene overexpression 16 1.3.2.1 The Yeast Two-Hybrid S. cerevisiae ORF Array 17 1.3.2.2 The PCUP1-GST Library 21 1.3.2.3 The PGAL1/10-GST Library 21 1.3.2.4 The Movable ORF Library 22 1.3.2.5 The FLEXGene ORF Collection 23 1.3.2.6 The Yeast Genome Tiling Collection 23 v 1.3.2.7 Summary and Comparison of Various Existing Overexpression 24 Libraries 1.4 Systematic Identification of Genetic Interactions in Yeast 25 1.4.1 Synthetic Genetic Array (SGA) Analysis 25 1.4.1.1 Application of SGA to SDL analysis 27 1.4.1.2 Application of SGA to genetic mapping 27 1.4.1.3 Application of SGA to array-based high-content screening 27 1.4.2 Diploid-based Synthetic Lethal Analysis on Microarrays (dSLAM) 29 1.4.3 Genetic Interaction Mapping (GIM) 30 1.4.4 Summary of Genetic Interaction Mapping Strategies 30 1.5 Next-Generation Sequencing 31 1.6 Summary and Rationale 32 Chapter Two: The MoBY-ORF 1.0 Yeast Plasmid Library 34 2.1 Introduction 35 2.2 Results 35 2.2.1 Construction of a library of molecular barcoded yeast ORFs 35 2.2.2 Verification of constructed clones by sequencing 36 2.2.3 Assessment of clone function using temperature-sensitive mutants 39 2.2.4 Complementation cloning to identify drug-resistant mutants and 39 compound mode-of-action 2.3 Summary 39 2.4 Methods 40 2.4.1 Yeast Strains 40 2.4.2 Growth Media 40 2.4.3 Clone Construction and Analysis 40 2.4.4 Sequence Confirmation of the MoBY-ORF Collection Barcodes and 42 3’ ORF Junctions 2.4.5 Functional Complementation of Essential Genes 42 vi Chapter Three: Mapping Genetic Networks by Systematic Dosage 43 Suppression 3.1 Introduction 44 3.2 Results 46 3.2.1 Construction of the MoBY-ORF 2.0 plasmid library 46 3.2.2. Dosage suppression analysis of temperature-sensitive conditional 46 mutants Methods used to identify dosage suppressors 46 Description of results 53 3.2.3 An integrated dosage suppression genetic interaction network 57 Network overview 58 Identification of a genetic link between PKA signaling and the 58 kinetochore 3.2.4 Distribution of dosage suppressors across cellular processes 61 3.2.5. Overlap of dosage suppression interactions with protein-protein and 64 negative genetic network edges 3.2.6 Mechanistic categorization of dosage suppression interactions 64 Dosage suppression decision tree for categorizing dosage 64 suppression interactions Description of categories 64 3.3 Discussion 73 3.4 Methods 76 3.4.1 Growth media 76 3.4.2 Clone construction and analysis 76 3.4.3 Plasmid pool preparation 77 3.4.4 Cloning of dosage suppressors with the 2µ MoBY-ORF library 77 3.4.5 Yeast barcode microarray hybridization and data analysis 80 3.4.6 Empirical determination of raw barcode microarray intensity cutoff 80 for identification of candidate dosage suppressors 3.4.7 Assessing fitness of barcoded yeast strains by Illumina/Solexa 81 sequencing vii 3.4.8 Confirmation of candidate dosage suppressors and test for reciprocal 82 suppression 3.4.9 Overlap of dosage suppression genetic interactions with other types of 83 interactions 3.4.10 Analysis of functional relatedness 83 3.4.11 Identifying gene clusters in the integrated dosage suppression network 83 Chapter Four: Conclusions and Future Directions 84 4.1 General Overview 85 4.2 The MoBY-ORF gene overexpression libraries: present and future 85 applications 4.3 Dosage suppression genetic interaction networks: illuminating a new 88 facet of genetics 4.4 Understanding the mechanistic basis of dosage suppression 90 4.5 Concluding thoughts 92 Chapter Five: References 94 Chapter Six: Appendices 112 viii List of Tables Page 1.1 Overview of existing essential gene strain collections for S. cerevisiae 14 1.2 Overview of gene overexpression plasmid libraries for S. cerevisiae 18 3.2.1 Overlap of dosage suppression interactions with other types of 65 interactions 3.2.2 Distribution of dosage suppression gene pairs annotated in the 66 Saccharomyces Genome Database 3.2.3 Gene pairs tested for reciprocal suppression 71 3.2.4 Yeast strains used in this study 78 ix List of Figures Page 1.1 The barcoded kanMX cassette of the S. cerevisiae deletion collection 9 1.2 Barcode microarray method in yeast 11 1.3 Plasmid libraries in S. cerevisiae 19 2.2.1 Plasmid map of p5472 37 2.2.2 Construction of the MoBY-ORF library by homologous 38 recombination in yeast 3.2.1 Schematic of a plasmid in the MoBY-ORF 2.0 plasmid library 47 3.2.2 Plasmid map of p5476 48 3.2.3 MAGIC with the MoBY-ORF 1.0 plasmid library 49 3.2.4 Using the MoBY-ORF 2.0 library to identify candidate dosage 51 suppressors by barcode microarray 3.2.5 Empirical determination of raw barcode microarray intensity cutoff 54 for identification of candidate dosage suppressors 3.2.6 Dosage suppression genetic interaction network for S.
Recommended publications
  • Involvement of DPP9 in Gene Fusions in Serous Ovarian Carcinoma
    Smebye et al. BMC Cancer (2017) 17:642 DOI 10.1186/s12885-017-3625-6 RESEARCH ARTICLE Open Access Involvement of DPP9 in gene fusions in serous ovarian carcinoma Marianne Lislerud Smebye1,2, Antonio Agostini1,2, Bjarne Johannessen2,3, Jim Thorsen1,2, Ben Davidson4,5, Claes Göran Tropé6, Sverre Heim1,2,5, Rolf Inge Skotheim2,3 and Francesca Micci1,2* Abstract Background: A fusion gene is a hybrid gene consisting of parts from two previously independent genes. Chromosomal rearrangements leading to gene breakage are frequent in high-grade serous ovarian carcinomas and have been reported as a common mechanism for inactivating tumor suppressor genes. However, no fusion genes have been repeatedly reported to be recurrent driver events in ovarian carcinogenesis. We combined genomic and transcriptomic information to identify novel fusion gene candidates and aberrantly expressed genes in ovarian carcinomas. Methods: Examined were 19 previously karyotyped ovarian carcinomas (18 of the serous histotype and one undifferentiated). First, karyotypic aberrations were compared to fusion gene candidates identified by RNA sequencing (RNA-seq). In addition, we used exon-level gene expression microarrays as a screening tool to identify aberrantly expressed genes possibly involved in gene fusion events, and compared the findings to the RNA-seq data. Results: We found a DPP9-PPP6R3 fusion transcript in one tumor showing a matching genomic 11;19-translocation. Another tumor had a rearrangement of DPP9 with PLIN3. Both rearrangements were associated with diminished expression of the 3′ end of DPP9 corresponding to the breakpoints identified by RNA-seq. For the exon-level expression analysis, candidate fusion partner genes were ranked according to deviating expression compared to the median of the sample set.
    [Show full text]
  • Library Construction and Screening
    Library construction and screening • A gene library is a collection of different DNA sequences from an organism, • which has beenAlso called genomic libraries or gene banks. • cloned into a vector for ease of purification, storage and analysis. Uses of gene libraries • To obtain the sequences of genes for analysis, amplification, cloning, and expression. • Once the sequence is known probes, primers, etc. can be synthesized for further diagnostic work using, for example, hybridization reactions, blots and PCR. • Knowledge of a gene sequence also offers the possibility of gene therapy. • Also, gene expression can be used to synthesize a product in particular host cells, e.g. synthesis of human gene products in prokaryotic cells. two types of gene library depending upon the source of the DNA used. 1.genomic library. 2.cDNA library Types of GENE library: • genomic library contains DNA fragments representing the entire genome of an organism. • cDNA library contains only complementary DNA molecules synthesized from mRNA molecules in a cell. Genomic Library : • Made from nuclear DNA of an organism or species. • DNA is cut into clonable size pieces as randomly possible using restriction endonuclease • Genomic libraries contain whole genomic fragments including gene exons and introns, gene promoters, intragenic DNA,origins of replication, etc Construction of Genomic Libraries 1. Isolation of genomic DNA and vector. 2.Cleavage of Genomic DNA and vector by Restriction Endonucleases. 3.Ligation of fragmented DNA with the vector. 4.Transformation of
    [Show full text]
  • Construction of Small-Insert Genomic DNA Libraries Highly Enriched
    Proc. Natl. Acad. Sci. USA Vol. 89, pp. 3419-3423, April 1992 Genetics Construction of small-insert genomic DNA libraries highly enriched for microsatellite repeat sequences (marker-selected libraries/CA repeats/sequence-tagged sites/genetic mapping/dog genome) ELAINE A. OSTRANDER*tt, PAM M. JONG*t, JASPER RINE*t, AND GEOFFREY DUYKt§ *Department of Molecular and Cellular Biology, 401 Barker Hall, University of California, Berkeley, CA 94720; tHuman Genome Center, Lawrence Berkeley Laboratory, 1 Cyclotron Road, 74-157, Berkeley, CA 94720; and §Department of Genetics, Howard Hughes Medical Institute, Harvard Medical School, 25 Shattuck Street, Boston, MA 02115 Communicated by Philip Leder, January 8, 1992 ABSTRACT We describe an efficient method for the con- Generation of a high-density map of markers for an entire struction of small-insert genomic libraries enriched for highly genome or a single chromosome requires the isolation and polymorphic, simple sequence repeats. With this approach, characterization of hundreds of markers such as microsatel- libraries in which 40-50% of the members contain (CA). lite repeats (10, 11). Two simple yet tedious approaches have repeats are produced, representing an =50-fold enrichment generally been used for this task. One approach is to screen over conventional small-insert genomic DNA libraries. Briefly, a large-insert genomic library with an end-labeled (CA),, or a genomic library with an average insert size of less than 500 (TG),, oligonucleotide (n > 15). Clones that hybridize to the base pairs was constructed in a phagemid vector. Ampliflcation probe are purified and divided into subclones, which are of this library in a dut ung strain ofEscherchia coli allowed the screened by hybridization for a fragment containing the recovery of the library as closed circular single-stranded DNA repeat.
    [Show full text]
  • Multiplexed Microbial Library Preparation Using Smrtbell
    Technical Overview: Multiplexed Microbial Library Preparation Using SMRTbell Express Template Prep Kit 2.0 Sequel System ICS v8.0 / Sequel Chemistry 3.0 / SMRT Link v9.0 Sequel II System ICS v9.0 / Sequel II Chemistry 2.0 / SMRT Link v9.0 Sequel IIe System ICS v10.0 / Sequel II Chemistry 2.0 / SMRT Link v10.0 For Research Use Only. Not for use in diagnostic procedures. © Copyright 2021 by Pacific Biosciences of California, Inc. All rights reserved. PN 101-742-600 Ver 2021-02-01-A (February 2021) Multiplexed Microbial Library Preparation Using SMRTbell Express Template Prep Kit 2.0 1. Multiplexed Microbial Sample Preparation Workflow Overview 2. Multiplexed Microbial Sample Preparation Workflow Details 3. Multiplexed Microbial Sequencing Workflow Details 4. Multiplexed Microbial Data Analysis Workflow Details 5. Multiplexed Microbial Library Example Performance Data 6. Technical Documentation & Applications Support Resources 7. Appendix: General Recommendations for High-Molecular Weight gDNA QC and Handling for Multiplexed Microbial SMRTbell Library Construction MULTIPLEXED MICROBIAL SEQUENCING: HOW TO GET STARTED Application-Specific Application-Specific Application Consumable Library Construction, Best Practices Guide Procedure & Checklist Bundle Purchasing Guide Sequencing & Analysis gDNA QC & Shearing ≥1 µg Input gDNA / Microbial Sample 10 kb – 15 kb Target DNA Shear Size Library Construction Multiplex Up To 48 Microbial Samples with the Sequel II and IIe Systems using Barcoded Overhang Adapters (BOA) SMRT Sequencing Use the Sequel,
    [Show full text]
  • Mai Muudatuntuu Ti on Man Mini
    MAIMUUDATUNTUU US009809854B2 TI ON MAN MINI (12 ) United States Patent ( 10 ) Patent No. : US 9 ,809 ,854 B2 Crow et al. (45 ) Date of Patent : Nov . 7 , 2017 Whitehead et al. (2005 ) Variation in tissue - specific gene expression ( 54 ) BIOMARKERS FOR DISEASE ACTIVITY among natural populations. Genome Biology, 6 :R13 . * AND CLINICAL MANIFESTATIONS Villanueva et al. ( 2011 ) Netting Neutrophils Induce Endothelial SYSTEMIC LUPUS ERYTHEMATOSUS Damage , Infiltrate Tissues, and Expose Immunostimulatory Mol ecules in Systemic Lupus Erythematosus . The Journal of Immunol @(71 ) Applicant: NEW YORK SOCIETY FOR THE ogy , 187 : 538 - 552 . * RUPTURED AND CRIPPLED Bijl et al. (2001 ) Fas expression on peripheral blood lymphocytes in MAINTAINING THE HOSPITAL , systemic lupus erythematosus ( SLE ) : relation to lymphocyte acti vation and disease activity . Lupus, 10 :866 - 872 . * New York , NY (US ) Crow et al . (2003 ) Microarray analysis of gene expression in lupus. Arthritis Research and Therapy , 5 :279 - 287 . * @(72 ) Inventors : Mary K . Crow , New York , NY (US ) ; Baechler et al . ( 2003 ) Interferon - inducible gene expression signa Mikhail Olferiev , Mount Kisco , NY ture in peripheral blood cells of patients with severe lupus . PNAS , (US ) 100 ( 5 ) : 2610 - 2615. * GeneCards database entry for IFIT3 ( obtained from < http : / /www . ( 73 ) Assignee : NEW YORK SOCIETY FOR THE genecards. org /cgi - bin / carddisp .pl ? gene = IFIT3 > on May 26 , 2016 , RUPTURED AND CRIPPLED 15 pages ) . * Navarra et al. (2011 ) Efficacy and safety of belimumab in patients MAINTAINING THE HOSPITAL with active systemic lupus erythematosus : a randomised , placebo FOR SPECIAL SURGERY , New controlled , phase 3 trial . The Lancet , 377 :721 - 731. * York , NY (US ) Abramson et al . ( 1983 ) Arthritis Rheum .
    [Show full text]
  • The Human Genome Project
    TO KNOW OURSELVES ❖ THE U.S. DEPARTMENT OF ENERGY AND THE HUMAN GENOME PROJECT JULY 1996 TO KNOW OURSELVES ❖ THE U.S. DEPARTMENT OF ENERGY AND THE HUMAN GENOME PROJECT JULY 1996 Contents FOREWORD . 2 THE GENOME PROJECT—WHY THE DOE? . 4 A bold but logical step INTRODUCING THE HUMAN GENOME . 6 The recipe for life Some definitions . 6 A plan of action . 8 EXPLORING THE GENOMIC LANDSCAPE . 10 Mapping the terrain Two giant steps: Chromosomes 16 and 19 . 12 Getting down to details: Sequencing the genome . 16 Shotguns and transposons . 20 How good is good enough? . 26 Sidebar: Tools of the Trade . 17 Sidebar: The Mighty Mouse . 24 BEYOND BIOLOGY . 27 Instrumentation and informatics Smaller is better—And other developments . 27 Dealing with the data . 30 ETHICAL, LEGAL, AND SOCIAL IMPLICATIONS . 32 An essential dimension of genome research Foreword T THE END OF THE ROAD in Little has been rapid, and it is now generally agreed Cottonwood Canyon, near Salt that this international project will produce Lake City, Alta is a place of the complete sequence of the human genome near-mythic renown among by the year 2005. A skiers. In time it may well And what is more important, the value assume similar status among molecular of the project also appears beyond doubt. geneticists. In December 1984, a conference Genome research is revolutionizing biology there, co-sponsored by the U.S. Department and biotechnology, and providing a vital of Energy, pondered a single question: Does thrust to the increasingly broad scope of the modern DNA research offer a way of detect- biological sciences.
    [Show full text]
  • Cdna Libraries and Expression Libraries
    Solutions for Practice Problems for Recombinant DNA, Session 4: cDNA Libraries and Expression Libraries Question 1 In a hypothetical scenario you wake up one morning to your roommate exclaiming about her sudden hair growth. She has been supplementing her diet with a strange new fungus purchased at the local farmer’s market. You take samples of the fungus to your lab and you find that this fungus does indeed make a protein (the harE protein) that stimulates hair growth. You construct a fungal genomic DNA library in E. Coli with the hope of cloning the harE gene. If you succeed you will be a billionaire! You obtain DNA from the fungus, digest it with a restriction enzyme, and clone it into a vector. a) What features must be present on your plasmid that will allow you to use this as a cloning vector to make fungal genomic DNA library. Your vector would certainly need to have a unique restriction enzyme site, a selectable marker such as the ampicillin resistance gene, and a bacterial origin of replication. Other features may be required depending upon how you plan to use this library. b) You clone your digested genomic DNA into this vector. The E. coli (bacteria) cells that you will transform to create your library will have what phenotype prior to transformation? Prior to transformation, the E. coli cells that you will transform will be sensitive to antibiotic. This allows you to select for cells that obtained a plasmid. c) How do you distinguish bacterial cells that carry a vector from those that do not? Cells that obtained a vector will have obtained the selectable marker (one example is the ampicillin resistance gene).
    [Show full text]
  • Bacterial Artificial Chromosomes (Bacs) Became the Most Broadly Used Resource for Several Reasons
    Bacterial Artificial Chromosomes STC Production on Human BACs Archive Provided for Historical Purposes Home STC Project History 1995 Meeting Articles Contacts Links HGP Sequences HGP Research Several types of DNA library resources were sponsored by the DOE before and during the Human Genome Program (HGP). These included both prokaryotic and eukaryotic vector systems, and clone libraries representing single chromosomes. Bacterial Artificial Chromosomes (BACs) became the most broadly used resource for several reasons. The large size was a good match for capabilities of high throughput sequencing centers. As contrasted to some earlier resources, chimerism (having gene segments from multiple chromosome sites combined in one clone) is substantially if not completely absent. With some interesting exceptions , the BACS are stable in their bacterial hosts. In support of the functional analysis of genes, the BACs are very useful for making transgenic animals with segments of human DNAs. A brief history of BAC development is available in a preface to a 2003 issue of Methods in Molecular Biology , wherein details of BAC related protocols reside. One particular BAC project was crucial to the timely completion of human genome sequencing. (See history .) In a 1996 initiative, the DOE Office of Biological and Environmental Research sponsored the production of sequence tag connectors (STCs) for the BACs being used in human genome sequencing. (STCs are sequence reads at the ends of cloned DNA segments; they mark the boundaries of the cloned DNA.) This publicly available resource has served both the international public collaboration and Celera Genomics Inc . in the generation of the human genome sequence. The BACs representing a genome can together serve as a scaffold on which much shorter DNA sequence assemblies can be located.
    [Show full text]
  • Inferring Biological Networks from Genome-Wide Transcriptional And
    INFERRING BIOLOGICAL NETWORKS FROM GENOME-WIDE TRANSCRIPTIONAL AND FITNESS DATA By WAZEER MOHAMMAD VARSALLY A thesis submitted to The University of Birmingham for the degree of Doctor of Philosophy College of Life and Environmental Sciences School of Biosciences The University of Birmingham July 2013 I University of Birmingham Research Archive e-theses repository This unpublished thesis/dissertation is copyright of the author and/or third parties. The intellectual property rights of the author or third parties in respect of this work are as defined by The Copyright Designs and Patents Act 1988 or as modified by any successor legislation. Any use made of information contained in this thesis/dissertation must be in accordance with that legislation and must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the permission of the copyright holder. ABSTRACT In the last 15 years, the increased use of high throughput biology techniques such as genome-wide gene expression profiling, fitness profiling and protein interactomics has led to the generation of an extraordinary amount of data. The abundance of such diverse data has proven to be an essential foundation for understanding the complexities of molecular mechanisms and underlying pathways within a biological system. One approach of extrapolating biological information from this wealth of data has been through the use of reverse engineering methods to infer biological networks. This thesis demonstrates the capabilities and applications of such methodologies in identifying functionally enriched network modules in the yeast species Saccharomyces cerevisiae and Schizosaccharomyces pombe. This study marks the first time a mutual information based network inference approach has been applied to a set of specific genome-wide expression and fitness compendia, as well as the integration of these multi- level compendia.
    [Show full text]
  • Human Chromosome-Specific Cdna Libraries: New Tools for Gene Identification and Genome Annotation
    Downloaded from genome.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press RESEARCH Human Chromosome-specific cDNA Libraries: New Tools for Gene Identification and Genome Annotation Richard G. Del Mastro, 1'2 Luping Wang, ~'2 Andrew D. Simmons, Teresa D. Gallardo, 1 Gregory A. Clines, ~ Jennifer A. Ashley, 1 Cynthia J. Hilliard, 3 John J. Wasmuth, 3 John D. McPherson, 3 and Michael Lovett ~'4 1Department of Biochemistry and the McDermott Center for Human Growth and Development, The University of Texas Southwestern Medical Center, Dallas, Texas 75235-8591; 3Department of Biological Chemistry and the Human Genome Center, University of California, Irvine, California 9271 7 To date, only a small percentage of human genes have been cloned and mapped. To facilitate more rapid gene mapping and disease gene isolation, chromosome S-specific cDNA libraries have been constructed from five sources. DNA sequencing and regional mapping of 205 unique cDNAs indicates that 25 are from known chromosome S genes and 138 are from new chromosome S genes (a frequency of 79.5%}. Sequence complexity estimates indicate that each library contains -20% of the -SO00 genes that are believed to reside on chromosome 5. This study more than doubles the number of genes mapped to chromosome S and describes an important new tool for disease gene isolation. A detailed map of expressed sequences within the pressed Sequence Tags (eSTs)] (Adams et al. 1991, human genome would provide an indispensable 1992, 1993a,b; Khan et al. 1991; Wilcox et al. resource for isolating disease genes, and would 1991; Okubo et al.
    [Show full text]
  • Gene Library
    Course: M.Sc. Biotechnology Paper: BIOT4009: Genetic Engineering and Gene Therapy 1 UNIT – IV Gene library BRIJESH PANDEY DEPARTMENT OF BIOTECHNOLOGY MAHATMA GANDHI CENTRAL UNIVERSITY, BIHAR Gene library 2 Library is collection of clones….. Collection of clones representing Total transcripts- Total Genome- Part of Genome- CDNA library/ Genomic DNA Sub Genomic EST Library/ library DNA library Expressed library Genomic DNA Library 3 Collection of clones representing total genome Present in population of identical vectors Vectors contain clonable fragments of genomic DNA Vectors are self-replicating Vectors containing insert DNA are maintained in host cells like E. coli and S. cerevisiae Genomic DNA Library construction method 4 Spread on lawn of host bacteria and count the titre Joining the vector and insert DNA fragments Store, distribute and use using ligase Library may also be constructed in high capacity vectors like BAC/ YAC/ PAC Genomic DNA Library contd. 5 Since size of genome of organism varies widely Number of clones required in library to represent total genome varies It depends upon Type of and frequency of restriction endonuclease Average size of fragments Total size of genome e.g. Human genome size=2.8 x 10 6 Kb Average fragment / clone size= 20 kb Number of fragments required to represent total genome =1.4 x 10 5 6 The number of independent recombinants required in the library must be greater than n, because sampling variation will lead to the inclusion of some sequences several times and the exclusion of other sequences in a library of just n recombinants. Clarke and Carbon (1976) P= probability of including any DNA sequence in a random library of N independent recombinants: To achieve a 95% probability ( P = 0.95) of including any particular sequence in a random human genomic DNA library of 20 kb fragment size Number of clones required would be From: Principles of gene manipulation by Primrose et al 6 th ed.
    [Show full text]
  • Variation in Protein Coding Genes Identifies Information Flow
    bioRxiv preprint doi: https://doi.org/10.1101/679456; this version posted June 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Animal complexity and information flow 1 1 2 3 4 5 Variation in protein coding genes identifies information flow as a contributor to 6 animal complexity 7 8 Jack Dean, Daniela Lopes Cardoso and Colin Sharpe* 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Institute of Biological and Biomedical Sciences 25 School of Biological Science 26 University of Portsmouth, 27 Portsmouth, UK 28 PO16 7YH 29 30 * Author for correspondence 31 [email protected] 32 33 Orcid numbers: 34 DLC: 0000-0003-2683-1745 35 CS: 0000-0002-5022-0840 36 37 38 39 40 41 42 43 44 45 46 47 48 49 Abstract bioRxiv preprint doi: https://doi.org/10.1101/679456; this version posted June 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Animal complexity and information flow 2 1 Across the metazoans there is a trend towards greater organismal complexity. How 2 complexity is generated, however, is uncertain. Since C.elegans and humans have 3 approximately the same number of genes, the explanation will depend on how genes are 4 used, rather than their absolute number.
    [Show full text]