Functional Sites in Structure and Sequence - Protein Active Sites and Mirna Target Recognition

Total Page:16

File Type:pdf, Size:1020Kb

Functional Sites in Structure and Sequence - Protein Active Sites and Mirna Target Recognition Functional Sites in Structure and Sequence - Protein Active Sites and miRNA Target Recognition - I n a u g u r a l - D i s s e r t a t i o n zur Erlangung des Doktorgrades der Mathematisch-Naturwissenschaftlichen Fakultät der Universität zu Köln vorgelegt von Alexander Stark aus Bietigheim 2004 2 Berichterstatter: Prof. Dr. Schomburg Prof. Dr. Waffenschmidt Tag der mündlichen Prüfung: 14. Juni 2004 3 Table of Contents Zusammenfassung __________________________________________________________ 9 Abstract __________________________________________________________________ 10 1 Introduction __________________________________________________________ 11 1.1 Protein Structure, Function and Evolution ___________________________________ 11 1.2 Protein Sequence Comparison _____________________________________________ 14 1.2.1 Pairwise Sequence Alignments and Sequence Searches ________________________________ 14 1.2.2 Multiple Sequence Alignments ___________________________________________________ 15 1.2.3 Sequence Profiles and Hidden Markov Models _______________________________________ 16 1.2.4 Limits of Sequence Comparison __________________________________________________ 18 1.2.4.1 Thresholds for Inference of Homology ________________________________________ 18 1.2.4.2 Thresholds for Reliable Inference of Function___________________________________ 18 1.2.5 Sequential Motifs______________________________________________________________ 20 1.3 Structure Prediction _____________________________________________________ 20 1.3.1 Homology Modelling___________________________________________________________ 20 1.3.2 Threading or Fold Recognition ___________________________________________________ 21 1.3.3 Ab initio Methods _____________________________________________________________ 22 1.3.4 CASP_______________________________________________________________________ 23 1.4 Assigning Function from Protein Structure __________________________________ 23 1.4.1 Structural Alignment ___________________________________________________________ 24 1.4.2 Active Site Identification and Comparison __________________________________________ 26 1.4.2.1 Identification of Active Sites using Conservation ________________________________ 26 1.4.2.2 Comparison of Active Site 3D Patterns ________________________________________ 27 1.5 Statistics for Sequence and Structure Comparison_____________________________ 30 1.6 Scope: PINTS–Patterns in Non-homologous Tertiary Structures _________________ 33 1.7 MicroRNAs: A Novel Class of Genes________________________________________ 34 1.7.1 miRNAs regulate post-transcriptional gene expression _________________________________ 34 1.7.2 General Importance of miRNAs __________________________________________________ 35 1.7.3 miRNA Biogenesis and Function _________________________________________________ 35 1.7.4 Assigning Function to miRNAs___________________________________________________ 38 1.7.4.1 Genetic Approaches_______________________________________________________ 39 1.7.4.2 Computational Approaches _________________________________________________ 39 1.7.5 Scope: A Screen for miRNA Targets in Drosophila ___________________________________ 40 4 2 Materials and Methods__________________________________________________ 41 2.1 General Equipment______________________________________________________ 41 2.2 Programs ______________________________________________________________ 41 2.3 The PINTS Program _____________________________________________________ 41 2.3.1 Representing Structure Data: Points _______________________________________________ 42 2.3.2 Data Input ___________________________________________________________________ 42 2.3.3 The Search Algorithm __________________________________________________________ 43 2.3.4 The RMSD Score and Statistical Significance________________________________________ 44 2.3.5 Output formats________________________________________________________________ 44 2.3.6 Other Parameters or Command Line Options ________________________________________ 45 2.4 PINTS Databases________________________________________________________ 45 2.4.1 PINTS Databases of Functional Patterns ____________________________________________ 45 2.4.2 Non-Redundant Databases of Protein Structures ______________________________________ 46 2.5 The Development of a Statistical Model _____________________________________ 46 2.5.1 Background database___________________________________________________________ 46 2.5.2 Parameter Determination ________________________________________________________ 46 2.5.3 Cumulative Distribution of P-values _______________________________________________ 47 2.5.4 Search with the Trypsin Catalytic Triad_____________________________________________ 47 2.5.5 Comparing Proteins to Pattern Databases ___________________________________________ 47 2.5.6 Over-represented Patterns _______________________________________________________ 47 2.6 Analysis of Structural Genomics Proteins____________________________________ 48 2.6.1 Structural similarity thresholds ___________________________________________________ 48 2.7 Analysis of Archaeal FBPA IA_____________________________________________ 49 2.7.1 Structural Alignments __________________________________________________________ 49 2.7.2 Comparison of FBPA Active-Site _________________________________________________ 49 2.8 miRNA Target Prediction_________________________________________________ 49 2.8.1 Accession numbers ____________________________________________________________ 49 2.8.2 Conserved 3' UTR-database______________________________________________________ 50 2.8.3 miRNA-Screen _______________________________________________________________ 51 2.8.4 Statistics ____________________________________________________________________ 51 3 Results and Discussion__________________________________________________ 52 3.1 Statistical Model for Local Structural Patterns _______________________________ 53 3.1.1 Rationale for a Statistical Model of RMSD __________________________________________ 53 3.1.2 Model assuming independence of atoms ____________________________________________ 55 3.1.3 Accounting for dependency of covalently linked atoms_________________________________ 57 5 3.1.4 Final P-value for Local Structural Pattern Comparison _________________________________ 60 3.1.5 Comparing Patterns to Databases of Proteins_________________________________________ 61 3.1.6 Comparing Proteins to Databases of Patterns_________________________________________ 62 3.1.7 Comparing Entire Protein Structures _______________________________________________ 63 3.1.8 Deviations from predicted E-Values _______________________________________________ 64 3.1.9 Concluding Remarks Regarding the Statistical Model__________________________________ 67 3.2 Assigning Function to Protein Structures – Examples __________________________ 68 3.2.1 Overall Performance of Functional Site Comparison___________________________________ 69 3.2.2 Confirmation of Superfamily, or Resolution of Ambiguity ______________________________ 70 3.2.3 Sites Found by Similarities Between Different Folds __________________________________ 71 3.2.4 Discussion ___________________________________________________________________ 73 3.3 Structural Analysis of the Archaeal Class I FBP-Aldolase_______________________ 75 3.3.1 Overall Structure of FBPA IA ____________________________________________________ 75 3.3.2 Comparison to the classical FBPA I _______________________________________________ 76 3.3.3 TIM and the aldolase superfamilies are homologous ___________________________________ 79 3.4 The PINTS Server_______________________________________________________ 81 3.4.1 Searches_____________________________________________________________________ 81 3.4.2 Output ______________________________________________________________________ 82 3.4.3 Settings _____________________________________________________________________ 83 3.4.4 PINTS-Weekly _______________________________________________________________ 83 3.4.5 Access Statistics ______________________________________________________________ 84 3.5 miRNA Target Prediction_________________________________________________ 85 3.5.1 Conserved 3’ UTR Database _____________________________________________________ 85 3.5.2 Screening strategy _____________________________________________________________ 87 3.5.3 Tests with previously validated targets _____________________________________________ 89 3.5.4 miR-7 regulates Notch targets ____________________________________________________ 90 3.5.5 miR-2 regulates pro-apoptotic genes _______________________________________________ 94 3.5.6 Statistical Evaluation of Target Predictions __________________________________________ 94 3.5.7 Additional validation by cross genome comparison____________________________________ 95 3.5.8 miR-277 is a putative metabolic switch _____________________________________________ 97 3.5.9 Features shared by validated targets________________________________________________ 97 3.5.10 Discussion ___________________________________________________________________ 98 3.6 Other Projects _________________________________________________________ 103 3.6.1 Estrogen receptor: a case study __________________________________________________ 103 3.6.2 The miRNA Oncogene Bantam __________________________________________________ 104 3.6.3 CASP5 – Assigning Structures to Sequences________________________________________ 105 3.6.4 The Relationship Between Sequence
Recommended publications
  • Contents and Society
    8 EMBL August 2001 &cetera Newsletter of the European Molecular Biology Laboratory published by the Office of Information and Public Affairs EMBL appoints heads for Hinxton and Monterotondo MBLhas appointed new heads of the EBI in Hinxton and the A priority for both scientists will be to boost research activities at EMouse Biology Programme in Monterotondo. Nadia their units. Past funding limitations have prevented the EBI from Rosenthal, Associate Professor at Harvard and consulting editor developing a full research programme, although there have been at the New England Journal of Medicine, has taken charge of the a few very active groups since the site was opened. New funds Mouse Biology Programme, assuming the post vacated by Klaus from the EMBL member states will permit the addition of a full Rajewsky in June. Janet Thornton, Professor at the University complement of research teams, just as Cameron and his col- College of London and Fellow of the Royal Society, will direct leagues have been extremely successful in obtaining major fund- activities at the EBI, especially research. She succeeds Michael ing from the EC and the Wellcome Trust to bolster the EBI data- Ashburner, who has served as Co-Head of the Institute alongside bases and related services. Graham Cameron since the EBI was launched five years ago. Monterotondo, which opened in 1999, has been waiting for the "The new five-year scientific plan at EMBL stresses functional g reen light from EMBL's Council to reach critical mass. genomics and foresees substantial expansion of the EBI and the Rosenthal’s appointment comes at a time when a partner on the Mouse Biology Programme at Monterotondo," says Director Monterotondo campus, the European Mutant Mouse Archive General Fotis C.
    [Show full text]
  • EMBO Facts & Figures
    excellence in life sciences Reykjavik Helsinki Oslo Stockholm Tallinn EMBO facts & figures & EMBO facts Copenhagen Dublin Amsterdam Berlin Warsaw London Brussels Prague Luxembourg Paris Vienna Bratislava Budapest Bern Ljubljana Zagreb Rome Madrid Ankara Lisbon Athens Jerusalem EMBO facts & figures HIGHLIGHTS CONTACT EMBO & EMBC EMBO Long-Term Fellowships Five Advanced Fellows are selected (page ). Long-Term and Short-Term Fellowships are awarded. The Fellows’ EMBO Young Investigators Meeting is held in Heidelberg in June . EMBO Installation Grants New EMBO Members & EMBO elects new members (page ), selects Young EMBO Women in Science Young Investigators Investigators (page ) and eight Installation Grantees Gerlind Wallon EMBO Scientific Publications (page ). Programme Manager Bernd Pulverer S Maria Leptin Deputy Director Head A EMBO Science Policy Issues report on quotas in academia to assure gender balance. R EMBO Director + + A Conducts workshops on emerging biotechnologies and on H T cognitive genomics. Gives invited talks at US National Academy E IC of Sciences, International Summit on Human Genome Editing, I H 5 D MAN 201 O N Washington, DC.; World Congress on Research Integrity, Rio de A M Janeiro; International Scienti c Advisory Board for the Centre for Eilish Craddock IT 2 015 Mammalian Synthetic Biology, Edinburgh. Personal Assistant to EMBO Fellowships EMBO Scientific Publications EMBO Gold Medal Sarah Teichmann and Ido Amit receive the EMBO Gold the EMBO Director David del Álamo Thomas Lemberger Medal (page ). + Programme Manager Deputy Head EMBO Global Activities India and Singapore sign agreements to become EMBC Associate + + Member States. EMBO Courses & Workshops More than , participants from countries attend 6th scienti c events (page ); participants attend EMBO Laboratory Management Courses (page ); rst online course EMBO Courses & Workshops recorded in collaboration with iBiology.
    [Show full text]
  • Peakmatcher: Matching Peaks Across Genome Assemblies
    PeakMatcher: Matching Peaks Across Genome Assemblies Ronald J. Nowling Christopher R. Beal Scott Emrich Milwaukee School of Engineering Marquette University University of Tennessee–Knoxville Milwaukee, Wisconsin Milwaukee, Wisconsin Knoxville, Tennessee [email protected] [email protected] [email protected] Susanta K. Behura Marc S. Halfon Molly Duman-Scheel University of Missouri SUNY at Buffalo Indiana University School of Medicine Columbia, Missouri Buffalo, New York South Bend, Indiana [email protected] [email protected] [email protected] ABSTRACT The second data set was STARR-Seq (Self-Transcribing Active When reference genome assemblies are updated, the peaks from Regulatory Region Sequencing) of Drosophila melanogaster DNA DNA enrichment assays such as ChIP-Seq and FAIRE-Seq need in S2 culture cells [1]. We called STARR peaks against two versions to be called again using the new genome assembly. PeakMatcher (dm3 and r5.53) of the D. melanogaster genome [6]. PeakMatcher is an open-source package that aids in validation by matching matched 77.4% (precision) of the 4,195 dm3 peaks with 94.8% (recall) peaks across two genome assemblies using the alignment of reads of the 3,114 r5.53 peaks. or within the same genome. PeakMatcher calculates recall and PeakMatcher and associated documentation are available on precision while also outputting lists of peak-to-peak matches. GitHub (https://github.com/rnowling/peak-matcher) under the open- PeakMatcher uses read alignments to match peaks across genome source Apache Software License v2. PeakMatcher was written in assemblies. PeakMatcher finds all read aligned to one genome that Python 3 using the intervaltree library.
    [Show full text]
  • Functional Assessment of Human Enhancer Activities Using Whole-Genome STARR- Sequencing Yuwen Liu1,2†, Shan Yu1,2†, Vineet K
    Liu et al. Genome Biology (2017) 18:219 DOI 10.1186/s13059-017-1345-5 RESEARCH Open Access Functional assessment of human enhancer activities using whole-genome STARR- sequencing Yuwen Liu1,2†, Shan Yu1,2†, Vineet K. Dhiman1,2†, Tonya Brunetti1,2, Heather Eckart2 and Kevin P. White1,2,3* Abstract Background: Genome-wide quantification of enhancer activity in the human genome has proven to be a challenging problem. Recent efforts have led to the development of powerful tools for enhancer quantification. However, because of genome size and complexity, these tools have yet to be applied to the whole human genome. Results: In the current study, we use a human prostate cancer cell line, LNCaP as a model to perform whole human genome STARR-seq (WHG-STARR-seq) to reliably obtain an assessment of enhancer activity. This approach builds upon previously developed STARR-seq in the fly genome and CapSTARR-seq techniques in targeted human genomic regions. With an improved library preparation strategy, our approach greatly increases the library complexity per unit of starting material, which makes it feasible and cost-effective to explore the landscape of regulatory activity in the much larger human genome. In addition to our ability to identify active, accessible enhancers located in open chromatin regions, we can also detect sequences with the potential for enhancer activity that are located in inaccessible, closed chromatin regions. When treated with the histone deacetylase inhibitor, Trichostatin A, genes nearby this latter class of enhancers are up-regulated, demonstrating the potential for endogenous functionality of these regulatory elements. Conclusion: WHG-STARR-seq provides an improved approach to current pipelines for analysis of high complexity genomes to gain a better understanding of the intricacies of transcriptional regulation.
    [Show full text]
  • Wednesday 26 OCTOBER Thursday 27 OCTOBER
    Wednesday 26 OCTOBER Thursday 27 OCTOBER 11:00 Maria Leptin Welcome address 08:25 Bus from Villa Toskana Hotel 08:30 Bus from ISG Hotel SESSION 1 Suzanne Eaton CHAIR Max Planck Institute of Molecular Cell Biology & Genetics, Dresden, DE 08:30 – 09:00 COFFEE BREAK EMBL OPERON FOYER 11:10 Helder Maiato Cracking the code of mitosis University of Porto, PT SESSION 5 Michael Way CHAIR The Francis Crick Institute, London, GB 11:30 Eugene Myers A forward-looking vision for microscopy and image analysis 09:00 Buzz Baum Cell division: its origins and mechanics Max Planck Institute of Molecular Cell Biology & Genetics, Dresden, DE University College London, GB 11:50 Adam Antebi Convergent mechanisms of longevity 09:20 Pekka Lappalainen Assembly of mechanosensitive actomyosin bundles in migrating cells Max Planck Institute for Biology of Ageing, Cologne, DE University of Helsinki, FI 12:10 Uwe Sauer Coordination of metabolism 09:40 Carsten Janke The tubulin code – generating signals to control microtubule function ETH Zurich, CH Institut Curie, Orsay, FR 12:30 Ehud Gazit Metabolite amyloids: a new paradigm for inborn error of metabolism 10:00 Carl-Philipp Heisenberg Cell and tissue mechanics in zebrafish gastrulation disorders Institute of Science and Technology Austria, Klosterneuburg, AT Tel Aviv University, IL 10:20 Erik Sahai Modelling lethal cancer phenotypes 12:50 – 13:20 LUNCH EMBL OPERON FOYER The Francis Crick Institute, London, GB SESSION 2 Javier Paz-Ares CHAIR Universidad Autónoma de Madrid, ES 10:40 – 11:00 COFFEE BREAK EMBL OPERON FOYER
    [Show full text]
  • Positional Specificity of Different Transcription Factor Classes Within Enhancers
    Positional specificity of different transcription factor classes within enhancers Sharon R. Grossmana,b,c, Jesse Engreitza, John P. Raya, Tung H. Nguyena, Nir Hacohena,d, and Eric S. Landera,b,e,1 aBroad Institute of MIT and Harvard, Cambridge, MA 02142; bDepartment of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139; cProgram in Health Sciences and Technology, Harvard Medical School, Boston, MA 02215; dCancer Research, Massachusetts General Hospital, Boston, MA 02114; and eDepartment of Systems Biology, Harvard Medical School, Boston, MA 02215 Contributed by Eric S. Lander, June 19, 2018 (sent for review March 26, 2018; reviewed by Gioacchino Natoli and Alexander Stark) Gene expression is controlled by sequence-specific transcription type-restricted enhancers (active in <50% of the cell types) and factors (TFs), which bind to regulatory sequences in DNA. TF ubiquitous enhancers (active in >90% of the cell types) (SI Ap- binding occurs in nucleosome-depleted regions of DNA (NDRs), pendix, Fig. S1C). which generally encompass regions with lengths similar to those We next sought to infer functional TF-binding sites within the protected by nucleosomes. However, less is known about where active regulatory elements. In a recent study (5), we found that within these regions specific TFs tend to be found. Here, we char- TF binding is strongly correlated with the quantitative DNA acterize the positional bias of inferred binding sites for 103 TFs accessibility of a region. Furthermore, the TF motifs associated within ∼500,000 NDRs across 47 cell types. We find that distinct with enhancer activity in reporter assays in a cell type corre- classes of TFs display different binding preferences: Some tend to sponded closely to those that are most enriched in the genomic have binding sites toward the edges, some toward the center, and sequences of active regulatory elements in that cell type (5).
    [Show full text]
  • Final Programme
    Programme Saturday 25 August 2018 15:00 - 18:00 Arrival and Registration ATC Registration Desk 16:30 - 17:30 Pre-Conference Workshop ATC Courtyard Seminar Room 18:00 - 18:15 Welcome and Opening Remarks ATC Auditorium 18:15 - 20:00 Session 1 Chair: Eileen Furlong, EMBL Heidelberg, Germany ATC Auditorium 18:15 - 18:35 FIREnano and ChromEMT: Visualizing the structural basis 1 of gene activation and silencing in the nucleus Clodagh O'Shea Salk Institute for Biological Studies, United States of America 18:35 - 18:55 Gene Expression Within the Context of a Topologically Associating Domain 2 Douglas Higgs MRC Weatherall Institute of Molecular Medicine, University of Oxford , United Kingdom 18:55 - 19:10 RNA-mediated regulation of chromatin complexes 3 Roberto Bonasio University of Pennsylvania, United States of America 19:10 - 19:25 Decoding the chromatin proteome of a single genomic 4 locus by DNA sequencing Fred van Leeuwen Netherlands Cancer Institute, The Netherlands 19:25 - 19:45 Epigenetic Regulation of HIV Transcription 5 Melanie Ott Gladstone Institutes, University of California, San Francisco, United States of America Page 5 EMBL Conference: Transcription and Chromatin 19:45 - 20:05 A sequence anomaly of the MLL promoter imposes a 6 transcription factor addiction in leukemia Christopher Vakoc Cold Spring Harbor Laboratory, United States of America 20:05 - 21:30 Dinner EMBL Canteen 21:30 - 23:00 Welcome Reception ATC Auditorium Foyer and ATC Rooftop Lounge Page 6 Programme Sunday 26 August 2018 09:00 - 12:40 Session 2 Co-Chairs: Peter
    [Show full text]
  • Multiplex Enhancer-Reporter Assays Uncover Unsophisticated TP53 Enhancer Logic
    Downloaded from genome.cshlp.org on October 6, 2021 - Published by Cold Spring Harbor Laboratory Press Research Multiplex enhancer-reporter assays uncover unsophisticated TP53 enhancer logic Annelien Verfaillie,1 Dmitry Svetlichnyy,1 Hana Imrichova,1 Kristofer Davie,1 Mark Fiers,2 Zeynep Kalender Atak,1 Gert Hulselmans,1 Valerie Christiaens,1 and Stein Aerts1 1Laboratory of Computational Biology, Center for Human Genetics, University of Leuven, 3000 Leuven, Belgium; 2VIB Center for the Biology of Disease, 3000 Leuven, Belgium Transcription factors regulate their target genes by binding to regulatory regions in the genome. Although the binding preferences of TP53 are known, it remains unclear what distinguishes functional enhancers from nonfunctional binding. In addition, the genome is scattered with recognition sequences that remain unoccupied. Using two complementary tech- niques of multiplex enhancer-reporter assays, we discovered that functional enhancers could be discriminated from non- functional binding events by the occurrence of a single TP53 canonical motif. By combining machine learning with a meta-analysis of TP53 ChIP-seq data sets, we identified a core set of more than 1000 responsive enhancers in the human genome. This TP53 cistrome is invariably used between cell types and experimental conditions, whereas differences among experiments can be attributed to indirect nonfunctional binding events. Our data suggest that TP53 enhancers represent a class of unsophisticated cell-autonomous enhancers containing a single TP53 binding site, distinct from complex develop- mental enhancers that integrate signals from multiple transcription factors. [Supplemental material is available for this article.] Enhancers are essential regulatory elements that are bound by respective TF (Wang et al.
    [Show full text]
  • Annual Report 20 15
    ANNUAL REPORT 2015 HUMAN FRONTIER SCIENCE PROGRAM The Human Frontier Science Program is unique, supporting international collaboration to undertake innovative, risky, basic research at the frontier of the life sciences. Special emphasis is given to the support and training of independent young investigators, beginning at the postdoctoral level. The Program is implemented by an international organization, supported financially by Australia, Canada, France, Germany, India, Italy, Japan, the Republic of Korea, New Zealand, Norway, Singapore, Switzerland, the United Kingdom, the United States of America, and the European Union. Since 1990, over 7000 awards have been made to researchers from more than 70 countries. Of these, 26 HFSP awardees have gone on to receive the Nobel Prize. APRIL 2015 - MARCH 2016 ANNUAL REPORT — 3 — The following documents are available on the HFSP website www.hfsp.org: Joint Communiqués (Tokyo 1992, Washington 1997, Berlin 2002, Bern 2004, Ottawa 2007, Canberra 2010, Brussels 2013, London 2016): http://www.hfsp.org/about-us/governance/intergovernmental-conference Statutes of the International Human Frontier Science Program Organization : http://www.hfsp.org/about-us/governance/statutes Guidelines for the participation of new members in HFSPO : http://www.hfsp.org/about-us/new-membership General reviews of the HFSP (1996, 2001, 2006-2007, 2010): http://www.hfsp.org/about-us/reviews-hfsp Updated and previous lists of awards, including titles and abstracts: http://www.hfsp.org/awardees — 4 — Table of contents INTRODUCTION
    [Show full text]
  • Not Mir-Ly Muscular: Micrornas and Muscle Development
    Downloaded from genesdev.cshlp.org on September 30, 2021 - Published by Cold Spring Harbor Laboratory Press PERSPECTIVE Not miR-ly muscular: microRNAs and muscle development Julius Brennecke, Alexander Stark, and Stephen M. Cohen1 European Molecular Biology Laboratory, 69117 Heidelberg, Germany Precision in spatial and temporal control of gene expres- Giraldez et al. 2005; He et al. 2005; Johnson et al. 2005; sion is important for formation of tissues and organs. Lu et al. 2005). However, in most of these studies, the Roughly 5% of the known gene complement in animals actual role of the miRNA in these processes is fairly encodes transcriptional activators and repressors, whose unclear. A few of these functions have been attributed on combinatorial action controls the expression of target the basis of a miRNA loss-of-function mutant pheno- genes, triggering cell-fate decisions and differentiation type. Others are inferred from the effects miRNA over- during development. With the discovery of microRNAs, expression or misexpression. Given their numerous tar- a new means of regulating gene expression has been in- gets and spatially limited expression patterns, ectopic troduced (for review, see Lai 2003; Ambros 2004; Bartel expression of miRNAs can be expected to cause pheno- 2004). These ∼22-nucleotide-long RNAs are now thought types, due to down-regulation of potential target genes in to comprise 1%–5% of the known genes in organisms the tissue where the miRNAs are not normally ex- ranging from nematodes and flies to mammals (Bartel pressed. How well do these reflect the endogenous func- 2004; Berezikov et al. 2005; Xie et al.
    [Show full text]
  • Regulatory DNA in A. Thaliana Can Tolerate High Levels of Sequence
    bioRxiv preprint doi: https://doi.org/10.1101/104323; this version posted January 30, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Regulatory DNA in A. thaliana can tolerate ​ ​ high levels of sequence divergence ​ 1 1 1 1 1 Alexandre CM ,​ Urton JR ,​ Jean-Baptiste K ,​ Dorrity MW ,​ Cuperus JC ,​ Sullivan ​ ​ ​ ​ ​ 2 3 3 4 4 4 1 AM ,​ Bemm F ,​ Jolic D ,​ Arsovski AA ,​ Thompson A ,​ Nemhauser JL ,​ Fields S ,​ ​ ​ ​ ​ ​ ​ ​ 3 1,# 1 Weigel D ,​ Bubb KL ,​ Queitsch C ​ ​ ​ 1 ​ Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA ​ 2 ​ Altius Institute for Biomedical Sciences, Seattle, WA, 98121, USA ​ 3 ​ Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany ​ 4 ​ Department of Biology, University of Washington, Seattle, WA, 98195, USA ​ # ​ Correspondence: [email protected] ​ bioRxiv preprint doi: https://doi.org/10.1101/104323; this version posted January 30, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. ABSTRACT Variation in regulatory DNA is thought to drive evolution. Cross-species comparisons of regulatory DNA have provided evidence for both weak purifying selection and substantial turnover in regulatory regions. However, disruption of transcription factor binding sites can affect the expression of neighboring genes.
    [Show full text]
  • Proceedings of the Eighteenth International Conference on Machine Learning., 282 – 289
    Abstracts of papers, posters and talks presented at the 2008 Joint RECOMB Satellite Conference on REGULATORYREGULATORY GENOMICS GENOMICS - SYSTEMS BIOLOGY - DREAM3 Oct 29-Nov 2, 2008 MIT / Broad Institute / CSAIL BMP follicle cells signaling EGFR signaling floor cells roof cells Organized by Manolis Kellis, MIT Andrea Califano, Columbia Gustavo Stolovitzky, IBM Abstracts of papers, posters and talks presented at the 2008 Joint RECOMB Satellite Conference on REGULATORYREGULATORY GENOMICS GENOMICS - SYSTEMS BIOLOGY - DREAM3 Oct 29-Nov 2, 2008 MIT / Broad Institute / CSAIL Organized by Manolis Kellis, MIT Andrea Califano, Columbia Gustavo Stolovitzky, IBM Conference Chairs: Manolis Kellis .................................................................................. Associate Professor, MIT Andrea Califano ..................................................................... Professor, Columbia University Gustavo Stolovitzky....................................................................Systems Biology Group, IBM In partnership with: Genome Research ..............................................................................editor: Hillary Sussman Nature Molecular Systems Biology ............................................... editor: Thomas Lemberger Journal of Computational Biology ...............................................................editor: Sorin Istrail Organizing committee: Eleazar Eskin Trey Ideker Eran Segal Nir Friedman Douglas Lauffenburger Ron Shamir Leroy Hood Satoru Miyano Program Committee: Regulatory Genomics:
    [Show full text]