The Intelligent Systems in Bioinformatics Conference 2001 (ISMB2001) 21St–25Th July, Tivoli Gardens, Copenhagen

Total Page:16

File Type:pdf, Size:1020Kb

The Intelligent Systems in Bioinformatics Conference 2001 (ISMB2001) 21St–25Th July, Tivoli Gardens, Copenhagen Comparative and Functional Genomics Comp Funct Genom 2001; 2: 330–337. DOI: 10.1002/cfg.108 Feature Meeting Review: The Intelligent Systems in Bioinformatics Conference 2001 (ISMB2001) 21st–25th July, Tivoli Gardens, Copenhagen K. Cara Woodwark* Biomolecular Sciences, UMIST, Manchester, M60 1QD, UK *Correspondence to: K. Cara Woodwark, Biomolecular Sciences, UMIST, PO Box 88, Manchester, M60 1QD, UK. Keywords: bioinformatics; conference; microarray; promoter; promoter prediction; gene E-mail: [email protected] prediction; RNA This year’s ISMB conference, an annual event explained that, given the right circumstances, any organised by the International Society for Compu- protein will form fibrils similar to those found in tational Biology (http://www.iscb.org) was the big- BSE or Alzheimer’s. gest ever, with over 1400 delegates. The venue was Proteins seem to fold and unfold all the time, also a first, situated, as it was, in Copenhagen’s which can cause problems for structure determina- Tivoli Gardens funfair. However there was still time tion in protein crystallography and NMR, as for the many satellite meetings that flanked the proteins made under different circumstances often conference including the Bio Pathways conference, have different structures. In the densely packed the Bioinformatics Open Source Conference and the environment of the cell, folding and unfolding may Bio-Ontologies Conference. form part of a switch mechanism, or chaperones Soren Brunak and Anders Krogh opened the may be involved to help a protein to fold into a conference. They remarked that it was 30 years particular structure. since the Needleman-Wunsch algorithm was writ- Aggregations or amyloid structures are respon- ten, but that things have not changed much, from sible for many diseases e.g. Alzheimers, New that time, in that the basic biological ideas are still variant CJD (related to BSE), Type II diabetes etc. driving research. They also mentioned that there Apparently, by the age of 60 we will all develop had been 180 papers submitted to the conference all some sort of aggregate, but hopefully they should of which had to be refereed and graded before be disease free (asymptomatic). 16 diseases caused choosing the 38 speakers. by amyloid structures have now been identified (20 Only the keynote talks are covered in depth here if diseases such as Parkinsons are included). For as all the other talks are covered in a special example, two point mutations in Lysozyme allow it supplement to the journal Bioinformatics. http:// to form disease-causing fibrils, composed of many bioinformatics.oupjournals.org/ parallel beta sheets. The conference began with talks on Protein The major breakthrough came when one of Structure and Modelling. However, rather than Chris’ students was working on PI3 Kinase NMR, talks about protein structure prediction, these talks when he went for a long weekend (160 hours). On were based much more on the biology of how his return, the trace had disappeared almost to protein structure information can help our under- nothing. So they looked to see what had happened standing of the evolution and function of proteins. to the protein and found that it had formed fibrils. Chris Dobson (Cambridge University) opened the This was a complete surprise as it wasn’t a disease conference with an excellent talk on Protein Fold- causing protein and so was not expected to form ing, Molecular Evolution, and Human Disease.He fibrils. After examining the fibrils they discovered Copyright # 2001 John Wiley & Sons, Ltd. Meeting Review 331 that they were hollow pipes formed by 4 groups of 2 becoming massive balls of fibrils as if we were to beta sheets wrapped around each other in a helical live long enough that is how we would end up! formation. These might be useful as nanotubes! All of this was discovered because someone had a Chris believes that the ability to form fibrils is long weekend! a character of all proteins, for example, even myo- The rest of the section was an interesting mix of globin, in its less soluble form, produced amyloid different aspects of protein structure. Gordana Apic fibres. In fact all proteins they have tried have (MRC Laboratory of Molecular Medicine) gave us produced fibrils, given the correct circumstances, an Insight into Domain Combinations. Potentially and any polypeptide chain if not chaperoned, or there are 180 000 pairwise combinations of SCOP controlled, could form fibrils. domains, but only 1,000 of these are found in There appears to be an initial time limiting step, 20 000 multidomain proteins from 40 species. as, like crystallisation, fibril formation needs a Indeed, 60% of domains have only one known nucleation, or seeding, step. This explains the combination partner. The domain order is highly rapid onset of diseases such as BSE after the first conserved within protein families. Stephen Mo¨ller symptoms are noticed, as after the initial contam- (EBI) spoke about predicting not only G protein ination with ‘‘seed’’ proteins there is a slow coupled receptors, but also their specificity, using ‘‘incubation’’ period until enough plaques are ‘‘SPEXS’’ (http://ep.ebi.ac.uk/, http://www.ebi.ac.uk/ formed to cause symptoms. After these first signs ycroning/coupling.html). Gianluca Pollastri (UC the growth of the fibrils takes place rapidly, Irvine & Bologna) used bi-directional neural network especially as the intermediate form of the fibril is architectures and evolutionary information to predict the most ‘‘contagious’’. Initial aggregates rather interaction positions between proteins, as structure than the fibrils are the real seeds. At this stage they tends to be more conserved than sequence (http:// are toxic and may lead to apoptosis, thus ridding promoter.ics.uci.edu/BRNN-PRED/). Tobias Mu¨ller the body of a diseased cell, although this is not (Deutsches Krebsforschungszentrum) also combined always a good thing, as even more ‘‘seed’’ forming structure prediction with sequence searching, but this fibrils may be released, to be taken up by other time used transmembrane domain specific matrices in cells. order to facilitate the search for homologous trans- The reason that age seems to be a factor in many membrane proteins (http://www.dkfz.de/tbi/people/ of these amyloid diseases is that over the years there tmueller). Michael Lappe (EBI) used a combination is an increased risk that something will go wrong of structure, in the form of fold information, and with the folding of a protein, thus forming a protein interaction data to predict function, nucleation seed. However, some proteins are more although the method is still very much in develop- likely to aggregate than others, due to sequence and ment (http://www.ebi.ac.uk/ylappe/FoldPred). cellular circumstance. For example, some mutated Chris Sander (Whitehead Institute) gave the proteins aggregate faster than others from a single keynote talk for the Sequence Motifs, Alignments point mutation. If, for example, the mutation is in and Families Section. He introduced his talk on the C terminus, then it is more likely to cause Structural Genomics, by pointing out what was to protein aggregation, even though it may not have be the overwhelming take-home message of the much affect on the folding. The position and conference – that what we (Bioinformaticians as ‘‘biology’’ of a protein is also important, as for well as Biologists) are trying to do is answer example, in vitro, myoglobin will aggregate more Biological questions. He hoped that soon we quickly than prions, yet there is no myoglobin would be able to model the ‘e-cell’ and then the aggregate disease. Therefore, whether a protein ‘e-organ’; model the perturbation of the system forms aggregates also depends on its position and caused by drugs; decipher neurobiology, and com- interactions within the cell. Selection against aggre- bine it all into systems biology where not just the gation may have increased the occurrence of whole organism, but whole ecosystems could be chaperones, as protein mixes do not form fibrils. modelled. Heterozygotes, with two alleles of a protein, may According to Chris Sander, structural genomics is also be at an advantage as mixed fibrils are also less only a part of what is needed. Eventually he would likely, so even sexual reproduction plays a role. like to be able to see the structure of every Chris believes that the whole of evolution and biological molecule, from which we would be able biology is one huge strategy to keep organisms from to fully understand the function of that molecule. Copyright # 2001 John Wiley & Sons, Ltd. Comp Funct Genom 2001; 2: 330–337. 332 Meeting Review However, protein structure is difficult and time (Epigenomics AG) differentiated between acute consuming to obtain. Sander suggests that rather lymphoblastic leukaemia (ALL) and acute myeloid than obtaining the structure of every molecule, we leukaemia (AML) by looking at the methylation need only determine the structure of one molecule state of CpG dinucleotides in CpG islands, which per protein family, or maybe even superfamily, are responsible for expression regulation. Their using homology to predict the structure of the other algorithm enabled the most discriminant sites to family members. be chosen. Giulio Pavesi (University of Milan) Even at the level of 30% sequence identity, there explained the huge numbers involved in promoter are 4000 sequences for all the families of the model prediction. For example, if a pattern of m letters is organisms in Pfam. However, not all families are in all sequences studied and may have mutations at represented in Pfam, so the actual number may be any position, then there are 4m possibilities. While closer to 4000r4. Producing this number of promoter regions are very small, e.g. m=6, this is a representative structures will take as much coordi- tractable number, 4096 possibilities, but with a nation as sequencing the human genome and to signal length of 20, then it is in to the trillions ensure availability to everyone, something that Bill (109,951,1627,776 to be exact).
Recommended publications
  • Algorithms for Computational Biology 8Th International Conference, Alcob 2021 Missoula, MT, USA, June 7–11, 2021 Proceedings
    Lecture Notes in Bioinformatics 12715 Subseries of Lecture Notes in Computer Science Series Editors Sorin Istrail Brown University, Providence, RI, USA Pavel Pevzner University of California, San Diego, CA, USA Michael Waterman University of Southern California, Los Angeles, CA, USA Editorial Board Members Søren Brunak Technical University of Denmark, Kongens Lyngby, Denmark Mikhail S. Gelfand IITP, Research and Training Center on Bioinformatics, Moscow, Russia Thomas Lengauer Max Planck Institute for Informatics, Saarbrücken, Germany Satoru Miyano University of Tokyo, Tokyo, Japan Eugene Myers Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany Marie-France Sagot Université Lyon 1, Villeurbanne, France David Sankoff University of Ottawa, Ottawa, Canada Ron Shamir Tel Aviv University, Ramat Aviv, Tel Aviv, Israel Terry Speed Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC, Australia Martin Vingron Max Planck Institute for Molecular Genetics, Berlin, Germany W. Eric Wong University of Texas at Dallas, Richardson, TX, USA More information about this subseries at http://www.springer.com/series/5381 Carlos Martín-Vide • Miguel A. Vega-Rodríguez • Travis Wheeler (Eds.) Algorithms for Computational Biology 8th International Conference, AlCoB 2021 Missoula, MT, USA, June 7–11, 2021 Proceedings 123 Editors Carlos Martín-Vide Miguel A. Vega-Rodríguez Rovira i Virgili University University of Extremadura Tarragona, Spain Cáceres, Spain Travis Wheeler University of Montana Missoula, MT, USA ISSN 0302-9743 ISSN 1611-3349 (electronic) Lecture Notes in Bioinformatics ISBN 978-3-030-74431-1 ISBN 978-3-030-74432-8 (eBook) https://doi.org/10.1007/978-3-030-74432-8 LNCS Sublibrary: SL8 – Bioinformatics © Springer Nature Switzerland AG 2021 This work is subject to copyright.
    [Show full text]
  • Are Profile Hidden Markov Models Identifiable?
    Are Profile Hidden Markov Models Identifiable? Srilakshmi Pattabiraman Tandy Warnow Department of Electrical and Computer Engineering Department of Computer Science University of Illinois at Urbana-Champaign University of Illinois at Urbana-Champaign Urbana, Illinois Urbana, Illinois [email protected] [email protected] ABSTRACT 1 INTRODUCTION Profile Hidden Markov Models (HMMs) are graphical models that Profile Hidden Markov Models (HMMs) are arguably themost can be used to produce finite length sequences from a distribution. common statistical models in bioinformatics. Originally introduced In fact, although they were only introduced for bioinformatics 25 by Haussler and colleagues in [10, 12], and then expanded later years ago (by Haussler et al., Hawaii International Conference on in many subsequent texts [4–6, 9, 11, 21, 25], profile HMMs are Systems Science 1993), they are arguably the most commonly used now used in many analytical steps in biological sequence analysis statistical model in bioinformatics, with multiple applications, in- [15, 17–19, 22]. cluding protein structure and function prediction, classifications Profile Hidden Markov models are graphical models with match of novel proteins into existing protein families and superfamilies, states, insertion states, and deletion states; and the match and in- metagenomics, and multiple sequence alignment. The standard use sertion states emit letters from an underlying alphabet Σ (i.e., Σ of profile HMMs in bioinformatics has two steps: first a profile may be the 20 amino acids, the four nucleotides, or some other HMM is built for a collection of molecular sequences (which may set of symbols). In the standard form presented in [4] (widely in not be in a multiple sequence alignment), and then the profile HMM use in bioinformatics applications), each profile Hidden Markov is used in some subsequent analysis of new molecular sequences.
    [Show full text]
  • EMBO Facts & Figures
    excellence in life sciences Reykjavik Helsinki Oslo Stockholm Tallinn EMBO facts & figures & EMBO facts Copenhagen Dublin Amsterdam Berlin Warsaw London Brussels Prague Luxembourg Paris Vienna Bratislava Budapest Bern Ljubljana Zagreb Rome Madrid Ankara Lisbon Athens Jerusalem EMBO facts & figures HIGHLIGHTS CONTACT EMBO & EMBC EMBO Long-Term Fellowships Five Advanced Fellows are selected (page ). Long-Term and Short-Term Fellowships are awarded. The Fellows’ EMBO Young Investigators Meeting is held in Heidelberg in June . EMBO Installation Grants New EMBO Members & EMBO elects new members (page ), selects Young EMBO Women in Science Young Investigators Investigators (page ) and eight Installation Grantees Gerlind Wallon EMBO Scientific Publications (page ). Programme Manager Bernd Pulverer S Maria Leptin Deputy Director Head A EMBO Science Policy Issues report on quotas in academia to assure gender balance. R EMBO Director + + A Conducts workshops on emerging biotechnologies and on H T cognitive genomics. Gives invited talks at US National Academy E IC of Sciences, International Summit on Human Genome Editing, I H 5 D MAN 201 O N Washington, DC.; World Congress on Research Integrity, Rio de A M Janeiro; International Scienti c Advisory Board for the Centre for Eilish Craddock IT 2 015 Mammalian Synthetic Biology, Edinburgh. Personal Assistant to EMBO Fellowships EMBO Scientific Publications EMBO Gold Medal Sarah Teichmann and Ido Amit receive the EMBO Gold the EMBO Director David del Álamo Thomas Lemberger Medal (page ). + Programme Manager Deputy Head EMBO Global Activities India and Singapore sign agreements to become EMBC Associate + + Member States. EMBO Courses & Workshops More than , participants from countries attend 6th scienti c events (page ); participants attend EMBO Laboratory Management Courses (page ); rst online course EMBO Courses & Workshops recorded in collaboration with iBiology.
    [Show full text]
  • Computational Biology and Bioinformatics
    Vol. 30 ISMB 2014, pages i1–i2 BIOINFORMATICS EDITORIAL doi:10.1093/bioinformatics/btu304 Editorial This special issue of Bioinformatics serves as the proceedings of The conference used a two-tier review system, a continuation the 22nd annual meeting of Intelligent Systems for Molecular and refinement of a process begun with ISMB 2013 in an effort Biology (ISMB), which took place in Boston, MA, July 11–15, to better ensure thorough and fair reviewing. Under the revised 2014 (http://www.iscb.org/ismbeccb2014). The official confer- process, each of the 191 submissions was first reviewed by at least ence of the International Society for Computational Biology three expert referees, with a subset receiving between four and (http://www.iscb.org/), ISMB, was accompanied by 12 Special eight reviews, as needed. These formal reviews were frequently Interest Group meetings of one or two days each, two satellite supplemented by online discussion among reviewers and Area meetings, a High School Teachers Workshop and two half-day Chairs to resolve points of dispute and reach a consensus on tutorials. Since its inception, ISMB has grown to be the largest each paper. Among the 191 submissions, 29 were conditionally international conference in computational biology and bioinfor- accepted for publication directly from the first round review Downloaded from matics. It is expected to be the premiere forum in the field for based on an assessment of the reviewers that the paper was presenting new research results, disseminating methods and tech- clearly above par for the conference. A subset of 16 papers niques and facilitating discussions among leading researchers, were viewed as potentially in the top tier but raised significant practitioners and students in the field.
    [Show full text]
  • Assigning Folds to the Proteins Encoded by the Genome of Mycoplasma Genitalium (Protein Fold Recognition͞computer Analysis of Genome Sequences)
    Proc. Natl. Acad. Sci. USA Vol. 94, pp. 11929–11934, October 1997 Biophysics Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium (protein fold recognitionycomputer analysis of genome sequences) DANIEL FISCHER* AND DAVID EISENBERG University of California, Los Angeles–Department of Energy Laboratory of Structural Biology and Molecular Medicine, Molecular Biology Institute, University of California, Los Angeles, Box 951570, Los Angeles, CA 90095-1570 Contributed by David Eisenberg, August 8, 1997 ABSTRACT A crucial step in exploiting the information genitalium (MG) (10), as a test of the capabilities of our inherent in genome sequences is to assign to each protein automatic fold recognition server and as a case study to sequence its three-dimensional fold and biological function. identify the difficulties facing automated fold assignment. Here we describe fold assignment for the proteins encoded by the small genome of Mycoplasma genitalium. The assignment MATERIALS AND METHODS was carried out by our computer server (http:yywww.doe- mbi.ucla.eduypeopleyfrsvryfrsvr.html), which assigns folds to The MG Sequences. The 468 MG sequences were obtained amino acid sequences by comparing sequence-derived predic- from The Institute for Genome Research (TIGR) through its tions with known structures. Of the total of 468 protein ORFs, Web address: http:yywww.tigr.orgytdbymdbymgdbymgd- 103 (22%) can be assigned a known protein fold with high b.html. Three types of annotation (based on searches in the confidence, as cross-validated with tests on known structures. sequence database) accompany each TIGR sequence (10): (i) Of these sequences, 75 (16%) show enough sequence similarity functional assignment—a clear sequence similarity with a to proteins of known structure that they can also be detected protein of known function from another organism was found by traditional sequence–sequence comparison methods.
    [Show full text]
  • Practical Structure-Sequence Alignment of Pseudoknotted Rnas Wei Wang
    Practical structure-sequence alignment of pseudoknotted RNAs Wei Wang To cite this version: Wei Wang. Practical structure-sequence alignment of pseudoknotted RNAs. Bioinformatics [q- bio.QM]. Université Paris Saclay (COmUE), 2017. English. NNT : 2017SACLS563. tel-01697889 HAL Id: tel-01697889 https://tel.archives-ouvertes.fr/tel-01697889 Submitted on 31 Jan 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. 1 NNT : 2017SACLS563 Thèse de doctorat de l’Université Paris-Saclay préparée à L’Université Paris-Sud Ecole doctorale n◦580 (STIC) Sciences et Technologies de l’Information et de la Communication Spécialité de doctorat : Informatique par M. Wei WANG Alignement pratique de structure-séquence d’ARN avec pseudonœuds Thèse présentée et soutenue à Orsay, le 18 Décembre 2017. Composition du Jury : Mme Hélène TOUZET Directrice de Recherche (Présidente) CNRS, Université Lille 1 M. Guillaume FERTIN Professeur (Rapporteur) Université de Nantes M. Jan GORODKIN Professeur (Rapporteur) University of Copenhagen Mme Johanne COHEN Directrice de Recherche (Examinatrice)
    [Show full text]
  • Twinscan: a Software Package for Homology-Based Gene Prediction
    Washington University in St. Louis Washington University Open Scholarship All Computer Science and Engineering Research Computer Science and Engineering Report Number: WUCSE-2003-8 2003-02-14 Twinscan: A Software Package for Homology-Based Gene Prediction Paul Flicek A complete mapping from genome to proteome would constitute a foundation for genome- based biology and provide targets for pharmaceutical and therapeutic intervention. This is one reason gene structure prediction has been a major subfield of computational biology for vo er 20 years. Many of the widely used gene prediction systems were developed in the 1990s and are unable to take advantage of the revolution in comparative genomics brought on by the sequencing of the entire genomes of an increasing numbers of vertebrates. Twinscan is a new system for high-throughput gene-structure prediction that exploits the patterns of conservation observed in alignments... Read complete abstract on page 2. Follow this and additional works at: https://openscholarship.wustl.edu/cse_research Part of the Computer Engineering Commons, and the Computer Sciences Commons Recommended Citation Flicek, Paul, "Twinscan: A Software Package for Homology-Based Gene Prediction" Report Number: WUCSE-2003-8 (2003). All Computer Science and Engineering Research. https://openscholarship.wustl.edu/cse_research/1126 Department of Computer Science & Engineering - Washington University in St. Louis Campus Box 1045 - St. Louis, MO - 63130 - ph: (314) 935-6160. This technical report is available at Washington University Open Scholarship: https://openscholarship.wustl.edu/ cse_research/1126 Twinscan: A Software Package for Homology-Based Gene Prediction Paul Flicek Complete Abstract: A complete mapping from genome to proteome would constitute a foundation for genome-based biology and provide targets for pharmaceutical and therapeutic intervention.
    [Show full text]
  • UC San Diego UC San Diego Electronic Theses and Dissertations
    UC San Diego UC San Diego Electronic Theses and Dissertations Title Analysis and applications of conserved sequence patterns in proteins Permalink https://escholarship.org/uc/item/5kh9z3fr Author Ie, Tze Way Eugene Publication Date 2007 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California UNIVERSITY OF CALIFORNIA, SAN DIEGO Analysis and Applications of Conserved Sequence Patterns in Proteins A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Computer Science by Tze Way Eugene Ie Committee in charge: Professor Yoav Freund, Chair Professor Sanjoy Dasgupta Professor Charles Elkan Professor Terry Gaasterland Professor Philip Papadopoulos Professor Pavel Pevzner 2007 Copyright Tze Way Eugene Ie, 2007 All rights reserved. The dissertation of Tze Way Eugene Ie is ap- proved, and it is acceptable in quality and form for publication on microfilm: Chair University of California, San Diego 2007 iii DEDICATION This dissertation is dedicated in memory of my beloved father, Ie It Sim (1951–1997). iv TABLE OF CONTENTS Signature Page . iii Dedication . iv Table of Contents . v List of Figures . viii List of Tables . x Acknowledgements . xi Vita, Publications, and Fields of Study . xiii Abstract of the Dissertation . xv 1 Introduction . 1 1.1 Protein Homology Search . 1 1.2 Sequence Comparison Methods . 2 1.3 Statistical Analysis of Protein Motifs . 4 1.4 Motif Finding using Random Projections . 5 1.5 Microbial Gene Finding without Assembly . 7 2 Multi-Class Protein Classification using Adaptive Codes . 9 2.1 Profile-based Protein Classifiers . 14 2.2 Embedding Base Classifiers in Code Space .
    [Show full text]
  • Understanding Regulation of Mrna by RNA Binding Proteins Alexander
    Understanding Regulation of mRNA by RNA Binding Proteins MA SSACHUSETTS INSTITUTE by OF TECHNOLOGY Alexander De Jong Robertson B.S., Stanford University (2008) LIBRARIES Submitted to the Graduate Program in Computational and Systems Biology in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Computational and Systems Biology at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY February 2014 o Massachusetts Institute of Technology 2014. All rights reserved. A A u th o r .... v ..... ... ................................................ Graduate Program in Computational and Systems Biology December 19th, 2013 C ertified by .............................................. Christopher B. Burge Professor Thesis Supervisor A ccepted by ........ ..... ............................. Christopher B. Burge Computational and Systems Biology Ph.D. Program Director 2 Understanding Regulation of mRNA by RNA Binding Proteins by Alexander De Jong Robertson Submitted to the Graduate Program in Computational and Systems Biology on December 19th, 2013, in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Computational and Systems Biology Abstract Posttranscriptional regulation of mRNA by RNA-binding proteins plays key roles in regulating the transcriptome over the course of development, between tissues and in disease states. The specific interactions between mRNA and protein are controlled by the proteins' inherent affinities for different RNA sequences as well as other fea- tures such as translation and RNA structure which affect the accessibility of mRNA. The stabilities of mRNA transcripts are regulated by nonsense-mediated mRNA de- cay (NMD), a quality control degradation pathway. In this thesis, I present a novel method for high throughput characterization of the binding affinities of proteins for mRNA sequences and an integrative analysis of NMD using deep sequencing data.
    [Show full text]
  • ABSTRACT DATA DRIVEN APPROACHES to IDENTIFY DETERMINANTS of HEART DISEASES and CANCER RESISTANCE Avinash Das Sahu, Doctor Of
    ABSTRACT Title of dissertation: DATA DRIVEN APPROACHES TO IDENTIFY DETERMINANTS OF HEART DISEASES AND CANCER RESISTANCE Avinash Das Sahu, Doctor of Philosophy, 2016 Dissertation directed by: Professor Sridhar Hannenhalli Department of Computer Science Cancer and cardio-vascular diseases are the leading causes of death world-wide. Caused by systemic genetic and molecular disruptions in cells, these disorders are the manifestation of profound disturbance of normal cellular homeostasis. People suffering or at high risk for these disorders need early diagnosis and personalized therapeutic intervention. Successful implementation of such clinical measures can significantly improve global health. However, development of effective therapies is hindered by the challenges in identifying genetic and molecular determinants of the onset of diseases; and in cases where therapies already exist, the main challenge is to identify molecular determinants that drive resistance to the therapies. Due to the progress in sequencing technologies, the access to a large genome-wide biolog- ical data is now extended far beyond few experimental labs to the global research community. The unprecedented availability of the data has revolutionized the ca- pabilities of computational researchers, enabling them to collaboratively address the long standing problems from many different perspectives. Likewise, this thesis tackles the two main public health related challenges using data driven approaches. Numerous association studies have been proposed to identify genomic variants that determine disease. However, their clinical utility remains limited due to their inability to distinguish causal variants from associated variants. In the presented thesis, we first propose a simple scheme that improves association studies in su- pervised fashion and has shown its applicability in identifying genomic regulatory variants associated with hypertension.
    [Show full text]
  • Research News
    Computing Research News COMPUTING RESEARCH ASSOCIATION, CELEBRATING 40 YEARS OF SERVICE TO THE COMPUTING RESEARCH COMMUNITY JUNE 2013 Vol. 25 / No. 6 Announcements 2 Coalition for National Science Funding 2 CRA Announces Outstanding Undergraduate Researcher Award Winners 3 Computing Research in Action 5 CERP Infographic 6 NSF Funding Opportunity 6 CRA Recognizes Participants 7 CRA Board Members 16 CRA Board Officers 16 CRA Staff 16 Professional Opportunities 17 COMPUTING RESEARCH NEWS, JUNE 2013 Vol. 25 / No. 6 Announcements 2012 Taulbee Report Updated May 15, 2013 Corrected Table F6 Click here to download updated version CRA Releases Latest Research Issue Report New Technology-based Models for Postsecondary Learning: Conceptual Frameworks and Research Agendas The report details the findings of a National Science Foundation-Sponsored Computing Research Association Workshop held at MIT on January 9-11, 2013. From the report: “Advances in technology and in knowledge about expertise, learning, and assessment have the potential to reshape the many forms of education and training past matriculation from high school. In the next decade, higher education, military and workplace training, and professional development must all transform to exploit the opportunities of a new era, leveraging emerging technology-based models that can make learning more efficient and possibly improve student support, all at lower cost for a broader range of learners.” The report is now available as a pdf at http://cra.org/resources/research-issues/. Slides from the presentation at NSF on April 19, 2013 are also available. Investments in STEM Research and Education: Fueling American Innovation On May 7, at the Rayburn House Office Building in Brett Bode from the National Center for Supercomputing Washington, DC, the Coalition for National Science Funding Applications at University of Illinois Urbana-Champaign were (CNSF) held its 19th annual exhibition and reception, on hand to talk about the “Blue Waters” project.
    [Show full text]
  • I S C B N E W S L E T T
    ISCB NEWSLETTER FOCUS ISSUE {contents} President’s Letter 2 Member Involvement Encouraged Register for ISMB 2002 3 Registration and Tutorial Update Host ISMB 2004 or 2005 3 David Baker 4 2002 Overton Prize Recipient Overton Endowment 4 ISMB 2002 Committees 4 ISMB 2002 Opportunities 5 Sponsor and Exhibitor Benefits Best Paper Award by SGI 5 ISMB 2002 SIGs 6 New Program for 2002 ISMB Goes Down Under 7 Planning Underway for 2003 Hot Jobs! Top Companies! 8 ISMB 2002 Job Fair ISCB Board Nominations 8 Bioinformatics Pioneers 9 ISMB 2002 Keynote Speakers Invited Editorial 10 Anna Tramontano: Bioinformatics in Europe Software Recommendations11 ISCB Software Statement volume 5. issue 2. summer 2002 Community Development 12 ISCB’s Regional Affiliates Program ISCB Staff Introduction 12 Fellowship Recipients 13 Awardees at RECOMB 2002 Events and Opportunities 14 Bioinformatics events world wide INTERNATIONAL SOCIETY FOR COMPUTATIONAL BIOLOGY A NOTE FROM ISCB PRESIDENT This newsletter is packed with information on development and dissemination of bioinfor- the ISMB2002 conference. With over 200 matics. Issues arise from recommendations paper submissions and over 500 poster submis- made by the Society’s committees, Board of sions, the conference promises to be a scientific Directors, and membership at large. Important feast. On behalf of the ISCB’s Directors, staff, issues are defined as motions and are discussed EXECUTIVE COMMITTEE and membership, I would like to thank the by the Board of Directors on a bi-monthly Philip E. Bourne, Ph.D., President organizing committee, local organizing com- teleconference. Motions that pass are enacted Michael Gribskov, Ph.D., mittee, and program committee for their hard by the Executive Committee which also serves Vice President work preparing for the conference.
    [Show full text]