Meeting Review: the Intelligent Systems in Bioinformatics Conference 2001 (ISMB2001) 21St–25Th July, Tivoli Gardens, Copenhagen

Total Page:16

File Type:pdf, Size:1020Kb

Meeting Review: the Intelligent Systems in Bioinformatics Conference 2001 (ISMB2001) 21St–25Th July, Tivoli Gardens, Copenhagen Comparative and Functional Genomics Comp Funct Genom 2001; 2: 330–337. DOI: 10.1002/cfg.108 Feature Meeting Review: The Intelligent Systems in Bioinformatics Conference 2001 (ISMB2001) 21st–25th July, Tivoli Gardens, Copenhagen K. Cara Woodwark* Biomolecular Sciences, UMIST, Manchester, M60 1QD, UK *Correspondence to: K. Cara Woodwark, Biomolecular Sciences, UMIST, PO Box 88, Manchester, M60 1QD, UK. Keywords: bioinformatics; conference; microarray; promoter; promoter prediction; gene E-mail: [email protected] prediction; RNA This year’s ISMB conference, an annual event explained that, given the right circumstances, any organised by the International Society for Compu- protein will form fibrils similar to those found in tational Biology (http://www.iscb.org) was the big- BSE or Alzheimer’s. gest ever, with over 1400 delegates. The venue was Proteins seem to fold and unfold all the time, also a first, situated, as it was, in Copenhagen’s which can cause problems for structure determina- Tivoli Gardens funfair. However there was still time tion in protein crystallography and NMR, as for the many satellite meetings that flanked the proteins made under different circumstances often conference including the Bio Pathways conference, have different structures. In the densely packed the Bioinformatics Open Source Conference and the environment of the cell, folding and unfolding may Bio-Ontologies Conference. form part of a switch mechanism, or chaperones Soren Brunak and Anders Krogh opened the may be involved to help a protein to fold into a conference. They remarked that it was 30 years particular structure. since the Needleman-Wunsch algorithm was writ- Aggregations or amyloid structures are respon- ten, but that things have not changed much, from sible for many diseases e.g. Alzheimers, New that time, in that the basic biological ideas are still variant CJD (related to BSE), Type II diabetes etc. driving research. They also mentioned that there Apparently, by the age of 60 we will all develop had been 180 papers submitted to the conference all some sort of aggregate, but hopefully they should of which had to be refereed and graded before be disease free (asymptomatic). 16 diseases caused choosing the 38 speakers. by amyloid structures have now been identified (20 Only the keynote talks are covered in depth here if diseases such as Parkinsons are included). For as all the other talks are covered in a special example, two point mutations in Lysozyme allow it supplement to the journal Bioinformatics. http:// to form disease-causing fibrils, composed of many bioinformatics.oupjournals.org/ parallel beta sheets. The conference began with talks on Protein The major breakthrough came when one of Structure and Modelling. However, rather than Chris’ students was working on PI3 Kinase NMR, talks about protein structure prediction, these talks when he went for a long weekend (160 hours). On were based much more on the biology of how his return, the trace had disappeared almost to protein structure information can help our under- nothing. So they looked to see what had happened standing of the evolution and function of proteins. to the protein and found that it had formed fibrils. Chris Dobson (Cambridge University) opened the This was a complete surprise as it wasn’t a disease conference with an excellent talk on Protein Fold- causing protein and so was not expected to form ing, Molecular Evolution, and Human Disease.He fibrils. After examining the fibrils they discovered Copyright # 2001 John Wiley & Sons, Ltd. Meeting Review 331 that they were hollow pipes formed by 4 groups of 2 becoming massive balls of fibrils as if we were to beta sheets wrapped around each other in a helical live long enough that is how we would end up! formation. These might be useful as nanotubes! All of this was discovered because someone had a Chris believes that the ability to form fibrils is long weekend! a character of all proteins, for example, even myo- The rest of the section was an interesting mix of globin, in its less soluble form, produced amyloid different aspects of protein structure. Gordana Apic fibres. In fact all proteins they have tried have (MRC Laboratory of Molecular Medicine) gave us produced fibrils, given the correct circumstances, an Insight into Domain Combinations. Potentially and any polypeptide chain if not chaperoned, or there are 180 000 pairwise combinations of SCOP controlled, could form fibrils. domains, but only 1,000 of these are found in There appears to be an initial time limiting step, 20 000 multidomain proteins from 40 species. as, like crystallisation, fibril formation needs a Indeed, 60% of domains have only one known nucleation, or seeding, step. This explains the combination partner. The domain order is highly rapid onset of diseases such as BSE after the first conserved within protein families. Stephen Mo¨ller symptoms are noticed, as after the initial contam- (EBI) spoke about predicting not only G protein ination with ‘‘seed’’ proteins there is a slow coupled receptors, but also their specificity, using ‘‘incubation’’ period until enough plaques are ‘‘SPEXS’’ (http://ep.ebi.ac.uk/, http://www.ebi.ac.uk/ formed to cause symptoms. After these first signs ycroning/coupling.html). Gianluca Pollastri (UC the growth of the fibrils takes place rapidly, Irvine & Bologna) used bi-directional neural network especially as the intermediate form of the fibril is architectures and evolutionary information to predict the most ‘‘contagious’’. Initial aggregates rather interaction positions between proteins, as structure than the fibrils are the real seeds. At this stage they tends to be more conserved than sequence (http:// are toxic and may lead to apoptosis, thus ridding promoter.ics.uci.edu/BRNN-PRED/). Tobias Mu¨ller the body of a diseased cell, although this is not (Deutsches Krebsforschungszentrum) also combined always a good thing, as even more ‘‘seed’’ forming structure prediction with sequence searching, but this fibrils may be released, to be taken up by other time used transmembrane domain specific matrices in cells. order to facilitate the search for homologous trans- The reason that age seems to be a factor in many membrane proteins (http://www.dkfz.de/tbi/people/ of these amyloid diseases is that over the years there tmueller). Michael Lappe (EBI) used a combination is an increased risk that something will go wrong of structure, in the form of fold information, and with the folding of a protein, thus forming a protein interaction data to predict function, nucleation seed. However, some proteins are more although the method is still very much in develop- likely to aggregate than others, due to sequence and ment (http://www.ebi.ac.uk/ylappe/FoldPred). cellular circumstance. For example, some mutated Chris Sander (Whitehead Institute) gave the proteins aggregate faster than others from a single keynote talk for the Sequence Motifs, Alignments point mutation. If, for example, the mutation is in and Families Section. He introduced his talk on the C terminus, then it is more likely to cause Structural Genomics, by pointing out what was to protein aggregation, even though it may not have be the overwhelming take-home message of the much affect on the folding. The position and conference – that what we (Bioinformaticians as ‘‘biology’’ of a protein is also important, as for well as Biologists) are trying to do is answer example, in vitro, myoglobin will aggregate more Biological questions. He hoped that soon we quickly than prions, yet there is no myoglobin would be able to model the ‘e-cell’ and then the aggregate disease. Therefore, whether a protein ‘e-organ’; model the perturbation of the system forms aggregates also depends on its position and caused by drugs; decipher neurobiology, and com- interactions within the cell. Selection against aggre- bine it all into systems biology where not just the gation may have increased the occurrence of whole organism, but whole ecosystems could be chaperones, as protein mixes do not form fibrils. modelled. Heterozygotes, with two alleles of a protein, may According to Chris Sander, structural genomics is also be at an advantage as mixed fibrils are also less only a part of what is needed. Eventually he would likely, so even sexual reproduction plays a role. like to be able to see the structure of every Chris believes that the whole of evolution and biological molecule, from which we would be able biology is one huge strategy to keep organisms from to fully understand the function of that molecule. Copyright # 2001 John Wiley & Sons, Ltd. Comp Funct Genom 2001; 2: 330–337. 332 Meeting Review However, protein structure is difficult and time (Epigenomics AG) differentiated between acute consuming to obtain. Sander suggests that rather lymphoblastic leukaemia (ALL) and acute myeloid than obtaining the structure of every molecule, we leukaemia (AML) by looking at the methylation need only determine the structure of one molecule state of CpG dinucleotides in CpG islands, which per protein family, or maybe even superfamily, are responsible for expression regulation. Their using homology to predict the structure of the other algorithm enabled the most discriminant sites to family members. be chosen. Giulio Pavesi (University of Milan) Even at the level of 30% sequence identity, there explained the huge numbers involved in promoter are 4000 sequences for all the families of the model prediction. For example, if a pattern of m letters is organisms in Pfam. However, not all families are in all sequences studied and may have mutations at represented in Pfam, so the actual number may be any position, then there are 4m possibilities. While closer to 4000r4. Producing this number of promoter regions are very small, e.g. m=6, this is a representative structures will take as much coordi- tractable number, 4096 possibilities, but with a nation as sequencing the human genome and to signal length of 20, then it is in to the trillions ensure availability to everyone, something that Bill (109,951,1627,776 to be exact).
Recommended publications
  • Algorithms for Computational Biology 8Th International Conference, Alcob 2021 Missoula, MT, USA, June 7–11, 2021 Proceedings
    Lecture Notes in Bioinformatics 12715 Subseries of Lecture Notes in Computer Science Series Editors Sorin Istrail Brown University, Providence, RI, USA Pavel Pevzner University of California, San Diego, CA, USA Michael Waterman University of Southern California, Los Angeles, CA, USA Editorial Board Members Søren Brunak Technical University of Denmark, Kongens Lyngby, Denmark Mikhail S. Gelfand IITP, Research and Training Center on Bioinformatics, Moscow, Russia Thomas Lengauer Max Planck Institute for Informatics, Saarbrücken, Germany Satoru Miyano University of Tokyo, Tokyo, Japan Eugene Myers Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany Marie-France Sagot Université Lyon 1, Villeurbanne, France David Sankoff University of Ottawa, Ottawa, Canada Ron Shamir Tel Aviv University, Ramat Aviv, Tel Aviv, Israel Terry Speed Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC, Australia Martin Vingron Max Planck Institute for Molecular Genetics, Berlin, Germany W. Eric Wong University of Texas at Dallas, Richardson, TX, USA More information about this subseries at http://www.springer.com/series/5381 Carlos Martín-Vide • Miguel A. Vega-Rodríguez • Travis Wheeler (Eds.) Algorithms for Computational Biology 8th International Conference, AlCoB 2021 Missoula, MT, USA, June 7–11, 2021 Proceedings 123 Editors Carlos Martín-Vide Miguel A. Vega-Rodríguez Rovira i Virgili University University of Extremadura Tarragona, Spain Cáceres, Spain Travis Wheeler University of Montana Missoula, MT, USA ISSN 0302-9743 ISSN 1611-3349 (electronic) Lecture Notes in Bioinformatics ISBN 978-3-030-74431-1 ISBN 978-3-030-74432-8 (eBook) https://doi.org/10.1007/978-3-030-74432-8 LNCS Sublibrary: SL8 – Bioinformatics © Springer Nature Switzerland AG 2021 This work is subject to copyright.
    [Show full text]
  • Ontology-Based Methods for Analyzing Life Science Data
    Habilitation a` Diriger des Recherches pr´esent´ee par Olivier Dameron Ontology-based methods for analyzing life science data Soutenue publiquement le 11 janvier 2016 devant le jury compos´ede Anita Burgun Professeur, Universit´eRen´eDescartes Paris Examinatrice Marie-Dominique Devignes Charg´eede recherches CNRS, LORIA Nancy Examinatrice Michel Dumontier Associate professor, Stanford University USA Rapporteur Christine Froidevaux Professeur, Universit´eParis Sud Rapporteure Fabien Gandon Directeur de recherches, Inria Sophia-Antipolis Rapporteur Anne Siegel Directrice de recherches CNRS, IRISA Rennes Examinatrice Alexandre Termier Professeur, Universit´ede Rennes 1 Examinateur 2 Contents 1 Introduction 9 1.1 Context ......................................... 10 1.2 Challenges . 11 1.3 Summary of the contributions . 14 1.4 Organization of the manuscript . 18 2 Reasoning based on hierarchies 21 2.1 Principle......................................... 21 2.1.1 RDF for describing data . 21 2.1.2 RDFS for describing types . 24 2.1.3 RDFS entailments . 26 2.1.4 Typical uses of RDFS entailments in life science . 26 2.1.5 Synthesis . 30 2.2 Case study: integrating diseases and pathways . 31 2.2.1 Context . 31 2.2.2 Objective . 32 2.2.3 Linking pathways and diseases using GO, KO and SNOMED-CT . 32 2.2.4 Querying associated diseases and pathways . 33 2.3 Methodology: Web services composition . 39 2.3.1 Context . 39 2.3.2 Objective . 40 2.3.3 Semantic compatibility of services parameters . 40 2.3.4 Algorithm for pairing services parameters . 40 2.4 Application: ontology-based query expansion with GO2PUB . 43 2.4.1 Context . 43 2.4.2 Objective .
    [Show full text]
  • Ploidetect Enables Pan-Cancer Analysis of the Causes and Impacts of Chromosomal Instability
    bioRxiv preprint doi: https://doi.org/10.1101/2021.08.06.455329; this version posted August 8, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Ploidetect enables pan-cancer analysis of the causes and impacts of chromosomal instability Luka Culibrk1,2, Jasleen K. Grewal1,2, Erin D. Pleasance1, Laura Williamson1, Karen Mungall1, Janessa Laskin3, Marco A. Marra1,4, and Steven J.M. Jones1,4, 1Canada’s Michael Smith Genome Sciences Center at BC Cancer, Vancouver, British Columbia, Canada 2Bioinformatics training program, University of British Columbia, Vancouver, British Columbia, Canada 3Department of Medical Oncology, BC Cancer, Vancouver, British Columbia, Canada 4Department of Medical Genetics, Faculty of Medicine, Vancouver, British Columbia, Canada Cancers routinely exhibit chromosomal instability, resulting in tumors mutate, these variants are considerably more difficult the accumulation of changes in the abundance of genomic ma- to detect accurately compared to other types of mutations terial, known as copy number variants (CNVs). Unfortunately, and consequently they may represent an under-explored the detection of these variants in cancer genomes is difficult. We facet of tumor biology. 20 developed Ploidetect, a software package that effectively iden- While small mutations can be determined through base tifies CNVs within whole-genome sequenced tumors. Ploidetect changes embedded within aligned sequence reads, CNVs was more sensitive to CNVs in cancer related genes within ad- are variations in DNA quantity and are typically determined vanced, pre-treated metastatic cancers than other tools, while also segmenting the most contiguously.
    [Show full text]
  • BIOINFORMATICS APPLICATIONS NOTE Pages 380-381
    Vol. 14 no. 4 1998 BIOINFORMATICS APPLICATIONS NOTE Pages 380-381 MView: a web-compatible database search or multiple alignment viewer NigelP. Brown, ChristopheLeroy and Chris Sander European Bioinformatics Institute (EMBLĆEBI), Wellcome Genome Campus, CambridgeCB10 1SD, UK Received on December 10, 1997; revised and accepted on January 15, 1998 Abstract may be hyperlinked to the SRS system (Etzold et al., 1996), a text field, a field of scoring information from searches, and Summary: MView is a tool for converting the results of a a field reporting the per cent identity of each sequence with sequence database search into the form of a coloured multiple respect to a preferred sequence in the alignment, usually the alignment of hits stacked against the query. Alternatively, an query in the case of a search. existing multiple alignment can be processed. In either case, Multiple alignments require minimal parsing and are the output is simply HTML, so the result is platform independent and does not require a separate application or subjected only to formatting stages. Search hits are first applet to be loaded. stacked against the ungapped query sequence and require Availability: Free from http://www.sander.ebi.ac.uk/mview/ special processing. Ungapped search (e.g. BLAST) hit subject to copyright restrictions. fragments are assembled into a single string by overlaying Contact: [email protected] them preferentially by score onto a template string, while gapped search (e.g. FASTA) hits have columns corresponding Often when running FASTA (Pearson, 1990) or BLAST to query gaps excised. Consequently, the stacked alignment is (Altschul et al., 1990), it is desired to visualize the database a patchwork of reconstituted sequences that nevertheless is hits stacked against the query sequence.
    [Show full text]
  • Are Profile Hidden Markov Models Identifiable?
    Are Profile Hidden Markov Models Identifiable? Srilakshmi Pattabiraman Tandy Warnow Department of Electrical and Computer Engineering Department of Computer Science University of Illinois at Urbana-Champaign University of Illinois at Urbana-Champaign Urbana, Illinois Urbana, Illinois [email protected] [email protected] ABSTRACT 1 INTRODUCTION Profile Hidden Markov Models (HMMs) are graphical models that Profile Hidden Markov Models (HMMs) are arguably themost can be used to produce finite length sequences from a distribution. common statistical models in bioinformatics. Originally introduced In fact, although they were only introduced for bioinformatics 25 by Haussler and colleagues in [10, 12], and then expanded later years ago (by Haussler et al., Hawaii International Conference on in many subsequent texts [4–6, 9, 11, 21, 25], profile HMMs are Systems Science 1993), they are arguably the most commonly used now used in many analytical steps in biological sequence analysis statistical model in bioinformatics, with multiple applications, in- [15, 17–19, 22]. cluding protein structure and function prediction, classifications Profile Hidden Markov models are graphical models with match of novel proteins into existing protein families and superfamilies, states, insertion states, and deletion states; and the match and in- metagenomics, and multiple sequence alignment. The standard use sertion states emit letters from an underlying alphabet Σ (i.e., Σ of profile HMMs in bioinformatics has two steps: first a profile may be the 20 amino acids, the four nucleotides, or some other HMM is built for a collection of molecular sequences (which may set of symbols). In the standard form presented in [4] (widely in not be in a multiple sequence alignment), and then the profile HMM use in bioinformatics applications), each profile Hidden Markov is used in some subsequent analysis of new molecular sequences.
    [Show full text]
  • EMBO Facts & Figures
    excellence in life sciences Reykjavik Helsinki Oslo Stockholm Tallinn EMBO facts & figures & EMBO facts Copenhagen Dublin Amsterdam Berlin Warsaw London Brussels Prague Luxembourg Paris Vienna Bratislava Budapest Bern Ljubljana Zagreb Rome Madrid Ankara Lisbon Athens Jerusalem EMBO facts & figures HIGHLIGHTS CONTACT EMBO & EMBC EMBO Long-Term Fellowships Five Advanced Fellows are selected (page ). Long-Term and Short-Term Fellowships are awarded. The Fellows’ EMBO Young Investigators Meeting is held in Heidelberg in June . EMBO Installation Grants New EMBO Members & EMBO elects new members (page ), selects Young EMBO Women in Science Young Investigators Investigators (page ) and eight Installation Grantees Gerlind Wallon EMBO Scientific Publications (page ). Programme Manager Bernd Pulverer S Maria Leptin Deputy Director Head A EMBO Science Policy Issues report on quotas in academia to assure gender balance. R EMBO Director + + A Conducts workshops on emerging biotechnologies and on H T cognitive genomics. Gives invited talks at US National Academy E IC of Sciences, International Summit on Human Genome Editing, I H 5 D MAN 201 O N Washington, DC.; World Congress on Research Integrity, Rio de A M Janeiro; International Scienti c Advisory Board for the Centre for Eilish Craddock IT 2 015 Mammalian Synthetic Biology, Edinburgh. Personal Assistant to EMBO Fellowships EMBO Scientific Publications EMBO Gold Medal Sarah Teichmann and Ido Amit receive the EMBO Gold the EMBO Director David del Álamo Thomas Lemberger Medal (page ). + Programme Manager Deputy Head EMBO Global Activities India and Singapore sign agreements to become EMBC Associate + + Member States. EMBO Courses & Workshops More than , participants from countries attend 6th scienti c events (page ); participants attend EMBO Laboratory Management Courses (page ); rst online course EMBO Courses & Workshops recorded in collaboration with iBiology.
    [Show full text]
  • Computational Biology and Bioinformatics
    Vol. 30 ISMB 2014, pages i1–i2 BIOINFORMATICS EDITORIAL doi:10.1093/bioinformatics/btu304 Editorial This special issue of Bioinformatics serves as the proceedings of The conference used a two-tier review system, a continuation the 22nd annual meeting of Intelligent Systems for Molecular and refinement of a process begun with ISMB 2013 in an effort Biology (ISMB), which took place in Boston, MA, July 11–15, to better ensure thorough and fair reviewing. Under the revised 2014 (http://www.iscb.org/ismbeccb2014). The official confer- process, each of the 191 submissions was first reviewed by at least ence of the International Society for Computational Biology three expert referees, with a subset receiving between four and (http://www.iscb.org/), ISMB, was accompanied by 12 Special eight reviews, as needed. These formal reviews were frequently Interest Group meetings of one or two days each, two satellite supplemented by online discussion among reviewers and Area meetings, a High School Teachers Workshop and two half-day Chairs to resolve points of dispute and reach a consensus on tutorials. Since its inception, ISMB has grown to be the largest each paper. Among the 191 submissions, 29 were conditionally international conference in computational biology and bioinfor- accepted for publication directly from the first round review Downloaded from matics. It is expected to be the premiere forum in the field for based on an assessment of the reviewers that the paper was presenting new research results, disseminating methods and tech- clearly above par for the conference. A subset of 16 papers niques and facilitating discussions among leading researchers, were viewed as potentially in the top tier but raised significant practitioners and students in the field.
    [Show full text]
  • Assigning Folds to the Proteins Encoded by the Genome of Mycoplasma Genitalium (Protein Fold Recognition͞computer Analysis of Genome Sequences)
    Proc. Natl. Acad. Sci. USA Vol. 94, pp. 11929–11934, October 1997 Biophysics Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium (protein fold recognitionycomputer analysis of genome sequences) DANIEL FISCHER* AND DAVID EISENBERG University of California, Los Angeles–Department of Energy Laboratory of Structural Biology and Molecular Medicine, Molecular Biology Institute, University of California, Los Angeles, Box 951570, Los Angeles, CA 90095-1570 Contributed by David Eisenberg, August 8, 1997 ABSTRACT A crucial step in exploiting the information genitalium (MG) (10), as a test of the capabilities of our inherent in genome sequences is to assign to each protein automatic fold recognition server and as a case study to sequence its three-dimensional fold and biological function. identify the difficulties facing automated fold assignment. Here we describe fold assignment for the proteins encoded by the small genome of Mycoplasma genitalium. The assignment MATERIALS AND METHODS was carried out by our computer server (http:yywww.doe- mbi.ucla.eduypeopleyfrsvryfrsvr.html), which assigns folds to The MG Sequences. The 468 MG sequences were obtained amino acid sequences by comparing sequence-derived predic- from The Institute for Genome Research (TIGR) through its tions with known structures. Of the total of 468 protein ORFs, Web address: http:yywww.tigr.orgytdbymdbymgdbymgd- 103 (22%) can be assigned a known protein fold with high b.html. Three types of annotation (based on searches in the confidence, as cross-validated with tests on known structures. sequence database) accompany each TIGR sequence (10): (i) Of these sequences, 75 (16%) show enough sequence similarity functional assignment—a clear sequence similarity with a to proteins of known structure that they can also be detected protein of known function from another organism was found by traditional sequence–sequence comparison methods.
    [Show full text]
  • Are You an Invited Speaker? a Bibliometric Analysis of Elite Groups for Scholarly Events in Bioinformatics
    Are You an Invited Speaker? A Bibliometric Analysis of Elite Groups for Scholarly Events in Bioinformatics Senator Jeong, Sungin Lee, and Hong-Gee Kim Biomedical Knowledge Engineering Laboratory, Seoul National University, 28–22 YeonGeon Dong, Jongno Gu, Seoul 110–749, Korea. E-mail: {senator, sunginlee, hgkim}@snu.ac.kr Participating in scholarly events (e.g., conferences, work- evaluation, but it would be hard to claim that they have pro- shops, etc.) as an elite-group member such as an orga- vided comprehensive lists of evaluation measurements. This nizing committee chair or member, program committee article aims not to provide such lists but to add to the current chair or member, session chair, invited speaker, or award winner is beneficial to a researcher’s career develop- practices an alternative metric that complements existing per- ment.The objective of this study is to investigate whether formance measures to give a more comprehensive picture of elite-group membership for scholarly events is represen- scholars’ performance. tative of scholars’ prominence, and which elite group is By one definition (Jeong, 2008), a scholarly event is the most prestigious. We collected data about 15 global “a sequentially and spatially organized collection of schol- (excluding regional) bioinformatics scholarly events held in 2007. We sampled (via stratified random sampling) ars’ interactions with the intention of delivering and shar- participants from elite groups in each event. Then, bib- ing knowledge, exchanging research ideas, and performing liometric indicators (total citations and h index) of seven related activities.” As such, scholarly events are communica- elite groups and a non-elite group, consisting of authors tion channels from which our new evaluation tool can draw who submitted at least one paper to an event but were its supporting evidence.
    [Show full text]
  • Practical Structure-Sequence Alignment of Pseudoknotted Rnas Wei Wang
    Practical structure-sequence alignment of pseudoknotted RNAs Wei Wang To cite this version: Wei Wang. Practical structure-sequence alignment of pseudoknotted RNAs. Bioinformatics [q- bio.QM]. Université Paris Saclay (COmUE), 2017. English. NNT : 2017SACLS563. tel-01697889 HAL Id: tel-01697889 https://tel.archives-ouvertes.fr/tel-01697889 Submitted on 31 Jan 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. 1 NNT : 2017SACLS563 Thèse de doctorat de l’Université Paris-Saclay préparée à L’Université Paris-Sud Ecole doctorale n◦580 (STIC) Sciences et Technologies de l’Information et de la Communication Spécialité de doctorat : Informatique par M. Wei WANG Alignement pratique de structure-séquence d’ARN avec pseudonœuds Thèse présentée et soutenue à Orsay, le 18 Décembre 2017. Composition du Jury : Mme Hélène TOUZET Directrice de Recherche (Présidente) CNRS, Université Lille 1 M. Guillaume FERTIN Professeur (Rapporteur) Université de Nantes M. Jan GORODKIN Professeur (Rapporteur) University of Copenhagen Mme Johanne COHEN Directrice de Recherche (Examinatrice)
    [Show full text]
  • Twinscan: a Software Package for Homology-Based Gene Prediction
    Washington University in St. Louis Washington University Open Scholarship All Computer Science and Engineering Research Computer Science and Engineering Report Number: WUCSE-2003-8 2003-02-14 Twinscan: A Software Package for Homology-Based Gene Prediction Paul Flicek A complete mapping from genome to proteome would constitute a foundation for genome- based biology and provide targets for pharmaceutical and therapeutic intervention. This is one reason gene structure prediction has been a major subfield of computational biology for vo er 20 years. Many of the widely used gene prediction systems were developed in the 1990s and are unable to take advantage of the revolution in comparative genomics brought on by the sequencing of the entire genomes of an increasing numbers of vertebrates. Twinscan is a new system for high-throughput gene-structure prediction that exploits the patterns of conservation observed in alignments... Read complete abstract on page 2. Follow this and additional works at: https://openscholarship.wustl.edu/cse_research Part of the Computer Engineering Commons, and the Computer Sciences Commons Recommended Citation Flicek, Paul, "Twinscan: A Software Package for Homology-Based Gene Prediction" Report Number: WUCSE-2003-8 (2003). All Computer Science and Engineering Research. https://openscholarship.wustl.edu/cse_research/1126 Department of Computer Science & Engineering - Washington University in St. Louis Campus Box 1045 - St. Louis, MO - 63130 - ph: (314) 935-6160. This technical report is available at Washington University Open Scholarship: https://openscholarship.wustl.edu/ cse_research/1126 Twinscan: A Software Package for Homology-Based Gene Prediction Paul Flicek Complete Abstract: A complete mapping from genome to proteome would constitute a foundation for genome-based biology and provide targets for pharmaceutical and therapeutic intervention.
    [Show full text]
  • Full List of PCAWG Consortium Working Groups and Writing
    Supplementary information to: Genomics: data sharing needs an international code of conduct To accompany a Comment published in Nature 578, 31–33 (2020) https://www.nature.com/articles/d41586-020-00082-9 By Mark Phillips, Fruzsina Molnár-Gábor, Jan O. Korbel, Adrian Thorogood, Yann Joly, Don Chalmers, David Townend & Bartha M. Knoppers for the PCAWG Consortium. SUPPLEMENTARY INFORMATION | NATURE | 1 The ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium Working Groups PCAWG Steering committee 1,2 3,4,5,6 7,8 9,10 Peter J Campbell# ,​ Gad Getz# ,​ Jan O Korbel# ,​ Lincoln D Stein# ​ and Joshua M ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ Stuart#11,12 ​ ​ PCAWG Head of project management Jennifer L Jennings13 ​ PCAWG Executive committee 14 15 16 17 18 Sultan T Al-Sedairy ,​ Axel Aretz ,​ Cindy Bell ,​ Miguel Betancourt ,​ Christiane Buchholz ,​ 19 ​ ​ 20 ​ 21 ​ 22 23 ​ Fabien Calvo ,​ Christine Chomienne ,​ Michael Dunn ,​ Stuart Edmonds ,​ Eric Green ,​ Shailja 24 ​ 23 ​ 25 ​ 13 ​ 26 ​ Gupta ,​ Carolyn M Hutter ,​ Karine Jegalian ,​ Jennifer L Jennings ,​ Nic Jones ,​ Hyung-Lae 27 ​ 28,29,30 ​ 31​ 32 ​ ​ 32 26 Kim ,​ Youyong Lu ,​ Hitoshi Nakagama ,​ Gerd Nettekoven ,​ Laura Planko ,​ David Scott ,​ ​ 3​ 3,34 35 ​ 9,10 ​ 1 ​ ​ 35 Tatsuhiro Shibata ,​ Kiyo Shimizu ,​ Lincoln D Stein# ,​ Michael R Stratton ,​ Takashi Yugawa ,​ ​ 36,37 ​ ​ 24 ​ ​ 38 ​ 39 ​ Giampaolo Tortora ,​ K VijayRaghavan ,​ Huanming Yang ​ and Jean C Zenklusen ​ ​ ​ ​ PCAWG Ethics and Legal Working Group 40 41 41 42 41 Don Chalmers# ,​ Yann Joly ,​ Bartha M Knoppers# ,​ Fruzsina
    [Show full text]