Strategic Plan 2011-2016

Total Page:16

File Type:pdf, Size:1020Kb

Strategic Plan 2011-2016 Strategic Plan 2011-2016 Wellcome Trust Sanger Institute Strategic Plan 2011-2016 Mission The Wellcome Trust Sanger Institute uses genome sequences to advance understanding of the biology of humans and pathogens in order to improve human health. -i- Wellcome Trust Sanger Institute Strategic Plan 2011-2016 - ii - Wellcome Trust Sanger Institute Strategic Plan 2011-2016 CONTENTS Foreword ....................................................................................................................................1 Overview .....................................................................................................................................2 1. History and philosophy ............................................................................................................ 5 2. Organisation of the science ..................................................................................................... 5 3. Developments in the scientific portfolio ................................................................................... 7 4. Summary of the Scientific Programmes 2011 – 2016 .............................................................. 8 4.1 Cancer Genetics and Genomics ................................................................................ 8 4.2 Human Genetics ...................................................................................................... 10 4.3 Pathogen Variation .................................................................................................. 13 4.4 Malaria .................................................................................................................... 15 4.5 Mouse and Zebrafish Genetics ................................................................................ 16 4.6 Genome Informatics ................................................................................................ 18 5. Research into diseases of developing countries ................................................................... 19 6. Core platforms for data generation and analysis ................................................................... 20 7. Strategic issues across all Programmes ............................................................................... 22 7.1 Strategy for DNA sequencing .................................................................................. 22 7.2 Strategy for data handling and sharing .................................................................... 25 8. Strategy for translation .......................................................................................................... 26 9. Developing our people .......................................................................................................... 27 10. Developing our organisation ............................................................................................... 30 11. Premises ............................................................................................................................. 31 12. Strategic relations ............................................................................................................... 31 13. Spreading the word ............................................................................................................. 33 14. Societal aspects of the Personal Genome........................................................................... 34 15. Resources........................................................................................................................... 35 Supporting documents ....................................................................................................... S-1/47 -iii- Wellcome Trust Sanger Institute Strategic Plan 2011-2016 Foreword We present below the Strategic Plan for the Wellcome Trust Sanger Institute for 2011-2016. The centrepiece of the Plan is the portfolio of scientific experiments we propose to conduct over the next quinquennium. However, the document also addresses training of young scientists and the ways in which we will inform and interact with the public about our work. Our science is conducted by people in a complex physical and societal environment. The Plan therefore further reflects on our scientific infrastructure and strategy for its development, the organisation and governance of the Institute, our interactions with other scientific organisations, the development of our people, our buildings, and the resources we are requesting in order to implement the Plan. It has been written subsequent to external review of our proposals and a Site Visit by a review panel at Sanger in May 2010, and incorporates the advice and comments received. -1- Wellcome Trust Sanger Institute Strategic Plan 2011-2016 Overview The Sanger Institute’s primary scientific theme in the 2011-2016 quinquennium will be the study of differences in DNA sequence between individual genomes, understanding the consequences of this variation for the biology of humans and other organisms and developing approaches by which this understanding can be used to improve human health. Our mission and strategy are entirely congruent with those of the Wellcome Trust and we aim to contribute as much as we can from our scientific corner to achieving the Trust’s goals. Why study variation in DNA sequence? Because individual members of a species, whether humans, mice, fish, worms, parasites, bacteria, or viruses differ from one another in their anatomy, physiology, metabolism and behaviour, and differences in their DNA sequences make a substantial contribution to this variation in phenotype. The overarching aim of our studies is, therefore, to characterise differences between individual genomes and identify the phenotypic changes which are the consequence of genomic variants. Our research will entail exploration of variation in DNA sequence between human genomes, between the genomes of microorganisms that cause infectious disease (pathogens) and between the genomes of individual animals from two model organism species, mouse and zebrafish. We will extensively explore naturally occurring DNA variation and will complement it with experimental genomic differences that we ourselves introduce. Our studies of DNA variation in humans will be wide ranging. They aim to identify DNA variants that cause phenotypic differences which fall within the normal spectrum, for example in hair colour or height, and also variants that result in overt disease. A spectrum of human diseases, rare and common, prevalent in developed and developing countries will be investigated including obesity, inflammatory disease, epilepsy, severity of response to malarial infection and disorders of development. Our studies in humans will also encompass somatic DNA variation, differences between the genomes of individual cells in the body, which underlies the development of all cancers. Naturally occurring variation will also be studied in many disease-associated microorganisms, allowing us to elucidate the genomic variants that are associated with important phenotypic features for human health such as microbial virulence, resistance to antibiotics and escape from vaccination. Because DNA variation distinguishes one member of a species from another and tells us how closely they are related, studying DNA sequence differences also allows us to track patterns and changes in the distribution of organisms. For example, it reveals how human populations have migrated in the past and the environmental or lifestyle factors implicated in selecting some humans over others, such as infectious disease and food scarcity. In a similar manner, it will allow us to track the spread of disease-causing microbes, monitor their routes of geographical dissemination and understand the waxing and waning of epidemics. Indeed, we will also be able to track cancer cells using somatically acquired genomic differences, even when they are undetectable by other methods. Importantly, investigation of natural variation at Sanger is complemented by our ability to introduce artificial changes into genomes. We will knock out genes in many microorganisms allowing us to identify those that are absolutely required for the organism to survive and - 2 - Wellcome Trust Sanger Institute Strategic Plan 2011-2016 therefore which potentially recommend themselves as drug targets. Our studies of experimentally induced variation in zebrafish, mice and embryonic stem cells will systematically and comprehensively explore the changes in phenotype that result when genes are deleted, one by one. We will also use engineered mutations in mice to cause cancers and thus understand in greater depth the genes that are implicated in tumour formation. Of course, these experiments will also provide the animals or cells themselves, which can then be used to investigate further the disturbance in biology that is caused by the DNA variant and which ultimately results in the phenotype. The results emanating from all these experiments will be made freely available in data repositories. In many cases we will, in addition, organise and annotate the data in intelligible and easily usable databases, transforming the ability of other research scientists to use the information. Physical resources such as animals, cells and DNA clones will also be made available and are already widely used by many research communities around the world. In some instances, the discoveries we make will lead directly to changes in medical practice which will have impact on the
Recommended publications
  • ANNUAL REVIEW 1 October 2005–30 September
    WELLCOME TRUST ANNUAL REVIEW 1 October 2005–30 September 2006 ANNUAL REVIEW 2006 The Wellcome Trust is the largest charity in the UK and the second largest medical research charity in the world. It funds innovative biomedical research, in the UK and internationally, spending around £500 million each year to support the brightest scientists with the best ideas. The Wellcome Trust supports public debate about biomedical research and its impact on health and wellbeing. www.wellcome.ac.uk THE WELLCOME TRUST The Wellcome Trust is the largest charity in the UK and the second largest medical research charity in the world. 123 CONTENTS BOARD OF GOVERNORS 2 Director’s statement William Castell 4 Advancing knowledge Chairman 16 Using knowledge Martin Bobrow Deputy Chairman 24 Engaging society Adrian Bird 30 Developing people Leszek Borysiewicz 36 Facilitating research Patricia Hodgson 40 Developing our organisation Richard Hynes 41 Wellcome Trust 2005/06 Ronald Plasterk 42 Financial summary 2005/06 Alastair Ross Goobey 44 Funding developments 2005/06 Peter Smith 46 Streams funding 2005/06 Jean Thomas 48 Technology Transfer Edward Walker-Arnott 49 Wellcome Trust Genome Campus As at January 2007 50 Public Engagement 51 Library and information resources 52 Advisory committees Images 1 Surface of the gut. 3 Zebrafish. 5 Cells in a developing This Annual Review covers the 2 Young children in 4 A scene from Y fruit fly. Wellcome Trust’s financial year, from Kenya. Touring’s Every Breath. 6 Data management at the Sanger Institute. 1 October 2005 to 30 September 2006. CONTENTS 1 45 6 EXECUTIVE BOARD MAKING A DIFFERENCE Developing people: To foster a Mark Walport The Wellcome Trust’s mission is research community and individual Director to foster and promote research with researchers who can contribute to the advancement and use of knowledge Ted Bianco the aim of improving human and Director of Technology Transfer animal health.
    [Show full text]
  • Ensembl Genomes: Extending Ensembl Across the Taxonomic Space P
    Published online 1 November 2009 Nucleic Acids Research, 2010, Vol. 38, Database issue D563–D569 doi:10.1093/nar/gkp871 Ensembl Genomes: Extending Ensembl across the taxonomic space P. J. Kersey*, D. Lawson, E. Birney, P. S. Derwent, M. Haimel, J. Herrero, S. Keenan, A. Kerhornou, G. Koscielny, A. Ka¨ ha¨ ri, R. J. Kinsella, E. Kulesha, U. Maheswari, K. Megy, M. Nuhn, G. Proctor, D. Staines, F. Valentin, A. J. Vilella and A. Yates EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK Received August 14, 2009; Revised September 28, 2009; Accepted September 29, 2009 ABSTRACT nucleotide archives; numerous other genomes exist in states of partial assembly and annotation; thousands of Ensembl Genomes (http://www.ensemblgenomes viral genomes sequences have also been generated. .org) is a new portal offering integrated access to Moreover, the increasing use of high-throughput genome-scale data from non-vertebrate species sequencing technologies is rapidly reducing the cost of of scientific interest, developed using the Ensembl genome sequencing, leading to an accelerating rate of genome annotation and visualisation platform. data production. This not only makes it likely that in Ensembl Genomes consists of five sub-portals (for the near future, the genomes of all species of scientific bacteria, protists, fungi, plants and invertebrate interest will be sequenced; but also the genomes of many metazoa) designed to complement the availability individuals, with the possibility of providing accurate and of vertebrate genomes in Ensembl. Many of the sophisticated annotation through the similarly low-cost databases supporting the portal have been built in application of functional assays.
    [Show full text]
  • Abstracts Genome 10K & Genome Science 29 Aug - 1 Sept 2017 Norwich Research Park, Norwich, Uk
    Genome 10K c ABSTRACTS GENOME 10K & GENOME SCIENCE 29 AUG - 1 SEPT 2017 NORWICH RESEARCH PARK, NORWICH, UK Genome 10K c 48 KEYNOTE SPEAKERS ............................................................................................................................... 1 Dr Adam Phillippy: Towards the gapless assembly of complete vertebrate genomes .................... 1 Prof Kathy Belov: Saving the Tasmanian devil from extinction ......................................................... 1 Prof Peter Holland: Homeobox genes and animal evolution: from duplication to divergence ........ 2 Dr Hilary Burton: Genomics in healthcare: the challenges of complexity .......................................... 2 INVITED SPEAKERS ................................................................................................................................. 3 Vertebrate Genomics ........................................................................................................................... 3 Alex Cagan: Comparative genomics of animal domestication .......................................................... 3 Plant Genomics .................................................................................................................................... 4 Ksenia Krasileva: Evolution of plant Immune receptors ..................................................................... 4 Andrea Harper: Using Associative Transcriptomics to predict tolerance to ash dieback disease in European ash trees ............................................................................................................
    [Show full text]
  • The ELIXIR Core Data Resources: ​Fundamental Infrastructure for The
    Supplementary Data: The ELIXIR Core Data Resources: fundamental infrastructure ​ for the life sciences The “Supporting Material” referred to within this Supplementary Data can be found in the Supporting.Material.CDR.infrastructure file, DOI: 10.5281/zenodo.2625247 (https://zenodo.org/record/2625247). ​ ​ Figure 1. Scale of the Core Data Resources Table S1. Data from which Figure 1 is derived: Year 2013 2014 2015 2016 2017 Data entries 765881651 997794559 1726529931 1853429002 2715599247 Monthly user/IP addresses 1700660 2109586 2413724 2502617 2867265 FTEs 270 292.65 295.65 289.7 311.2 Figure 1 includes data from the following Core Data Resources: ArrayExpress, BRENDA, CATH, ChEBI, ChEMBL, EGA, ENA, Ensembl, Ensembl Genomes, EuropePMC, HPA, IntAct /MINT , InterPro, PDBe, PRIDE, SILVA, STRING, UniProt ● Note that Ensembl’s compute infrastructure physically relocated in 2016, so “Users/IP address” data are not available for that year. In this case, the 2015 numbers were rolled forward to 2016. ● Note that STRING makes only minor releases in 2014 and 2016, in that the interactions are re-computed, but the number of “Data entries” remains unchanged. The major releases that change the number of “Data entries” happened in 2013 and 2015. So, for “Data entries” , the number for 2013 was rolled forward to 2014, and the number for 2015 was rolled forward to 2016. The ELIXIR Core Data Resources: fundamental infrastructure for the life sciences ​ 1 Figure 2: Usage of Core Data Resources in research The following steps were taken: 1. API calls were run on open access full text articles in Europe PMC to identify articles that ​ ​ mention Core Data Resource by name or include specific data record accession numbers.
    [Show full text]
  • Prospects & Overviews Orthology Prediction Methods: a Quality Assessment Using Curated Protein Families
    Prospects & Overviews Methods, Models & Techniques Orthology prediction methods: A quality assessment using curated protein families Kalliopi Trachana1), Tomas A. Larsson1)2), Sean Powell1), Wei-Hua Chen1), Tobias Doerks1), Jean Muller3)4) and Peer Bork1)5)Ã The increasing number of sequenced genomes has Introduction prompted the development of several automated orthology prediction methods. Tests to evaluate the accuracy of pre- The analysis of fully sequenced genomes offers valuable insights into the function and evolution of biological systems dictions and to explore biases caused by biological and [1]. The annotation of newly sequenced genomes, comparative technical factors are therefore required. We used 70 man- and functional genomics, and phylogenomics depend on ually curated families to analyze the performance of five reliable descriptions of the evolutionary relationships of public methods in Metazoa. We analyzed the strengths protein families. All the members within a protein family and weaknesses of the methods and quantified the impact are homologous and can be further separated into orthologs, of biological and technical challenges. From the latter part which are genes derived through speciation from a single ancestral sequence, and paralogs, which are genes resulting of the analysis, genome annotation emerged as the largest from duplication events before and after speciation (out- and single influencer, affecting up to 30% of the performance. in-paralogy, respectively) [2, 3]. The large number of fully Generally, most methods did well in assigning orthologous sequenced genomes and the fundamental role of orthology group but they failed to assign the exact number of genes in modern biology have led to the development of a plethora of forhalfofthegroups.Thepublicly available benchmark set methods (e.g.
    [Show full text]
  • C. Elegans Whole Genome Sequencing Reveals Mutational Signatures Related to Carcinogens and DNA Repair Deficiency
    Downloaded from genome.cshlp.org on September 28, 2021 - Published by Cold Spring Harbor Laboratory Press C. elegans whole genome sequencing reveals mutational signatures related to carcinogens and DNA repair deficiency Authors: Bettina Meier * (1); Susanna L Cooke * (2); Joerg Weiss (1); Aymeric P Bailly (1,3); Ludmil B Alexandrov (2); John Marshall (2); Keiran Raine (2); Mark Maddison (2); Elizabeth Anderson (2); Michael R Stratton (2); Anton Gartner * (1); Peter J Campbell * (2,4,5). * These authors contributed equally to this project. Institutions: (1) Centre for Gene Regulation and Expression, University of Dundee, Dundee, UK. (2) Cancer Genome Project, Wellcome Trust Sanger Institute, Hinxton, UK. (3) CRBM/CNRS UMR5237, University of Montpellier, Montpellier, France. (4) Department of Haematology, University of Cambridge, Cambridge, UK. (5) Department of Haematology, Addenbrooke’s Hospital, Cambridge, UK. Address for correspondence: Dr Peter J Campbell, Dr Anton Gartner, Cancer Genome Project, Centre for Gene Regulation and Expression, Wellcome Trust Sanger Institute, The University of Dundee, Hinxton CB10 1SA, Dow Street, Cambridgeshire, Dundee DD1 5EH UK. UK. Tel: +44 (0) 1223 494745 Phone: +44 (0) 1382 385809 Fax: +44 (0) 1223 494809 E-mail: [email protected] E-mail: [email protected] Running title: mutation profiling in C. elegans Keywords: mutation pattern, genetic and environmental factors, C. elegans, cisplatin, aflatoxin B1, whole-genome sequencing. Downloaded from genome.cshlp.org on September 28, 2021 - Published by Cold Spring Harbor Laboratory Press ABSTRACT Mutation is associated with developmental and hereditary disorders, ageing and cancer. While we understand some mutational processes operative in human disease, most remain mysterious.
    [Show full text]
  • Abstracts In
    ECCB 2014 Accepted Posters with Abstracts G: Bioinformatics of health and disease G01: Emile Rugamika Chimusa, Jacquiline Wangui Mugo and Nicola Mulder. Leveraging ancestry along the genome of admixed individuals to resolve missing heritability in disease scoring statistics Abstract: Human genetics has been haunted by the mystery of “missing heritability” of common traits. Although studies have discovered several variants associated with common diseases and traits, these variants typically appear to explain only a minority of the heritability. Resolving missing heritability, the difference between phenotypic variance explained by associated SNPs and estimates of narrow-sense heritability (h2), will inform strategies for disease mapping and prediction of complex traits. Among biased estimates of h2 due to epistatic interactions and rare variants not captured by genotyping arrays have been cited to be the most can be the most explanations for missing heritability. Here, we present an approach for estimating heritability of traits based on sharing local ancestry segments between pairs of unrelated individuals in an admixed population. From simulation data and real data, we demonstrated that our approach outperformed current approaches for estimating heritability of traits and holds values in admixture mapping for deconvoluting genes underlying ethnic differences in complex diseases risk. G02: Sylvain Mareschal, Pierre-Julien Viailly, Philippe Bertrand, Fabienne Desmots-Loyer, Elodie Bohers, Catherine Maingonnat, Karen Leroy, Thierry Fest and Fabrice Jardin. Next- Generation Sequencing applied to tailor targeted therapies in lymphoma: the RELYSE project Abstract: Non-Hodgkin Lymphomas (NHL) are lymphoid cell malignancies accounting for about 4% of all cancers, with an incidence rate of 12 cases per 100,000 and per year in Europe.
    [Show full text]
  • Deep Profiling of Protease Substrate Specificity Enabled by Dual Random and Scanned Human Proteome Substrate Phage Libraries
    Deep profiling of protease substrate specificity enabled by dual random and scanned human proteome substrate phage libraries Jie Zhoua, Shantao Lib, Kevin K. Leunga, Brian O’Donovanc, James Y. Zoub,d, Joseph L. DeRisic,d, and James A. Wellsa,d,e,1 aDepartment of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158; bDepartment of Biomedical Data Science, Stanford University, Stanford, CA 94305; cDepartment of Biochemistry and Biophysics, University of California, San Francisco, CA 94158; dChan Zuckerberg Biohub, San Francisco, CA 94158; and eDepartment of Cellular and Molecular Pharmacology, University of California, San Francisco, CA 94158 Edited by Benjamin F. Cravatt, Scripps Research Institute, La Jolla, CA, and approved August 19, 2020 (received for review May 11, 2020) Proteolysis is a major posttranslational regulator of biology inside lysate and miss low abundance proteins and those simply not and outside of cells. Broad identification of optimal cleavage sites expressed in cell lines tested that typically express only half their and natural substrates of proteases is critical for drug discovery genomes (13). and to understand protease biology. Here, we present a method To potentially screen a larger and more diverse sequence that employs two genetically encoded substrate phage display space, investigators have developed genetically encoded substrate libraries coupled with next generation sequencing (SPD-NGS) that phage (14, 15) or yeast display libraries (16, 17). Degenerate DNA allows up to 10,000-fold deeper sequence coverage of the typical six- sequences (up to 107) encoding random peptides were fused to a to eight-residue protease cleavage sites compared to state-of-the-art phage or yeast coat protein gene for a catch-and-release strategy synthetic peptide libraries or proteomics.
    [Show full text]
  • Methods in and Applications of the Sequencing of Short Non-Coding Rnas" (2013)
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2013 Methods in and Applications of the Sequencing of Short Non- Coding RNAs Paul Ryvkin University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Bioinformatics Commons, Genetics Commons, and the Molecular Biology Commons Recommended Citation Ryvkin, Paul, "Methods in and Applications of the Sequencing of Short Non-Coding RNAs" (2013). Publicly Accessible Penn Dissertations. 922. https://repository.upenn.edu/edissertations/922 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/922 For more information, please contact [email protected]. Methods in and Applications of the Sequencing of Short Non-Coding RNAs Abstract Short non-coding RNAs are important for all domains of life. With the advent of modern molecular biology their applicability to medicine has become apparent in settings ranging from diagonistic biomarkers to therapeutics and fields angingr from oncology to neurology. In addition, a critical, recent technological development is high-throughput sequencing of nucleic acids. The convergence of modern biotechnology with developments in RNA biology presents opportunities in both basic research and medical settings. Here I present two novel methods for leveraging high-throughput sequencing in the study of short non- coding RNAs, as well as a study in which they are applied to Alzheimer's Disease (AD). The computational methods presented here include High-throughput Annotation of Modified Ribonucleotides (HAMR), which enables researchers to detect post-transcriptional covalent modifications ot RNAs in a high-throughput manner. In addition, I describe Classification of RNAs by Analysis of Length (CoRAL), a computational method that allows researchers to characterize the pathways responsible for short non-coding RNA biogenesis.
    [Show full text]
  • CRISPR and Beyond Perturbations at Scale to Understand Genomes 2-4
    CRISPR and Beyond Perturbations at Scale to Understand Genomes 2-4 September 2019 Wellcome Genome Campus, UK Lectures to be held in the Francis Crick Auditorium Lunch and dinner to be held in the Hall Restaurant Poster sessions to be held in the Conference Centre Spoken presentations - If you are an invited speaker, or your abstract has been selected for a spoken presentation, please give an electronic version of your talk to the AV technician. Poster presentations – If your abstract has been selected for a poster, please display this in the Conference Centre on arrival. Conference programme Monday, 2 September 2019 11:45-12:50 Registration 12:50-13:00 Welcome and introductions Leopold Parts, Wellcome Sanger Institute UK 13.00-14.00 Keynote lecture Allan Bradley Kymab, UK 14:00-15:30 Session 1: Understanding impact of coding variations Chair: Leopold Parts, Wellcome Sanger Institute UK 14:00 New approaches for addressing the effect of genetic variation at scale Douglas Fowler University of Washington, USA 14:30 Tools for systematic proactive functional testing of human missense variants Frederick Roth University of Toronto 15:00 The mutational landscape of a prion-like domain Benedetta Bolognesi Institute for Bioengineering of Catalonia, Spain 15:15 Functional determination of all possible disease-associated variants in a region of CARD11 using saturation genome editing Richard James University of Washington, USA 15:30-16:00 Afternoon tea 16:00-17:30 Session 2: Measuring consequences of non-coding variation Chair: Lea Starita, University
    [Show full text]
  • Exploiting High Throughput DNA Sequencing Data for Genomic Analysis
    Exploiting high throughput DNA sequencing data for genomic analysis Markus Hsi-Yang Fritz Darwin College A dissertation submitted to the University of Cambridge for the degree of Doctor of Philosophy 14th October 2011 Markus Hsi-Yang Fritz EMBL-European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton, Cambridge, CB10 1SD United Kingdom email: [email protected] This dissertation is the result of my own work and includes nothing which is the outcome of work done in collaboration except where specifically indicated in the text. No part of this work has been submitted or is currently being submitted for any other qualification. This document does not exceed the word limit of 60,000 words1 as defined by the Biology Degree Committee. Markus Hsi-Yang Fritz 14th October 2011 1 excluding bibliography, figures, appendices etc. To an exceptional scientist — my Dad. Exploiting high throughput DNA sequencing data for genomic analysis Summary Markus Hsi-Yang Fritz Darwin College The last few years have witnessed a drastic increase in genomic data. This has been facilitated by the shift away from the Sanger sequencing tech- nique to an array of high-throughput methods — so-called next-generation sequencing technologies. This enormous growth of available DNA data has been a tremendous boon to large-scale genomics studies and has rapidly advanced fields such as environmental genomics, ancient DNA research, population genomics and disease association. On the other hand, however, researchers and sequence archives are now facing an enormous data deluge. Critically, the rate of sequencing data accumulation is now outstripping advances in hard drive capacity, network bandwidth and processing power.
    [Show full text]
  • Advances in Understanding Cancer Genomes Through Second-Generation Sequencing
    Advances in understanding cancer genomes through second-generation sequencing Matthew Meyerson, Stacey Gabriel and Gad Getz Second-generation A major near-term medical impact of the genome comprehensive genome-based diagnosis of cancer is sequencing technology revolution will be the elucidation of mecha- becoming increasingly crucial for therapeutic decisions. Used in this Review to refer to nisms of cancer pathogenesis, leading to improvements During the past decades, there have been major sequencing methods that have in the diagnosis of cancer and the selection of cancer advances in experimental and informatic methods emerged since 2005 that second-generation sequencing parallelize the sequencing treatment. Thanks to for genome characterization based on DNA and RNA 1–5 process and produce millions technologies , recently it has become feasible to microarrays and on capillary-based DNA sequenc- of typically short sequence sequence the expressed genes (‘transcriptomes’)6,7, ing (‘first-generation sequencing’, also known as Sanger reads (50–400 bases) from known exons (‘exomes’)8,9, and complete genomes10–15 sequencing). These technologies provided the ability amplified DNA clones. of cancer samples. to analyse exonic mutations and copy number altera- It is also often known as next-generation sequencing. These technological advances are important for tions and have led to the discovery of many important advancing our understanding of malignant neoplasms alterations in the cancer genome20. because cancer is fundamentally a disease of the genome. However, there are particular challenges for the A wide range of genomic alterations — including point detection and diagnosis of cancer genome alterations. mutations, copy number changes and rearrangements For example, some genomic alterations in cancer are — can lead to the development of cancer.
    [Show full text]