Single-Cell Genome Sequencing: Current State of the Science

Total Page:16

File Type:pdf, Size:1020Kb

Single-Cell Genome Sequencing: Current State of the Science REVIEWS SINGLE-CELL OMICS Single-cell genome sequencing: current state of the science Charles Gawad1, Winston Koh2,3 and Stephen R. Quake2,3 Abstract | The field of single-cell genomics is advancing rapidly and is generating many new insights into complex biological systems, ranging from the diversity of microbial ecosystems to the genomics of human cancer. In this Review, we provide an overview of the current state of the field of single-cell genome sequencing. First, we focus on the technical challenges of making measurements that start from a single molecule of DNA, and then explore how some of these recent methodological advancements have enabled the discovery of unexpected new biology. Areas highlighted include the application of single-cell genomics to interrogate microbial dark matter and to evaluate the pathogenic roles of genetic mosaicism in multicellular organisms, with a focus on cancer. We then attempt to predict advances we expect to see in the next few years. Genetic mosaicism Cell theory provided an entirely new framework for In this Review, we describe the current state of the Occurs when there are at least understanding biology and disease by asserting that field, including approaches for cell isolation, whole-­ two genotypes in different cells cells are the basic unit of life1. The subsequent discov- genome amplification (WGA), DNA sequencing con- of the same organism. ery that DNA is the heritable programme that encodes siderations and sequence data analysis, and highlight Whole-genome the proteins that carry out cellular functions led to how recent progress is addressing some of the technical amplification the development of the fields of modern genetics and challenges associated with these approaches. We then The use of biochemical genomics2. Although bulk approaches for studying discuss how those advancements have begun to fulfil methods to produce multiple genetic variation have identified thousands of new some of the ambitious aspirations for the field in appli- copies of the entire genome. unicellular species and determined genetic aetiologies cations such as identifying new features of microbial for thousands of human diseases, most of that work ecosystems and characterizing human intercellular has been done at the level of the ecosystem or organ- genetic heterogeneity, in particular in cancer. ism3,4. However, we now know that diversity within an ecosystem of unicellular species is far greater than Technological challenges we can accurately measure by studying a mixed group Acquiring high-quality single-cell sequencing data has of organisms, and that the genomes within the cells of four primary technical challenges: efficient physical iso- an individual multicellular organism are not always lation of individual cells; amplification of the genome of the same. that single cell to acquire sufficient material for down- 1Departments of Oncology Single-cell genomics aims to provide new per- stream analyses; querying the genome in a cost-effective and Computational Biology, spectives to our understanding of genetics by bring- manner to identify variation that can test the hypotheses St. Jude Children’s Research ing the study of genomes to the cellular level (FIG. 1). of the study; and interpreting the data within the con- Hospital, Memphis, These tools are opening up new frontiers by dissect- text of biases and errors that are introduced during the Tennessee 38105, USA. 2Departments of ing the contributions of individual cells to the biology first three steps. To maximize the quality of single-cell Bioengineering and Applied of ecosystems and organisms. For example, it is now data and to ensure that the signal is separable from Physics, Stanford University, possible to use single-cell genomics to identify and technical noise, each of these variables requires careful Stanford, California 94304, assemble the genomes of unculturable microorgan- consideration when designing single-cell studies. USA. 5 genetic mosaicism 3Howard Hughes Medical isms , evalu­ate the roles of in normal 6 Institute, Stanford University, physiology and disease , and determine the contribu- Cell isolation. The first step in isolating individual California 94304, USA. tions of intra-tumour genetic heterogeneity in cancer cells from primary samples is to produce a suspension 7 Correspondence to S.R.Q. development or treatment response . However, this of viable single cells. This is not trivial when working [email protected] field rests on the ability to study a single DNA mol- with complex solid tissues, which require mechanical or doi:10.1038/nrg.2015.16 ecule from individually isolated cells, a process that is enzymatic dissociation that keeps the cells viable while Published online 25 Jan 2016 technically challenging. not biasing for specific subpopulations. In addition, NATURE REVIEWS | GENETICS VOLUME 17 | MARCH 2016 | 175 © 2016 Macmillan Publishers Limited. All rights reserved REVIEWS Example Output Unicellular Multicellular Output Example applications microorganisms eukaryotes applications List of variant frequencies Species Variant Allele frequency Cataloguing detected Sampling A W% variants and A Metagenomic their allele of entire B X% B frequencies in population C Y% C Tissue tissue samples D Z% D Variant Species A B C D Segregating detected Detection 1 variants to of rare A Mini- Single determine clonal microorganisms B metagenomic cell Cell 2 structures, E 3 identifying rare variants Genomic complexity of sample Phased genomic variants Assembly Genome Assembly of A C B D of new assemblies Single Single low-complexity microbial microorganism chromosome genomic regions genomes Chromosome 1a Chromosome 1b Figure 1 | Opportunities enabled by single-cell sequencing strategies. Single-cell approaches provide higher-resolution views of the genomic content of samples by reducing the complexity of the genomicNature Reviewssignal through | Genetics the physical separation of cells or chromosomes. Shown in the centre of the figure are schematics of the different levels of cellular complexity analysed by approaches in microorganisms and multicellular eukaryotes. Extending outwards are simplified diagrams of data outputs from these approaches and examples of applications, for unicellular microorganisms (extending to the left) and multicellular eukaryotes (extending to the right). For microbial genomics, decreasing the number of organisms enables the detection of rare microorganisms in a sample. Single-cell sequencing allows for the assembly of the entire genomes of new microorganisms. Single-cell sequencing of multicellular organisms can reveal rare genetic variants and provide information on the co‑occurrence of mutations and evolutionary history of samples. Single chromosome sequencing allows for the phasing of variants across a genome. diseased tissues can have different dissociation kinet- samples, depending on the environmental source ics when compared with their normal counterparts, as additional sample preparation and FACS setting con- well as varied dissociation between samples of the same siderations can also be required17. Finally, automated disease. Standard digestion protocols for commonly micromanipulation methods that use droplets or micro- studied tissues, as well as vigorous approaches for opti- mechanical valves in microfluidic devices are entering mizing the dissociation of rare or diseased tissues, are mainstream use18–20. Regardless of the method used, it areas that require further development. Laser-capture is also important to accurately confirm that a single cell microdissection8 provides a low-throughput way of has been physically isolated so that spurious biological isolating DNA from single cells in their native spatial conclusions are not made after evaluating chambers that context, but the quality of sequencing data derived from are empty or contain multiple cells. In the ideal case microdissected single cells has been relatively poor. this can be accomplished by obtaining microscopy data Finally, microfluidic and bead-based methods have from each chamber or well containing a single cell. been developed to specifically enrich for single circu- Various single-cell isolation technologies have lating tumour cells (CTCs), as reviewed in detail else- recently been reviewed, where the trade-offs in accuracy, where9. Environmental microbial samples also require throughput, reproducibility and ease of use were high- efficient lysis of bacteria, with requirements that can be lighted9,21,22. Most of the studies using these technolo- highly variable between species10. gies have been done to illustrate feasibility using a small Once in suspension, several approaches have been number of cells. Addressing many of the fundamental developed to isolate single cells. These include meth- biological questions that are uniquely approachable ods that require manual manipulation, such as serial with single-cell genomics will require the interrogation dilution11, micropipetting12, microwell dilution13 and of thousands of cells, making it more likely that tech- ­optical tweezers14. In addition, several protocols have nologies that are scalable through parallelization, such been developed to isolate intact cells or nuclei using as microfluidic-based approaches, are adopted for the Optical tweezers fluorescence-activated­ cell sorting (FACS)15. Nuclear long term. In addition, identifying scalable methods for Devices that use a laser to manipulate submicron isolation has the advantage of enabling single-­cell single-cell isolation
Recommended publications
  • Single Cell Whole Genome Amplification On-Chip for Forensic Purposes
    Single cell whole genome amplification on-chip for forensic purposes Loes Steller 11814268 MA Forensic Science Literature thesis (5 EC) 14-12-2019 Supervisor: dr. B.B. Bruijns & dr. J.C. Knotter Examiner: prof. dr. A.D. Kloosterman Word count: 9348 (excl references and abstract) Abstract DNA analysis via short tandem repeats (STRs) is a valuable tool for human identification of biological crime traces. HoWever, crime traces With a limited amount of DNA are often not sufficient for STR analysis. In this revieW, a novel approach using single cell Whole genome amplification (WGA) on a microfluidic chip is proposed, With the aim to improve the success rate of DNA analysis in forensic investigations. WGA functions as an extra amplification step to increase the amount of DNA that is available for STR-PCR, Which could improve the success rate of obtaining a DNA profile and alloW mixture deconvolution already at the amplification stage. Via performing single cell WGA on-chip in enclosed nanoscale volumes, time, cost and chance of contamination could potentially be reduced compared to tube-based WGA. Moreover, the portable nature of microfluidic chips might enable direct analysis at the crime scene. This revieW aims to provide an overvieW of available single cell WGA methods, including kits for polymerase chain reaction-based, multiple displacement amplification-based and hybrid methods for single cell WGA. The performance of each method analyzed With STR analysis will be discussed, in order to determine Whether these methods could be useful in forensic investigations. Furthermore, available microfluidic chips for single cell WGA reported in literature Will be described.
    [Show full text]
  • Single-Cell Research an Overview of Recent Single-Cell Research Publications Featuring Illumina® Technology
    Single-Cell Research An Overview of Recent Single-Cell Research Publications Featuring Illumina® Technology TABLE OF CONTENTS 5 Introduction Multiple Annealing and Looping–Based Amplification Cycles 7 Applications Genomic DNA and mRNA Sequencing Cancer 68 Epigenomics Methods Metagenomics Single-Cell Assay for Transposase- Stem Cells Accessible Chromatin Using Developmental Biology Sequencing Immunology Single-Cell Bisulfite Sequencing/ Neurobiology Single-Cell Whole-Genome Bisulfite Sequencing Drug Discovery Single-Cell Methylome & Transcriptome Reproductive Health Sequencing Microbial Ecology and Evolution Single-Cell Reduced-Representation Plant Biology Bisulfite Sequencing Forensics Single-Cell Chromatin Allele-Specific Gene Expression Immunoprecipitation Sequencing Chromatin Conformation Capture 50 Sample Preparation Sequencing Droplet-Based Chromatin 54 Data Analysis Immunoprecipitation Sequencing 60 DNA Methods 78 RNA Methods Multiple-Strand Displacement Designed Primer–Based RNA Amplification Sequencing Genome & Transcriptome Sequencing Single-Cell Universal Poly(A)- Independent RNA Sequencing For Research Use Only. Not for use in diagnostic procedures. An overview of recent publications featuring Illumina tecnology 3 Quartz-Seq Smart-Seq Smart-Seq2 Single-Cell Methylome & Transcriptome Sequencing Genome & Transcriptome Sequencing Genomic DNA and mRNA Sequencing T Cell–Receptor Chain Pairing Unique Molecular Identifiers Cell Expression by Linear Amplification Sequencing Flow Cell–Surface Reverse-Transcription Sequencing Single-Cell Tagged Reverse- Transcription Sequencing Fixed and Recovered Intact Single-Cell RNA Sequencing Cell Labeling via Photobleaching Indexing Droplets Drop-Seq CytoSeq Single-Cell RNA Barcoding and Sequencing High-Throughput Single-Cell Labeling For Research Use Only. Not for use in diagnostic procedures. 4 Single-cell Research INTRODUCTION Living tissues are composed of a variety of cell types. Each cell type has a distinct 1. Tanay A and Regev A.
    [Show full text]
  • (MDA) and Multiple Annealing and Looping-Based Amplification Cycles (MALBAC) in Single-Cell Sequencing
    RESEARCH ARTICLE Comparison of Multiple Displacement Amplification (MDA) and Multiple Annealing and Looping-Based Amplification Cycles (MALBAC) in Single-Cell Sequencing Minfeng Chen1,2., Pengfei Song3., Dan Zou4, Xuesong Hu2, Shancen Zhao2,6*, OPEN ACCESS Shengjie Gao2,5*, Fei Ling1* Citation: Chen M, Song P, Zou D, Hu X, Zhao S, et al. (2014) Comparison of Multiple Displacement 1. School of Bioscience and Bioengineering, South China University of Technology, Guangzhou, 510006, Amplification (MDA) and Multiple Annealing and China, 2. BGI-Shenzhen, Shenzhen, 518083, China, 3. The fourth people’s hospital of Shenzhen (Futian Looping-Based Amplification Cycles (MALBAC) in hospital), Shenzhen, 518033, China, 4. School of Computer, National University of Defense Technology, Single-Cell Sequencing. PLoS ONE 9(12): e114520. Changsha, 410073, China, 5. State Key Laboratory of Agrobiotechnology and School of Life Sciences, The doi:10.1371/journal.pone.0114520 Chinese University of Hong Kong, Hong Kong, China, 6. Department of Molecular Medicine, Aarhus University Hospital, Aarhus, Denmark Editor: Deyou Zheng, Albert Einsten College of Medicine, United States of America *[email protected] (FL); [email protected] (SG); [email protected] (SZ) Received: June 13, 2014 . These authors contributed equally to this work. Accepted: November 9, 2014 Published: December 8, 2014 Copyright: ß 2014 Chen et al. This is an open- access article distributed under the terms of the Abstract Creative Commons Attribution License, which permits unrestricted use, distribution, and repro- Single-cell sequencing promotes our understanding of the heterogeneity of cellular duction in any medium, provided the original author and source are credited. populations, including the haplotypes and genomic variability among different generation of cells.
    [Show full text]
  • MDA in Capillary for Whole Genome Amplification
    MDA in Capillary for Whole Genome Amplification Junji Li a, Na Lu a, Xulian Shi a,b,c, Yi Qiao a, Liang Chen a, Mengqin Duan a, Yong Hou b,c, Qinyu Ge a, Yuhan Tao a, Jing Tu a,*, Zuhong Lu a,* a State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China b BGI-Shenzhen, Shenzhen 518083, China c China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China * Correspondence: [email protected], [email protected]. Abstract: Whole genome amplification (WGA) plays an important role in sample preparation of low-input templates for high-throughput sequencing. Multiple displacement amplification (MDA), a popular isothermal WGA methods, suffers a major hurdle of highly uneven amplification. Optimizations have been made in the past by separating the reagents into numbers of tiny chambers or droplets in microfluidic devices, which significantly improves the amplification uniformity of MDA. However, skill barrier still exists for biological researchers to handle chip fabrication and droplet manipulation. Here, we present a novel MDA protocol, in-capillary MDA (icMDA), which significantly simplifies the manipulation and improves the uniformity of amplification by dispersing reagents in a long quasi-1D capillary tubing. We demonstrated that icMDA is able to accurately detect SNVs with higher efficiency and sensitivity. Moreover, this straightforward method employs neither customized instruments nor complicated operations, making it a ready-to-use approach for most laboratories. Keywords: Whole genome amplification; Multiple displacement amplification; In-capillary reaction; High-throughput sequencing Introduction Over the past 10 years, whole genome amplification (WGA), which is a bridge from low initial genomic DNA to high-throughput sequencing library (Shapiro et al.
    [Show full text]
  • DNA SEQUENCING METHODS COLLECTION an Overview of Recent DNA-Seq Publications Featuring Illumina® Technology TABLE of CONTENTS
    DNA SEQUENCING METHODS COLLECTION An overview of recent DNA-seq publications featuring Illumina® technology TABLE OF CONTENTS 06 Introduction 07 Sequence Rearrangements 08 RAD and PE RAD-Seq: Restriction-Site Associated DNA Sequencing 11 ddRADseq: Double Digest Restriction-Site Associated DNA Marker Generation 13 2b-RAD: RAD With Type IIB Restriction Endonucleases 14 SLAF-seq: Specific Locus Amplified Fragment Sequencing 16 hyRAD: Hybridization RAD for Degraded DNA 17 Rapture: Restriction-Site Associated DNA Capture 18 Digenome-seq: Cas9-Digested Whole-Genome Sequencing 19 CAP-seq: CXXC Affinity Purification Sequencing 20 CPT-seq: Contiguity-Preserving Transposition Sequencing 21 RC-Seq: Retrotransposon Capture Sequencing 22 Tn-Seq: Transposon Sequencing INSeq: Insertion Sequencing 24 TC-Seq: Translocation-Capture Sequencing 25 Rep-Seq: Repertoire Sequencing Ig-seq: DNA Sequencing of Immunoglobulin Genes MAF: Molecular Amplification fingerprinting 26 EC-seq: Excision Circle Sequencing 27 Bubble-Seq: Libraries of Restriction Fragments that Contain Replication Initiation Sites (Bubbles) 28 NSCR: Nascent Strand Capture and Release 29 Repli-Seq: Nascent DNA Replication Strand Sequencing 30 NS-Seq: Nascent Strand Sequencing 31 DNA break mapping 32 Map DNA Single-Strand Breaks (SSB-Seq) 33 BLESS: Breaks labeling and Enrichment on Streptavidin and Sequencing 34 DSB-Seq: Map DNA Double-Strand Breaks 35 Break-seq: Double-Stranded Break Labeling 36 GUIDE-seq: Genome-Wide, Unbiased Identification of DSBs Enabled by Sequencing 38 HTGTS: High-Throughput
    [Show full text]
  • Sequencing and Proteomics M ETHODS in M OLECULAR B IOLOGY
    Methods in Molecular Biology 1979 Valentina Proserpio Editor Single Cell Methods Sequencing and Proteomics M ETHODS IN M OLECULAR B IOLOGY Series Editor John M. Walker School of Life and Medical Sciences University of Hertfordshire Hatfield, Hertfordshire, AL10 9AB, UK For further volumes: http://www.springer.com/series/7651 Single Cell Methods Sequencing and Proteomics Edited by Valentina Proserpio Department of Life Sciences and System Biology, University of Turin, Italian Institute for Genomic Medicine, IIGM Turin, Torino, Italy Editor Valentina Proserpio Department of Life Sciences and System Biology University of Turin, Italian Institute for Genomic Medicine, IIGM Turin Torino, Italy ISSN 1064-3745 ISSN 1940-6029 (electronic) Methods in Molecular Biology ISBN 978-1-4939-9239-3 ISBN 978-1-4939-9240-9 (eBook) https://doi.org/10.1007/978-1-4939-9240-9 © Springer Science+Business Media, LLC, part of Springer Nature 2019 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors, and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication.
    [Show full text]
  • Single-Cell Sequencing of the Small and AT-Skewed Genome of Malaria
    Liu et al. Genome Medicine (2021) 13:75 https://doi.org/10.1186/s13073-021-00889-9 METHOD Open Access Single-cell sequencing of the small and AT- skewed genome of malaria parasites Shiwei Liu1, Adam C. Huckaby1, Audrey C. Brown1, Christopher C. Moore2, Ian Burbulis3,4, Michael J. McConnell3,5,6 and Jennifer L. Güler1,2* Abstract Single-cell genomics is a rapidly advancing field; however, most techniques are designed for mammalian cells. We present a single-cell sequencing pipeline for an intracellular parasite, Plasmodium falciparum, with a small genome of extreme base content. Through optimization of a quasi-linear amplification method, we target the parasite genome over contaminants and generate coverage levels allowing detection of minor genetic variants. This work, as well as efforts that build on these findings, will enable detection of parasite heterogeneity contributing to P. falciparum adaptation. Furthermore, this study provides a framework for optimizing single-cell amplification and variant analysis in challenging genomes. Keywords: Whole-genome amplification, AT-skewed genome, Malaria, Single-cell sequencing, MALBAC, Copy number variation, Single-nucleotide polymorphism Background deletion of a genomic region) contribute to antimalarial Malaria is a life-threatening disease caused by protozoan resistance in P. falciparum [4–12]. It is important to as- Plasmodium parasites. P. falciparum causes the greatest sess genetic diversity within parasite populations to bet- number of human malaria deaths [1]. The clinical symp- ter understand the mechanisms of rapid adaption to toms of malaria occur when parasites invade human antimalarial drugs and other selective forces. These stud- erythrocytes and undergo rounds of asexual reproduction ies are often complicated by multi-clonal infections and by maturing from early-stage into late-stage forms and limited parasite material from clinical isolates.
    [Show full text]
  • Improved Clinical Outcomes of Preimplantation Genetic Testing For
    Niu et al. BMC Pregnancy and Childbirth (2020) 20:388 https://doi.org/10.1186/s12884-020-03082-9 RESEARCH ARTICLE Open Access Improved clinical outcomes of preimplantation genetic testing for aneuploidy using MALBAC-NGS compared with MDA-SNP array Wenbin Niu1,2†, Linlin Wang1,2†, Jiawei Xu1,2, Ying Li1,2, Hao Shi1,2, Gang Li1,2, Haixia Jin1,2, Wenyan Song1,2, Fang Wang1,2 and Yingpu Sun1,2* Abstract Background: To assess whether preimplantation genetic testing for aneuploidy with next generation sequencing (NGS) outweighs single nucleotide polymorphism (SNP) array in improving clinical outcomes. Methods: A retrospective analysis of the clinical outcomes of patients who underwent PGT-A treatment in a single center from January 2013 to December 2017.A total of 1418 couples who underwent PGT-A treatment were enrolled, of which 805 couples used NGS for PGT-A, while the remaining 613 couples used SNP array for PGT-A. Clinical pregnancy rate, miscarriage rate and healthy baby rate were compared between the MALBAC-NGS-PGT-A and MDA-SNP-PGT-A groups. Results: After testing karyotypes of 5771 biopsied blastocysts, 32.2% (1861/5771) were identified as chromosomally normal, while 67.8% were chromosomally abnormal. In terms of clinical outcomes, women in the MALBAC-NGS- PGT-A group had a significantly higher clinical pregnancy rate (50.5% vs 41.7%, p = 0.002) and healthy baby rate (39.6% vs 31.4%, p = 0.003), and a lower miscarriage rate (15.5% vs 22.8%, p = 0.036). Conclusion: This is the largest study reporting the extensive application of NGS-based PGT-A, whilst comparing the clinical outcomes of MALBAC-NGS-PGT-A and MDA-SNP-PGT-A.
    [Show full text]