Massively Parallel Sequencing for Monitoring Genetic Consistency and Quality Control of Live Viral Vaccines

Total Page:16

File Type:pdf, Size:1020Kb

Massively Parallel Sequencing for Monitoring Genetic Consistency and Quality Control of Live Viral Vaccines Massively parallel sequencing for monitoring genetic consistency and quality control of live viral vaccines Alexander Neverov and Konstantin Chumakov1 Center for Biologics Evaluation and Research, Food and Drug Administration, Rockville, MD 20852 Edited* by Robert H. Purcell, National Institutes of Health, Bethesda, MD, and approved October 6, 2010 (received for review August 24, 2010) Intrinsic genetic instability of RNA viruses may lead to the accumu- uation. MAPREC is currently recommended by the World lation of revertants during manufacture of live viral vaccines, Health Organization (WHO) for screening of batches of OPV requiring rigorous quality control to ensure vaccine safety. Each before they can be released for use in humans (11, 18). lot of oral poliovirus vaccine (OPV) is tested for neurovirulence in Despite high sensitivity and accuracy of MAPREC, it can be animals and also for the presence of neurovirulent revertants. used to monitor only a limited number of genomic loci known to Mutant analysis by PCR and restriction enzyme cleavage (MAPREC) contain the markers of attenuation and misses mutations at other is used to measure the frequency of neurovirulent mutations at the sites, which can also contribute to loss of the attenuated phe- 5′ untranslated region (UTR) of the viral genome that correlate with notype. For most viruses, determinants of attenuation are un- the level of neurovirulence determined by the monkey neuroviru- known, which limits the utility of this approach for other live viral lence test. However, MAPREC can only monitor mutations at a few vaccines. Nevertheless, ensuring molecular consistency of vac- genomic loci and miss mutations at other sites that could adversely cine batches by requiring that no mutation accumulates beyond affect vaccine quality. Here we propose to use massively parallel the level present in past batches with good clinical record could sequencing (MPS) for sensitive detection and quantification of all help maintain vaccine safety (19). Therefore methods enabling mutations in the entire genome of attenuated viruses. Analysis of detection of unstable genomic loci in attenuated strains could be vaccine samples and reference preparations demonstrated a perfect used for the monitoring of genetic stability. Such studies are agreement with MAPREC results. Quantitative MPS analysis of usually conducted by serial passaging of attenuated strains in validated reference preparations tested by MAPREC produced iden- vivo and in vitro, followed by screening for the presence of small tical results, suggesting that the method could take advantage of numbers of mutants that may have accumulated in viral pop- MICROBIOLOGY the existing reference materials and be used as a replacement for ulations (16, 17, 20, 21). Traditional sequencing approaches can the MAPREC procedure in lot release of OPV. Patterns of mutations detect mutants only if their content exceeds 20–30% and are present at a low level in vaccine preparations were characteristic of therefore of limited utility. Other approaches based on analysis seed viruses used for their manufacture and could be used for iden- of electrophoretic mobility in gels (22, 23) do not always allow fi ti cation of individual batches. This approach may represent the ulti- precise location of mutations. Matrix-assisted laser desorption/ mate tool for monitoring genetic consistency of live viral vaccines. ioniation-time of flight (MALDI-TOF) mass spectrometry (24) and hybridization with microarrays of short oligonucleotides (25, fi mutant quanti cation | quasispecies | sequence heterogeneity | vaccine 26) are more sensitive, but are relatively laborious and may re- safety | neurovirulence quire follow-up by direct sequencing. In the past few years, a set of new technologies for massively igh mutation rates inherent to replication of RNA viruses parallel sequencing (MPS), which are also referred to as high- Hcreate a wide variety of mutants that are present in virus throughput, next generation, or ultradeep sequencing, have populations, which are often referred to as quasispecies (1). The emerged. They allow rapid generation of extremely large amounts diffuse, “cloud-like” nature of viral populations allows them to of sequence information (27) and are primarily used to sequence rapidly adapt to changing replicative environments by selecting genomes of higher organisms and for metagenomic studies of preexisting variants with better fitness (2–4). Many important virus populations of various bacteria and viruses (28, 29). These new properties cannot be explained by a mere consensus sequence, technologies are also very promising for analysis of viral quasis- but require knowledge about the microvariants present in viral pecies and have been used to study HIV (30, 31), severe acute stocks. One such property is virulence, which is critically relevant to respiratory syndrome (SARS) (32), hepatitis B (33), and other vaccine development and manufacture. RNA viruses (29). MPS was also used for detection of adventitious Attenuated Sabin strains used in manufacture of oral poliovirus agents in vaccines (34). Here we demonstrate that MPS can be vaccine (OPV) (5–7) can regain neurovirulence during growth used to identify very small amounts of mutant viruses in prepara- in vaccine recipients and propagation in cell cultures (8). It tions of live viral vaccines as well as for their accurate quantifica- occurs because of the accumulation of mutants with higher neu- tion. The ability to quantify potentially harmful mutations in rovirulence (9, 10), which include changes in the 5′ untranslated vaccine batches makes this method suitable for quality control region (UTR) (10). Because of this genetic instability, every batch and ensuring manufacturing consistency of live viral vaccines. of OPV must be tested for neurovirulence in monkeys (11) or transgenic mice susceptible to poliomyelitis (12, 13). Previously, we Results developed a highly sensitive molecular method, mutant analysis In the first experiment, pyrosequencing (technology developed by PCR and restriction enzyme cleavage (MAPREC), which en- by 454 Life Sciences, a Roche company, further referred to as abled us to quantify the 5′ UTR revertants in monovalent batches of OPV and demonstrate that their content in vaccine lots directly correlates with the results of the monkey neurovirulence test Author contributions: A.N. and K.C. designed research; A.N. and K.C. performed research; (MNVT) (14). The content of 472-C revertants in type 3 OPV is A.N. and K.C. analyzed data; and A.N. and K.C. wrote the paper. usually below 0.8% in batches of vaccine that passed the MNVT, The authors declare no conflict of interest. whereas batches that failed the test usually contain more than *This Direct Submission article had a prearranged editor. 1% of 472-C revertants (15). Similar methods were developed to Freely available online through the PNAS open access option. quantify mutants with 480-A in Sabin 1 and 481-G in Sabin 2 1To whom correspondence should be addressed. E-mail: [email protected]. strains (16, 17) that were also shown to be determinants of atten- gov. www.pnas.org/cgi/doi/10.1073/pnas.1012537107 PNAS Early Edition | 1of6 Downloaded by guest on September 24, 2021 454 Life Sciences) was used to analyze full-length PCR ampli- Table 1. Frequencies of mutations identified by pyrosequencing cons prepared from two samples of type 3 OPV: one that passed in plasmid DNA, PCR product, and viruses the monkey neurovirulence test and another that failed. Both Mutation frequencies were made from passage level 3 of the Sabin original seed stock (SO+3). The lot that passed the MNVT was manufactured in Total Insertions primary monkey kidney cells, whereas the failed lot was grown in nucleotides Substitutions, and WI-38 human diploid cells. Fig. 1A shows that the number of Sample read, 106 % deletions, % times each nucleotide was determined in positive and cDNA strands was about the same, but varied between 2,000 and 10,000 Plasmid 28.1 0.050 0.424 depending on the genomic location with the total for both PCR product 40.1 0.104 0.860 strands being between 6,000 and 15,000. Both samples contained Plaque 18.2 0.121 0.719 a number of mutants, the most prominent of which was the one HeLa p1 20.7 0.143 0.694 HeLa p7 36.6 0.155 0.858 with C2493U mutation, which changes threonine to isoleucine at amino acid 6 of the capsid protein VP1 (35, 36). The mutation Vero p5 19.0 0.197 1.232 does not affect neurovirulence, but the virus with 2493-U rapidly accumulates upon growth in cultured cells (15). Importantly, in agreement with our previous MAPREC results, there was a dif- 0.05%, whereas there were 0.104% mutations in PCR product ference in the content of 472-C mutants, which correlated with and 0.121% in the rederived virus. Although there was no ap- the MNVT results. The vaccine lot that passed the test contained parent bias for the type of mutations detected in plasmid 0.35% 472-C, whereas the lot that failed it contained 2.4%. DNA (Fig. 2), in all other samples prepared by PCR amplifica- There were a number of other mutations reaching several tion there were significantly more transitions (purine–purine and percent of the population in addition to a low level of less than pyrimidine–pyrimidine changes) than transversions (purine– 0.5% present at all nucleotide positions. This background level pyrimidine and pyrimidine–purine changes). After passaging the may represent true heterogeneity due to the quasispecies nature virus in HeLa cells
Recommended publications
  • Quasispecies Theory in Virology
    JOURNAL OF VIROLOGY, Jan. 2002, p. 463–465 Vol. 76, No. 1 0022-538X/02/$04.00ϩ0 DOI: 10.1128/JVI.76.1.463–465.2002 Copyright © 2002, American Society for Microbiology. All Rights Reserved. Quasispecies Theory in Virology Holmes and Moya claim that quasispecies is an unnecessary ular sort. Darwinian principles in connection with quasispecies and misleading description of RNA virus evolution, that virol- have been explicitly invoked by theoreticians and experimen- ogists refer to quasispecies inappropriately, and that there is talists alike (11, 13, 16, 35). little evidence of quasispecies in RNA virus evolution. They What is the evidence of quasispecies dynamics in RNA virus wish to look for other ideas in evolutionary biology and to set populations, and why is quasispecies theory exerting an influ- down an agenda for future research. I argue here that real virus ence in virology? The initial experiment with phage Q␤ which quasispecies often differ from the theoretical quasispecies as provided the first experimental support for a quasispecies dy- initially formulated and that this difference does not invalidate namics in an RNA virus (14, 17) has now been carried out with quasispecies as a suitable theoretical framework to understand biological and molecular clones of representatives of the major viruses at the population level. groups of human, animal, and plant RNA viruses, including In the initial theoretical formulation to describe error-prone immunodeficiency viruses and hepatitis C virus, both in cell replication of simple RNA (or RNA-like) molecules, quasispe- culture and in vivo (11). Support for quasispecies has also cies were defined as stationary (equilibrium) mutant distribu- come from studies on replication of RNA molecules in vitro tions of infinite size, centered around one or several master (4).
    [Show full text]
  • Metagenomics Approaches for the Detection and Surveillance of Emerging and Recurrent Plant Pathogens
    microorganisms Review Metagenomics Approaches for the Detection and Surveillance of Emerging and Recurrent Plant Pathogens Edoardo Piombo 1,2 , Ahmed Abdelfattah 3,4 , Samir Droby 5, Michael Wisniewski 6,7, Davide Spadaro 1,8,* and Leonardo Schena 9 1 Department of Agricultural, Forest and Food Sciences (DISAFA), University of Torino, 10095 Grugliasco, Italy; [email protected] 2 Department of Forest Mycology and Plant Pathology, Uppsala Biocenter, Swedish University of Agricultural Sciences, P.O. Box 7026, 75007 Uppsala, Sweden 3 Institute of Environmental Biotechnology, Graz University of Technology, Petersgasse 12, 8010 Graz, Austria; [email protected] 4 Department of Ecology, Environment and Plant Sciences, University of Stockholm, Svante Arrhenius väg 20A, 11418 Stockholm, Sweden 5 Department of Postharvest Science, Agricultural Research Organization (ARO), The Volcani Center, Rishon LeZion 7505101, Israel; [email protected] 6 U.S. Department of Agriculture—Agricultural Research Service (USDA-ARS), Kearneysville, WV 25430, USA; [email protected] 7 Department of Biological Sciences, Virginia Technical University, Blacksburg, VA 24061, USA 8 AGROINNOVA—Centre of Competence for the Innovation in the Agroenvironmental Sector, University of Torino, 10095 Grugliasco, Italy 9 Department of Agriculture, Università Mediterranea, 89122 Reggio Calabria, Italy; [email protected] * Correspondence: [email protected]; Tel.: +39-0116708942 Abstract: Globalization has a dramatic effect on the trade and movement of seeds, fruits and vegeta- bles, with a corresponding increase in economic losses caused by the introduction of transboundary Citation: Piombo, E.; Abdelfattah, A.; plant pathogens. Current diagnostic techniques provide a useful and precise tool to enact surveillance Droby, S.; Wisniewski, M.; Spadaro, protocols regarding specific organisms, but this approach is strictly targeted, while metabarcoding D.; Schena, L.
    [Show full text]
  • Viral Quasispecies Reconstruction Using Next Generation Sequencing Reads
    Georgia State University ScholarWorks @ Georgia State University Computer Science Dissertations Department of Computer Science Summer 8-12-2013 Viral Quasispecies Reconstruction Using Next Generation Sequencing Reads Bassam A. Tork Georgia State University Follow this and additional works at: https://scholarworks.gsu.edu/cs_diss Recommended Citation Tork, Bassam A., "Viral Quasispecies Reconstruction Using Next Generation Sequencing Reads." Dissertation, Georgia State University, 2013. https://scholarworks.gsu.edu/cs_diss/77 This Dissertation is brought to you for free and open access by the Department of Computer Science at ScholarWorks @ Georgia State University. It has been accepted for inclusion in Computer Science Dissertations by an authorized administrator of ScholarWorks @ Georgia State University. For more information, please contact [email protected]. VIRAL QUASISPECIES RECONSTRUCTION USING NEXT GENERATION SEQUENCING READS by BASSAM TORK Under the Direction of Dr. Alexander Zelikovsky ABSTRACT The genomic diversity of viral quasispecies is a subject of great interest, especially for chronic infections. Characterization of viral diversity can be addressed by high-throughput sequencing technology (454 Life Sciences, Illumina, SOLiD, Ion Torrent, etc.). Standard assembly software was originally designed for single genome assembly and cannot be used to assemble and estimate the frequency of closely related quasispecies sequences. This work focuses on parsimonious and maximum likelihood models for assembling viral quasispecies
    [Show full text]
  • Viruses at the Edge of Adaptation
    Virology 270, 251–253 (2000) doi:10.1006/viro.2000.0320, available online at http://www.idealibrary.com on MINIREVIEW Viruses at the Edge of Adaptation Esteban Domingo1 Centro de Biologı´a Molecular “Severo Ochoa,” (CSIC-UAM) Universidad Auto´noma de Madrid, Cantoblanco, 28049 Madrid, Spain Received February 25, 2000; accepted March 14, 2000 How vulnerable is the line that separates adaptation from extinction? Viruses, in particular RNA viruses, are well known for their high rates of genetic variation and their potential to adapt to environmental modifications (Drake and Holland, 1999; Domingo et al., 2000). Yet, fitness variations—both increases and decreases—can be spectacularly rapid, and the simple genetic stratagem of forcing virus multiplication to go through repeated genetic bottlenecks can induce fitness losses, at times near viral extinction. New information has been recently obtained on the two sides of the survival line: the edge of adaptation and the edge of extinction. © 2000 Academic Press SUBTLE ADAPTIVE STRATEGIES present in the population. Furthermore, this particular variant was systematically selected, while, when absent RNA virus quasispecies are highly effective in the from memory, its selection was rare (Ruiz-Jarabo et al., exploration of new genomic sequences (Eigen and 2000). In agreement with the concept that memory ge- Biebricher, 1988). It has long been known that mutant nomes are part of the mutant spectrum of the quasispe- swarms (the mutant spectra that populate viral quasispe- cies, memory was erased when the populations were cies) contain potentially useful variants [those with in- subjected to genetic bottlenecks (Fig. 1). Because of the creased resistance to antiviral agents, altered interferon- existence of a molecular memory, a viral quasispecies inducing capacity, or antibody-escape and cytotoxic T may be capable of reacting swiftly to a selective con- lymphocyte (CTL)-escape mutants, among others].
    [Show full text]
  • High-Throughput Pyrosequencing (To Use GS 20; Whole Genome Sequencer)
    2007 International Meeting of the Microbiological Society of Korea >>> W1-1 High-throughput Pyrosequencing (to use GS 20; whole genome sequencer) Youseak Go Macrogen Corporation Genomic Analysis Division The Genome sequencer FLX system, developed by 454 Life Science Corporation, is an ultra-high-throughput automatic DNA sequencing system capable carrying out and monitoring sequencing reactions in a massively parallel fashion. Hundreds of thousands of simultaneous reactions, in the “well” of a PicoTiterPlate device. The basic chemistry utilize the release of pyrophosphate (PPi) that occurs with each nucleotide addition during DNA-directed DNA synthesis to generate an amount of light commensurate with the amount of PPi released; this light is captured by a charge-coupled device camera and converted into a digital signal. The raw data resulting from a sequencing run consists of a series of digital images captured by the camera, where the images are a representation of the surface of the PicoTiterPlate device over which the sequencing reaction are taking place; and each image corresponding to one reagent flow over that surface, as defined by the Run script. If the sample DNA fragment present in a given PicoTiterPlate well is extended during a nucleotide flow, light is emitted from the well and captured on the image corresponding to that flow. Furthermore, the amount of light emitted is proportional to the number of nucleotide extended. Knowledge of the nucleotide flowed while each image is being captured, of location on the PicoTiterPlate device where light is being emitted during each flow allows the software to identify PicoTiterPlate wells that contain a DNA library fragment and determine the sequence of the DNA fragments present in each well.
    [Show full text]
  • Evolutionary Virology at 40
    | PERSPECTIVES Evolutionary Virology at 40 Jemma L. Geoghegan* and Edward C. Holmes†,‡,§,**,1 *Department of Biological Sciences, Macquarie University, Sydney, New South Wales 2109, Australia and †Marie Bashir Institute for Infectious Diseases and Biosecurity, ‡Charles Perkins Centre, §School of Life and Environmental Sciences, and **Sydney Medical School, The University of Sydney, New South Wales 2006, Australia ORCID IDs: 0000-0003-0970-0153 (J.L.G.); 0000-0001-9596-3552 (E.C.H.) ABSTRACT RNA viruses are diverse, abundant, and rapidly evolving. Genetic data have been generated from virus populations since the late 1970s and used to understand their evolution, emergence, and spread, culminating in the generation and analysis of many thousands of viral genome sequences. Despite this wealth of data, evolutionary genetics has played a surprisingly small role in our understanding of virus evolution. Instead, studies of RNA virus evolution have been dominated by two very different perspectives, the experimental and the comparative, that have largely been conducted independently and sometimes antagonistically. Here, we review the insights that these two approaches have provided over the last 40 years. We show that experimental approaches using in vitro and in vivo laboratory models are largely focused on short-term intrahost evolutionary mechanisms, and may not always be relevant to natural systems. In contrast, the comparative approach relies on the phylogenetic analysis of natural virus populations, usually considering data collected over multiple cycles of virus–host transmission, but is divorced from the causative evolutionary processes. To truly understand RNA virus evolution it is necessary to meld experimental and comparative approaches within a single evolutionary genetic framework, and to link viral evolution at the intrahost scale with that which occurs over both epidemiological and geological timescales.
    [Show full text]
  • Pyrosequencing Technology for Short DNA Sequencing and Whole
    生物物理 47(2),129-132(2007) 理論/実験 技術 Pyrosequencing Technology for Short DNA Sequencing and Whole Genome Sequencing Baback Gharizadeh1, Mehran Ghaderi2 and Pål Nyrén3 1Stanford Genome Technology Center, Stanford University 2Clinical Pathology/Cytology, Karolinska University Hospital 3Department of Biotechnology, Royal Institute of Technology 1.Introduction 2.Pyrosequencing chemistry Recent impressive advances in DNA sequencing technolo- Pyrosequencing technique is based on sequencing-by-syn- gies have accelerated the detailed analysis of genomes from thesis principle8), 9) and employs a series of four enzymes to ac- many organisms. We have been observing reports of complete curately detect short nucleic acid sequences during DNA syn- or draft versions of the genome sequence of several well-stud- thesis. In Pyrosequencing10), the sequencing primer is ied, multicellular organisms. Human biology and medicine hybridized to a single-stranded DNA biotin-labeled template are in the midst of a revolution by Human Genome Project1) (post-PCR alkali treated) and mixed with the enzymes; DNA as the main catalyst. polymerase, ATP sulfurylase, luciferase and apyrase, and the The chain termination sequencing method, also known as substrates adenosine 5′ phosphosulfate (APS) and luciferin. Sanger sequencing, developed by Frederick Sanger and col- Fig. 1 illustrates schematically the chemistry of Pyrose- leagues2), has been the most widely used sequencing method quencing method. Cycles of four deoxynucleotide triphos- since its advent in 1977 and still is in use after more than 29 phates (dNTPs) are separately added to the reaction mixture years since its advent. The remarkable advances in chemistry iteratively. After each nucleotide dispensation, DNA poly- and automation to the Sanger sequencing method has made it merase catalyzes the incorporation of complementary dNTP to a simple and elegant technique, central to almost all past into the template strand.
    [Show full text]
  • Error Threshold in RNA Quasispecies Models with Complementation
    ARTICLE IN PRESS Journal of Theoretical Biology 265 (2010) 278–286 Contents lists available at ScienceDirect Journal of Theoretical Biology journal homepage: www.elsevier.com/locate/yjtbi Error threshold in RNA quasispecies models with complementation Josep Sardanye´s a,Ã, Santiago F. Elena a,b a Instituto de Biologı´a Molecular y Celular de Plantas, Consejo Superior de Investigaciones Cientı´ficas-UPV, Ingeniero Fausto Elio s/n, 46022 Valencia, Spain b Santa Fe Institute, 1399 Hyde Park Road, Santa Fe NM 87501, USA article info abstract Article history: A general assumption of quasispecies models of replicons dynamics is that the fitness of a genotype is Received 22 December 2009 entirely determined by its sequence. However, a more biologically plausible situation is that fitness Received in revised form depends on the proteins that catalyze metabolic reactions, including replication. In a stirred population 26 April 2010 of replicons, such as viruses replicating and accumulating within the same cell, the association between Accepted 14 May 2010 a given genome and the proteins it encodes is not tight as it can be replicated by proteins translated Available online 21 May 2010 from other genomes. We have investigated how this complementation phenomenon affects the error Keywords: threshold in simple quasispecies mean field models. We first studied a model in which the master and Complementation the mutant genomes code for wild-type and mutant replicases, respectively. We assume that the Defective interfering particles mutant replicase has a reduced activity and that the wild-type replicase does not have increased affinity Error threshold for the master genome.
    [Show full text]
  • Profiling SARS-Cov-2 Mutation Fingerprints That Range from the Viral Pangenome to Individual Infection Quasispecies Billy T
    Lau et al. Genome Medicine (2021) 13:62 https://doi.org/10.1186/s13073-021-00882-2 RESEARCH Open Access Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies Billy T. Lau1,2†, Dmitri Pavlichin1†, Anna C. Hooker1†, Alison Almeda1, Giwon Shin1, Jiamin Chen1, Malaya K. Sahoo3, Chun Hong Huang3, Benjamin A. Pinsky3,4, Ho Joon Lee1* and Hanlee P. Ji1,2* Abstract Background: The genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients. In combination, these sets of viral mutations provide distinct genetic fingerprints that reveal the patterns of transmission and have utility in contact tracing. Methods: Leveraging thousands of sequenced SARS-CoV-2 genomes, we performed a viral pangenome analysis to identify conserved genomic sequences. We used a rapid and highly efficient computational approach that relies on k-mers, short tracts of sequence, instead of conventional sequence alignment. Using this method, we annotated viral mutation signatures that were associated with specific strains. Based on these highly conserved viral sequences, we developed a rapid and highly scalable targeted sequencing assay to identify mutations, detect quasispecies variants, and identify mutation signatures from patients. These results were compared to the pangenome genetic fingerprints. Results: We built a k-mer index for thousands of SARS-CoV-2 genomes and identified conserved genomics regions and landscape of mutations across thousands of virus genomes.
    [Show full text]
  • Multiplexed Microsatellite Recovery Using Massively Parallel Sequencing
    Molecular Ecology Resources (2011) 11, 1060–1067 doi: 10.1111/j.1755-0998.2011.03033.x Multiplexed microsatellite recovery using massively parallel sequencing T. N. JENNINGS,* B. J. KNAUS,* T. D. MULLINS,† S. M. HAIG† and R. C. CRONN* *Pacific Northwest Research Station, USDA Forest Service, 3200 SW Jefferson Way, Corvallis, OR 97331, USA, †Forest and Rangeland Ecosystem Science Center, US Geological Survey, 3200 SW Jefferson Way, Corvallis, OR 97331, USA Abstract Conservation and management of natural populations requires accurate and inexpensive genotyping methods. Traditional microsatellite, or simple sequence repeat (SSR), marker analysis remains a popular genotyping method because of the com­ paratively low cost of marker development, ease of analysis and high power of genotype discrimination. With the availabil­ ity of massively parallel sequencing (MPS), it is now possible to sequence microsatellite-enriched genomic libraries in multiplex pools. To test this approach, we prepared seven microsatellite-enriched, barcoded genomic libraries from diverse taxa (two conifer trees, five birds) and sequenced these on one lane of the Illumina Genome Analyzer using paired-end 80-bp reads. In this experiment, we screened 6.1 million sequences and identified 356 958 unique microreads that contained di- or trinucleotide microsatellites. Examination of four species shows that our conversion rate from raw sequences to polymorphic markers compares favourably to Sanger- and 454-based methods. The advantage of multiplexed MPS is that the staggering capacity of modern microread sequencing is spread across many libraries; this reduces sample preparation and sequencing costs to less than $400 (USD) per species. This price is sufficiently low that microsatellite libraries could be prepared and sequenced for all 1373 organisms listed as ‘threatened’ and ‘endangered’ in the United States for under $0.5 M (USD).
    [Show full text]
  • Review Christopher P
    4618 Electrophoresis 2008, 29, 4618–4626 Daniel G. Hert1 Review Christopher P. Fredlake1 1,2Ã Annelise E. Barron Advantages and limitations of next- 1Department of Chemical and Biological Engineering, generation sequencing technologies: Northwestern University, Evanston IL, USA A comparison of electrophoresis and 2Department of Bioengineering, Stanford University, Stanford, non-electrophoresis methods CA, USA The reference human genome provides an adequate basis for biological researchers to study the relationship between genotype and the associated phenotypes, but a large push Received July 14, 2008 is underway to sequence many more genomes to determine the role of various specifi- Revised August 29, 2008 Accepted August 29, 2008 cities among different individuals that control these relationships and to enable the use of human genome data for personalized and preventative healthcare. The current elec- trophoretic methodology for sequencing an entire mammalian genome, which includes standard molecular biology techniques for genomic sample preparation and the separation of DNA fragments using capillary array electrophoresis, remains far too expensive ($5 million) to make genome sequencing ubiquitous. The National Human Genome Research Institute has put forth goals to reduce the cost of human genome sequencing to $100 000 in the short term and $1000 in the long term to spur the innovative development of technologies that will permit the routine sequencing of human genomes for use as a diagnostic tool for disease. Since the announcement of these goals, several companies have developed and released new, non-electrophoresis- based sequencing instruments that enable massive throughput in the gathering of genomic information. In this review, we discuss the advantages and limitations of these new, massively parallel sequencers and compare them with the currently developing next generation of electrophoresis-based genetic analysis platforms, specifically microchip electrophoresis devices, in the context of three distinct types of genetic analysis.
    [Show full text]
  • Pyrosequencing- Principles and Applications
    Review Article ISSN 2250-0480 Vol 2/Issue 2/Apr-Jun 2012 PYROSEQUENCING- PRINCIPLES AND APPLICATIONS MD. FAKRUDDIN1*, ABHIJIT CHOWDHURY1, MD. NUR HOSSAIN1, KHANJADA SHAHNEWAJ BIN MANNAN2, REAZ MOHAMMAD MAZUMDAR3 1Institute of Food Science and Technology (IFST), Bangladesh Council of Scientific and Industrial Research (BCSIR), Dhaka, B angladesh. 2Center for Food & Waterborne Disease (CFWD), icddr'b, Dhaka, Bangladesh. 3BCSIR Laboratories Chittagong, Bangladesh Council of Scientific and Industrial Research (BCSIR), Bangladesh. ABSTRACT Pyrosequencing is the first alternative to the conventional Sanger method for de novo DNA sequencing. Pyrosequencing is a DNA sequencing technology based on the sequencing-by-synthesis principle. It employs a series of four enzymes to accurately detect nucleic acid sequences during the synthesis. Pyrosequencing has the potential advantages of accuracy, flexibility, parallel processing, and can be easily automated. Furthermore, the technique dispenses with the need for labeled primers, labeled nucleotides, and gel-electrophoresis. Pyrosequencing has opened up new possibilities for performing sequence-based DNA analysis. The method has been proven highly suitable for single nucleotide polymorphism analysis and sequencing of short stretches of DNA. Pyrosequencing has been successful for both confirmatory sequencing and de novo sequencing. By increasing the read length to higher scores and by shortening the sequence reaction time per base calling, pyrosequencing may take over many broad areas of DNA sequencing applications as the trend is directed to analysis of fewer amounts of specimens and large-scale settings, with higher throughput and lower cost. This article considers key features regarding different aspects of pyrosequencing technology, including the general principles, enzyme properties, sequencing modes, instrumentation, limitations, potential and future applications.
    [Show full text]