Data Formats Document Table of Contents

Total Page:16

File Type:pdf, Size:1020Kb

Data Formats Document Table of Contents Data File Formats File format v1.5 Software v1.10 Copyright © 2010 Complete Genomics Incorporated. All rights reserved. cPAL and DNB are trademarks of Complete Genomics, Inc. in the US and certain other countries. All other trademarks are the property of their respective owners. Disclaimer of Warranties. COMPLETE GENOMICS, INC. PROVIDES THESE DATA IN GOOD FAITH TO THE RECIPIENT “AS IS.” COMPLETE GENOMICS, INC. MAKES NO REPRESENTATION OR WARRANTY, EXPRESS OR IMPLIED, INCLUDING WITHOUT LIMITATION ANY IMPLIED WARRANTY OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE OR USE, OR ANY OTHER STATUTORY WARRANTY. COMPLETE GENOMICS, INC. ASSUMES NO LEGAL LIABILITY OR RESPONSIBILITY FOR ANY PURPOSE FOR WHICH THE DATA ARE USED. Any permitted redistribution of the data should carry the Disclaimer of Warranties provided above. Data file formats are expected to evolve over time. Backward compatibility of any new file format is not guaranteed. Complete Genomics data is for Research Use Only and not for use in the treatment or diagnosis of any human subject. Information, descriptions and specifications in this publication are subject to change without notice. Data Formats Document Table of Contents Table of Contents Preface ........................................................................................................................................................................................... 6 Conventions .................................................................................................................................................................................................. 6 Analysis Tools .............................................................................................................................................................................................. 6 References ..................................................................................................................................................................................................... 6 Introduction ................................................................................................................................................................................ 8 Sequencing Approach ............................................................................................................................................................................... 8 Mapping Reads and Calling Variations ............................................................................................................................................. 8 Read Data Format....................................................................................................................................................................................... 8 Data Delivery ................................................................................................................................................................................................ 9 Data File Formats and Conventions ................................................................................................................................. 10 Data File Structure ................................................................................................................................................................................... 10 Header Format........................................................................................................................................................................................... 10 Sequence Coordinate System .............................................................................................................................................................. 14 Data File Content and Organization ................................................................................................................................ 15 ASM Results ................................................................................................................................................................................................ 15 Small Variations and Annotations ............................................................................................................................................... 16 Variations File .................................................................................................................................................................................. 17 Annotated Variants within Genes ........................................................................................................................................... 21 Annotated Variants within Non-coding RNAs ................................................................................................................... 27 Count of Variations by Gene ...................................................................................................................................................... 29 Variations at Known dbSNP Loci ............................................................................................................................................. 31 Variations and Annotations Summary .................................................................................................................................. 34 Copy Number Variation .................................................................................................................................................................... 37 Copy Number Segmentation ...................................................................................................................................................... 38 Detailed Ploidy and Coverage Information ......................................................................................................................... 41 Genomic Copy Number Analysis of Tumor Samples ........................................................................................................... 44 Tumor CNV Segments ................................................................................................................................................................... 44 Detailed Tumor Coverage Level Information .................................................................................................................... 46 Depth of Coverage Report File .................................................................................................................................................. 49 Structural Variation ............................................................................................................................................................................ 51 Detected Junctions and Associated Annotations .............................................................................................................. 53 High-confidence Junctions and Associated Annotations .............................................................................................. 57 Alignments of DNBs in Junction Cluster ............................................................................................................................... 57 Evidence Junctions and Annotations ..................................................................................................................................... 60 Assemblies Underlying Called Variants ..................................................................................................................................... 63 Alignment CIGAR Format ............................................................................................................................................................ 63 Results from Assembled Intervals .......................................................................................................................................... 65 Individual Reads Aligned to Assembled Sequences ........................................................................................................ 67 Correlation of Evidence between Assemblies ................................................................................................................... 69 Coverage and Reference Scores .................................................................................................................................................... 71 Quality and Characteristics of Sequenced Genome .............................................................................................................. 73 Coverage Distribution Report File .......................................................................................................................................... 73 Coverage-by-GC-Content Report File .................................................................................................................................... 75 Indel Net Length Report File ..................................................................................................................................................... 77 Indel Net Length in Coding Region Report File ................................................................................................................. 78 © Complete Genomics, Inc. ii Data Formats Document List of Tables Substitution Net Length File Report File.............................................................................................................................
Recommended publications
  • Supplemental Note Hominoid Fission of Chromosome 14/15 and Role Of
    Supplemental Note Supplemental Note Hominoid fission of chromosome 14/15 and role of segmental duplications Giuliana Giannuzzi, Michele Pazienza, John Huddleston, Francesca Antonacci, Maika Malig, Laura Vives, Evan E. Eichler and Mario Ventura 1. Analysis of the macaque contig spanning the hominoid 14/15 fission site We grouped contig clones based on their FISH pattern on human and macaque chromosomes (Groups F1, F2, F3, G1, G2, G3, and G4). Group F2 clones (Hsa15b- and Hsa15c-positive) showed a single signal on macaque 7q and three signal clusters on human chromosome 15: at the orthologous 15q26, as expected, as well as at 15q11–14 and 15q24–25, which correspond to the actual and ancestral pericentromeric regions, respectively (Ventura et al. 2003). Indeed, this locus contains an LCR15 copy (Pujana et al. 2001) in both the macaque and human genomes. Most BAC clones were one-end anchored in the human genome (chr15:100,028–100,071 kb). In macaque this region experienced a 64 kb duplicative insertion from chromosome 17 (orthologous to human chromosome 13) (Figure 2), with the human configuration (absence of insertion) likely being the ancestral state because it is identical in orangutan and marmoset. Four Hsa15b- positive clones mapped on macaque and human chromosome 19, but neither human nor macaque assemblies report the STS Hsa15b duplicated at this locus. The presence of assembly gaps in macaque may explain why the STS is not annotated in this region. We aligned the sequence of macaque CH250-70H12 (AC187495.2) versus its human orthologous sequence (hg18 chr15:100,039k-100,170k) and found a 12 kb human expansion through tandem duplication of a ~100 bp unit corresponding to a portion of exon 20 of DNM1 (Figure S3).
    [Show full text]
  • Genetic Variation Across the Human Olfactory Receptor Repertoire Alters Odor Perception
    bioRxiv preprint doi: https://doi.org/10.1101/212431; this version posted November 1, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Genetic variation across the human olfactory receptor repertoire alters odor perception Casey Trimmer1,*, Andreas Keller2, Nicolle R. Murphy1, Lindsey L. Snyder1, Jason R. Willer3, Maira Nagai4,5, Nicholas Katsanis3, Leslie B. Vosshall2,6,7, Hiroaki Matsunami4,8, and Joel D. Mainland1,9 1Monell Chemical Senses Center, Philadelphia, Pennsylvania, USA 2Laboratory of Neurogenetics and Behavior, The Rockefeller University, New York, New York, USA 3Center for Human Disease Modeling, Duke University Medical Center, Durham, North Carolina, USA 4Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina, USA 5Department of Biochemistry, University of Sao Paulo, Sao Paulo, Brazil 6Howard Hughes Medical Institute, New York, New York, USA 7Kavli Neural Systems Institute, New York, New York, USA 8Department of Neurobiology and Duke Institute for Brain Sciences, Duke University Medical Center, Durham, North Carolina, USA 9Department of Neuroscience, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, USA *[email protected] ABSTRACT The human olfactory receptor repertoire is characterized by an abundance of genetic variation that affects receptor response, but the perceptual effects of this variation are unclear. To address this issue, we sequenced the OR repertoire in 332 individuals and examined the relationship between genetic variation and 276 olfactory phenotypes, including the perceived intensity and pleasantness of 68 odorants at two concentrations, detection thresholds of three odorants, and general olfactory acuity.
    [Show full text]
  • Assembly and Annotation of an Ashkenazi Human Reference Genome
    bioRxiv preprint doi: https://doi.org/10.1101/2020.03.18.997395; this version posted March 18, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Assembly and Annotation of an Ashkenazi Human Reference Genome Alaina Shumate1,2,† Aleksey V. Zimin1,2,† Rachel M. Sherman1,3 Daniela Puiu1,3 Justin M. Wagner4 Nathan D. Olson4 Mihaela Pertea1,2 Marc L. Salit5 Justin M. Zook4 Steven L. Salzberg1,2,3,6* 1Center for Computational Biology, Johns Hopkins University, Baltimore, MD 2Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 3Department of Computer Science, Johns Hopkins University, Baltimore, MD 4National Institute of Standards and Technology, Gaithersburg, MD 5Joint Initiative for Metrology in Biology, Stanford University, Stanford, CA 6Department of Biostatistics, Johns Hopkins University, Baltimore, MD †These authors contributed equally to this work. *Corresponding author. Email: [email protected] Abstract Here we describe the assembly and annotation of the genome of an Ashkenazi individual and the creation of a new, population-specific human reference genome. This genome is more contiguous and more complete than GRCh38, the latest version of the human reference genome, and is annotated with highly similar gene content. The Ashkenazi reference genome, Ash1, contains 2,973,118,650 nucleotides as compared to 2,937,639,212 in GRCh38. Annotation identified 20,157 protein-coding genes, of which 19,563 are >99% identical to their counterparts on GRCh38. Most of the remaining genes have small differences.
    [Show full text]
  • Chr Start End Size Gene Exon 1 69482 69600 118 OR4F5 1 1 877520
    #chr start end size gene exon 1 69482 69600 118 OR4F5 1 1 877520 877636 116 SAMD11 8 1 877807 877873 66 SAMD11 9 1 877934 878066 132 SAMD11 10 1 878067 878068 1 SAMD11 10 1 878070 878080 10 SAMD11 10 1 896670 896724 54 KLHL17 2 1 896726 896728 2 KLHL17 2 1 935267 935268 1 HES4 1 1 935271 935357 86 HES4 1 1 955548 955694 146 AGRN 1 1 955720 955758 38 AGRN 1 1 984242 984422 180 AGRN 24 1 984611 984629 18 AGRN 25 1 989928 989936 8 AGRN 35 1 1132946 1133034 88 TTLL10 13 1 1149358 1149397 39 TNFRSF4 1 1 1167654 1167713 59 B3GALT6 1 1 1167733 1167853 120 B3GALT6 1 1 1167878 1167908 30 B3GALT6 1 1 1181889 1182075 186 FAM132A 1 1 1200204 1200215 11 UBE2J2 2 1 1219462 1219466 4 SCNN1D 5 1 1223355 1223358 3 SCNN1D 12 1 1232009 1232018 9 ACAP3 15 1 1244308 1244320 12 PUSL1 2 1 1244321 1244327 6 PUSL1 2 1 1247601 1247603 2 CPSF3L 15 1 1290483 1290485 2 MXRA8 5 1 1290487 1290488 1 MXRA8 5 1 1290492 1290516 24 MXRA8 5 1 1290619 1290631 12 MXRA8 4 1 1291003 1291136 133 MXRA8 3 1 1292056 1292089 33 MXRA8 2 1 1334399 1334405 6 CCNL2 1 1 1355427 1355489 62 LOC441869 2 1 1355659 1355917 258 LOC441869 2 1 1361505 1361521 16 TMEM88B 1 1 1361523 1361525 2 TMEM88B 1 1 1361527 1361528 1 TMEM88B 1 1 1361635 1361636 1 TMEM88B 1 1 1361642 1361726 84 TMEM88B 1 1 1361757 1361761 4 TMEM88B 1 1 1362931 1362956 25 TMEM88B 2 1 1374780 1374838 58 VWA1 3 1 1374841 1374842 1 VWA1 3 1 1374934 1375100 166 VWA1 3 1 1389738 1389747 9 ATAD3C 4 1 1389751 1389816 65 ATAD3C 4 1 1389830 1389849 19 ATAD3C 4 1 1390835 1390837 2 ATAD3C 5 1 1407260 1407331 71 ATAD3B 1 1 1407340 1407474
    [Show full text]
  • A Bedr Way of Genomic Interval Processing Syed Haider1†, Daryl Waggott1†, Emilie Lalonde1,2, Clement Fung1, Fei-Fei Liu2,3 and Paul C
    Haider et al. Source Code for Biology and Medicine (2016) 11:14 DOI 10.1186/s13029-016-0059-5 SOFTWARE Open Access A bedr way of genomic interval processing Syed Haider1†, Daryl Waggott1†, Emilie Lalonde1,2, Clement Fung1, Fei-Fei Liu2,3 and Paul C. Boutros1,2* Abstract Background: Next-generation sequencing is making it critical to robustly and rapidly handle genomic ranges within standard pipelines. Standard use-cases include annotating sequence ranges with gene or other genomic annotation, merging multiple experiments together and subsequently quantifying and visualizing the overlap. The most widely-used tools for these tasks work at the command-line (e.g. BEDTools) and the small number of available R packages are either slow or have distinct semantics and features from command-line interfaces. Results: To provide a robust R-based interface to standard command-line tools for genomic coordinate manipulation, we created bedr. This open-source R package can use either BEDTools or BEDOPS as a back-end and performs data- manipulation extremely quickly, creating R data structures that can be readily interfaced with existing computational pipelines. It includes data-visualization capabilities and a number of data-access functions that interface with standard databases like UCSC and COSMIC. Conclusions: bedr package provides an open source solution to enable genomic interval data manipulation and restructuring in R programming language which is commonly used in bioinformatics, and therefore would be useful to bioinformaticians and genomic researchers. Keywords: Genomic intervals, BED format, Sequence algebra, Data integration Background R is the de facto standard for statistical analysis and With the advent of high-throughput sequencing tech- visualization in computational biology [3] for both ex- nologies, data scientists are facing immense challenges ploratory prototyping and rigorous production pipelines.
    [Show full text]
  • Gnomad Lof Supplement
    1 gnomAD supplement gnomAD supplement 1 Data processing 4 Alignment and read processing 4 Variant Calling 4 Coverage information 5 Data processing 5 Sample QC 7 Hard filters 7 Supplementary Table 1 | Sample counts before and after hard and release filters 8 Supplementary Table 2 | Counts by data type and hard filter 9 Platform imputation for exomes 9 Supplementary Table 3 | Exome platform assignments 10 Supplementary Table 4 | Confusion matrix for exome samples with Known platform labels 11 Relatedness filters 11 Supplementary Table 5 | Pair counts by degree of relatedness 12 Supplementary Table 6 | Sample counts by relatedness status 13 Population and subpopulation inference 13 Supplementary Figure 1 | Continental ancestry principal components. 14 Supplementary Table 7 | Population and subpopulation counts 16 Population- and platform-specific filters 16 Supplementary Table 8 | Summary of outliers per population and platform grouping 17 Finalizing samples in the gnomAD v2.1 release 18 Supplementary Table 9 | Sample counts by filtering stage 18 Supplementary Table 10 | Sample counts for genomes and exomes in gnomAD subsets 19 Variant QC 20 Hard filters 20 Random Forest model 20 Features 21 Supplementary Table 11 | Features used in final random forest model 21 Training 22 Supplementary Table 12 | Random forest training examples 22 Evaluation and threshold selection 22 Final variant counts 24 Supplementary Table 13 | Variant counts by filtering status 25 Comparison of whole-exome and whole-genome coverage in coding regions 25 Variant annotation 30 Frequency and context annotation 30 2 Functional annotation 31 Supplementary Table 14 | Variants observed by category in 125,748 exomes 32 Supplementary Figure 5 | Percent observed by methylation.
    [Show full text]
  • Us 2018 / 0305689 A1
    US 20180305689A1 ( 19 ) United States (12 ) Patent Application Publication ( 10) Pub . No. : US 2018 /0305689 A1 Sætrom et al. ( 43 ) Pub . Date: Oct. 25 , 2018 ( 54 ) SARNA COMPOSITIONS AND METHODS OF plication No . 62 /150 , 895 , filed on Apr. 22 , 2015 , USE provisional application No . 62/ 150 ,904 , filed on Apr. 22 , 2015 , provisional application No. 62 / 150 , 908 , (71 ) Applicant: MINA THERAPEUTICS LIMITED , filed on Apr. 22 , 2015 , provisional application No. LONDON (GB ) 62 / 150 , 900 , filed on Apr. 22 , 2015 . (72 ) Inventors : Pål Sætrom , Trondheim (NO ) ; Endre Publication Classification Bakken Stovner , Trondheim (NO ) (51 ) Int . CI. C12N 15 / 113 (2006 .01 ) (21 ) Appl. No. : 15 /568 , 046 (52 ) U . S . CI. (22 ) PCT Filed : Apr. 21 , 2016 CPC .. .. .. C12N 15 / 113 ( 2013 .01 ) ; C12N 2310 / 34 ( 2013. 01 ) ; C12N 2310 /14 (2013 . 01 ) ; C12N ( 86 ) PCT No .: PCT/ GB2016 /051116 2310 / 11 (2013 .01 ) $ 371 ( c ) ( 1 ) , ( 2 ) Date : Oct . 20 , 2017 (57 ) ABSTRACT The invention relates to oligonucleotides , e . g . , saRNAS Related U . S . Application Data useful in upregulating the expression of a target gene and (60 ) Provisional application No . 62 / 150 ,892 , filed on Apr. therapeutic compositions comprising such oligonucleotides . 22 , 2015 , provisional application No . 62 / 150 ,893 , Methods of using the oligonucleotides and the therapeutic filed on Apr. 22 , 2015 , provisional application No . compositions are also provided . 62 / 150 ,897 , filed on Apr. 22 , 2015 , provisional ap Specification includes a Sequence Listing . SARNA sense strand (Fessenger 3 ' SARNA antisense strand (Guide ) Mathew, Si Target antisense RNA transcript, e . g . NAT Target Coding strand Gene Transcription start site ( T55 ) TY{ { ? ? Targeted Target transcript , e .
    [Show full text]
  • Data Formats Document Table of Contents
    Data File Formats File format v1.4 Software v1.9.0 Copyright © 2010 Complete Genomics Incorporated. All rights reserved. cPAL and DNB are trademarks of Complete Genomics, Inc. in the US and certain other countries. All other trademarks are the property of their respective owners. Disclaimer of Warranties. COMPLETE GENOMICS, INC. PROVIDES THESE DATA IN GOOD FAITH TO THE RECIPIENT “AS IS.” COMPLETE GENOMICS, INC. MAKES NO REPRESENTATION OR WARRANTY, EXPRESS OR IMPLIED, INCLUDING WITHOUT LIMITATION ANY IMPLIED WARRANTY OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE OR USE, OR ANY OTHER STATUTORY WARRANTY. COMPLETE GENOMICS, INC. ASSUMES NO LEGAL LIABILITY OR RESPONSIBILITY FOR ANY PURPOSE FOR WHICH THE DATA ARE USED. Any permitted redistribution of the data should carry the Disclaimer of Warranties provided above. Data file formats are expected to evolve over time. Backward compatibility of any new file format is not guaranteed. Complete Genomics data is for Research Use Only and not for use in the treatment or diagnosis of any human subject. Information, descriptions and specifications in this publication are subject to change without notice. Data Formats Document Table of Contents Table of Contents Preface ........................................................................................................................................................................................... 4 Conventions .................................................................................................................................................................................................
    [Show full text]
  • High-Resolution DNA Analysis of Human Embryonic Stem Cell Lines Reveals Culture-Induced Copy Number Changes and Loss of Heterozygosity
    RESOURCE High-resolution DNA analysis of human embryonic stem cell lines reveals culture-induced copy number changes and loss of heterozygosity Elisa Närvä1, Reija Autio1,2, Nelly Rahkonen1, Lingjia Kong2, Neil Harrison3, Danny Kitsberg4, Lodovica Borghese5, Joseph Itskovitz-Eldor6, Omid Rasool1, Petr Dvorak7, Outi Hovatta8, Timo Otonkoski9,10, Timo Tuuri9, Wei Cui11, Oliver Brüstle5, Duncan Baker12, Edna Maltby12, Harry D Moore13, Nissim Benvenisty14, Peter W Andrews3, Olli Yli-Harja2,15 & Riitta Lahesmaa1 Prolonged culture of human embryonic stem cells (hESCs) can lead to adaptation and the acquisition of chromosomal abnormalities, underscoring the need for rigorous genetic analysis of these cells. Here we report the highest-resolution study of hESCs to date using an Affymetrix SNP 6.0 array containing 906,600 probes for single nucleotide polymorphisms (SNPs) and 946,000 probes for copy number variations (CNVs). Analysis of 17 different hESC lines maintained in different laboratories identified 843 CNVs of 50 kb–3 Mb in size. We identified, on average, 24% of the loss of heterozygosity (LOH) sites and 66% of the CNVs changed in culture between early and late passages of the same lines. Thirty percent of the genes detected within CNV sites had altered expression compared to samples with normal copy number states, of which >44% were functionally linked to cancer. Furthermore, LOH of the q arm of chromosome 16, which has not been observed previously in hESCs, was detected. Pluripotent hESCs are studied for potential applications in regenera- human genome variability. Specific recurrent CNVs are common in tive medicine because of their unique capacity to self-renew and to tumors10,11; particular tumor types have characteristic copy number differentiate into any cell type.
    [Show full text]
  • Explorations in Olfactory Receptor Structure and Function by Jianghai
    Explorations in Olfactory Receptor Structure and Function by Jianghai Ho Department of Neurobiology Duke University Date:_______________________ Approved: ___________________________ Hiroaki Matsunami, Supervisor ___________________________ Jorg Grandl, Chair ___________________________ Marc Caron ___________________________ Sid Simon ___________________________ [Committee Member Name] Dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Department of Neurobiology in the Graduate School of Duke University 2014 ABSTRACT Explorations in Olfactory Receptor Structure and Function by Jianghai Ho Department of Neurobiology Duke University Date:_______________________ Approved: ___________________________ Hiroaki Matsunami, Supervisor ___________________________ Jorg Grandl, Chair ___________________________ Marc Caron ___________________________ Sid Simon ___________________________ [Committee Member Name] An abstract of a dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Department of Neurobiology in the Graduate School of Duke University 2014 Copyright by Jianghai Ho 2014 Abstract Olfaction is one of the most primitive of our senses, and the olfactory receptors that mediate this very important chemical sense comprise the largest family of genes in the mammalian genome. It is therefore surprising that we understand so little of how olfactory receptors work. In particular we have a poor idea of what chemicals are detected by most of the olfactory receptors in the genome, and for those receptors which we have paired with ligands, we know relatively little about how the structure of these ligands can either activate or inhibit the activation of these receptors. Furthermore the large repertoire of olfactory receptors, which belong to the G protein coupled receptor (GPCR) superfamily, can serve as a model to contribute to our broader understanding of GPCR-ligand binding, especially since GPCRs are important pharmaceutical targets.
    [Show full text]
  • {'Ess': 0, 'Noness': 3} True 70007 AAGATCTAAAACTTTACACTAG []
    uag2uaa_1_13_window gene gene_ogee gene_essentiality site_index cbe_targets abe_targets 0 1 OR4F5 {'ess': 0, 'noness': 3} True 70007 AAGATCTAAAACTTTACACTAG [] 1 1 PERM1 unavailable False 976171 TGGCTCCTAGGAGCTGGGGCTG [] 2 1 RNF223 unavailable False 1071816 TGGCCCTAATTATCAGTCAGAG [] 3 1 TTLL10 {'ess': 0, 'noness': 7} True 1185524 CCTTAGCTAGGGAGAGTCCGGG [] 4 1 TTLL10 {'ess': 0, 'noness': 7} True 1197846 TGGCTGCCCCTAGGGCCTCGCG[] 5 1 TNFRSF18 {'ess': 0, 'noness': 7} True 1203590 TCCTAAGACCCCACCCCATCAG [] 6 1 ACAP3 {'ess': 0, 'noness': 7} True 1293563 CCGGCCCTAGCTCTCTTCCAGG [] 7 1 CPTP unavailable False 1327762 CCCGCCCCTAGGGCAGGTCCAG[] 8 1 MXRA8 {'ess': 0, 'noness': 7} True 1353603 CCTATTTGCAGTTCTCCTTCCG [] 9 1 MRPL20 {'ess': 6, 'noness': 2} False 1402568 TGGCCTAGGGTAACCCTTTAAG [] 10 1 ATAD3B {'ess': 0, 'noness': 7} True 1495816 CCAGTGCCTACAACAGGGGGTG [] 11 1 MMP23B {'ess': 0, 'noness': 7} True 1634328 TCACTACACTTTCCCTTTCTTG [] 12 1 NADK {'ess': 4, 'noness': 5} False 1752903 -1 -1 13 1 CALML6 {'ess': 1, 'noness': 7} True 1917192 CTGCACCTACTGGATCAGCTTG [] 14 1 PRKCZ {'ess': 1, 'noness': 7} True 2157294 GGGACGCCCTAGTAGCTGGCTG [] 15 1 SKI {'ess': 2, 'noness': 6} False 2306764 GGAATCTACGGCTCCAGCTCCG [] 16 1 MORN1 {'ess': 0, 'noness': 7} True 2355447 CGTCTGATCCTAGAAGAGTGAG [] 17 1 RER1 {'ess': 1, 'noness': 5} False 2403123 TCCCGCTTCTAGCTGGCGAAGG [] 18 1 PRXL2B unavailable False 2589478 AGCACACCTAGGGCAGCAGACG [] 19 1 MMEL1 {'ess': 1, 'noness': 7} True 2590989 TTGGCTACCACACGCGGCATCG [] 20 1 ARHGEF16 {'ess': 0, 'noness': 7} True 3480586 GCTACACGTCCGTCTCCACCCG
    [Show full text]
  • Standard Sequencing Service Data File Formats File Format V2.4 Software V2.4 December 2012
    Standard Sequencing Service Data File Formats File format v2.4 Software v2.4 December 2012 CGA Tools, cPAL, and DNB are trademarks of Complete Genomics, Inc. in the US and certain other countries. All other trademarks are the property of their respective owners. Disclaimer of Warranties. COMPLETE GENOMICS, INC. PROVIDES THESE DATA IN GOOD FAITH TO THE RECIPIENT “AS IS.” COMPLETE GENOMICS, INC. MAKES NO REPRESENTATION OR WARRANTY, EXPRESS OR IMPLIED, INCLUDING WITHOUT LIMITATION ANY IMPLIED WARRANTY OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE OR USE, OR ANY OTHER STATUTORY WARRANTY. COMPLETE GENOMICS, INC. ASSUMES NO LEGAL LIABILITY OR RESPONSIBILITY FOR ANY PURPOSE FOR WHICH THE DATA ARE USED. Any permitted redistribution of the data should carry the Disclaimer of Warranties provided above. Data file formats are expected to evolve over time. Backward compatibility of any new file format is not guaranteed. Complete Genomics data is for Research Use Only and not for use in the treatment or diagnosis of any human subject. Information, descriptions and specifications in this publication are subject to change without notice. Copyright © 2011-2012 Complete Genomics Incorporated. All rights reserved. RM_DFFSS_2.4-01 Table of Contents Table of Contents Preface ...........................................................................................................................................................................................1 Conventions .................................................................................................................................................................................................
    [Show full text]