Project Deliverable

Total Page:16

File Type:pdf, Size:1020Kb

Project Deliverable Ref. Ares(2013)2985750 - 03/09/2013 Project number: 317871 Project acronym: BIOBANKCLOUD Project title: Scalable, Secure Storage and Analysis of Biobank Data Project website URL: http://www.biobankcloud.com/ Project Coordinator Name and Organisation: Jim Dowling, KTH E-mail: [email protected] WORK PACKAGE 8 : Dissemination Work Package Leader Name and Organisation: JAN-ERIC LITTON, Karolinska Institute (KI) E-mail: [email protected] PROJECT DELIVERABLE D8.1: Dissemination plan Deliverable Due date (and month since project start): 2013-08-31, m9 Deliverable Version: v0.2 BiobankCloud D8.1 page 1/22 317871 Document history Version Date Changes By Reviewed Lora Dimitrova Karin Zimmermann 0.1 2013-08-14 First draft Michael Hummel Jörgen Brandt Ulf Leser Lora Dimitrova 0.2 2013-08-26 Final Roxana Merino Martinez Michael Hummel Jan-Eric Litton BiobankCloud D8.1 page 2/22 317871 Abstract The goal of this deliverable is to elaborate a plan for the dissemination of the BiobankCloud. For this purpose two different approaches will be applied: active as well as passive dissemination. Different academic and industrial partners will be actively contacted, while internet tools, various meetings/workshops and publications in major scientific journals will be used for passive dissemination of the BiobankCloud. In D8.1 we describe these possibilities for dissemination of the knowledge about the BiobankCloud in detail. BiobankCloud D8.1 page 3/22 317871 Table of Contents Dissemination............................................................................................................. 5 1. Active Dissemination ........................................................................................... 6 1.1 Companies ............................................................................................... 6 1.1.1 Next Generation Sequencing Equipment Suppliers ............................... 6 1.1.2 Bioinformatics application developers ................................................... 9 1.2 Academic Users ..................................................................................... 10 1.2.1 Large-scale Next Generation Sequencing Users ................................. 10 1.2.2 Small-scale Next Generation Sequencing Users ................................. 13 1.2.3 Scientific Consortia ............................................................................. 13 2. Passive Dissemination ...................................................................................... 17 2.1 Internet Presence of the BiobankCloud .................................................. 17 2.2 Conferences and Meetings ..................................................................... 18 2.2.1 Biobanking Conferences and Meetings ............................................... 18 2.2.2 Next Generation Sequencing Conferences and Meetings ................... 19 2.3 Workshops ............................................................................................. 21 2.4 Publications in Major Journals ................................................................ 21 3. Conclusions ...................................................................................................... 22 4. References ........................................................................................................ 22 BiobankCloud D8.1 page 4/22 317871 Dissemination For the dissemination of the knowledge about the BiobankCloud we will use two different approaches: active and passive dissemination. Active dissemination means that scientists, institutes or companies will be contacted by us and we will present the advantages of the BiobankCloud for storage/analysis of their next generation sequencing (NGS) data to them. Passive dissemination means, that we will introduce the BiobankCloud to the scientific community through the internet, meetings/workshops as well as publications and will develop various approaches for being found and contacted by future users. In Diagram 1 a schematic overview of the different possibilities for dissemination of the BiobankCloud is shown. Diagram 1: Overview of the Different Dissemination Strategies. BiobankCloud D8.1 page 5/22 317871 1. Active Dissemination There are two groups of users, who will be actively contacted: academic and industrial users. In the following section these two groups are described in detail. 1.1 Companies Two groups of potential industrial users will be actively contacted: NGS equipment suppliers and bioinformatics application developers. 1.1.1 Next Generation Sequencing Equipment Suppliers Several companies worldwide provide NGS machines, and most of them have developed solutions for storage and processing of the generated NGS data. In this section the NGS platforms will be shortly described and the way, in which companies manage NGS data will be explained using the example of Illumina. Illumina, Inc. (San Diego, CA, USA) Illumina (https://basespace.illumina.com/home) has developed the technology called sequencing by synthesis, which is implemented in four different platforms: the HiSeq series, HiScan SQ, MiSeq and the Genome Analyzer IIx system. The runs carried out with these systems have a maximal output of 8,5–600 Gb of data and the run duration goes from 4 h–11 days. Many diverse applications can be performed with these platforms like DNA sequencing (whole genome/whole exome sequencing, targeted resequencing, de novo sequencing, amplicon sequencing, single nucleotide polymorphism (SNP) discovery, copy number variations (CNV) identification and analysis of chromosomal abnormalities), RNA sequencing (transcriptome analysis, small RNA discovery), ChIP-Seq and methylation analysis. For analysis, archiving and sharing of the generated NGS data Illumina has developed the BaseSpace cloud platform. The platform is built on Amazon’s Web Services, who are leading in cloud-based infrastructure. All sequencer from the MiSeq and HiSeq series have the BaseSpace platform integrated and when starting a run the sequencing data can be directly sent in real time to BaseSpace, thus saving time for data transfer. In the cloud the data can be securely stored and shared with collaborators. The storage of the first TB of data is free forever. Additionally BiobankCloud D8.1 page 6/22 317871 BaseSpace allows the transfer of ownerships- in this way e.g. NGS core facilities can provide sequencing data to their customers very quickly, without the need of transferring the data. For analysis of the NGS data many BaseSpace apps are available. While alignment and variant processing of the data is free, the integrated apps have different prices. 454 Life Sciences / Roche Applied Science (Penzberg, Germany) 454 Life Sciences/Roche (http://www.454.com/) has introduced the first commercially available next generation sequencer (Genome Sequencer 20) in 2005. They continued developing NGS based on the pyrosequencing of DNA captured on beads during the next years and now, two NGS systems are available on the market. The GS Junior System generates shorter reads (400 bp average length, longest reads up to 600 bp) and has a run time of 10 h, while the GS FLX+ System run takes 23 h, but generates reads up to 1.000 bp (mode read length 700 bp). For analysis of the NGS data the GS Data Analysis Software has been developed. The platforms are used for whole genome sequencing, targeted resequencing, metagenomics, epigenomic and methylation analysis as well as RNA-Seq. Applied Biosystems / Life Technologies, Inc. (Carlsbad, CA, USA) Applied Biosystems (ABI; http://de-de.invitrogen.com/site/de/de/home/brands/ Applied-Biosystems.html) has launched three 5500 Series Genetic Analysis Systems: 5500 System, 5500xl System with 1.0 µm microbeads and the 5500xl System with 0.75 µm nanobeads. With these platforms genome, exome, transcriptome and other analyses can be carried out in parallel within the same run with the highest accuracy. The volume of the generated data is 7–more than 20 Gb/day and the runs last 2 days (exome sequencing)–7 days (human genome sequencing). For analysis of the generated data ABI has developed the LifeScope™Genomic Analysis Software. Ion Torrent / Life Technologies, Inc. (Carlsbad, CA, USA) Based on the semiconductor sequencing technology Ion Torrent (http://de- de.invitrogen.com/site/de/de/home/brands/Ion-Torrent.html) has developed two BiobankCloud D8.1 page 7/22 317871 systems (ION PROTON™SEQUENCER and ION PGM™SEQUENCER) for NGS. The ION PROTON™SEQUENCER is used for exome, whole transcriptome and human genome sequencing, while the ION PGM™SEQUENCER is suitable for small-genome, gene set and ChIP-Seq sequencing as well as for gene expression analysis. Data analysis, annotation and archiving are performed with the Ion Reporter™ Software. Pacific Biosciences of California, Inc. (Menlo Park, CA, USA) Pacific Biosciences (http://www.pacificbiosciences.com/) has developed a single molecule DNA sequencing system, which allows sequencing of DNA molecules in real-time, without prior amplification and in 30-120 minutes. This technology produces the longest read lengths (average length of 3.000-5.000 bp, longest reads > 20.000 bp) compared with other sequencing platforms and the data generated have high accuracy and sensitivity. Another advantage of the PacBio RS II system is the possibility to analyze DNA base modifications like methylated adenine, which can’t be performed with the other platforms. The NGS machine is supplied with different software tools for processing of the generated data (SMRT Analysis software).
Recommended publications
  • Next-Generation Sequencing-Based Method Shows Increased Mutation Detection Sensitivity in an Indian Retinoblastoma Cohort
    Molecular Vision 2016; 22:1036-1047 <http://www.molvis.org/molvis/v22/1036> © 2016 Molecular Vision Received 23 May 2016 | Accepted 14 August 2016 | Published 16 August 2016 Next-generation sequencing-based method shows increased mutation detection sensitivity in an Indian retinoblastoma cohort Jaya Singh,1 Avshesh Mishra,1 Arunachalam Jayamuruga Pandian,2 Ashwin C. Mallipatna,3 Vikas Khetan,4 S. Sripriya,2 Suman Kapoor,1 Smita Agarwal,1 Satish Sankaran,1 Shanmukh Katragadda,1 Vamsi Veeramachaneni,1 Ramesh Hariharan,1,5 Kalyanasundaram Subramanian,1 Ashraf U. Mannan1 (The first two authors contributed equally to this work.) 1Strand Center for Genomics and Personalized Medicine, Strand Life Sciences, Bangalore, India; 2Sankara Nethralaya ONGC Department of Genetics and Molecular Biology, Vision Research Foundation, Chennai, India; 3Department of Pediatric Ophthalmology and Strabismology, Narayana Nethralaya, Bangalore, India; 4Department of Vitreo Retina, Sankara Nethralaya, Chennai, India; 5Department of Computer Science and Automation, Indian Institute of Science, Bangalore, India Purpose: Retinoblastoma (Rb) is the most common primary intraocular cancer of childhood and one of the major causes of blindness in children. India has the highest number of patients with Rb in the world. Mutations in the RB1 gene are the primary cause of Rb, and heterogeneous mutations are distributed throughout the entire length of the gene. Therefore, genetic testing requires screening of the entire gene, which by conventional sequencing is time consuming and expensive. Methods: In this study, we screened the RB1 gene in the DNA isolated from blood or saliva samples of 50 unrelated patients with Rb using the TruSight Cancer panel. Next-generation sequencing (NGS) was done on the Illumina MiSeq platform.
    [Show full text]
  • Sureselectxt RNA Direct Protocol Provides Simultaneous Transcriptome Enrichment and Ribosomal Depletion of FFPE RNA
    SureSelectXT RNA Direct Protocol Provides Simultaneous Transcriptome Enrichment and Ribosomal Depletion of FFPE RNA Application Note Authors Abstract Jennifer Carter Jones, Alex Siebold, The ability to extract RNA and prepare RNA sequencing (RNA-Seq) libraries and Anne Bergstrom Lucas from Formalin Fixed Paraffin Embedded (FFPE) tissues allows researchers to identify and validate new biomarkers of disease onset, progression, and therapy resistance. However, the typically poor quality of RNA derived from FFPE samples has previously limited use of this tissue source as a resource for transcriptome profiling and research by next generation sequencing (NGS)1. Recently, when performing targeted enrichment of cDNA libraries, we have found that with protocol modifications of the Agilent Strand-Specific RNA Library Prep kit we can generate RNA-Seq data with minimal ribosomal contamination and good sequencing coverage. In this study, we used a set of matched fresh frozen (FF) and FFPE-derived RNA from tumor/normal samples to demonstrate that high-quality data can be derived from FFPE samples using a protocol that does not require upfront ribosomal depletion or poly(A) selection. When using the SureSelectXT RNA Direct protocol and the reagents from the SureSelectXT RNA Direct Kit, we found that transcripts were up regulated and down regulated to similar degrees with similar confidence levels in both the FF and FFPE samples, demonstrating the utility for meaningful gene expression studies with RNA stored in FFPE blocks. Introduction alignment statistics are similar generate much lower RIN values and In much of the world, preserving between the FFPE and FF samples, we have found that these low RIN tissue by formalin fixation followed and a comparison of the tumor values are not the best indicator by paraffin embedding of the versus normal gene expression of success with this new protocol.
    [Show full text]
  • Genespring in Genomics Research a Standard in Biological Interpretation
    Agilent Genomics Software Future Directions Michael Rosenberg, PhD Director, Genomics Software Agilent: A Focused Measurement Company Serving Diverse End Markets Electronic Measurement Bio-Analytical Measurement 2008 Revenue: $3.6 Billion 2008 Revenue: $2.2 Billion General Purpose Communications Chemical Analysis Life Sciences 37% 25% 20% 18% Other General Computers & Forensics Industry Semiconductors Wireless Mfg. Other Comms Environ- Academic & Government mental Food 4% 14% Aerospace Broadband & Defense Wireless R&D R&D/Mfg Petrochemical Pharma & Biotech Page 2 June 4, 2010 Agilent Genomics Portfolio DNA RNA Target aCGH/ CH ChIP Splice miRNA CNV 3 Enrich Variants GE Sys. Transcription Targeted sample prep mRNA Copy number Methylation mRNA Micro RNAs Factors isoforms Make high sensitivity Identify the presence Study chromosomal Map methylation Measure protein/DNA Enrich for genomic Identify the splice measurements of of microRNAs and aberrations and patterns across and interactions and to regions or transcripts forms of specific gene transcription and measure the effect of measure gene copy study effects on better characterize for high throughput genes and study their correlate results with knockouts and number transcription transcription, DNA sequencing effect on protein other genomic data correlate this activity replication and repair. translation. with gene transcription. Agilent Bioinformatics Software A comprehensive suite of applications Transcriptome Genome GeneSpring AGW miRNA, qPCR, Exon, ChIP, Methyl, CGH Copy#, LOH, GWAS RNA
    [Show full text]
  • Integrated Analysis of RNA-Seq and Chip-Seq Data Using Strand NGS To
    Application Note Integrated analysis of RNA-Seq and ChIP-Seq data using Strand NGS to understand the Regulation of Cardiogenesis Mohammed Toufiq, Sumeet Deshmukh, Srikanthi Ramachandrula, Sunil C. Cherukuri Strand Life Sciences Pvt Ltd Overview built an architecture of pathway and NLP 1% of all human births in the west are affected networks. Pathway Analysis showed NKX2-5 with congenital heart disease and this to play a pivotal role in cardiogenesis and any constitutes a major burden on public health change in its levels leads to multiplying effects organizations. Numerous next-generation on heart development pathway. NLP was used sequencing studies utilize multiple to build a network with plausible interaction of technologies to understand and answer genes mined from interaction DB and a complex biological questions. Here, using proprietary database constructed by Strand Strand NGS bioinformatics software version using all the PubMed abstracts. Further, NLP 3.0 and above, we provide an illustrative showed that most proteins that interact with example on how to integrate data from NKX2-5 are either enhancers or growth factors different sources to identify differential which might explain the mechanism of expression profiles and infer their transcription heightened impact on heart development. We factor binding sites (using RNA-Seq and ChIP- intend to showcase Strand NGS as a go to tool Seq data) in a combined analysis to define the for analyzing and interpreting multi-omics data regulation of Cardiogenesis. Transcription with intuitive workflows and user friendly factors like NKX2-5 and MEIS1 have been features. shown to play a critical role in vertebrate heart development.
    [Show full text]
  • Giri Narasimhan
    11/29/2016 Giri Narasimhan CURRICULUM VITAE GIRI NARASIMHAN ADDRESS: Professor, School of Computing and Information Science, PHONE: (305) 348-3748 ECS 254A, Florida International University, FAX: (305) 348-6142 Miami, FL 33199. WEBPAGES: http://www.cis.fiu.edu/~giri; http://biorg.cis.fiu.edu/ (Research Group) E-MAIL: [email protected] q EDUCATION DEGREE DISCIPLINE INSTITUTION YEAR B. Tech. Electrical Engineering Indian Institute of Technology, Bombay, India 1982 Ph. D. Computer Science University of Wisconsin - Madison 1989 q EXPERIENCE RANK/POSITION DEPARTMENT/DIVISION INSTITUTION PERIOD Professor School of Computer Science Florida International University From Fall 2004 Associate Dean, College of Engineering & Research and Florida International University 2009-2015 Computing Graduate Studies Visiting Scholar Next Generation Sequencing Strand Life Sciences Jan-Apr 2009 Visiting Professor Microbiology & Molecular Genetics Harvard Medical School Fall 2006 Visiting Researcher IMAGEN-NICTA National ICT Australia (NICTA) Feb 2006 Associate Professor School of Computer Science Florida International University 2001-2004 Professor Mathematical Sciences Department University of Memphis 2001 Associate Professor Mathematical Sciences Department University of Memphis 1995-2001 Visiting Professor Computer Science Department University of Copenhagen,Denmark May-July 2000 Visiting Professor Computer Science Department Lund University, Sweden May-June 1999 Visiting Professor Inst. for Advanced Comp. Studies University of Maryland,College Park Nov-Dec 1997
    [Show full text]
  • Elucidating the Transcriptional Program of Feline Injection-Site Sarcoma Using a Cross- Species Mrna-Sequencing Approach Qi Wei1†, Stephen A
    Wei et al. BMC Cancer (2019) 19:311 https://doi.org/10.1186/s12885-019-5501-z RESEARCH ARTICLE Open Access Elucidating the transcriptional program of feline injection-site sarcoma using a cross- species mRNA-sequencing approach Qi Wei1†, Stephen A. Ramsey1*† , Maureen K. Larson2, Noah E. Berlow3, Donasian Ochola4, Christopher Shiprack1, Amita Kashyap1, Bernard Séguin4, Charles Keller3 and Christiane V. Löhr1* Abstract Background: Feline injection-site sarcoma (FISS), an aggressive iatrogenic subcutaneous malignancy, is challenging to manage clinically and little is known about the molecular basis of its pathogenesis. Tumor transcriptome profiling has proved valuable for gaining insights into the molecular basis of cancers and for identifying new therapeutic targets. Here, we report the first study of the FISS transcriptome and the first cross-species comparison of the FISS transcriptome with those of anatomically similar soft-tissue sarcomas in dogs and humans. Methods: Using high-throughput short-read paired-end sequencing, we comparatively profiled FISS tumors vs. normal tissue samples as well as cultured FISS-derived cell lines vs. skin-derived fibroblasts. We analyzed the mRNA- seq data to compare cancer/normal gene expression level, identify biological processes and molecular pathways that are associated with the pathogenesis of FISS, and identify multimegabase genomic regions of potential somatic copy number alteration (SCNA) in FISS. We additionally conducted cross-species analyses to compare the transcriptome of FISS to those of
    [Show full text]
  • Giri Narasimhan
    6/18/2013 Giri Narasimhan CURRICULUM VITAE GIRI NARASIMHAN ADDRESS: • Professor, School of Computing and Information Science, PHONE: (305) 348-3748 ECS 254A, Florida International University, FAX: (305) 348-6142 Miami, FL 33199. • Associate Dean, Research and Graduate Studies College of Engineering and Computing 10555 W. Flagler Street, EC 2443 WEBPAGES: http://www.cis.fiu.edu/~giri; http://biorg.cis.fiu.edu/ (Research Group) E-MAIL: [email protected] q EDUCATION DEGREE DISCIPLINE INSTITUTION YEAR B. Tech. Electrical Engineering Indian Institute of Technology, Bombay, India 1982 Ph. D. Computer Science University of Wisconsin - Madison 1989 q EXPERIENCE RANK/POSITION DEPARTMENT/DIVISION INSTITUTION PERIOD Professor School of Computer Science Florida International University From Fall 2004 Associate Dean, College of Engineering & From Summer Research and Florida International University Computing 2009 Graduate Studies Visiting Scholar Next Generation Sequencing Strand Life Sciences Jan-Apr 2009 Visiting Professor Microbiology & Molecular Genetics Harvard Medical School Fall 2006 Visiting Researcher IMAGEN-NICTA National ICT Australia (NICTA) Feb 2006 Associate Professor School of Computer Science Florida International University 2001-2004 Professor Mathematical Sciences Department University of Memphis 2001 Associate Professor Mathematical Sciences Department University of Memphis 1995-2001 Visiting Professor Computer Science Department University of Copenhagen,Denmark May-July 2000 Visiting Professor Computer Science Department Lund University,
    [Show full text]