Supporting Information

Total Page:16

File Type:pdf, Size:1020Kb

Supporting Information Supporting Information Boothby et al. 10.1073/pnas.1510461112 SI Text tardigrades were allowed to settle and were left overnight at Repetitive Sequences in the H. dujardini Genome. A total of 4,440 room temperature. (10.72%) of the PacBio contigs, making up 7.91% of the as- sembled sequence, do not have direct homologous matches in the DNA Extraction. Isolated specimens were transferred to 1.5-mL × Illumina assembly. These contigs may represent highly repetitive microcentrifuge tubes and centrifuged at 16,837 g for 2 min to sections of the genome that are difficult to assemble with short pellet animals. Excess liquid was removed, and tubes with pel- Illumina reads, or they may be misassemblies of the PacBio data leted specimens were dipped in liquid nitrogen and simulta- neously homogenized with plastic pestles and then left briefly to due to the inherently high rate of sequencing error (10–15%). To thaw. Freeze/thaw douncing was repeated five times to homog- try to resolve this question, we aligned the Illumina reads used to enize samples. To isolate genomic DNA, homogenized samples create our assembly to all assembled PacBio contigs using Bowtie 2. were processed with Qiagen’s DNeasy Blood and Tissue Kit Alignment of our Illumina assembly to our PacBio assembly showed ’ ∼ (catalog no. 69506) according to the manufacturer s instructions. that the average coverage of the differential PacBio contigs is 66% Following isolation, genomic DNA was ethanol-precipitated. of the average coverage across all PacBio contigs, indicating that these sequences are represented in the raw Illumina data but are Assessment of EST and Core Eukaryotic Genes Mapping Approach likely repetitive, and therefore very difficult to assemble accurately. Representation in the Genome. All available H. dujardini ESTs Tandem repeats (39) (at least 2 bp × 10) were found within were obtained from the GenBank. The BLAST-like alignment 20 bp of the ends of 22.85% of scaffolds. Combined with the larger tool (BLAT) was used to find matches between H. dujardini scale repetitive sequence suggested by our PacBio sequencing, ESTs and our sequencing datasets. Resulting BLAT files were these results indicate that these elements are widespread and a used with baa.pl (40). significant hindrance to genome assembly. Core eukaryotic genes mapping approach analysis was per- formed as previously described (10). Distribution of Foreign DNA Among Scaffolds. Although, on average, the distribution of foreign genes is approximately the same through Assessment of Redundancy Within the Genome. BLAT analysis using our assembly, looking at individual scaffolds, one can see that there is baa.pl was performed as described above, comparing all predicted a bias for smaller scaffolds having either 100% HGT genes or 0% protein sequences from the H. dujardini genome annotation with HGT genes (Fig. S2E). This bias is due to smaller scaffolds having assembled scaffolds. fewer genes on them, and this bias decreases with increasing min- imal gene per scaffold number (Fig. S2F). So, when comparing Genomic Library Preparation and Sequencing. scaffolds with at least one gene with scaffolds with at least 10 genes, Short insert mate pair libraries. Genomic DNA libraries were con- one can see a trend toward a more ubiquitous distribution of for- structed by Cofactor Genomics according to the following pro- eign genes as minimal gene count is increased. cedure. Briefly, genomic DNA was sheared to the desired size using a Covaris S2 instrument. Three libraries were constructed with the SI Experimental Procedures following insert sizes; 300 bp, 500 bp, and 800 bp. Following shearing, Genome and Transcriptome Sequences. This Whole Genome Shotgun DNA was end-repaired and A-tailed to prepare for adaptor ligation. project has been deposited at DDBJ/EMBL/GenBank under the Indexed adaptors were ligated to sample DNA, and the adaptor- accession LMYF00000000. The version described in this paper is ligated DNA was then size-selected on a 2% SizeSelect E-Gel version LMYF01000000. (Invitrogen) and amplified by PCR. Library quality was assessed by measuring nanomolar concentration and the fragment size in Tardigrade Culture and Isolation. Tardigrade cultures were main- base pairs. Sequencing was performed on an Illumina HiSeq 2000 tained using spring water and Chlorococcum algae (Sciento) as platform with a 2 × 100 paired-end read configuration. previously described (9). For large-scale culture, H. dujardini Moleculo long-read library preparation. Isolated DNA was submitted “ ” cultures were grown at room temperature in 10-L plastic carboys to Illumina for preparation and sequencing using the long-read filled to 5 L with deionized water. Carboys were inoculated with DNA sequencing service. Resulting sequences were assembled Chlorococcum algae, which served as a food source for the water into long reads by Illumina. bears. To separate tardigrades from their algal food source for PacBio library preparation and sequencing. Quantity and quality were DNA and RNA isolations, carboys were shaken to suspend algal assessed using an Invitrogen Qubit 2.0 fluorometer (Invitrogen Life sediment evenly and this solution was decanted into 150-mm Technologies) and an Experion instrument (Bio-Rad). DNA was purified using Agencourt AMPure beads (Beckman Coulter). DNA glass Petri dishes. Petri dishes were placed near a bright window, (∼10 μg) was further prepared for sequencing using a PacBio 10-kb but out of direct sunlight. It has previously been observed that Preparation Kit (Pacific Biosciences) following the manufacturer’s H. dujardini move away from sunlight. After the suspended protocol. Five single molecule real-time (SMRT) cells were run using sediment had settled on the bottom of the dish, a glass pipette the P6-C4 chemistry and a 1 × 240 movie at the University of North was used to clear algae from the edge of the dish furthest from Carolina at Chapel Hill High-Throughput Sequencing Facility. the window. The dish was monitored over a week until visible accumulation of tardigrades could be seen in this cleared area. Genomic Sequence Assembly. Assembly was performed with the Using a glass pipette, tardigrades were collected and transferred Celera assembler, version 8.1 (35). Long reads were first con- to a 50-mL beaker filled with deionized water. Tardigrades were verted from a Fastq to Celera Assembler format with the com- allowed to settle to the bottom of the beaker for 2 h, and the mand “fastqToCA -libraryname Hd -technology moleculo -reads majority of the water was then removed from the beaker without LongRead.fastq > lr.frg.” The assembler was then run with the disturbing the tardigrades. The beaker was again filled with de- following parameters [largely following the method of McCoy ionized water to wash the tardigrades. This washing procedure et al. (36)]: unitigger = bogart merSize = 31 ovlMinLen = was repeated an additional two times. Following the final wash, 2,000 utgGraphErrorRate = 0.03 utgMergeErrorRate = 0.045 Boothby et al. www.pnas.org/cgi/content/short/1510461112 1of9 obtErrorRate = 0.03 obtErrorLimit = 4.5 ovlMerSize = 31 used to indicate likely HGT for a gene (14). All BLAST searches ovlErrorRate = 0.03 batThreads = 24 cnsConcurrency = 12 were carried out using the BLOSUM62 matrix. frgCorrConcurrency = 12 frgCorrThreads = 12 mbtConcurrency = 12 merOverlapperThreads = 12 merylThreads = 12 ovlConcurrency = Gene Tree Construction and Analysis. Alignments for phylogenetic 12 ovlCorrConcurrency = 12 ovlThreads = 12. The resulting contigs analysis were compiled by blasting predicted protein sequences of and unplaced unitigs were then \scaffolded using the 300-bp, prospective horizontally transferred genes against the NCBI nr 500-bp, and 800-bp mate pair libraries with SSPACE, version database. One hundred cases were chosen at random (SI Ex- 2.0 (41), and the command-line parameters: -l = libfile.txt -s = perimental Procedures, Bridged PCR). The top five hits from Longreads.fasta -b = Hd -x = 1-z= 0-k= 4-a= 0.7 -n = 15 -T = metazoa, fungi, Embryophyta (multicellular plants), single-celled 24 -p = 1-o= 20 -t = 0-m= 32 -r = 0.9. eukaryotes, Archaea, and Eubacteria were used for alignment, except when probable homologs were not present in a lineage. Genomic Annotation. Annotations for the H. dujardini genome Sequences were aligned with MUSCLE (53) and edited by eye. assembly were generated using the automated genome annota- Alignments were trimmed using the bioinformatics program tion pipeline MAKER (16, 37, 38), which aligns and filters EST gBlocks (54) and by eye. Both maximum likelihood and Bayesian and protein homology evidence, identifies repeats, produces ab approaches were used for phylogenetic analyses, implementing initio gene predictions, infers 5′ and 3′ UTRs, and integrates the Le-Gascuel (LG) model (55), with an estimated proportion these data to produce final downstream gene models, along with of invariable sites and an estimated gamma shape parameter. quality control statistics. PhyML (56) was used to determine maximum likelihood tree to- Inputs for MAKER included the H. dujardini genome as- pologies, with 500 bootstrap replicates to evaluate branch supports. sembly, H. dujardini ESTs, a tardigrade-specific repeat library The Bayesian analysis was performed in MRBAYES (57), which made with RepeatModeler (42) based on the reference genome ran for between 600,000 and 3,600,000 generations, depending on assembly, and a protein database combining the UniProt/Swiss-Prot when convergence occurred, and
Recommended publications
  • May Help Explain Expression of Multiple Genes in Alzheimer's
    Journal of Systems and Integrative Neuroscience Research Article ISSN: 2059-9781 Micro RNA’s (miRNA’s) may help explain expression of multiple genes in Alzheimer’s Frontal Cortex James P Bennett* and Paula M Keeney Neurodegeneration Therapeutics, Inc., 3050A Berkmar Drive, Charlottesville, VA 22901, USA Abstract MicroRNA’s (miRNA’s) are non-coding RNA’s that can regulate gene expression. miRNA’s are small (~22 nt) and can bind by complementation to mRNA’s resulting in direct degradation and/or interference with translation. We used paired-end RNAseq and small RNAseq to examine expression of mRNA’s and miRNA’s, respectively, in total RNA’s isolated from frontal cortex of Alzheimer’s disease (AD) and control (CTL) subjects. mRNA’s were aligned to the hg38 human genome using Tophat2/Bowtie2 and relative expression levels determined by Cufflinks as FPKM (fragments per kilobase of exon per million mapped reads). miRNA’s were aligned to the hg19 human genome and quantitated as number of mature known miRNA reads with miRDeep*. Using a false-discovery rate (FDR) of 10% (FDR ≤ 0.10) applied to 11,794 genes with FPKM > 2.0, we identified 55 genes upregulated in AD and 191 genes downregulated in AD. The top 145 miRNA’s with the highest mean expression were mostly under-expressed in AD (132/145 miRNAs had AD/CTL < 1.0)). AD samples had increased coefficients of variation for miRNA and mRNA expression, implying greater heterogeneity. miRTAR analysis showed that 32 (58%) of the mRNA genes upregulated in AD could be controlled by one or more of 60 miRNA’s under-expressed >1.5-fold in AD (AD/CTL < 0.67).
    [Show full text]
  • Stool-Derived Eukaryotic RNA Biomarkers for Detection of High-Risk Adenomas
    bioRxiv preprint doi: https://doi.org/10.1101/534412; this version posted January 29, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Stool-derived eukaryotic RNA biomarkers for detection of high-risk adenomas Erica Barnell1,2*, Yiming Kang2,3*, Andrew Barnell2, Katie Campbell1,2, Kimberly R. Kruse2, Elizabeth M. Wurtzler2, Malachi Griffith1,4,5,6, Aadel A. Chaudhuri3,6,7+, Obi L. Griffith1,4,5,6+ 1 McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA 2 Geneoscopy LLC, Division of Gastroenterology and Hepatology, St. Louis, MO, USA 3 Department of Computer Science and Engineering, Washington University, St. Louis, MO, USA 4 Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA 5 Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO, USA 6 Siteman Cancer Center, Washington University School of Medicine, St. Louis, MO, USA 7 Department of Radiation Oncology, Washington University School of Medicine, St. Louis, MO, USA +Corresponding author *These authors contributed equally to this work. Abbreviations AUC area under the curve LRA low-risk adenoma ANOVA analysis of variance MRA medium-risk adenoma CRC colorectal cancer NPV negative predictive value DE differentially expressed PPV positive predictive value FC fold-change ROC receiver operating characteristic FIT fecal immunochemical test seRNA stool-derived eukaryotic RNA HRA high-risk adenoma Abstract Background and aims: Colorectal cancer (CRC) is the second leading cause of cancer related deaths in the United States.
    [Show full text]
  • Targeted Sequencing Informs the Evaluation of Normal Karyotype Cytopenic Patients for Low-Grade Myelodysplastic Syndrome
    Letters to the Editor 2422 chronic lymphocytic leukemia are mostly derived from independent clones. 13 Binder M, Muller F, Jackst A, Lechenne B, Pantic M, Bacher U et al. B-cell receptor Haematologica 2014; 99: 329–338. epitope recognition correlates with the clinical course of chronic lymphocytic 9 Sanchez ML, Almeida J, Gonzalez D, Gonzalez M, Garcia-Marcos MA, leukemia. Cancer 2011; 117: 1891–1900. Balanzategui A et al. Incidence and clinicobiologic characteristics of leukemic 14 Seiler T, Woelfle M, Yancopoulos S, Catera R, Li W, Hatzi K et al. Characteri- B-cell chronic lymphoproliferative disorders with more than one B-cell clone. zation of structurally defined epitopes recognized by monoclonal antibodies pro- Blood 2003; 102: 2994–3002. duced by chronic lymphocytic leukemia B cells. Blood 2009; 114:3615–3624. 10 Klinger M, Zheng J, Elenitoba-Johnson KS, Perkins SL, Faham M, Bahler DW. Next- generation IgVH sequencing CLL-like monoclonal B-cell lymphocytosis reveals frequent oligoclonality and ongoing hypermutation. Leukemia 2015; 30: This work is licensed under a Creative Commons Attribution- 1055–1061. NonCommercial-NoDerivs 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons 11 Tiller T, Meffre E, Yurasov S, Tsuiji M, Nussenzweig MC, Wardemann H. license, unless indicated otherwise in the credit line; if the material is not included under Efficient generation of monoclonal antibodies from single human B cells by single the Creative Commons license, users will need to obtain permission from the license cell RT-PCR and expression vector cloning. J Immunol Methods 2008; 329: holder to reproduce the material.
    [Show full text]
  • Application of RAD-Seq in Evolutionary Genomics of Non-Model Organisms
    Application of RAD-seq in Evolutionary Genomics of Non-Model Organisms By Boryana S. Koseva Submitted to the graduate degree program in Ecology and Evolutionary Biology and the Graduate Faculty of the University of Kansas in partial fulfillment of the requirements for the degree of Doctor of Philosophy. Chair: John K. Kelly Stuart J. Macdonald Justin P. Blumenstiel Jamie R. Walters Mark T. Holder Date Defended: 28 April 2017 ii The dissertation committee for Boryana Koseva certifies that this is the approved version of the following dissertation: Application of RAD-seq in Evolutionary Genomics of Non-Model Organisms Chair: John K. Kelly Date Approved: 11 May 2017 iii Abstract Next generation sequencing (NGS) technologies are revolutionizing how we study genetics and evolution in the modern world. Data is generated at such a fast pace that scientists are struggling to keep up with the innovations in methodology and analytical tools. Genomes are being sequenced at an unprecedented rate, and scientists in fields that until recently found no use in learning molecular techniques are venturing into the world of high-throughput sequencing. Almost 10 years ago, a research group developed Restriction-site Associated DNA Sequencing (RAD- seq), a method that targets polymorphisms in close proximity to restriction cut sites in hundreds of samples simultaneously. The beauty of RAD-seq lies in that it is highly customizable and it does not require a reference genome, or intimate prior knowledge of the genetics of the study organism one would like to use. The most exciting part about new RAD-seq methods being developed is that their accessibility has opened the door to many non-model organisms to be used in new areas of research.
    [Show full text]
  • IS-Seq: a Novel High Throughput Survey of in Vivo IS6110 Transposition in Multiple Mycobacterium Tuberculosis Genomes
    BMC Genomics This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. IS-seq: a novel high throughput survey of in vivo IS6110 transposition in multiple Mycobacterium tuberculosis genomes BMC Genomics 2012, 13:249 doi:10.1186/1471-2164-13-249 Alejandro Reyes ([email protected]) Andrea Sandoval ([email protected]) Andrés Cubillos-Ruiz ([email protected]) Katerine E Varley ([email protected]) Iván Hernández-Neuta ([email protected]) Sofía Samper ([email protected]) Carlos Martín ([email protected]) Maria Jesús Garcia ([email protected]) Viviana Ritacco ([email protected]) Lucelly López ([email protected]) Jaime Robledo ([email protected]) María Mercedes Zambrano MMZ ([email protected]) Robi D Mitra ([email protected]) Patricia Del Portillo ([email protected]) ISSN 1471-2164 Article type Research article Submission date 3 November 2011 Acceptance date 30 May 2012 Publication date 15 June 2012 Article URL http://www.biomedcentral.com/1471-2164/13/249 Like all articles in BMC journals, this peer-reviewed article was published immediately upon acceptance. It can be downloaded, printed and distributed freely for any purposes (see copyright notice below). Articles in BMC journals are listed in PubMed and archived at PubMed Central. For information about publishing your research in BMC journals or any BioMed Central journal, go to © 2012 Reyes et al. ; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
    [Show full text]
  • Identification of Purple Sea Urchin Telomerase RNA Using a Next-Generation Sequencing Based Approach
    Downloaded from rnajournal.cshlp.org on October 5, 2021 - Published by Cold Spring Harbor Laboratory Press METHOD Identification of purple sea urchin telomerase RNA using a next-generation sequencing based approach YANG LI,1,6 JOSHUA D. PODLEVSKY,2,6 MANJA MARZ,3,6 XIAODONG QI,1 STEVE HOFFMANN,4,5 PETER F. STADLER,5 and JULIAN J.-L. CHEN1,7 1Department of Chemistry & Biochemistry and 2School of Life Sciences, Arizona State University, Tempe, Arizona 85287, USA 3Department of Bioinformatics, Friedrich Schiller University of Jena, D-07743 Jena, Germany 4LIFE Center and 5Interdisciplinary Center for Bioinformatics, University of Leipzig, D-04107 Leipzig, Germany ABSTRACT Telomerase is a ribonucleoprotein (RNP) enzyme essential for telomere maintenance and chromosome stability. While the catalytic telomerase reverse transcriptase (TERT) protein is well conserved across eukaryotes, telomerase RNA (TR) is extensively divergent in size, sequence, and structure. This diversity prohibits TR identification from many important organisms. Here we report a novel approach for TR discovery that combines in vitro TR enrichment from total RNA, next-generation sequencing, and a computational screening pipeline. With this approach, we have successfully identified TR from Strongylocentrotus purpuratus (purple sea urchin) from the phylum Echinodermata. Reconstitution of activity in vitro confirmed that this RNA is an integral component of sea urchin telomerase. Comparative phylogenetic analysis against vertebrate TR sequences revealed that the purple sea urchin TR contains vertebrate-like template-pseudoknot and H/ACA domains. While lacking a vertebrate- like CR4/5 domain, sea urchin TR has a unique central domain critical for telomerase activity. This is the first TR identified from the previously unexplored invertebrate clade and provides the first glimpse of TR evolution in the deuterostome lineage.
    [Show full text]
  • Bioinformatics Workshopfor
    HHMI bioinformatics workshop for Student/Scientist Partnerships Program & Abstracts June 14 - 16, 2012 ii Contents Agenda 1 Abstracts 5 Mark Adams 6 The Genomics Scholars Program: Encouraging a research path for under- represented minorities in community colleges Charles F. Aquadro 7 The Cornell Genetic Ancestry Project: Engaging Undergrads Through Participation Lois Banta 8 Metagenomic Analysis of Microbial Diversity in Winogradsky Columns Vincent Buonaccorsi 9 The Genome Consortium on Active Teaching using Next Generation Sequencing Steven G. Cresawn 10 Mycobacteriophages Genomics and Bioinformatics As a Positive Feedback Loop E.A. Dinsdale 11 Microbes, Metagenomes and Marine Mammals: Enabling the Next Generation of Students to Enter the Genomic Era Sam Donovan 12 Teaching and Learning Scientific Data Literacy Skills David J. Dooling 13 Making Sequence Analysis Accessible, or Even Invisible Todd T. Eckdahl 14 GCAT SynBio: Building a Community of Faculty Conducting Synthetic Biology Research with Undergraduate Student Sean Eddy 15 Thoughts on bioinformatics education Sarah C. R. Elgin 16 GEP: The Genomics Education Partnership Alex Hartemink 17 Course: Introduction to computational genomics Ian Korf 18 Teaching new bioinformatics programmers Robert M. Kuhn 19 Using the UCSC Genome Browser to Teach Fundamental Genetic Concepts -- Student-Produced Video Library Carolyn Lawrence 20 Structural Annotation of the Maize Genome: Tools for Community Contributions iii Contents Wilson Leung 21 GEP: The Web Framework Suzanna E. Lewis 22 WebApollo: A Web-Based Sequence Annotation Editor For Distributed Community Annotation Jennifer Mansfield 23 Research in an undergraduate biology curriculum: investigating the functional genomics of chemosensation in the tobacco hornworm Manduca sexta Juan C. Martínez-Cruzado 24 Caribbean-Focused Genomics Research Projects with Tailored Courses for Developing Human and Computational Infrastructure in Puerto Rico David Micklos 25 DNA Subway: An Intuitive Interface to Introduce Genome Informatics William R.
    [Show full text]
  • Company Overview and Highlights
    COMPANY OVERVIEW AND HIGHLIGHTS Confidential 1 Who is Cofactor? According to the National Cancer Institute (NCI), 50% of the world’s population CANCER will have cancer, and 25% will die from it. Cofactor Genomics is on a mission to change those statistics. Precision Medicine ensures the right patient receives the right treatment at the right time. Nowhere is precision medicine more active than in the field of cancer immunotherapy, with more than 2,400 drugs and 2,400+ combination therapies in clinical trials. Cancer-immunotherapy Even with so many breakthroughs, we still see a poor response; in some cases, Drug Clinical Trials less than 20% of patients respond to these therapies. The reason is the industry’s reliance on imperfect, single-analyte biomarkers, often using antiquated methods to match patients to therapies. Our team of 20-year veterans in genomics, knowing the inherent limitations of DNA and proteins, instead are using the $1.7B power of RNA to build the technologies, data assets, distribution and regulatory Annual TAM Potential frameworks to enable more predictive multidimensional biomarkers. Through these efforts, Cofactor has established themselves as the “Leaders in RNA”, and are demonstrating that RNA-based technologies are key to realizing the promise of precision medicine. Driven by clinical and market data, we are prioritizing a list of key cancer subtypes that address the highest unmet clinical need, with a $1.7B annual TAM to be captured as we demonstrate utility and gain adoption. 2 Confidential InvestorSummary_Handout_v_3_0_190913 Powered by Health Expression Models Cofactor has invested over $10M over the past decade to build the world’s largest database of Health Expression Models.
    [Show full text]
  • CV + Publications
    4100 Forest Park #217 Dave Messina. Saint Louis, MO 63108 314.974.6377 http://davemessina.com [email protected] Education 2012 PhD, Computational Biology, Stockholm University 2004 MA, Genetics, Washington University School of Medicine 1996 BA Biology, University of Illinois at Chicago Research Experience Director of Analysis @ Cofactor Genomics (2012-present) • Leads Cofactor’s computational research & development program, involving software and infrastructure • Oversees bioinformatics analyses for all clients, from small academic labs to large pharma companies • Represents the company in both pre- and post-sales customer contacts PhD candidate @ Stockholm Bioinformatics Centre, Stockholm University (2007-2012) • Thesis project: developing methods to identify novel protein-coding genes from metagenomic samples • Created DASher, a Java viewer for integrating distributed protein annotations (dasher.sbc.su.se) • Co-designed and developed the API for SeqXML and OrthoXML, formats for comparative orthology (seqxml.org) Core Developer @ BioPerl (2006-present) • Co-wrote the Deobfuscator, a highly-used documentation tool (bioperl.org/wiki/Deobfuscator) • Maintain, debug, and improve the codebase for a large open-source project (~700,000 lines of code) • Provide frontline support to user community • Plan and guide the future direction of the project Senior Programmer Analyst @ Genome Sequencing Center, Washington University (2004-2007) • Created semblance, an open-source program for fully-automated quality-control analysis and comparison of
    [Show full text]
  • Missouri's Advantages for the Biomedical Industry
    Missouri’s advantages for the biomedical industry Five reasons to locate a biomedical company in Missouri: 1. Thriving industry: Missouri has the right business environment for biomedical companies. Missouri ranks 12th for employment in biopharmaceuticals according to the U.S. Cluster Mapping Project, which was conducted through Harvard Business School. Go to page 3. 2. Abundant resources: Companies are supported by research centers, incubator facilities and hospitals—and an abundance of funding sources—to bring their discoveries from the laboratory to the market. Washington University in St. Louis ranked 7th for NIH funding in 2014 and was ranked the 6th best medical school for research in 2015 by U.S. News and World Report. Go to page 8. 3. Quality workforce: Missouri has the talent to sustain bioscience companies. Missouri’s bioscience industry in Missouri is comprised of nearly 3,500 companies employing nearly 50,000 Missourians (U.S. Bureau of Labor Statistics, 2014). Go to page 15. 4. World-class education: Missouri’s top-ranked educational institutions draw unparalleled talent. 40 Missouri colleges and universities offer bachelor’s degrees or higher in biological or biomedical sciences, including the University of Missouri, which is one of only a few universities in the U.S. with schools of medicine, agriculture and veterinary medicine on one campus. Go to page 16. 5. Solid infrastructure: Missouri’s top-ranked transportation network is a dependable delivery system for goods and services. With its centralized Midwest location, Missouri lies at the heart of a vast transportation network with direct connections to domestic and foreign markets.
    [Show full text]
  • An Island-Based Approach for RNA-SEQ Differential Expression Analysis
    University of Louisville ThinkIR: The University of Louisville's Institutional Repository Electronic Theses and Dissertations 5-2015 An island-based approach for RNA-SEQ differential expression analysis. Abdallah Eteleeb University of Louisville Follow this and additional works at: https://ir.library.louisville.edu/etd Part of the Computer Engineering Commons Recommended Citation Eteleeb, Abdallah, "An island-based approach for RNA-SEQ differential expression analysis." (2015). Electronic Theses and Dissertations. Paper 2072. https://doi.org/10.18297/etd/2072 This Doctoral Dissertation is brought to you for free and open access by ThinkIR: The University of Louisville's Institutional Repository. It has been accepted for inclusion in Electronic Theses and Dissertations by an authorized administrator of ThinkIR: The University of Louisville's Institutional Repository. This title appears here courtesy of the author, who has retained all other copyrights. For more information, please contact [email protected]. AN ISLAND-BASED APPROACH FOR RNA-SEQ DIFFERENTIAL EXPRESSION ANALYSIS By Abdallah Eteleeb B.S., University of Al-Jabal Al-Gharbi, Libya, 1996 M.S., HAN University of Applied Science, The Netherlands, 2005 A Dissertation Submitted to the Faculty of the J.B. Speed School of Engineering of the University of Louisville in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy in Computer Science and Engineering Department of Computer Engineering and Computer Science University of Louisville Louisville, Kentucky May 2015 c Copyright 2015 by Abdallah Eteleeb All Rights Reserved AN ISLAND-BASED APPROACH FOR RNA-SEQ DIFFERENTIAL EXPRESSION ANALYSIS By Abdallah Eteleeb B.S., University of Al-Jabal Al-Gharbi, Libya, 1996 M.S., HAN University of Applied Science, The Netherlands, 2005 A Dissertation Approved On April 6, 2015 Date by the following Dissertation Committee: Eric C.
    [Show full text]
  • Molecular Genetic Studies of Senescence in Anthurium
    MOLECULAR GENETIC STUDIES OF SENESCENCE IN ANTHURIUM A DISSERTATION SUBMITTED TO THE GRADUATE DIVISION OF THE UNIVERSITY OF HAWAI‘I AT MĀNOA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY IN MOLECULAR BIOSCIENCES AND BIOENGINEERING AUGUST 2012 By Pierriden Azucena Perez Dissertation Committee: David Christopher, Chairperson Anne Alvarez Richard Criley John Hu Gernot Presting Keywords: Anthurium senescence, Agrobacterium-mediated transformation ABSTRACT Senescence is a complex physiological process and has become an attractive area of research in plant molecular biology. The autoregulated production of cytokinin in plants transformed with the PrSAG12-IPT gene construct significantly delayed leaf senescence, and created plants that lived longer, produced more flowers with improved vase life, and an overall increased productivity. The promoter region of an arabidopsis cysteine protease served as the senescence-activated switch for the cytokinin gene IPT, and the discovery of a homolog in anthurium (ANTH17) made possible the cloning and isolation of its promoter. The sequence contained motifs and cis-elements characteristic of senescence response, and transformation of arabidopsis with PrANTH17-IPT showed similar traits with those transformed with PrSAG12-IPT. Agrobacterium-mediated transformation of anthurium with the senescence-activated gene constructs proved challenging, and stable transformation of plants was confirmed by screening for the reporter gene GFP using molecular methods. An effort to establish a protoplast transient expression system in anthurium was initiated in order to study protein subcellular signaling and localization, and is still in the process of optimization. Transcriptomic analysis of senescing leaf and spathe identified proteins involved in tissue-specific development, and provided an enormous collection of over 17,000 gene sequences that are differentially expressed.
    [Show full text]