Opportunities in Functional Genomics: a Primer on Lab and Computational Aspects Author(S): Steven B

Opportunities in Functional Genomics: A Primer on Lab and Computational Aspects Author(s): Steven B. Roberts and Mackenzie R. Gavery Source: Journal of Shellfish Research, 37(4):747-754. Published By: National Shellfisheries Association https://doi.org/10.2983/035.037.0406 URL: http://www.bioone.org/doi/full/10.2983/035.037.0406 BioOne (www.bioone.org) is a nonprofit, online aggregation of core research in the biological, ecological, and environmental sciences. BioOne provides a sustainable online platform for over 170 journals and books published by nonprofit societies, associations, museums, institutions, and presses. Your use of this PDF, the BioOne Web site, and all posted and associated content indicates your acceptance of BioOne’s Terms of Use, available at www.bioone.org/page/terms_of_use. Usage of BioOne content is strictly limited to personal, educational, and non-commercial use. Commercial inquiries or rights and permissions requests should be directed to the individual publisher as copyright holder. BioOne sees sustainable scholarly publishing as an inherently collaborative enterprise connecting authors, nonprofit publishers, academic institutions, research libraries, and research funders in the common goal of maximizing access to critical research. Journal of Shellfish Research, Vol. 37, No. 4, 747–754, 2018. OPPORTUNITIES IN FUNCTIONAL GENOMICS: A PRIMER ON LAB AND COMPUTATIONAL ASPECTS STEVEN B. ROBERTS* AND MACKENZIE R. GAVERY School of Aquatic and Fishery Sciences, University of Washington, 1122 NE Boat Street, Seattle, WA 98105 ABSTRACT As computational and sequencing technologies continue to flourish, the barrier for those interested in complementing traditional ecological and physiological studies with functional genomics is easier to overcome. Here, an overview of transcriptome sequencing and DNA methylation analyses in shellfish is provided, primarily for those with fundamental interests and training in a different domain. The approaches covered here can provide valuable information on how organisms respond to their environment and also be used to evaluate evolutionary relationships. First, biological and technological background is provided, highlighting studies in shellfish that have applied these approaches. This is followed by practical methods and tools for conducting this work at the laboratory bench and in front of a computer. In an effort to provide educational resources and the ability to update computational and analytical resources as they become available, a supplementary repository for this work has been created and is available at https://github.com/sr320/fun-gen. KEY WORDS: genomics, bioinformatics, gene expression, DNA methylation, DNA sequencing BACKGROUND latter can easily form a basis for hypotheses that can be tested in later experiments using genomic and nongenomic approaches. When considering marine invertebrates whether in an eco- There are a myriad of processes that can influence transcrip- logical, aquaculture, or culinary perspective, one does not often tional activity, including epigenetic mechanisms. Epigenetics is consider the expression of genes and associated regulatory a burgeoning field in nonmodel organisms that refers to mechanisms; however, for some disciplines this could be a valu- mechanisms that can alter expression in a heritable manner able direction to expand and complement scientific endeavors. without altering the underlying nucleotides themselves. Unlike Even for those interested in molecular biology, there is often the nucleotides themselves, epigenetic mechanisms, including a steep learning curve from a clear fundamental understanding DNA methylation, histone modifications, and noncoding to practical application. This is particularly the case for RNAs, are sensitive and responsive to environmental signals. functional genomics, a field that is increasingly shedding light As such, epigenetic mechanisms have been proposed to be an on numerous aspects of shellfish biology. In terms of ecology, important link between the phenotype and environment, and areas that could be studied using functional genomics ap- could transform how we consider adaptation depending on the proaches include adaptation, population structure, and species degree of heritability. interaction. From the physiological perspective, the focus of One could encapsulate functional genomics into the foun- this review, this would include understanding how fundamental dation of what controls organismal physiology, including the biology works (e.g., growth, reproduction, and immune re- response to environmental stress. An additional component of sponse). Combined, this information could be effectively ap- genomics is how alteration of nucleotides can alter a phenotype. plied in numerous ways including shellfish aquaculture, We will not explore aspects of genetic variation here, although resource management, and ecological modeling. Here, a short there are several reviews covering the topic in relation to introduction to gene expression and DNA methylation in shellfish (Degremont et al. 2015, Robledo et al. 2017). There shellfish is provided, as well as practical advice on how to get are also several comprehensive reviews on the application of started. genomics in shellfish (Suarez-Ulloa et al. 2013, Gomez-Chiarri This work focuses on ‘‘functional genomics’’ which are et al. 2015a, Sanmartın et al. 2016). approaches associated with the ‘‘expression’’ of the genome in some manner. Briefly, there are portions of the genome (genes) CURRENT STATE OF RESOURCES AND TECHNOLOGY that can be transcribed, often then translated to proteins to carry out fundamental processes. Specifically, there is a focus on Ultimately, having a completely sequenced genome of an messenger RNA (mRNA), which is the RNA that is transcribed organism can greatly increase the ability to study the expression from the genome where introns are excluded and a polyadeny- and variation; however, the number of species where the lated tail is appended. There are numerous scientific questions genomes have been fully assembled to this level of complete that can be addressed by examining mRNA expression pat- chromosomes is limited. One notable example where this has terns, including providing a snapshot into environmental been accomplished is Crassostrea virginica (GenBank accession response, identifying early signals of phenotypes, and charac- GCA_002022765, Gomez-Chiarri et al. 2015b). Given the terizing mechanisms of disease resistance. Gene expression complex nature of genome arrangement in many marine in- analysis can be used to address specific hypotheses, but also vertebrates, genome assembly is not a simple task and requires can be a powerful tool for discovery and descriptive science. The significant resources in both computing hardware and bioinformatic expertise. The technology is advancing at such a rate *Corresponding author. E-mail: [email protected] that it would be challenging to comment on ideal sequencing DOI: 10.2983/035.037.0406 and associated bioinformatic approaches for genome assembly. 747 748 ROBERTS AND GAVERY Generally, sequencing the genome of a shellfish species will Epigenetic mechanisms, such as DNA methylation, are key often involve identifying an individual with limited genome to our understanding of how environment impacts phenotype, complexity (e.g., inbred), generating sequencing libraries con- as they can contribute to gene regulatory activity and be stituted of variable fragment lengths, and relying on software to influenced by the environment. DNA methylation refers to assemble the many pieces of the genome together in the correct the enzymatic addition of a methyl group to a cytosine residue order and orientation. If genomes are available for your species in DNA, which occurs almost exclusively at CpG dinucleotides of interest, this could certainly improve downstream analysis (i.e., a cytosine located 5# of a guanine) in animals. The and increase options for interpretation. enzymatic machinery supporting DNA methylation includes Some of the first mollusc genomes to be sequenced were the a family of DNA methyltransferases (DNMT) including the California sea hare, Aplysia californica, and the owl limpet, maintenance methyltransferase DNMT1 and the de novo Lottia gigantea. The L. gigantea genome is 348 Mb, has a sub- methyltransferases DNMT3A/3B. In plant and mammalian stantial number of repetitive elements (21% of the assembled studies, it has been shown that DNA methylation is sensitive to genome), and about 23,800 genes (Simakov et al. 2013). The external factors including maternal behavior (Weaver et al. 2004), Pacific oyster, Crassostrea gigas, was the first bivalve genome nutrition (Dolinoy et al. 2007), and photoperiod (Azzi et al. 2014). published with 11,969 scaffolds (Zhang et al. 2012). A total of Most of the studies on DNA methylation in shellfish have 28,027 genes were predicted. In early 2017, a number of new focused on Crassostrea gigas, as the available genome signifi- bivalve genomes were published including Patinopecten yes- cantly improves effectiveness of data analysis. One primary soensis (Wang et al. 2017) and genomes of a deep-sea vent/seep means to identify DNA methylation in any given cell or tissue is mussel (Bathymodiolus platifrons) and a shallow-water mussel to use bisulfite treatment of DNA, which converts unmethy- (Modiolus philippinarum) (Sun et al. 2017). Another popular lated cytosines to uracil, whereas methylated cytosines are bivalve, the eastern oyster (Crassostrea virginica), has also been

Opportunities in Functional Genomics: a Primer on Lab and Computational Aspects Author(S): Steven B

Chapter 14: Functional Genomics Learning Objectives

Masterpath: Network Analysis of Functional Genomics Screening Data

The Economic Impact and Functional Applications of Human Genetics and Genomics

Human Functional Genomics Project Begins Unraveling Links Between

High-Throughput Automated Microfluidic Sample Preparation for Accurate Microbial Genomics

IBM Functional Genomics Platform, a Cloud-Based Platform for Studying Microbial Life at Scale

After the Draft Sequence, What Next for the Human Genome Mapping Project Resource Centre?

Toward a Protein–Protein Interaction Map of the Budding Yeast: A

A Massively Parallel Barcoded Sequencing Pipeline Enables Generation of the First Orfeome and Interactome Map for Rice

Arabidopsis Thaliana Functional Genomics Project Annual Report 2008

Comparative Genomics for Reliable Protein-Function Prediction from Genomic Data

Classical Genetics 3. the Beginnings of Genomic Biol