Bioinformatics Resources for Microrna Discovery Alyssa C

Bioinformatics Resources for Microrna Discovery Alyssa C

Kennesaw State University DigitalCommons@Kennesaw State University Faculty Publications 1-18-2016 Bioinformatics Resources for MicroRNA Discovery Alyssa C. Moore Jonathan S. Winkjer Tsai-Tien Tseng Kennesaw State University, [email protected] Follow this and additional works at: https://digitalcommons.kennesaw.edu/facpubs Part of the Molecular Biology Commons Recommended Citation Moore, Alyssa C.; Winkjer, Jonathan S.; and Tseng, Tsai-Tien, "Bioinformatics Resources for MicroRNA Discovery" (2016). Faculty Publications. 3586. https://digitalcommons.kennesaw.edu/facpubs/3586 This Article is brought to you for free and open access by DigitalCommons@Kennesaw State University. It has been accepted for inclusion in Faculty Publications by an authorized administrator of DigitalCommons@Kennesaw State University. For more information, please contact [email protected]. Bioinformatics Resources for MicroRNA Discovery Alyssa C. Moore, Jonathan S. Winkjer and Tsai-Tien Tseng Department of Molecular and Cellular Biology, Kennesaw State University, Kennesaw, GA, USA. Supplementary Issue: Gene and Protein Expression Profiling in Disease ABSTR ACT: Biomarker identification is often associated with the diagnosis and evaluation of various diseases. Recently, the role of microRNA (miRNA) has been implicated in the development of diseases, particularly cancer. With the advent of next­generation sequencing, the amount of data on miRNA has increased tremendously in the last decade, requiring new bioinformatics approaches for processing and storing new information. New strategies have been developed in mining these sequencing datasets to allow better understanding toward the actions of miRNAs. As a result, many databases have also been established to disseminate these findings. This review focuses on several curated databases of miRNAs and their targets from both predicted and validated sources. KEYWORDS: bioinformatics, microRNA, database SUPPLEMENT: Gene and Protein Expression Profiling in Disease COmpetING Interests: Authors disclose no potential conflicts of interest. CItatION: Moore et al. Bioinformatics Resources for MicroRNA Discovery. Biomarker COrrespOndence: [email protected] Insights 2015:10(S4) 53–58 doi: 10.4137/BMI.S29513. COpyrIGHT: © the authors, publisher and licensee Libertas Academica Limited. This is TYPE: Review an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 License. ReceIVed: July 21, 2015. RESUBMItted: November 22, 2015. Accepted FOR publIcatION: November 24, 2015. Paper subject to independent expert blind peer review. All editorial decisions made by independent academic editor. Upon submission manuscript was subject to anti- AcademIC edITOR: Karen Pulford, Editor in Chief plagiarism scanning. Prior to publication all authors have given signed confirmation of Peer REVIEW: Five peer reviewers contributed to the peer review report. Reviewers’ agreement to article publication and compliance with all applicable ethical and legal reports totaled 1,082 words, excluding any confidential comments to the academic editor. requirements, including the accuracy of author and contributor information, disclosure of competing interests and funding sources, compliance with ethical requirements relating FundING: Alyssa C. Moore was supported by the S-STEM scholarship from the to human and animal study participants, and compliance with any copyright requirements National Science Foundation. This material is based upon work supported by the of third parties. This journal is a member of the Committee on Publication Ethics (COPE). National Science Foundation under Grant No. 1259954. The authors confirm that the funder had no influence over the study design, content of the article, or selection of Published by Libertas Academica. Learn more about this journal. this journal. Introduction emphasis on the role of microRNA (miRNA) as a biomarker In 1956, Crick stated the central dogma of molecular bio­ in the diagnosis and possible treatment for cancer.3,4 miRNAs logy describing the flow of information from DNA to are small single­stranded noncoding RNAs that control gene RNA to protein.1 Although the process of information expression at the posttranscriptional level. miRNAs act as transmission was oversimplified, the central dogma hinted posttranscription regulators of mRNA by binding to a specific at the wealth of information that can be extracted from miRNA­binding site on the 3′­untranslated region (3′­UTR) every biological sequence. The mining of information from of mRNA.5 They are often regarded as both predictive and nucleotide and protein sequences prompted the develop­ prognostic biomarkers.6 Sequence­level polymorphisms in ment of bioinfor matics, the science that interfaces biology miRNA or their target sites can have strong downstream and computer science to answer biological questions on a effects in phenotype. These polymorphisms have been impli­ molecular level. Sequence­based discovery allows the elu­ cated in a number of diseases, ranging from cancer, diabetes, cidation of the relationships between structure, function, Parkinson’s, and Alzheimer’s. For example, miRNAs have and evolution. Discovering the relationships between our been considered as a serum biomarker for cancer diagnosis genetic sequences and the various genetic actions, includ­ and prognosis, particularly in B­cell lymphoma as noted by ing the causes of diseases, is one of the main goals of Lawrie et al.7 bioinformatics. With the advent of next­generation sequencing (NGS), The development of biomarker identifications is often the identification and quantitation of miRNA as bio­ associated with the diagnosis and evaluation of various dis­ markers are becoming more precise. Many experiments of eases. Many biomarkers are macromolecules of nucleic acids, miRNA quantitation were the results of whole transcriptome carbohydrates, and proteins in nature. The initial isolation of sequencing, often referred to as RNA­seq.8 The hallmark nucleic acid­based biomarkers requires the need for genomics feature of NGS is the ability to elucidate millions of strands as opposed to proteomics, which is needed to isolate protein­ of nucleotides simultaneously, which results in an unprece­ based biomarkers. These raw-omic outputs are often subjected dented amount of coverage for any genome. While NGS is of to further analyses with bioinformatics techniques that focus great interest to many readers, the technical detail is beyond on particular aspects of the dataset, specifically, in this dis­ the scope and the allotted space of this review. For users cussion, biomarkers.2 Recently, there has been an increased interested in NGS technology, review articles by Mardis, BIOMARKER INSIGHTS 2015:10(S4) 53 Moore et al Mutz et al, and Koboldt et al provide a thorough coverage on miRBase its usage and application.9–12 For readers interested in NGS miRBase (www.mirbase.org) combines the knowledge of and classical methods of miRNA discovery, Eminaga et al, miRNA and NGS to create a repository aimed at assigning Tam et al, and Git et al provide an excellent overview for stable and consistent names to novel miRNAs.22 While it can the processes.13–15 be accessed via its web interface, bulk download via file trans­ NGS is also known as massive parallel sequencing or fer protocol is also available. Established in 2002, miRBase deep sequencing due to its potential outputs. Consequently, was originally called the miRNA Registry, which allowed the amount of data generated has also been unprecedented. submissions of novel miRNAs to be named in a consistent and This requires the establishment of corresponding protocols organized fashion.23 Its first release contained 218 miRNA in processing miRNA data from RNA­seq experiments. For loci from five species. As of June 2014, after continuous bioinformatics to contribute to the analysis of these RNA­ growth, release 21 contains 28,645 entries representing hair­ seq datasets, protocols need to be created for finding the pin precursor miRNAs that expressed 35,828 mature miRNA most relevant miRNA species. While the main goal of this products in 223 species. miRBase can be used for searching review is to focus on various repositories of miRNAs and and browsing both hairpin and mature sequences. their interactions, it is worthy of note that efforts of compu­ Since the inception of miRBase, the annotation strat­ tational approaches, such as miRClassify,16 are also accele­ egy was developed and continually improved to organize all rating the overall annotation process of miRNAs. In addition, the information associated with miRNA species. Its goal was TargetScan,17 miRanda,18 and PicTar19 are the leading pro­ to officialize identifiers as quickly as possible for publication grams in the field, as reflected by the number of citations. in articles. For example, the prefix in dme­mir­100 desig­ For other computational approaches, it is recommended that nates the organism and is followed by sequentially assigned readers should review articles by Zou et al, Wang et al, and numbers. Recently, for sequences derived from the 5′ and 3′ Wei et al.16,20,21 arms of the hairpin precursor, names are assigned as dme­ As one of the most important goals in bioinfor­ miR­100­5p and dme­miR­100­3p, respectively, to specify matics, the proper storage and organization of data will the mature sequences. This standardized scheme also includes lead to easy retrieval and dissemination of information. a strategy where homologous miRNA loci are

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    7 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us