Current Biology 7–8 (2016) 39–52

Contents lists available at ScienceDirect

Current Plant Biology

jo urnal homepage: www.elsevier.com/locate/cpb

Databases and bioinformatics tools for research

Priyanka Garg, Pankaj Jaiswal

Department of Botany and Plant Pathology, Oregon State University, 2082 Cordley Hall, Corvallis, OR 97331, USA

a r t i c l e i n f o a b s t r a c t

Keywords: Rice is one of the most important agricultural crop in the world and widely studied model plant. The

Biological database

completion of whole genome sequence of rice ( sativa) and high-throughput experimental plat-

Rice

forms have led to the generation of the tremendous amount of data, and development of the specialized

Gene expression

databases and bioinformatics tools for data processing, efficient organization, analysis, and visualiza-

Biocuration

tion. In this article, we discuss a collection of biological databases that host genomics data on sequence,

Ontology

Pathways gene expression, genetic variation, gene-interactomes, and pathways, and facilitate data analysis and

visualization.

© 2016 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license

(http://creativecommons.org/licenses/by/4.0/).

1. Introduction organisms including rice. In contrast community-specific databases

cater to the need of a specific research community such as plant

Over the last decades, an increasing amount of genome-scale databases, e.g. PlantGDB [8], ENSEMBL [9], Gramene [10],

experimental data sets became available and several online and PLEXdb [11], Gene Expression Atlas [12], Planteome [13], etc. The

open source, biology databases have emerged. For instance, cur- species-specific databases for the model and non-model organisms,

rently, ∼1685 publicly available, online databases are listed at NAR for example, RAPdb [14], Beijing Genomics Institute-Rice Informa-

online Molecular Biology Database Collection [1]. These databases tion System [15] and Rice SNP-Seek Database [16,17] provide in

can be categorized on the basis of data type, data curation methods, depth coverage of the data sets and are more specifically tuned to

the scope of data coverage and accessibility of the database. Many the need of a specialized small community of researchers Table 2.

such publicly funded resources host data (raw, annotated, ana- Furthermore, each database can be assigned to one or more cate-

lyzed) for various species including crops, model and non-model gories on the basis of their content, for example, gene expression

plants, whereas, others are dedicated to a group of species from a databases, molecular interaction databases, genome annotation,

taxonomic clade and may contain a certain type of data. In addition, nucleotide or protein databases, smallRNA databases, genomic

an array of tools and web applications are available that facilitate variation, phenome and pathway databases.

formatting, analysis and visualization of various types of genomic Rice is an important crop and serves as a model for monocotyle-

data. don family. Recent advances in rice genome biology have generated

Data coverage decides the target user community for a database. tremendous amount of data including fully sequenced high qual-

These large-scale public repositories or international archives, ity reference genomes [9,10], low coverage sequencing data from

usually developed and maintained by national and international 3010 rice accessions of the rice germplasm core collection with

projects, provide genomic data from several species. Table 1 lists an average sequencing depth of 14× [16–19], genetic variation,

some of the generic large-scale public repositories or archives, and transcriptomes, proteomes, metabolomes, etc. which necessitates

databases, for example, GenBank [2], EMBL [3], INSDC [4], and development of bioinformatics resources and databases for storage,

DDBJ [5] for sequences and annotation, PDB [6] for protein struc- processing, organization, analysis, and visualization of such data at

tures and UniProt [7] for protein information. These are long-term systems level.

sustainable repositories for archiving valuable data from several For the benefit of rice researchers, we are providing a compre-

hensive list of such generic and specialized genomic databases,

resources, web applications and analysis tools (Tables 1 and 2).

ଝ Some of the information provided here is also useful to the commu-

This article is part of a special issue entitled “Genomic resources and databases”,

nity of plant researchers who may not be engaged in rice research. It

published in the journal Current Plant Biology 7–8, 2016.

∗ is possible that we may have missed some resources and we expect

Corresponding author.

E-mail address: [email protected] (P. Jaiswal). this list to grow in future.

http://dx.doi.org/10.1016/j.cpb.2016.12.006

2214-6628/© 2016 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

40 P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52

2. Resources for genes, genomes, genetic variations project) and their consequences on the gene function and struc-

ture, descriptions of phenotypic traits and plant pathway databases

The rice genome assembly, annotation, and associated infor- developed using the BioCyc platform such as RiceCyc [27], and the

mation is mainly provided by the MSU Rice Genome Annotation Reactome platform-based Plant Reactome that provides pathways

Project [20,21], the International rice genome sequencing project’s from the reference rice and gene homology-based projections to all

(IRGSP) rice RAPdb [14,22] and the Oryza Genome Evolution (OGE) Oryza species with sequenced genome or transcriptome [28]. The

project (http://oge.gramene.org). Gramene uses annotated rice features and data available at the genome portal of Gramene and

genes and genome assemblies of O. sativa ssp. japonica cv Nippon- their collaborator Ensembl Plants [9] are similar.

bare from IRGSP [14,22], O. sativa ssp. indica cv 93-11 and several In addition to Gramene, a number of databases provide genetic

wild Oryza species sequenced by the OGE and the Internatioanl variation data (SNPs and indels) including PmiRKB [29], Rice Varia-

Oryza Map Alignment Project (I-OMAP) (http://oge.gramene.org) tion Database [30], RiceVarMap [31], SNP haplotype database [32],

[23–26]. These genomes are presented by building an integrated and Rice SNP-Seek Database [16,17]. The largest data set (∼29

web resource for rice that includes rice species-specific genome million SNPs) for rice genetic variants come from the 3000 rice

browser, whole genome alignment, synteny, genetic and physi- genome sequencing project that is now being hosted at IRIC, a Rice

cal maps with genes, gene trees, ESTs and QTL locations, genetic SNP-Seek Database together with phenotype and variety informa-

diversity data including SNPs (from the 3000 rice genome sequence tion/passport data [16,17].

Table 1

Generic genomic databases and bioinformatics tools. The description may be copied from the original source [2,5–7,9–11,29,30,33,34,38,39,49,50–52,58,63–91].

Data Types

te/pr els, els, ng i li nd

tation

ud Database Name etc.)

s

no and latest Description Species URL es on release date (incl otein) cti Others on mutants, i Expression Expression , all RNA types RNA all Genetic/genomic Genetic/genomic wild speci wild variati Genome an Genome Gene/gene products products Gene/gene SNP including proteins and and proteins including Pathway/network/intera (transcript/metabo An open source, data resource

O. sativa and

Gramene for comparative functional

other plant h ttp://www.gramene.org Y Y Y Y Y Y

Nov 2016 genomics in cereals and other

species

plant spe cies [10].

It provides, information on

metab olic pathways inferred from

EXPath O. sativa and 2

microarr ay-based transcriptomic http://expath.itps.ncku.edu.tw Y Y Y

Oct 21, 201 2 plant species

data, gene annotati on and

orthologous genes[38].

PLANEX (PLAnt Contains pub licly available O. sativa and 7

co-Expression) GeneChip data obtained from the http://planex.plantbioinformatics.org Y Y Y

plan t species

database Gene Express ion Omnibus [63].

Provides informati on on co-

expre ssion gene-networks

ATT ED-II Oryza and 8

sup ported by microarray and http://atted.jp Y Y Y

Sep 1, 2015 plant species

RNA seq uencing-based transcriptomic data. [64]. PODC (Plant

A repository of annotated gene

Omics Data O. sativa and 7

expre ssion data and omics data http://bioinf.mind.meiji.ac.jp/podc Y Y Y Y

Center) plant species

analysis too ls [65].

Mar 16, 2016

A plan t miRNA data repositories

con taining .associated

PMRD (Plant information on sequence,

O. sativa and

MicroRN A secondary structure, target

120 pl ant http://bioinformatics.cau.edu.cn/PMRD Y Y Y Y

Datab ase) genes, expression profiles of

species

Nov 17, 2014 miRNAs and their mappi ng to the

spe cies-specific genome browser

[66].

NIAS GBdb

(National Institute A database containing

O. sativa and

of Agrobiological information on simple sequence

other plant h ttp://www.gene.affrc.go.jp/databases_en.php Y Y Y

Sciences repeat (SSR) polymorphisms in

species

planttfdb plant genomes [67].

database)

A databa se for rice reverse O. sativa ssp.

geneti cs, build with flanking indica and

OryGenesDB seq uence tags of various japonica, and 2 http://orygenesdb.cirad.fr/index.html Y Y Y

mutag ens and functional other plant

genomics da ta [68] species

P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52 41

Table 1 (Continued)

All ows the user to retrieve data

related conserved structural-

O. sativa and 7

FamNe t functional domains within http://www.gene2function.de/famnet.html Y Y Y

plan t species

proteins from one or more plant

spe cies [69]

A comparative hub for annotated

plant genome and gene family

data. Provides a vi ew of the O. sativa and

Phytozome evolution ary history of every 64 other plant

h ttp://www.phytozome.net Y Y Y Oct 16,2015 plant gene at the level of and algae

sequence, gene structure, gene species

famil y and genome organization [70]. A relational database that

integrates data from rice,

POG s (Putative

Arabidop sis, and maize into

orthologous

'putative orthologous groups' O. sativa and 3

Groups 2) h ttp://pogs.uoregon.edu Y Y Y

(POGs) and allow comparisons plant species Datab ase

among orthologs and

2014

extrapolati on of annotations

among species [71].

The database con tains a

O. sativa and

GreenPhylDB catalogue of gene families

other plant h ttp://www.greenphyl.org/cgi-bin/index.cgi Y Y Y

Sep 4, 2015 covering a broad taxon omy of

species

green plants [72].

Provides informati on on

ppdb (plant transcription start sites (TSSs),

promoter core promoter structure (TATA Oryza and 4

h ttp://ppdb.agr.gifu-u.ac.jp/ppdb/cgi-bin/index.cgi Y Y Y

database) boxes, Initiators, Y Patches, GA plant species

Jun 5, 2013 and CA elements) and regulatory

element groups (REGs) [73].

Consisting o f large-scale

CSRDB (Cereal datasets of maize and rice

O. sativa and

Sma ll RNA smRNA sequences generated by http://sundarlab.ucdavis.edu/smrnas Y Y

maize

Datab ase) high-throughput pyrosequencing

[74].

Compiles transfer RNA (tRNA)

gene seque nces retrieved from O. sativa and

PlantRNA

full y annotated plant nuclear, 10 plant http://plantrna.ibmp.cnrs.fr Y Y Sep 6, 2012 plastid and mitochondrial species

genomes [75].

A Cen tral resource for annotated

proteins consisting of two

O. sativa and

UniProtKB secti ons: UniProtKB/Swiss-Prot

other organism h ttp://www.uniprot.org Y Y

Nov 2016 for manually annotated entries,

species

and UniProtKB /TrEMBL for

computer-annotated entries [7].

GenBank NIH genetic sequ ence database, O. sativa and

Updated on daily a repository of publicly available other organism http://www.ncbi.nlm.nih.gov Y Y basis DNA seq uences [2]. species

DDBJ O. sativa and

A public repo sitory of nucleotide

Updated on dail y other organism http://www.ddbj.nig.ac.jp Y Y

seq uence data [5].

bases species

Comprehen sive collection of

O. sativa and

EMBL nucleoti de sequences and

other organism h ttp://www.ebi.ac.uk/about Y Y

Dec 7, 2016 annotation from available pu blic

species

sou rces [76] . Provides Genome brow ser for

O. sativa and

Ensembl Plants several plant species various

other organism h ttp://plants.ensembl.org/index.html Y Y

Dec 2016 genomic data sets, and tools for

species analysis and visuali zation of

42 P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52

Table 1 (Continued)

genome-scale large da ta sets in

the context o f genome [9]

O. sativa ssp.

Manuall y curated genes that are japonica cv. DroughtDB involved in drought stress http://pgsb.helmholtz-muenchen.de/droughtdb Y Y Nipponbare and

response [77]. 8 plant spe cies

A collection of biotic and abiotic stress responsive genes with options to identify probable

Transcription Factor Binding Sites in their promoters. An STIFDB2 (Stress integrated biocuration and O. sativa ssp. Responsive genomic data mining approach japonica and Transcription http://caps.ncbs.res.in/stifdb2 Y Y Y

have been employed to Indica and

Factor Database)

cha racterize the data set of Arabidopsis

Oct 2012

transcription factors and

con sensus binding sites from

literature and stress-responsive

genes from the Gene Expression

Omnibus [78].

Composed of a coll ection of

databases that relate to the

GR ASSIUS

con trol of gene expression in the (Grass Reg ulatory

grasses, and their relationship O. sativa and 3

Information www.grassius.org Y Y Y

with agronomic traits. Include s plant species

Services) transcription factors, promoters,

Aug 25, 2014

co-regulators and transcription

factor-ORF clone s [79].

Integ rated database for co-

expre ssed genes and

CoP database O. sativa and 7

biological processes in plants http://webs2.kazusa.or.jp/kagiana/cop0911 Y Y

Nov 11, 2009 plant species derived from microarray da ta

[39].

First integrative resource that

O. sativa and 7

IsomiR Bank con tains the sequence and http://mcg.ustc.edu.cn/bsc/isomir Y Y

organisms

expre ssion of isomiRs [80].

Provides informati on on the

Plant MPSS

expre ssion level of genes, and O. sativa ssp.

(mass ively

potential novel transcripts indica and

parall el signature http://mpss.udel.edu Y Y

(antisense transc ripts, alternative japonica and 3

seque ncing)

sp lice isoforms, and regulatory plant species

databases

intergenic tran scripts) [81].

Provides information on co-

PlantArr ayNet expresssed genes using O. sativa and 2

h ttp://arraynet.mju.ac.kr/arraynet Y Y

Jan 10 , 2011 microarray-based transcriptomic plant species data [82].

A unified gene expression

resource for plants and plant

pathogen s. It is a genotype to

pheno type, hypothesis building Oryza and 12

PLEXdb

information wa rehouse, other plant http://www.plexdb.org Y Y

Jun 2013

leveraging highly parallel species

expre ssion data with seamless

portals to related genetic,

physical, and pathway data [11].

P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52 43

Table 1 (Continued)

BAR (Bio- Several plant

Analytical Provides interactive interfaces for

species

Resource for the exploratory visuali zation of http://bar.utoronto.ca Y Y

including O. plant biology) gene expre ssion data [83, 84] sativa. Jun 2, 2015

PmiRKB (Plant Provides four major functional

miRNA 21 O. sativa

modules-"SNPs", "Pri-miRNAs", http://bis.zju.edu.cn/pmirkb Y Y Knowled ge Base) and Arabidopsis "MiR—Tar", and "Self-reg"[29]. Jun 5, 2010 A web interface to access large sets of transcription factors of several plant species. O. sativa ssp.

Information including protein

PlnTFDB (plant indica and

seq uences, coding regions,

transcripti on japonica and http://plntfdb.bio.uni-potsdam.de/v3.0 Y Y

genomic sequences , expressed

factor da tabase) other plant seq uence tags (ESTs), domain species architecture and scientific

literature is provided for each

famil y [85].

PlantDHS (plant Integ rates histone modification,

O. sativa ssp.

DNase I RNA seq uencing, nucleosome japonica cv. hypersensitive positioning/ occupancy, http://plantdhs.org Y Y

Nipponbare and

site database) transcription factor binding sites,

2 plant spe cies

Feb 23,201 6 and genomic sequence [86].

Plant Homolog 16 plant sp.

A databa se composed of plant

Datab ase Including 10 http://phd.big.ac.cn Y Y

homologous gene s [30].

May 9 , 2015 Oryza species

Simple yet robust and extensible

PO (Plant con trolled vocabularies that O. sativa and

Ontology) acc urately reflect the biology of other plant www.plantontology.org Y Y Sep 2016 plant structures and species

developmental stages [58]

Plant-PrAS (Plan t Database of physicochemical

Protein and structural properties, and O. sativa and 5

h ttp://plant-pras.riken.jp Y Y

Annotation Suite) novel fun ctional region in plant plant species

database proteo mes [87].

PDB (Protein Worldwide archive of structural O. sativa and

Data Bank) data o f biological other organism http://www.rcsb.org/pdb Y Y Dec 6, 2016 macromolecules [6]. species MPIC

O. sativa and

(Mitochondrial Searchable information on the

23 othe r http://www.plantenergy.uwa.edu.au/applications/m

Protein Import protein import apparatus of pl ant Y Y

organism pic

Components) and non-plant mitochondria [88].

species

database

A databa se of molecular O. sativa and

PlantGDB

seq uence data from several plant other plant http://www.plantgdb.org Y

Jul 23, 2012

spe cies [89]. species

A web server for query,

visua lization, and analysis of

PlantAPA

poly(A) sites in plants , which can

(Alternative O. sativa and 3

profil e heterogeneous cleavage http://bmi.xmu.edu.cn/plantapa Y Y polyadenylation) plant species

sites and quantify expression

Aug 30, 2016

pa ttern of poly(A) sites across

di fferent conditions [90].

A resource for co mmon

referen ce ontologies for plants

Oryza and plant

Planteome and species-specific crop h ttp://www.planteome.org Y Y

species

ontologies. Also provides

ontology-ba sed annotation of rice

44 P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52

Table 1 (Continued)

genes, QTLs, phenotypes, germplasms [49].

Functional annotation of plant Oryza and 2

Mercator pipeline h ttp://mapman.gabipd.org/web/guest/app/Mercator Y Y

'omics ' data [91]. plant species

Provides li fe scientists with

powe rful computational

CyVerse (former

infrastructure to handle huge Plants, animals,

iPlant http://www.cyverse.org Y Y Y datasets and complex analysis, and microbes

Collaborative) thus enabli ng data-driven

discovery [50].

A softwa re system that allows

experimentali sts without

informati cs or programming

Galaxy h ttp://galaxyproject.org Y Y expertise to perform complex

large-scale analysis with just a

Web brow ser [51, 52]

Database of allergenic proteins

MoCh A gained by bioinformatics tools or

(Molecular evidence of IgE binding. It has

Oryza and 3 h ttp://lilab.life.sjtu.edu.cn:8080/mocha/main-7.9-

Characteristi cs collected genome, transcriptome, Y Y

plan t species 2.html

database for proteo me data of reliable All ergens) experiments and molecular

features.

A we b-based tool for accessing

the diurnal and circadian

genome-wide expre ssion results

Diurnal Plant spe cies http://diurnal.mocklerlab.org Y Y

of genes from several array

experiments conducted on

common model plants [33].

Provides pow erful tools to

O.sativa and 16

explore gen e expression across

Genevestigator organism h ttps://genevestigator.com/gv Y Y a wide variety of biolog ical

species

con texts [34].

3. Resources for gene expression datasets, kegg-bin/show organism?menu type=pathway maps&org=dosa)

gene-interactomes, pathways and ontologies and the Plant Reactome [28], host pathway databases for metabolic,

regulatory, developmental and signaling pathway analysis.

The gene expression databases play a vital role in extracting, Table 1 also includes a number of publicly available molecu-

organizing and interpreting the information regarding the expres- lar interaction databases. The databases such as CoP [39] provides

sion profile of a gene and genomes under a specific developmental information on the co-expressed genes based on transcriptome

stage or in response to a particular treatment to build a connec- analysis for 8 plant species including rice. DIPOS [40] is a database

tion between the genotype and phenotype of an organism. The of interacting proteins in , while PRIN [41] is based on

transcriptomic data can be obtained using various experimental the interologs of six model organisms, where large-scale protein-

platforms, such as real-time PCR, microarray, and RNA-seq. The protein interaction experiments have been applied. RiceNet [42],

Diurnal [33], GENEVESTIGATOR [34] and EMBL-EBI Gene Expres- provides an updated network prioritization server for Oryza sativa

sion Atlas [12] are popular resources that host gene expression data. ssp. japonica.

EMBL-EBI Gene Expression Atlas provides information on the base- The use of ontologies such as the Gene Ontology (GO) [43]

line expression of the gene or a set of genes in a given sample from and the Plant Ontology (PO) [44] are instrumental in providing

the RNA-seq based experiments as well as their differential expres- high quality and consistency in the annotation of genes and gene

sion data from both the RNA-seq and microarray experiments. This products for molecular function, role in biological processes and

resource also updates expression data frequently by aligning it location in the cellular components (all sub classes of GO). The

against the most recent version of the genome assembly and anno- PO-based annotations include observations or samples isolated

tation available from the Plant Ensembl and Gramene database. The for expression and phenotype associations to plant structure and

rice-specific expression data is also hosted by resources like IC4R the plant growth and developmental stage, the two subclasses

[30], RiceFREND [35], RiceXPro [36]. of PO. The Plant Trait Ontology (TO) and the ontology- entity-

MCDRP [37] resource hosts manually curated annotation of rice quality attribute-based phenotype annotations are being promoted

proteins based on published datasets. EXPath [38] database pro- for mutant phenotype and QTL trait annotations [45–48]. These

vides information on metabolic pathways for several plant species annotations are often provided by the individual species specific

based on the analysis of the microarray-based gene expression databases. The Planteome is a new database (www.planteome.org)

data. Other databases such as RiceCyc [27], and OryzaCyc (http:// [49] that provided the reference set of ontologies for integration in

www.plantcyc.org/databases/oryzacyc/) host metabolic pathway the genome, expression and phenome projects. It also provides a

network for rice. Whereas, KEGG rice (http://www.genome.jp/ collection of ontology-based annotation of about 85 plant species

P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52 45

including the various rice reference O. sativa subspecies japonica [54] tool provided by the Ensembl and Gramene, Pathway enrich-

and indica and the wild Oryza species. ment and comparison tools by Plant Reactome [28].

4. Web applications and bioinformatics analysis tools 5. Fundability, accessibility, and interoperability of

genomic data and resources

Web based tools provide an excellent platform to analyze

huge data sets and, thus enabling data-driven discoveries. For The primary aim of many plant genomic databases has been

example, CyVerse [50] and Galaxy [51,52] are two widely used to acquire, organize, represent, and help navigate and retrieve the

cyber infrastructure platforms for data storage, dissemination, and information of interest. In accordance with data collection tech-

high-throughput bioinformatics analysis. They provide access to niques, biological databases are classified into either Primary and

popular bioinformatics tools, access to computing infrastructure Secondary or both. Primary databases rely upon direct archiv-

and options to run user tools. CoGe [53] is another “platform ing of the experimental results as a data source such as NCBI

for performing Comparative Genomics research. It provides an Sequence Read Archive (SRA) [55], INSDC and its collaborator

open-ended network of interconnected tools to manage, ana- archives [4], Rice RAPdb [22]. The secondary databases are pop-

lyze, and visualize next-gen sequencing data” (Source: https:// ulated with processed and analyzed data of primary data-sets, e.g.

genomevolution.org/coge/). Besides the infrastructure platforms, NCBI RefSeq [56], EMBL-EBI’s Gene Expression Atlas [12]. However,

many of the databases listed in Tables 1 and 2 provide local data resources like Gramene [10,57], Plant Ontology [44,58], Planteome

analysis options. For example. the Variant Effect Predictor (VEP) [13], Ensembl Plants [9] and Plant Reactome [28] provide both the

Table 2

Rice specific databases. The description may be copied from the original source [20–22,12,14–17,30–32,35–37,41–42,48,68,92–113].

Data Types

ng on indels, , ncludi i

Database Name nteractions

/i

and latest Description Species URL release date RNA types all nd Others Expression d species etc.) d species il w Genome annotation Genetic/genomic variati Genetic/genomic proteins a proteins

Pathway/network

(transcript/metabolite/protein) Gene/gene products (including SNP, mutants

Annotation datab ase for rice

comparative genomics and O. brachyantha,

RGKbase (Rice evolution ary biology, which O. glaberrima,

Genome includes ge nome sequence O. sativa ssp.

Knowled gebase) assemblies, transcriptomic and japonica cv. http://rgkbase.big.ac.cn/RGKbase Y Y Y Y Y Y

Jun 28, 2012 epigenomic data, genetic Nipponbare,

variations, quan titative trait loci ssp. Indica 93-

(QTLs) and the relevant literature 11, and PA64s

[92]. MCDR P Gene expression database for

(Manually O. sativa ssp.

rice proteins including metabolic

Curated Da tabase indica and http://www.genomeindia.org/biocuration Y Y Y Y Y

pathways and protein interaction

of Rice Proteins) japonica from publishe d articles [37].

Nov 30, 2016

Functional genomi cs database

linking annotation, sub-cellular

Rice DB O. sativa and

location, function , expression, http://ricedb.plantenergy.uwa.edu.au Y Y Y Y

Oct 23, 201 3 Arabidopsis

regulati on, and evolutionary

information [93].

Represents transcriptiona l

activity on the rice genome at the

nucleoti de level based on the

RNA-seq data un der 140

TENOR environmen tal stresses and plant O. sativa ssp. (Transcriptome

hormone treated conditions. japonica cv. http://tenor.dna.affrc.go.jp Y Y Y Y

ENcyclopedia Of

Expression profil es, information Nipponbare

Rice)

of cis-regu latory elements in

promoter regions and co-

expre ssed transcript are provided

for each transc ript [94].

Sub-pla tform of PlantExpress for

a single-species gene expre ssion

OryzaExpress analysis in O. sativa . Consists of

O. sativa http://plantomics.mind.meiji.ac.jp/OryzaExpress Y Y Y

Apr 22, 201 4 gene expression networks and

omics an notations derived from

microarr ay data [95].

Accommodates rice reference

Rice Expression

genome with standa rdized and

Datab ase Oryza spp http://expression.ic4r.org Y Y Y

acc urate gene annotations

Aug 2016 derived from RNA-seq data [30 ].

46 P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52

Table 2 (Continued)

cDNA resource with

comprehensive information for

functiona l analysis of indica ssp.

O. sativa ssp.

and comparative genomics

RICD (Rice Indica indica Guang-

including sequ ences, protein http://www.ncgr.ac.cn/ricd Y Y Y Y

cDNA Database) lu-ai 4 and

domain ann otations, similarity Minghui 63 search results, SNPs and InDels information, and hyperlinks to gene annotation [96].

A database for rice reverse

geneti cs, build with FSTs O. sativa ssp. (flanking sequence tags) of indica and

OryGenesDB various mutagens and functional japonica, and 2 http://orygenesdb.cirad.fr/index.html Y Y Y

genomics da ta, collected from other plant

both international insertion species

co llections and the literature [68].

An integrative

genomic database for rice with an emphasis on combining

datasets in a way that maintains

Ricebase key li nks between the past and

O. sativa http://ricebase.org Y Y Y

Aug 10, 2016 current gen etic studies. Includes DNA sequence data, gene annotations, nucleotide variation

data and molecular marker

fragment size data [97]. Displays hormone-metabolome (hormonome) and transcriptome

data in a single formatted

UniVIO (uniformed) heat map.

(Uniformed Hormonome and transc riptome O. sativa ssp.

Vi ewer for data obtained from 14 organ japonica cv. http://univio.psc.riken.jp Y Y Y Y

Integrated Omics) parts of rice plants at the Nipponbare

Oct 22, 201 2 reproductive stage and seedling

sho ots of three gibberellin

signa ling mutants are included in

the database .[98].

Rice proteome databa se based

on sho tgun proteogenomics,

OryzaPG-DB con tains proteome of rice

(Oryza undi fferentiated cultured cells,

ProteoGenomic corresponding cDNA, transcript O. sativa http://oryzapg.iab.keio.ac.jp Y Y Y

Datab ase) and genome sequences, novel

Jan 30 , 2012 proteogenomics features and tupdated gene models annotation [99].

Q-TARO (QTL

Displays the co-localization of O. sativa ssp.

Annotation Rice

QTLs and distributi on of QTL indica and http://qtaro.abr.affrc.go.jp Y Y Y

Online Da tabase )

clusters on rice genome [100]. japonica

March 31, 2012

A resource for pu blicly available

MosDB

seq uences of the rice (Oryza http://pgsb.helmholtz- Updated on daily O. sativa Y Y

sativa L.) genome. Sub-platform muenchen.de/plant/rice/index.jsp

basis

of Plantdb [101].

Displays sequence and

annotation data for the rice

MSU Rice

genome . Includes genome

Genome O. sativa ssp.

brow ser, motifs/domains within

Annotation japonica cv. h ttp://rice.plantbiology.msu.edu Y Y the predicted gen es, a rice

Project Nipponbare

repeat database, identifi ed

Feb 7, 2012

related sequences in other plant

spe cies [20, 21].

P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52 47

Table 2 (Continued)

RAP-DB (Rice Genomics database provides O. sativa ssp. Annotation

gene annotations for the genome japonica cv. http://rapdb.dna.affrc.go.jp Y Y

Project DataBase)

seq uence of rice [14, 22]. Nipponbare Aug 5, 2016 Provides access to the most updated version of the Oryza genome evolution (OGE) project and the International Oryza Map Several species OGE Gramene alignment (I_OMAP) project. http://oge.gramene.org Y Y of Oryza Includes, genome assembly, annotation, synteny, gene trees, SNPs and interspecific genome alignments. A wiki-based, publicly editable and open-content platform for RiceWiki Oryza http://wiki.ic4r.org/index.php/Main_Page Y Y community curation of rice genes

[102].

A coll ection of databases for six large gene famili es in rice,

including those for

glycosyltransferases, glyc oside

hydrolases, kinases, transcription factors, transporters , and

Rice

cytochrome P450

Phylogenomic O. sativa http://ricephylogenomics.ucdavis.edu/index.shtml Y Y Y

monooxy genases viz. Rice Database

Kinase Database, Rice GT Jun 2015 database, Rice GH database,

Rice TF database, Rice

transporter databa se, Rice

Cytochrome P450 database [103].

DIPOS (database

Provides comprehensive

of interacting

information of interacting proteins O. sativa http://www.riceresearch.info Y Y

proteins in Oryza

in rice [40].

sativa)

Metab olic network of rice.

Provides metabolic pa thways,

reactions, metabolites and

RiceCyc O. sativa ssp h ttp://pathway.gramene.org/RICE/organism-

ass ociated gene entities. The Y

Dec 2016 japonica su mmary?

analysis too ls provide pathway

comparison , and gene

expre ssion analysis.

Provides metabolic an d

h ttp://www.genome.jp/kegg-

regulatory pa thways, enzymes,

KEGG rice bin/show_ organism?menu_type=pathway_maps& Y

reactions, metabolites and

org=dosa

ass ociated gene entities.

Experimentall y determined gene- http://www.ebi.ac.uk/ebisearch/search.ebi?db=inta

IntAct rice O. sativa Y gene interaction da ta ct-experiments&t=Oryza+sativa

An updated network prioritization

server for Oryza sativa ssp.

Jap onica. Gene prioritization

RiceNe t O. sativa ssp.

allow s users to predict new http://www.inetbio.org/ricenet Y Y

Dec 29,201 4 japonica

can didate genes for a phenotype

or biological pathways by prioritizi ng rice genes [42].

48 P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52

Table 2 (Continued)

Rice knowledgeb ase- expression profil es derived from RNA-seq IC4R (Information

data, genomic variations, plant

Commons for

homologs, post-translational Oryza http://ic4r.org Y Y

Rice)

modifications, literature as well

May 5 , 2015

as community-con tributed annotations [30]. A rice genome automated Rice GAAS (Rice annotation system, which Genome integrates programs for Automated O. sativa http://RiceGAAS.dna.affrc.go.jp Y Y prediction and analysis of Annotation protein-coding gene structure System) [104].

An open public repository of EMBL-EBI Gene gene expre ssion pattern data Expression Atlas under different biological O. sativa https://www.ebi.ac.uk/gxa/home Y Y Dec 2, 2016 conditions using both microarray and RNA-seq data [12]. Gene coexpression database O. sativa ssp. RiceFREND derived from Microarray data japonica cv. http://ricefrend.dna.affrc.go.jp Y Y Sep 13, 2012 [35]. Nipponbare

Repository of gene expression

profil es derived from microarray

analysis of tissues/organs RiceXPro (Rice encompassing the entire growth

O. sativa ssp.

EXpression of the rice plant under natural

japonica http://ricexpro.dna.affrc.go.jp Y Y

Profil e Database) field conditions, rice seedlings

2013 trea ted with various

phytohormones, and specific cell

type s/tissues isolated by laser

microdissection (LMD) [36 ].

ROAD (Rice A public resource for gene

O. sativa ssp.

O ligonucleotide expression and coexpression indica and http://www.ricearray.org Y Y

Array Database) analysis in rice derived from

japonica

Mar 28, 2012 microarr ay data [105].

A databa se of Arabidopsis

mutant lines overexpressing rice

full-length cDN A that contains a

RiceFOX O. sativa http://ricefox.psc.riken.jp Y Y Y

wide range of trai t information to

facilitate an alysis of gene

function [106 ].

Protein-protein interaction data of

PRIN are ba sed on the interologs

PRIN (predicted

of six model organisms where

rice interactome O. sativa http://bis.zju.edu.cn/prin Y Y

large-scale protein-protein netwo rk)

interaction experiments have

been applied [41].

Provides Genotype , Phenotype,

and Variety Information for rice

and SNP geno typing data. It

allow s quick retrieving of SNP

Rice SNP-Seek all eles for all varieties in a given O. sativa ssp.

Datab ase genome region, finding different japonica cv. http://www.oryzasnp.org/iric-portal Y Y

Feb 16, 2015 alleles from predefined varieties Nipponbare

and querying ba sic passport and

morph ological phenotypic

information abou t sequenced rice

line s [16, 17].

Rice Variation An atlas of re-sequen cing-based

Oryza varieties http://variation.ic4r.org Y Y

Datab ase rice genomic variations [30].

P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52 49

Table 2 (Continued)

Database of rice genomic

variation. Contains SNPs and RiceVarMap INDELs identified from

O. sativa http:/ricevarmap.ncpgr.cn Y Y

Oct 7, 2015 seq uencing data of two sets of

rice ge rmplasms of cultivated

spe cies [31]. O. sativa An archive for collecting, ssp. Japonica RMD (Rice managing and searching cv. Zhonghua

Mutant Database) information of the T-DNA http://rmd.ncpgr.cn Y Y 11 , Zhong hua

Mar 1, 2012 insertion mutants generated by

15 and

an enhan cer trap system [107].

Nipponbare

Represents transcription factors

with comprehe nsive expression,

cis-regulatory element and

RiceSRTFDB mutant information derived from O. sativa http://www.nipgr.res.in/RiceSRTFDB.html Y Y

Microarr ay data of a curated set of 456 Affymetrix GeneChip Rice

Genome arrays [108]. A coll ection of abiotic stress

responsive quantitative trait loci

Q licRice (QTLs) in rice and their O. sativa http://nabg.iasri.res.in:8080/qlic-rice Y Y corresponding sequenced gene loci [109].

An integrated information

resource and comparative analysis wo rkbench for rice

BGI-RIS V2

genomes including detail ed

(Beiji ng Genomics

annotation data , including O. sativa ssp.

Institute-Rice

genetic markers, Bacterial indica and h ttp://rice.genomics.org.cn/rice/index2.jsp Y

Information

Artificial Ch romosome (BAC) end japonica

System)

seq uences, gene contents,

Oct 28, 200 8

cDNAs, oligos, tiling arrays,

repetitive elements , and genomic

polymorphisms [15]. O. rufipogon , O.

longistaminata,

Subplatf orm of Oryzabase. O. sativa ssp.

Provides geno me sequence japonica cv.

OryzaGeno me information for 21 wild Oryza Nipponbare,

h ttp://viewer.shigen.info/oryzagenome Y

Jun 19 , 2015 species together with several Nongken-58,

cultivated strain reference and Aus-type

seq uences [110]. Kasalath and indica,

Guangluai-4 Contains informati on about rice

Oryza sp.

development and anatomy, rice

Oryzabase (different h ttp://www.shigen.nig.ac.jp/rice/oryzabase/top/top.j

mutan ts, and genetic resources, Y Y

2016 species of sp

especia lly for wild varieties of rice

oryza) [48 , 111]. aus, indica,

tropical

HapRice SNP haplotype database [32]. japonica and http://qtaro.abr.affrc.go.jp/index.html Y temperate japonica

50 P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52

Table 2 (Continued)

Genus Oryza

RiTE Da tabase A genus-wide collection of

and the closely-

(Rice TE transposable elements and h ttp://www.genome.arizona.edu/cgi-

related out- Y

database) Aug repeated sequences across 11 bin/rite/index.cgi

group Leersia

31, 201 5 diploid species [112].

perrieri.

A research organ ization involved

in serving, und erstanding,

sha ring, and using rice genetic

diversity, breeding and deli vering

IRRI (International

new varieties, developing and

Rice research Oryza http://irri.org Y

sha ring improved crop and

institute)

environmen tal management

practices, and facili tating the

large-scale adoption of

technologies.

The (r ice) Provides access restricted

h ttp://fiehnlab.ucdavis.edu/projects/rice_metabolo

metabolomics access to metabolomics data for Oryza sativa Y me

data rice [113].

collected data from external sources, but add their own quality data exchange, which makes it difficult to integrate them in one

annotation in addition to serving as community access to primary place. In an ideal situation, there should be a single platform for

data for which there are no known archives. all the databases on a single domain of interest, where a user can

Almost all the resources provide some form of biocuration of search all the respective databases with a single query using APIs

data and metadata required for integration and proper interpre- and ontologies and compare the results. AraPort [62] and Gramene

tation of the experiments from which data sets originate. The [10]. Are such examples. Some databases are already integrating

collection methods, annotation and statistical significance of this the links to other databases of similar data types to increase the

primary data affect the credibility and reliability of a resource. The credibility of their data, which is the first step in providing a unified

source of data is simply a physical access to the data and data platform. It maximizes the use of data available in current resources

location or a live data accessed via an application programming and may help in avoiding redundancy. It provides better visibility to

interface (APIs) and semantic web. small databases and can collectively provide a bigger picture since

Depending on the data type, such as genome assembly, gene small databases generally focus on one specific aspect and present

calls, functional annotation, gene expression, QTL and phenotypes, detailed information.

genetic markers, genetic diversity data in the form of SNPs, SSRs,

indels, genotyping and germplasm collections, biocuration and bio- Acknowledgements

logical databases greatly differ in their data curation practices.

The processes may include, fully automated, semi-automated, only

PJ greatly appreciates the funding provided by the NSF funded

manual curation or a combination of all. Therefore, accessing data

Gramene project (NSF IOS 1127112) and the Planteome project

and comparing the datasets across two or more resources poses

(NSF IOS 1340112) which supported this work and PG. PG and PJ

a major challenge. The accessibility of the data sets is further

wrote the manuscript. Funding agencies had no role in the study

compromised due to the restrictions on the data download. Due

design, data analysis, or preparation of the manuscript.

to emerging need for data integration, its reuse and re-analysis,

semantic concepts based on ontologies and application program-

References

ming interface (APIs) are being employed. Thus in a new trend,

majority of the biological databases are approaching adoption of

[1] D.J. Rigden, X.M. Fernandez-Suarez, M.Y. Galperin, The 2016 database issue

Findable, Accessible, Interoperable and Re-usable (FAIR) data prin- of Nucleic Acids Research and an updated molecular biology database

ciples [59–61]. collection, Nucleic Acids Res. 44 (D1) (2016) D1–D6.

[2] D.A. Benson, et al., GenBank, Nucleic Acids Res. 41 (Database issue) (2013)

D36–D42.

[3] B.L. Aken, et al., The Ensembl gene annotation system, Database (Oxford)

6. Conclusion

2016 (2016).

[4] G. Cochrane, et al., The international nucleotide sequence database

collaboration, Nucleic Acids Res. 44 (D1) (2016) D48–D50.

In this article, we have attempted to catalog different web data

[5] J. Mashima, et al., DNA data bank of Japan (DDBJ) progress report, Nucleic

resources available for rice. Some of them are well known and

Acids Res. 44 (D1) (2016) D51–D57.

widely used while others are new and small-scale repositories. [6] H.M. Berman, et al., The protein data bank, Nucleic Acids Res. 28 (1) (2000)

235–242.

With the increasing number of repositories, it is evident that there

[7] E. Boutet, et al., UniProtKB/Swiss-Prot, Methods Mol. Biol. 406 (2007)

is an enormous amount of data available on the web, associated 89–112.

with almost every aspect of rice research. In spite of having such a [8] Q. Dong, S.D. Schlueter, V. Brendel, PlantGDB, plant genome database and

analysis tools, Nucleic Acids Res. 32 (Database issue) (2004) D354–D359.

huge amount of diverse data, it has not been efficiently explored,

[9] D. Bolser, et al., Ensembl plants: integrating tools for visualizing, mining, and

as many researchers or prospect users in the biology are unfamil-

analyzing plant genomics data, Methods Mol. Biol. 1374 (2016) 115–140.

iar with all the possible resources to search and analyze the data. [10] M.K. Tello-Ruiz, et al., Gramene 2016: comparative plant genomics and

pathway resources, Nucleic Acids Res. 44 (D1) (2016) D1133–D1140.

Often different databases have different formats and protocols for

P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52 51

[11] S. Dash, et al., PLEXdb: gene expression resources for plants and plant [47] A.E. Thessen, et al., Emerging semantics to link phenotype and environment,

pathogens, Nucleic Acids Res. 40 (Database issue) (2012) D1194–D1201. Peer J. 3 (2015) e1470.

[12] R. Petryszak, et al., Expression Atlas update—an integrated database of gene [48] Y. Yamazaki, P. Jaiswal, Biological ontologies in rice databases. An

and protein expression in humans, animals and plants, Nucleic Acids Res. 44 introduction to the activities in Gramene and Oryzabase, Plant Cell Physiol.

(D1) (2016) D746–D752. 46 (1) (2005) 63–68.

[13] L. Cooper, et al., The Planteome Project. in International Conference on [49] L. Cooper, et al., The planteome project, in: International Conference on

Biomedical Ontology and BioCreative (ICBO 2016), Oregon State University, Biomedical Ontology and BioCreative (ICBO BioCreative 2016), Corvallis, OR,

Corvallis, OR, USA, 2016, ICBO http://icbo.cgrb.oregonstate.edu/node/305. USA, 2016, CEUR-ws.org Volume 1747 http://ceur-ws.org/Vol-1747/IT406-

[14] H. Ohyanagi, et al., The rice annotation project database (RAP-DB): hub for IP35 ICBO2016.pdf.

Oryza sativa ssp. japonica genome information, Nucleic Acids Res. 34 [50] U.K. Devisetty, et al., Bringing your tools to CyVerse discovery environment

(Database issue) (2006) D741–D744. using docker, F1000Res 5 (2016) 1442.

[15] X. He, J. Wang, Bgi-Ris V2, Methods Mol. Biol. 406 (2007) 275–299. [51] E. Afgan, et al., The Galaxy platform for accessible, reproducible and

[16] N. Alexandrov, et al., SNP-Seek database of SNPs derived from 3000 rice collaborative biomedical analyses: 2016 update, Nucleic Acids Res. 44 (W1)

genomes, Nucleic Acids Res. 43 (Database issue) (2015) D1023–D1027. (2016) W3–W10.

[17] L. Mansueto, et al., SNP-Seek II: a resource for allele mining and analysis of [52] C. Bornich, et al., Galaxy Portal: interacting with the galaxy platform

big genomic data in Oryza sativa, Curr. Plant Biol. 7–8 (2016), http://dx.doi. through mobile devices, Bioinformatics 32 (11) (2016) 1743–1745.

org/10.1016/j.cpb.2016.12.003. [53] E. Lyons, et al., Finding and comparing syntenic regions among Arabidopsis

[18] J.Y. Li, J. Wang, R.S. Zeigler, The 3,000 rice genomes project: new and the outgroups papaya, poplar, and grape: CoGe with rosids, Plant

opportunities and challenges for future rice research, Gigascience 3 (2014) 8. Physiol. 148 (4) (2008) 1772–1781.

[19] project r. g, The 3,000 rice genomes project, Gigascience 3 (2014) 7. [54] W. McLaren, et al., The ensembl variant effect predictor, Genome Biol. 17 (1)

[20] S. Ouyang, et al., The TIGR Rice Genome Annotation Resource: (2016) 122.

improvements and new features, Nucleic Acids Res. 35 (Database issue) [55] Y. Kodama, M. Shumway, R. Leinonen, The Sequence Read Archive:

(2007) D883–D887. explosive growth of sequencing data, Nucleic Acids Res. 40 (Database issue)

[21] Q. Yuan, et al., The institute for genomic research Osa1 rice genome (2012) D54–D56.

annotation database, Plant Physiol. 138 (1) (2005) 18–26. [56] K.D. Pruitt, et al., RefSeq: an update on mammalian reference sequences,

[22] H. Sakai, et al., Rice Annotation Project Database (RAP-DB): an integrative Nucleic Acids Res. 42 (Database issue) (2014) D756–D763.

and interactive database for rice genomics, Plant Cell Physiol. 54 (2) (2013) [57] M.K. Tello-Ruiz, et al., Gramene: a resource for comparative analysis of

e6. plants genomes and pathways, Methods Mol. Biol. 1374 (2016) 141–163.

[23] M. Wang, et al., The genome sequence of African rice (Oryza glaberrima) and [58] L. Cooper, et al., The plant ontology as a tool for comparative plant anatomy

evidence for independent domestication, Nat. Genet. 46 (9) (2014) 982–988. and genomic analyses, Plant Cell Physiol. 54 (2) (2013) e1.

[24] J. Chen, et al., Whole-genome sequencing of Oryza brachyantha reveals [59] A. Rodriguez-Iglesias, et al., Publishing FAIR data: an exemplar methodology

mechanisms underlying Oryza genome evolution, Nat. Commun. 4 (2013) utilizing PHI-base, Front. Plant Sci. 7 (2016) 641.

1595. [60] FAIR principles for data stewardship, Nat. Genet. 48 (4) (2016) 343.

[25] J. Jacquemin, et al., The International Oryza Map Alignment Project: [61] M.D. Wilkinson, et al., The FAIR Guiding Principles for scientific data

development of a genus-wide comparative genomics platform to help solve management and stewardship, Sci Data 3 (2016) 160018.

the 9 billion-people question, Curr. Opin. Plant Biol. 16 (2) (2013) 147–156. [62] V. Krishnakumar, et al., Araport: the Arabidopsis information portal, Nucleic

[26] Y. Zhang, et al., Genome and comparative transcriptomics of African Wild Acids Res. 43 (Database issue) (2015) D1003–D1009.

Rice Oryza longistaminata provide insights into molecular mechanism of [63] W.C. Yim, et al., PLANEX: the plant co-expression database, BMC Plant Biol.

rhizomatousness and self-incompatibility, Mol. Plant 8 (11) (2015) 13 (2013) 83.

1683–1686. [64] Y. Aoki, et al., ATTED-II in 2016: a plant coexpression database towards

[27] P. Dharmawardhana, et al., A genome scale metabolic network for rice and lineage-Specific coexpression, Plant Cell Physiol. 57 (1) (2016) e5.

accompanying analysis of tryptophan, auxin and serotonin biosynthesis [65] H. Ohyanagi, et al., Plant Omics Data Center: an integrated web repository

regulation under biotic stress, Rice (N Y) 6 (1) (2013) 15. for interspecies gene expression networks with NLP-based curation, Plant

[28] S. Naithani, et al., Plant Reactome: a resource for plant pathways and Cell Physiol. 56 (1) (2015) e9.

comparative analysis, Nucleic Acids Res. (2016). [66] Z. Zhang, et al., PMRD: plant microRNA database, Nucleic Acids Res. 38

[29] Y. Meng, et al., PmiRKB: a plant microRNA knowledge base, Nucleic Acids (Database issue) (2010) D806–D813.

Res. 39 (Database issue) (2011) D181–D187. [67] M. Takeya, et al., NIASGBdb: NIAS Genebank databases for genetic resources

[30] L. Hao, et al., Information commons for rice (IC4R), Nucleic Acids Res. 44 and plant disease information, Nucleic Acids Res. 39 (Database issue) (2011)

(D1) (2016) D1172–D1180. D1108–D1113.

[31] H. Zhao, et al., RiceVarMap: a comprehensive database of rice genomic [68] G. Droc, et al., OryGenesDB: a database for rice reverse genetics, Nucleic

variations, Nucleic Acids Res. 43 (Database issue) (2015) D1018–D1022. Acids Res. 34 (Database issue) (2006) D736–D740.

[32] J. Yonemaru, K. Ebana, M. Yano, HapRice, an SNP haplotype database and a [69] C. Ruprecht, et al., FamNet: a framework to identify multiplied modules

web tool for rice, Plant Cell Physiol. 55 (1) (2014) e9. driving pathway expansion in plants, Plant Physiol. 170 (3) (2016)

[33] T.C. Mockler, et al., The DIURNAL project: DIURNAL and circadian expression 1878–1894.

profiling, model-based pattern matching, and promoter analysis, Cold [70] D.M. Goodstein, et al., Phytozome: a comparative platform for green plant

Spring Harb. Symp. Quant. Biol. 72 (2007) 353–363. genomics, Nucleic Acids Res. 40 (Database issue) (2012) D1178–D1186.

[34] T. Hruz, et al., Genevestigator v3: a reference expression database for the [71] M. Tomcal, N. Stiffler, A. Barkan, POGs2: a web portal to facilitate

meta-analysis of transcriptomes, Adv. Bioinf. 2008 (2008) 420747. cross-species inferences about protein architecture and function in plants,

[35] Y. Sato, et al., RiceFREND: a platform for retrieving coexpressed gene PLoS One 8 (12) (2013) e82569.

networks in rice, Nucleic Acids Res. 41 (Database issue) (2013) [72] M.G. Conte, et al., GreenPhylDB: a database for plant comparative genomics,

D1214–D1221. Nucleic Acids Res. 36 (Database issue) (2008) D991–D998.

[36] Y. Sato, et al., RiceXPro version 3. 0: expanding the informatics resource for [73] A. Hieno, et al., ppdb: plant promoter database version 3.0, Nucleic Acids

rice transcriptome, Nucleic Acids Res. 41 (Database issue) (2013) Res. 42 (Database issue) (2014) D1188–D1192.

D1206–D1213. [74] C. Johnson, et al., CSRDB: a small RNA integrated database and browser

[37] P. Gour, et al., Manually curated database of rice proteins, Nucleic Acids Res. resource for cereals, Nucleic Acids Res. 35 (Database issue) (2007)

42 (Database issue) (2014) D1214–D1221. D829–D833.

[38] C.H. Chien, et al., EXPath: a database of comparative expression analysis [75] V. Cognat, et al., PlantRNA, a database for tRNAs of photosynthetic

inferring metabolic pathways for plants, BMC Genomics 16 (Suppl. 2) (2015) eukaryotes, Nucleic Acids Res. 41 (Database issue) (2013) D273–D279.

S6. [76] C. Kanz, et al., The EMBL nucleotide sequence database, Nucleic Acids Res.

[39] Y. Ogata, et al., CoP: a database for characterizing co-expressed gene 33 (Database issue) (2005) D29–33.

modules with biological information in plants, Bioinformatics 26 (9) (2010) [77] S. Alter, et al., DroughtDB: an expert-curated compilation of plant drought

1267–1268. stress genes and their homologs in nine species, Database (Oxford) 2015

[40] A. Sapkota, et al., DIPOS: database of interacting proteins in Oryza sativa, (2015) bav046.

Mol. Biosyst. 7 (9) (2011) 2615–2621. [78] M. Naika, et al., STIFDB2: an updated version of plant stress-responsive

[41] H. Gu, et al., PRIN: a predicted rice interactome network, BMC Bioinf. 12 transcription factor database with additional stress signals,

(2011) 161. stress-responsive transcription factor binding sites and stress-responsive

[42] T. Lee, et al., RiceNet v2: an improved network prioritization server for rice genes in Arabidopsis and rice, Plant Cell Physiol. 54 (2) (2013) e8.

genes, Nucleic Acids Res. 43 (W1) (2015) W122–W127. [79] A. Yilmaz, et al., GRASSIUS: a platform for comparative regulatory genomics

[43] The Gene Ontology C, Expansion of the Gene Ontology knowledgebase and across the grasses, Plant Physiol. 149 (1) (2009) 171–180.

resources, Nucleic Acids Res. (2016). [80] Y. Zhang, et al., IsomiR Bank: a research resource for tracking IsomiRs,

[44] L. Cooper, P. Jaiswal, The plant ontology: a tool for plant genomics, Methods Bioinformatics 32 (13) (2016) 2069–2071.

Mol. Biol. 1374 (2016) 89–114. [81] M. Nakano, et al., Plant MPSS databases: signature-based transcriptional

[45] A. Oellrich, et al., An ontology approach to comparative phenomics in plants, resources for analyses of mRNA and small RNA, Nucleic Acids Res. 34

Plant Methods 11 (2015) 10. (Database issue) (2006) D731–D735.

[46] A.R. Deans, et al., Finding our way through phenotypes, PLoS Biol. 13 (1)

(2015) e1002033.

52 P. Garg, P. Jaiswal / Current Plant Biology 7–8 (2016) 39–52

[82] T.H. Lee, et al., RiceArrayNet: a database for correlating gene expression [97] J.D. Edwards, A.M. Baldo, L.A. Mueller, Ricebase: a breeding and genetics

from transcriptome profiling, and its application to the analysis of platform for rice, integrating individual molecular markers, pedigrees and

coexpressed genes in rice, Plant Physiol. 151 (1) (2009) 16–33. whole-genome-based data, Database (Oxford) 2016 (2016).

[83] R.S. Austin, et al., New BAR tools for mining expression data and exploring [98] T. Kudo, et al., UniVIO: a multiple omics database with hormonome and

Cis-elements in Arabidopsis thaliana, Plant J. (2016). transcriptome data from rice, Plant Cell Physiol. 54 (2) (2013) e9.

[84] J. Waese, N.J. Provart, The Bio-Analytic Resource: data visualization and [99] M. Helmy, M. Tomita, Y. Ishihama, OryzaPG-DB: rice proteome database

analytic tools for multiple levels of plant biology, Curr. Plant Biol. 7–8 based on shotgun proteogenomics, BMC Plant Biol. 11 (2011) 63.

(2016), http://dx.doi.org/10.1016/j.cpb.2016.12.001. [100] J. Yonemaru, et al., Q-Taro: Qtl annotation rice online database, Rice 3 (2–3)

[85] D.M. Riano-Pachon, et al., PlnTFDB: an integrative plant transcription factor (2010) 194–203.

database, BMC Bioinf. 8 (2007) 42. [101] W.M. Karlowski, et al., MOsDB: an integrated information resource for rice

[86] T. Zhang, A.P. Marand, J. Jiang, PlantDHS: a database for DNase I genomics, Nucleic Acids Res. 31 (1) (2003) 190–192.

hypersensitive sites in plants, Nucleic Acids Res. 44 (D1) (2016) [102] Z. Zhang, et al., RiceWiki: a wiki-based database for community curation of

D1148–D1153. rice genes, Nucleic Acids Res. 42 (Database issue) (2014) D1222–D1228.

[87] A. Kurotani, et al., Plant-PrAS: a database of physicochemical and structural [103] K.H. Jung, et al., Phylogenomics databases for facilitating functional

properties and novel functional regions in plant proteomes, Plant Cell genomics in rice, Rice (N Y) 8 (1) (2015) 60.

Physiol. 56 (1) (2015) e11. [104] K. Sakata, et al., RiceGAAS: an automated annotation system and database

[88] M.W. Murcha, et al., MPIC: a mitochondrial protein import components for rice genome sequence, Nucleic Acids Res. 30 (1) (2002) 98–102.

database for plant and non-plant species, Plant Cell Physiol. 56 (1) (2015) [105] P. Cao, et al., The Rice Oligonucleotide Array Database: an atlas of rice gene

e10. expression, Rice (N Y) 5 (1) (2012) 17.

[89] J. Duvick, et al., PlantGDB: a resource for comparative plant genomics, [106] T. Sakurai, et al., RiceFOX: a database of Arabidopsis mutant lines

Nucleic Acids Res. 36 (Database issue) (2008) D959–D965. overexpressing rice full-length cDNA that contains a wide range of trait

[90] X. Wu, Y. Zhang, Q.Q. Li, PlantAPA: a portal for visualization and analysis of information to facilitate analysis of gene function, Plant Cell Physiol. 52 (2)

alternative polyadenylation in plants, Front. Plant Sci. 7 (2016) 889. (2011) 265–273.

[91] M. Lohse, et al., Mercator: a fast and simple web server for genome scale [107] J. Zhang, et al., RMD: a rice mutant database for functional analysis of the

functional annotation of plant sequence data, Plant Cell Environ. 37 (5) rice genome, Nucleic Acids Res. 34 (Database issue) (2006) D745–D748.

(2014) 1250–1258. [108] P. Priya, M. Jain, RiceSRTFDB: a database of rice transcription factors

[92] D. Wang, et al., The Rice Genome Knowledgebase (RGKbase): an annotation containing comprehensive expression, cis-regulatory element and mutant

database for rice comparative genomics and evolutionary biology, Nucleic information to facilitate gene function analysis, Database (Oxford) 2013

Acids Res. 41 (Database issue) (2013) D1199–D1205. (2013) bat027.

[93] R. Narsai, et al., Rice DB: an Oryza Information Portal linking annotation, [109] S. Smita, et al., QlicRice: a web interface for abiotic stress responsive QTL

subcellular location, function, expression, regulation, and evolutionary and loci interaction channels in rice, Database (Oxford) 2011 (2011) bar037.

information for rice and Arabidopsis, Plant J. 76 (6) (2013) 1057–1073. [110] H. Ohyanagi, et al., OryzaGenome: genome diversity database of wild Oryza

[94] Y. Kawahara, et al., TENOR: database for comprehensive mRNA-Seq species, Plant Cell Physiol. 57 (1) (2016) e1.

experiments in rice, Plant Cell Physiol. 57 (1) (2016) e7. [111] N. Kurata, Y. Yamazaki, Oryzabase. An integrated biological and genome

[95] K. Hamada, et al., OryzaExpress: an integrated database of gene expression information database for rice, Plant Physiol. 140 (1) (2006) 12–17.

networks and omics annotations in rice, Plant Cell Physiol. 52 (2) (2011) [112] D. Copetti, et al., RiTE database: a resource database for genus-wide rice

220–229. genomics and evolutionary biology, BMC Genomics 16 (2015) 538.

[96] T. Lu, et al., RICD: a rice indica cDNA database resource for rice functional [113] T. Kind, M. Scholz, O. Fiehn, How large is the metabolome? A critical analysis

genomics, BMC Plant Biol. 8 (2008) 118. of data exchange practices in chemistry, PLoS One 4 (5) (2009) e5440.