Published online 8 November 2018 Nucleic Acids Research, 2019, Vol. 47, Database issue D853–D858 doi: 10.1093/nar/gky1095 The UCSC Genome Browser database: 2019 update Maximilian Haeussler1,*, Ann S. Zweig1, Cath Tyner1, Matthew L. Speir1,Kate R. Rosenbloom1, Brian J. Raney1, Christopher M. Lee1, Brian T. Lee 1,AngieS.Hinrichs1, Jairo Navarro Gonzalez1, David Gibson1, Mark Diekhans1, Hiram Clawson1, Jonathan Casper1, Galt P. Barber1, David Haussler1,2, Robert M. Kuhn1 and W. James Kent1
1Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA and 2Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
Received September 14, 2018; Revised October 17, 2018; Editorial Decision October 18, 2018; Accepted October 19, 2018
ABSTRACT description and configuration options. A track can beset to one of the four display modes, from hidden, to a single The UCSC Genome Browser (https://genome.ucsc. line (‘dense’), thin lines (‘squish’), normal lines (‘pack’), up edu) is a graphical viewer for exploring genome an- to ‘full’ with one annotation per line (see our new video notations. For almost two decades, the Browser has https://youtu.be/jKix2B3hwnw). The most-used tracks on provided visualization tools for genetics and molec- the human genome are the various gene models, followed ular biology and continues to add new data and fea- by variants and ENCODE annotations. Documentation for tures. This year, we added a new tool that lets users each track is available by right-clicking it, clicking on its interactively arrange existing graphing tracks into name below the image or the vertical gray bar next to the new groups. Other software additions include new track. In addition, files in our public ‘makeDb/doc’ direc- formats for chromosome interactions, a ChIP-Seq tory contain the UNIX commands to build any track (see peak display for track hubs and improved support http://bit.ly/makeDocs). At the top of the documentation page, tracks can be configured. Depending on the track for HGVS. On the annotation side, we have added type, one can change the size, show only certain features, gnomAD, TCGA expression, RefSeq Functional ele- activate coding sequence codon display, etc. To share a com- ments, GTEx eQTLs, CRISPR Guides, SNPpedia and plete track selection and configuration, the user can create created a 30-way primate alignment on the human a stable link using ‘My Data > My Sessions’ and share with genome. Nine assemblies now have RefSeq-mapped collaborators or further create a ‘Public Session’ hosted on gene models. our servers. In addition to the main Genome Browser, the website INTRODUCTION contains related utilities in the ‘Tools’ menu. These util- ities may find sequences in the genome (BLAT, in-silico The University of California at Santa Cruz (UCSC) PCR), add annotations to uploaded variants (Variant An- Genome Browser (1) is a viewer for genome annota- notation Integrator), let users inspect, join and download tions, primarily those from human and mouse genomes. our annotations as text files in various table formats (Ta- The UCSC Genome Browser team has continually added ble Browser, Data Integrator) or map annotations between data and software features to the website since 2001 and genomes (LiftOver and also ‘View > In Other Genomes’). currently hosts 195 assemblies and 105 species (Menu: Other tools show data not directly located on chromo- ‘Genomes > Other Genomes’). We continue to add new somes, like VisiGene for in-situ images and our viewer for species and assemblies, but most of our user base focuses gene/protein interaction networks (‘Tools > Gene Interac- on human and mouse. As a result, most of our annotation tions’). is concentrated on these two species. All genome annotations can also be downloaded as Apart from showing the DNA sequence itself, the text files (‘Downloads > Genome Data’) or retrieved by Genome Browser is primarily used to display genome an- MySQL (‘Downloads > MySQL Access’). Under ‘Down- notations. Most of these are derived in some way from loads > Utilities’, we provide several hundred tools that al- aligning sequences to the genome, created by us or other low users to convert and analyze our genomics data files groups, and they are stored and shown as small rectan- on a UNIX-like operating system. Finally, we provide all gles with defined chromosome:start-end coordinates (i.e. Genome Browser source code along with scripts that help chr10:123 400–123 499) or graphs of annotation density. install these or a Genome Browser with selected assemblies Annotations are organized into tracks, each with a name,
*To whom correspondence should be addressed. Tel: +1 831 295 0653; Email: [email protected]