Published online 20 November 2019 Nucleic Acids Research, 2020, Vol. 48, Database issue D825–D834 doi: 10.1093/nar/gkz1025 ChiTaRS 5.0: the comprehensive database of chimeric transcripts matched with druggable fusions and 3D chromatin maps Deepak Balamurali †, Alessandro Gorohovski †, Rajesh Detroja, Vikrant Palande, Dorith Raviv-Shay and Milana Frenkel-Morgenstern*
Laboratory of Cancer Genomics and Biocomputing of Complex Diseases, The Azrieli Faculty of Medicine, Bar-Ilan University, Safed 1311502, Israel
Received September 15, 2019; Revised October 18, 2019; Editorial Decision October 18, 2019; Accepted October 26, 2019
ABSTRACT INTRODUCTION Chimeric RNA transcripts are formed when exons The transcriptome in eukaryotes is composed of single- from two genes fuse together, often due to chromo- stranded sequences of RNAs transcribed from various loca- somal translocations, transcriptional errors or trans- tions in the total genome. Although most RNA transcripts splicing effect. While these chimeric RNAs produce can be traced back to a single locus, exons from two differ- functional proteins only in certain cases, they play ent genes or from two copies of the same gene sometimes fuse together, leading to the formation of chimeric RNAs a significant role in disease phenotyping and pro- (1–22). The various causes of such fusions include tran- gression. ChiTaRS 5.0 (http://chitars.md.biu.ac.il/)is scriptional errors, trans-splicing effects and chromosomal the latest and most comprehensive chimeric tran- translocations (6,14). Thus, two unrelated genomic loci on script repository, with 111 582 annotated entries from different chromosomes may produce a chimeric transcript eight species, including 23 167 known human cancer through a genomic rearrangement event or due to trans- breakpoints. The database includes unique informa- splicing. Similarly, a read-through transcript of two adja- tion correlating chimeric breakpoints with 3D chro- cent genomic loci may produce chimeric RNAs (8–9,21). matin contact maps, generated from public datasets The possibility is high that chimeras are artefacts of tem- of chromosome conformation capture techniques plate switching by the reverse transcriptase enzyme (consid- (Hi–C). In this update, we have added curated in- ering the difficulty of performing reverse transcriptase-free formation on druggable fusion targets matched with assays) (9,16). Nonetheless, several studies have identified chimeric transcripts that have also been translated into pro- chimeric breakpoints, which are applicable to preci- teins. This establishes their authenticity and further moti- sion medicine in cancers. The introduction of a new vates scientists to curate a list of known chimeras (19–26). section that lists chimeric RNAs in various cell-lines A direct correlation has been suggested between the pres- is another salient feature. Finally, using text-mining ence of chimeras in the genome and their role in tumorige- techniques, novel chimeras in Alzheimer’s disease, nesis (18,19). The transcriptome becomes extremely more schizophrenia, dyslexia and other diseases were col- complex in the context of cancer, due to a high proportion lected in ChiTaRS. Thus, this improved version is of genomic rearrangements, nucleotide polymorphisms and an extensive catalogue of chimeras from multiple alterations of the splicing machinery. species. It extends our understanding of the evolu- One of the most renown gene fusions identified in solid tion of chimeric transcripts in eukaryotes and con- tumors is the TMPRSS2-ERG chimera, a well-documented tributes to the analysis of 3D genome conformational biomarker of prostate cancer. This chimera has been identi- fied in a high frequency of tested patient samples (28). Novel changes and the functional role of chimeras in the means, like the delivery of specific siRNAs by targeted lipo- etiopathogenesis of cancers and other complex dis- somal nanovectors and the use of peptidomimetic inhibitors eases. have been attempted, with the aim of mitigating the spread of prostate cancer (29,30). Similarly, FGFR3-TACC3 fu- sions have been identified as driving factors for cancer pro- gression in multiple tissue types like bladder, lung and brain.
*To whom correspondence should be addressed. Tel: +972 72 264 2901; Fax: +972 72 264 2901; Email: [email protected] †The authors wish it to be known that, in their opinion, the first two authors should be regarded as Joint First Authors.