Human Genes Escaping X-Inactivation Revealed by Single Cell Expression Data Kerem Wainer Katsir and Michal Linial*

Human Genes Escaping X-Inactivation Revealed by Single Cell Expression Data Kerem Wainer Katsir and Michal Linial*

Wainer Katsir and Linial BMC Genomics (2019) 20:201 https://doi.org/10.1186/s12864-019-5507-6 RESEARCHARTICLE Open Access Human genes escaping X-inactivation revealed by single cell expression data Kerem Wainer Katsir and Michal Linial* Abstract Background: In mammals, sex chromosomes pose an inherent imbalance of gene expression between sexes. In each female somatic cell, random inactivation of one of the X-chromosomes restoresthisbalance.Whilemost genes from the inactivated X-chromosome are silenced, 15–25% are known to escape X-inactivation (termed escapees). The expression levels of these genes are attributed to sex-dependent phenotypic variability. Results: We used single-cell RNA-Seq to detect escapees in somatic cells. As only one X-chromosome is inactivated in each cell, the origin of expression from the active or inactive chromosome can be determined from the variation of sequenced RNAs. We analyzed primary, healthy fibroblasts (n = 104), and clonal lymphoblasts with sequenced parental genomes (n = 25) by measuring the degree of allelic-specific expression (ASE) from heterozygous sites. We identified 24 and 49 candidate escapees, at varying degree of confidence, from the fibroblast and lymphoblast transcriptomes, respectively. We critically test the validity of escapee annotations by comparing our findings with a large collection of independent studies. We find that most genes (66%) from the unified set were previously reported as escapees. Furthermore, out of the overlooked escapees, 11 are long noncoding RNA (lncRNAs). Conclusions: X-chromosome inactivation and escaping from it are robust, permanent phenomena that are best studies at a single-cell resolution. The cumulative information from individual cells increases the potential of identifying escapees. Moreover, despite the use of a limited number of cells, clonal cells (i.e., same X- chromosomes are coordinately inhibited) with genomic phasing are valuable for detecting escapees at high confidence. Generalizing the method to uncharacterized genomic loci resulted in lncRNAs escapees which account for 20% of the listed candidates. By confirming genes as escapees and propose others as candidates from two different cell types, we contribute to the cumulative knowledge and reliability of human escapees. Keywords: X-inactivation, Allelic bias, RNA-Seq, Escapees, Single cell, Allele specific expression Background tissue [3]. This highly regulated process has been exten- Sex chromosomes pose an inherent genetic imbalance of sively studied [2–5]. gene expression between sexes. In order to ensure a bal- The initial silencing of ChrX is governed mainly by XIST anced expression in mammalian somatic tissues, one of (X-inactive specific transcript) [3, 4], a non-coding RNA the female’s X-chromosomes (ChrX) is randomly se- (ncRNA) unique to placental mammals. XIST is a master lected to undergo inactivation [1]. The random choice of regulator located at the X-inactivation center (XIC) that an inactivated X-chromosome (Xi) (i.e., paternal or ma- together with neighboring ncRNAs (e.g., FTX and JPX)ac- ternal) is completed at a very early phase of embryonic tivate the process of X-inactivation [3]. XIST is exclusively development [2]. Importantly, once this decision is made transcribed from Xi, and its RNA products act in cis by the selected inactivated chromosome is deterministically coating the chromosome within a restricted chromosomal defined for all descendant cells, and this choice is main- territory [6]. The activity of XIC genes in recruiting chro- tained throughout the organism’s life in every somatic matin remodeling complexes [3, 7, 8], results in an irre- versible heterochromatinization. The heterochromatin * Correspondence: [email protected] state underlies the steady, lifelong phenomenon of Department of Biological Chemistry, The Institute of Life Sciences, The X-inactivation [1]. Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat Ram, 9190400 Jerusalem, Israel © The Author(s). 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. Wainer Katsir and Linial BMC Genomics (2019) 20:201 Page 2 of 17 Ample studies have indicated that silencing does not phenotypic diversity results from the varying degree of apply to all genes in the inactivated X-chromosome. Spe- X-inactivation and escapee’sexpression(e.g.,[10, 26]). cifically, genes that are located at the Pseudoautosomal The X-inactivation is an event that occurs independ- regions (PARs) are expressed from both alleles, similar ently for each cell. Thus, collecting expression data from to the majority of genes from autosomal chromosomes single cells allows monitoring explicitly a single Xi allele [9]. In addition, on the ChrX there are also genes that in each cell. In the present study, we use the ASE data escape X-inactivation (coined escapees). Investigating extracted from RNA-Seq of single cells (scRNA-Seq) for these escapee genes is important to understand the basis identifying escapees. We present an analytical protocol of ChrX evolution [10] and X-inactivation mechanism using genomic data for two sets collected from [7]. Moreover, numerous clinical and phenotypic out- scRNA-Seq experiments. One set is based on primary fi- comes are thought to be explained by the status of es- broblasts, and the other is based on GM12878 lympho- capee genes [11]. blast cell line with a fully sequenced diploid genome. Complementary methods have been adapted for iden- We report on 24 genes from fibroblasts and 49 genes tifying escapees [12, 13]. For example, the expression from lymphoblasts as candidate escapees. Finally, we levels of mRNAs were compared between males and fe- demonstrate the potential of the method to identify a males in various tissues [14–16]. Additionally, extensive large number of escapees despite a modest number of lists of escapee candidates were reported from single cells analyzed. We show that while most of our mouse-human cell hybrids, and from allelic expression identified escapees strongly agree with the current patterns in fibroblast lines carrying a fragmented knowledge, we also provide an extended list of escapes X-chromosome [17]. The correlation of chromatin struc- that were previously undetected. ture and CpG methylation patterns with genes that es- cape X-inactivation was also used. For example, loci on Results Xi with low methylation levels were proposed as indica- A framework for measuring the escape from X- tors for escapee genes and were thus used as an add- inactivation in single cells itional detection method [18, 19]. We identify escapees by analyzing gene expression from In recent studies, genomic information from individ- somatic single cells using scRNA-Seq methodology (see uals and isolated cells became useful for marking the Methods). To evaluate the sensitivity of the method, we status of X-inactivation. Specifically, RNA sequencing compare X-chromosome (ChrX) expression to other (RNA-Seq) was used to infer allelic-specific expression autosomal chromosomes. Specifically, we focused on the (ASE) from the two X-chromosomes, according to a gene-rich chromosome 17 (Chr17) as a prototype of an statistical assumption for the minor and major expressed autosomal chromosome. Chr17 was selected as it repre- alleles [20]. ASE analysis from B-lymphocytes derived sents a chromosome with a minimal number of from two ethnically remote populations identified 114 parent-specific imprinted genes [27]. The quantitative escapees based on heterologous SNPs (hSNPs) [10]. By properties of ChrX and Chr17 are listed in Fig. 1a. default, the low-expressing hSNP alleles were considered This study is based on analysis of two female origin re- as evidence for Xi expression. Recently, a large-scale sources: (i) Primary UCF1014 fibroblasts (with 104 cells, ASE-based analysis was completed based on a few indi- see Methods). This set is specified by a higher coverage viduals using single cells [16]. transcriptomic data, but lacks information on haplotype Numerous observations indicate conflicts and inconclu- phasing (Fig. 1b); (ii) A smaller dataset of clonal lympho- sive labeling of a ChrX gene as inactivated or escapee. Such blasts (n = 25) from the GM12878 cell line with fully variability reflects the inherent properties of the phased and sequenced parental diploid genomes (Fig. 1c). phenomenon with respect to tissues, individuals and devel- In both datasets, transcription at heterozygous SNPs opmental stages. Several trends characterize X-inactivation (hSNPs) is the source of information for determining and escaping from it: (i) Escapees are located at the p-arm, monoallelic or biallelic expression. Each hSNP, in every cell, which comprises evolutionary young segments that di- that is supported by expression evidence above a predeter- verged more recently from ChrY [17, 21, 22]. (ii) Human mined threshold is considered an informative SNP (iSNP) escapees account for 15–25% [13]) of all known ChrX (see Methods, Additional file 1:Text).ThesumofiSNPs genes. Notably, this

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    17 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us