Comparative Genome Analyses Reveal Sequence Features Reflecting

Total Page:16

File Type:pdf, Size:1020Kb

Comparative Genome Analyses Reveal Sequence Features Reflecting Wu et al. BMC Genomics (2018) 19:705 https://doi.org/10.1186/s12864-018-5069-z RESEARCHARTICLE Open Access Comparative genome analyses reveal sequence features reflecting distinct modes of host-adaptation between dicot and monocot powdery mildew Ying Wu1†, Xianfeng Ma1,2†, Zhiyong Pan3, Shiv D. Kale4, Yi Song1,5, Harlan King1, Qiong Zhang1, Christian Presley1, Xiuxin Deng3, Cheng-I Wei6* and Shunyuan Xiao1,7* Abstract Background: Powdery mildew (PM) is one of the most important and widespread plant diseases caused by biotrophic fungi. Notably, while monocot (grass) PM fungi exhibit high-level of host-specialization, many dicot PM fungi display a broad host range. To understand such distinct modes of host-adaptation, we sequenced the genomes of four dicot PM biotypes belonging to Golovinomyces cichoracearum or Oidium neolycopersici. Results: We compared genomes of the four dicot PM together with those of Blumeria graminis f.sp. hordei (both DH14 and RACE1 isolates), B. graminis f.sp. tritici, and Erysiphe necator infectious on barley, wheat and grapevine, respectively. We found that despite having a similar gene number (6620–6961), the PM genomes vary from 120 to 222 Mb in size. This high-level of genome size variation is indicative of highly differential transposon activities in the PM genomes. While the total number of genes in any given PM genome is only about half of that in the genomes of closely related ascomycete fungi, most (~ 93%) of the ascomycete core genes (ACGs) can be found in the PM genomes. Yet, 186 ACGs were found absent in at least two of the eight PM genomes, of which 35 are missing in some dicot PM biotypes, but present in the three monocot PM genomes, indicating remarkable, independent and perhaps ongoing gene loss in different PM lineages. Consistent with this, we found that only 4192 (3819 singleton) genes are shared by all the eight PM genomes, the remaining genes are lineage- or biotype-specific. Strikingly, whereas the three monocot PM genomes possess up to 661 genes encoding candidate secreted effector proteins (CSEPs) with families containing up to 38 members, all the five dicot PM fungi have only 116–175 genes encoding CSEPs with limited gene amplification. Conclusions: Compared to monocot (grass) PM fungi, dicot PM fungi have a much smaller effectorome. This is consistent with their contrasting modes of host-adaption: while the monocot PM fungi show a high-level of host specialization, which may reflect an advanced host-pathogen arms race, the dicot PM fungi tend to practice polyphagy, which might have lessened selective pressure for escalating an with a particular host. Keywords: Powdery mildew, Genome comparison, Host adaptation, Effectors, Gene loss, Gene conservation, Host- pathogen arms race * Correspondence: [email protected]; [email protected] †Ying Wu and Xianfeng Ma contributed equally to this work. 6College of Agriculture & Natural Resources, University of Maryland, College Park, MD 20742, USA 1Institute for Bioscience and Biotechnology Research, University of Maryland, College Park, MD 20850, USA Full list of author information is available at the end of the article © The Author(s). 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. Wu et al. BMC Genomics (2018) 19:705 Page 2 of 20 Background retrotransposon proliferation in these PM genomes, Powdery mildew is one of the most important and wide- which is believed to be consistent with the obligate bio- spread plant diseases caused by ascomycete fungi belonging trophic parasitic style of the pathogens [11]. Remarkably, to the order of Erysiphales. Over 10,000 plant species are despite a low gene content, ~ 8% of the total annotated susceptible to powdery mildew (PM) diseases. These in- genes in these two genomes encode candidate secrete clude staple crops such as wheat and barley, and important effector proteins (CSEPs) that are presumed to increase horticultural crops such as tomato, grapevine and straw- virulence on specific hosts [9, 11, 12]. More recently, berry [1, 2]. PM fungi are obligate biotrophic pathogens Menardo and colleagues sequenced the genomes of sev- that strictly require living host cells to complete their life eral other grass PM fungi and conducted comparative cycle. Like rust fungi and oomycete pathogens, PM fungi genome analysis to reconstruct the evolutionary history of differentiate appressoria that penetrate the host plant cell different B. graminis lineages. The authors found that in wall and further develop feeding structures (i.e. haustoria) most cases, different B. graminis formae speciales (ff.spp.) to extract nutrients and water from host cells [3, 4]. fungi have co-evolved with their hosts, leading to host Interestingly, while monocotPMfungidisplaynarrow specialization. Yet, there are exceptions where possible host ranges [e.g. Blumeria graminis ‘formae speciales’ host jumps or host range expansions might have occurred tritici and hordei can infect only wheat or barley, [17]. Additionally, while the majority of grass PM fungi respectively [5]], some dicot PM fungi are capable of are monophylogenetic, there may be exceptions; for ex- infecting hosts belonging to different plant families [6]. ample, there are different B. graminis isolates affecting For example, Oidium neolycopersici is capable of infect- Dactylis glomerata, and therefore B. g. f.sp. dactylidis is ing plants from 13 families including Arabidopsis and probably not a monophyletic group [17, 18]. More inter- tomato that have diverged > 100 million years ago [7, estingly, Menardo and colleagues also discovered that B.g. 8]. How obligate biotrophic fungi adapt to a single or f.sp. triticale capable of infecting the artificial hybrid crop numerous host species remains to be an important triticale originated through a hybridization of isolates of open question. the Bgt and of the B.g. f. sp. secalis [19]. These findings Because powdery mildew fungi cannot be cultured in suggest that multiple mechanisms have contributed to vitro and there is no effective method for genetic trans- the genome evolution of B. graminis ff.spp., which is formation, it is very difficult to conduct functional studies believed to reflect their co-evolutionary struggle with of PM fungi via conventional genetic manipulations. The their respective host plants. next generation sequencing (NGS) and other OMICS Previous extensive studies demonstrate that barley and technologies have been used to understand the genome wheat resistance (R) gene conferring race-specific resistance evolution of PM fungi in relation to host-adaption [9]. to isolates of Bgh or Bgt encode immune receptors belong- Such studies however have mainly been focused on PM ing to the nucleotide-binding leucine-rich repeat (NB-LRR) fungi infecting monocot plants belonging to the subfamily superfamily and that multiple R alleles exist at the barley Pooideae of Poaceae, the only taxon of monocot plants Mla locus [20, 21] and the wheat Pm3 locus [22–24]. The that can be infected by PM fungi. These grass PM fungi availability of B. graminis genome and transcriptome data are considered to have originated from a single species have accelerated the identification of the Avirulence (Avr) Blumeria graminis [10]andaforma specialis (f.sp.) is used genes that are recognized by some of the characterized R to describe a B. graminis form specialized on a particular genes. Not surprisingly, several recently identified PM Avr host species. The genome of B. graminis f.sp. hodei (Bgh) genes that are recognized by alleles of R genes at the barley specialized on barley was first sequenced, partially assem- Mla locus, or the wheat Pm3 or the Pm2 locus encode bled, annotated and analyzed [11], which was followed by canonical CSEPs [25–27]. Moreover, a recent comparative similar work with B. graminis f.sp. tritici (Bgt) specialized genomic analysis of several host-specific B. graminis line- on wheat [12]. A recent study further improved the gen- ages found evidence for a rapid turnover of CSEP genes as ome assembly of two Bgh isolates, DH14 and RACE1, to a well as lineage-specific (LS) amplification of several CSEPs near-chromosome level, which has led to the annotation through repeated gene duplication [13, 28]. This analysis of 7118 genes for DH14 and 7239 for RACE1 [13]. These along with other studies also identified typical signatures of studies revealed that the three B. graminis genomes (two positive selection on effector genes of different B. graminis Bgh and one Bgt) are large in size (120-180 Mb) yet con- lineages presumably due to host-imposed selection pressure tain surprisingly few protein-coding genes (~ 7000 genes [12, 28, 29]. Thus, genetic, genomic and molecular evidence in total) compared with closely related necrotrophic support a notion that there has been an arms race between ascomycete fungi such as Botrytis cinerea and Sclerotinia B. graminis ff.spp. and their monocot hosts especially after sclerotiorum (11,707 and 10,175 from EnsembleFungi domestication [28, 30, 31]. Database) [14–16]. Both the extraordinary genome Compared to the extensive genomic studies of monocot expansion and gene loss are associated with massive PM fungi, there have been only limited whole-genome Wu et al. BMC Genomics (2018) 19:705 Page 3 of 20 scale studies on dicot PM fungi [9]. Spanu et al. compared genes of the four dicot PM fungi to those of Bgh (both the Bgh genome sequence with those of two dicot PM bio- DH14 and RACE1) [13], Bgt and E.
Recommended publications
  • Mouse Fscn3 Conditional Knockout Project (CRISPR/Cas9)
    https://www.alphaknockout.com Mouse Fscn3 Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Fscn3 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Fscn3 gene (NCBI Reference Sequence: NM_019569 ; Ensembl: ENSMUSG00000029707 ) is located on Mouse chromosome 6. 7 exons are identified, with the ATG start codon in exon 1 and the TAG stop codon in exon 6 (Transcript: ENSMUST00000031719). Exon 2 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Fscn3 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP24-176I9 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 2 starts from about 9.71% of the coding region. The knockout of Exon 2 will result in frameshift of the gene. The size of intron 1 for 5'-loxP site insertion: 1817 bp, and the size of intron 2 for 3'-loxP site insertion: 839 bp. The size of effective cKO region: ~1197 bp. The cKO region does not have any other known gene. Page 1 of 7 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 2 3 7 Targeting vector Targeted allele Constitutive KO allele (After Cre recombination) Legends Homology arm Exon of mouse Fscn3 cKO region loxP site Page 2 of 7 https://www.alphaknockout.com Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats.
    [Show full text]
  • Protein Interaction Network of Alternatively Spliced Isoforms from Brain Links Genetic Risk Factors for Autism
    ARTICLE Received 24 Aug 2013 | Accepted 14 Mar 2014 | Published 11 Apr 2014 DOI: 10.1038/ncomms4650 OPEN Protein interaction network of alternatively spliced isoforms from brain links genetic risk factors for autism Roser Corominas1,*, Xinping Yang2,3,*, Guan Ning Lin1,*, Shuli Kang1,*, Yun Shen2,3, Lila Ghamsari2,3,w, Martin Broly2,3, Maria Rodriguez2,3, Stanley Tam2,3, Shelly A. Trigg2,3,w, Changyu Fan2,3, Song Yi2,3, Murat Tasan4, Irma Lemmens5, Xingyan Kuang6, Nan Zhao6, Dheeraj Malhotra7, Jacob J. Michaelson7,w, Vladimir Vacic8, Michael A. Calderwood2,3, Frederick P. Roth2,3,4, Jan Tavernier5, Steve Horvath9, Kourosh Salehi-Ashtiani2,3,w, Dmitry Korkin6, Jonathan Sebat7, David E. Hill2,3, Tong Hao2,3, Marc Vidal2,3 & Lilia M. Iakoucheva1 Increased risk for autism spectrum disorders (ASD) is attributed to hundreds of genetic loci. The convergence of ASD variants have been investigated using various approaches, including protein interactions extracted from the published literature. However, these datasets are frequently incomplete, carry biases and are limited to interactions of a single splicing isoform, which may not be expressed in the disease-relevant tissue. Here we introduce a new interactome mapping approach by experimentally identifying interactions between brain-expressed alternatively spliced variants of ASD risk factors. The Autism Spliceform Interaction Network reveals that almost half of the detected interactions and about 30% of the newly identified interacting partners represent contribution from splicing variants, emphasizing the importance of isoform networks. Isoform interactions greatly contribute to establishing direct physical connections between proteins from the de novo autism CNVs. Our findings demonstrate the critical role of spliceform networks for translating genetic knowledge into a better understanding of human diseases.
    [Show full text]
  • Associated Marks Are Enriched in Imprinted Gene Regions and Predict Allele-Specific Modification
    Downloaded from genome.cshlp.org on October 4, 2021 - Published by Cold Spring Harbor Laboratory Press Methods Overlapping euchromatin/heterochromatin- associated marks are enriched in imprinted gene regions and predict allele-specific modification Bo Wen,1,4 Hao Wu,2,4 Hans Bjornsson,1 Roland D. Green,3 Rafael Irizarry,2 and Andrew P. Feinberg1,5 1Department of Medicine and Center for Epigenetics, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA; 2Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland 21205, USA; 3NimbleGen Systems, Inc., Madison, Wisconsin 53711, USA Most genome-level analysis treats the two parental alleles equivalently, yet diploid genomes contain two parental genomes that are often epigenetically distinct. While single nucleotide polymorphisms (SNPs) can be used to distinguish these genomes, it would be useful to develop a generalized strategy for identifying candidate genes or regions showing allele-specific differences, independent of SNPs. We have explored this problem by looking for overlapping marks in the genome related to both euchromatin (histone H3 dimethyl lysine-4 [H3K4Me2]) and heterochromatin (DNA methylation [DNAm]). “Double hits” were defined by the intersection of H3K4Me2 and DNAm. For the top 5% of marks, defined by a sliding window, imprinted gene regions were enriched for double hits 5.4-fold. When the location information of CTCF binding sites were integrated, the “triple hits” were enriched 76-fold for known imprinted genes in the regions studied. The double hits in imprinted genes were found to occur usually at the site of alternative or antisense transcripts. In addition, four of four imprinted genes tested showing double hits also showed allele-specific methylation.
    [Show full text]
  • 4-6 Weeks Old Female C57BL/6 Mice Obtained from Jackson Labs Were Used for Cell Isolation
    Methods Mice: 4-6 weeks old female C57BL/6 mice obtained from Jackson labs were used for cell isolation. Female Foxp3-IRES-GFP reporter mice (1), backcrossed to B6/C57 background for 10 generations, were used for the isolation of naïve CD4 and naïve CD8 cells for the RNAseq experiments. The mice were housed in pathogen-free animal facility in the La Jolla Institute for Allergy and Immunology and were used according to protocols approved by the Institutional Animal Care and use Committee. Preparation of cells: Subsets of thymocytes were isolated by cell sorting as previously described (2), after cell surface staining using CD4 (GK1.5), CD8 (53-6.7), CD3ε (145- 2C11), CD24 (M1/69) (all from Biolegend). DP cells: CD4+CD8 int/hi; CD4 SP cells: CD4CD3 hi, CD24 int/lo; CD8 SP cells: CD8 int/hi CD4 CD3 hi, CD24 int/lo (Fig S2). Peripheral subsets were isolated after pooling spleen and lymph nodes. T cells were enriched by negative isolation using Dynabeads (Dynabeads untouched mouse T cells, 11413D, Invitrogen). After surface staining for CD4 (GK1.5), CD8 (53-6.7), CD62L (MEL-14), CD25 (PC61) and CD44 (IM7), naïve CD4+CD62L hiCD25-CD44lo and naïve CD8+CD62L hiCD25-CD44lo were obtained by sorting (BD FACS Aria). Additionally, for the RNAseq experiments, CD4 and CD8 naïve cells were isolated by sorting T cells from the Foxp3- IRES-GFP mice: CD4+CD62LhiCD25–CD44lo GFP(FOXP3)– and CD8+CD62LhiCD25– CD44lo GFP(FOXP3)– (antibodies were from Biolegend). In some cases, naïve CD4 cells were cultured in vitro under Th1 or Th2 polarizing conditions (3, 4).
    [Show full text]
  • Supplementary Data
    Supplementary Fig. 1 A B Responder_Xenograft_ Responder_Xenograft_ NON- NON- Lu7336, Vehicle vs Lu7466, Vehicle vs Responder_Xenograft_ Responder_Xenograft_ Sagopilone, Welch- Sagopilone, Welch- Lu7187, Vehicle vs Lu7406, Vehicle vs Test: 638 Test: 600 Sagopilone, Welch- Sagopilone, Welch- Test: 468 Test: 482 Responder_Xenograft_ NON- Lu7860, Vehicle vs Responder_Xenograft_ Sagopilone, Welch - Lu7558, Vehicle vs Test: 605 Sagopilone, Welch- Test: 333 Supplementary Fig. 2 Supplementary Fig. 3 Supplementary Figure S1. Venn diagrams comparing probe sets regulated by Sagopilone treatment (10mg/kg for 24h) between individual models (Welsh Test ellipse p-value<0.001 or 5-fold change). A Sagopilone responder models, B Sagopilone non-responder models. Supplementary Figure S2. Pathway analysis of genes regulated by Sagopilone treatment in responder xenograft models 24h after Sagopilone treatment by GeneGo Metacore; the most significant pathway map representing cell cycle/spindle assembly and chromosome separation is shown, genes upregulated by Sagopilone treatment are marked with red thermometers. Supplementary Figure S3. GeneGo Metacore pathway analysis of genes differentially expressed between Sagopilone Responder and Non-Responder models displaying –log(p-Values) of most significant pathway maps. Supplementary Tables Supplementary Table 1. Response and activity in 22 non-small-cell lung cancer (NSCLC) xenograft models after treatment with Sagopilone and other cytotoxic agents commonly used in the management of NSCLC Tumor Model Response type
    [Show full text]
  • Global Patterns of Changes in the Gene Expression Associated with Genesis of Cancer a Dissertation Submitted in Partial Fulfillm
    Global Patterns Of Changes In The Gene Expression Associated With Genesis Of Cancer A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy at George Mason University By Ganiraju Manyam Master of Science IIIT-Hyderabad, 2004 Bachelor of Engineering Bharatiar University, 2002 Director: Dr. Ancha Baranova, Associate Professor Department of Molecular & Microbiology Fall Semester 2009 George Mason University Fairfax, VA Copyright: 2009 Ganiraju Manyam All Rights Reserved ii DEDICATION To my parents Pattabhi Ramanna and Veera Venkata Satyavathi who introduced me to the joy of learning. To friends, family and colleagues who have contributed in work, thought, and support to this project. iii ACKNOWLEDGEMENTS I would like to thank my advisor, Dr. Ancha Baranova, whose tolerance, patience, guidance and encouragement helped me throughout the study. This dissertation would not have been possible without her ever ending support. She is very sincere and generous with her knowledge, availability, compassion, wisdom and feedback. I would also like to thank Dr. Vikas Chandhoke for funding my research generously during my doctoral study at George Mason University. Special thanks go to Dr. Patrick Gillevet, Dr. Alessandro Giuliani, Dr. Maria Stepanova who devoted their time to provide me with their valuable contributions and guidance to formulate this project. Thanks to the faculty of Molecular and Micro Biology (MMB) department, Dr. Jim Willett and Dr. Monique Vanhoek in embedding valuable thoughts to this dissertation by being in my dissertation committee. I would also like to thank the present and previous doctoral program directors, Dr. Daniel Cox and Dr. Geraldine Grant, for facilitating, allowing, and encouraging me to work in this project.
    [Show full text]
  • Recent Progress in Genetic and Epigenetic Research on Type 2 Diabetes
    OPEN Experimental & Molecular Medicine (2016) 48, e220; doi:10.1038/emm.2016.7 & 2016 KSBMB. All rights reserved 2092-6413/16 www.nature.com/emm REVIEW Recent progress in genetic and epigenetic research on type 2 diabetes Soo Heon Kwak1 and Kyong Soo Park1,2,3 Type 2 diabetes (T2DM) is a common complex metabolic disorder that has a strong genetic predisposition. During the past decade, progress in genetic association studies has enabled the identification of at least 75 independent genetic loci for T2DM, thus allowing a better understanding of the genetic architecture of T2DM. International collaborations and large-scale meta- analyses of genome-wide association studies have made these achievements possible. However, whether the identified common variants are causal is largely unknown. In addition, the detailed mechanism of how these genetic variants exert their effect on the pathogenesis of T2DM requires further investigation. Currently, there are ongoing large-scale sequencing studies to identify rare, functional variants for T2DM. Environmental factors also have a crucial role in the development of T2DM. These could modulate gene expression via epigenetic mechanisms, including DNA methylation, histone modification and microRNA regulation. There is evidence that epigenetic changes are important in the development of T2DM. Recent studies have identified several DNA methylation markers of T2DM from peripheral blood and pancreatic islets. In this review, we will briefly summarize the recent progress in the genetic and epigenetic research on
    [Show full text]
  • Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations Fall 2010 Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress Renuka Nayak University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Computational Biology Commons, and the Genomics Commons Recommended Citation Nayak, Renuka, "Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress" (2010). Publicly Accessible Penn Dissertations. 1559. https://repository.upenn.edu/edissertations/1559 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/1559 For more information, please contact [email protected]. Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress Abstract Genes interact in networks to orchestrate cellular processes. Here, we used coexpression networks based on natural variation in gene expression to study the functions and interactions of human genes. We asked how these networks change in response to stress. First, we studied human coexpression networks at baseline. We constructed networks by identifying correlations in expression levels of 8.9 million gene pairs in immortalized B cells from 295 individuals comprising three independent samples. The resulting networks allowed us to infer interactions between biological processes. We used the network to predict the functions of poorly-characterized human genes, and provided some experimental support. Examining genes implicated in disease, we found that IFIH1, a diabetes susceptibility gene, interacts with YES1, which affects glucose transport. Genes predisposing to the same diseases are clustered non-randomly in the network, suggesting that the network may be used to identify candidate genes that influence disease susceptibility.
    [Show full text]
  • Functional Characterization of Thousands of Type 2 Diabetes-Associated And
    bioRxiv preprint doi: https://doi.org/10.1101/2020.02.12.939348; this version posted February 13, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Functional characterization of thousands of type 2 diabetes-associated and 2 chromatin-modulating variants under steady state and endoplasmic reticulum 3 stress 4 5 Shubham Khetan1,3, Susan Kales2, Romy Kursawe1, Alexandria Jillette1, Steven K. 6 Reilly4, Duygu Ucar1,3,5, Ryan Tewhey2,6,7*, and Michael L. Stitzel1,3,5* 7 8 1The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06032, USA 9 2The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, 04609, USA 10 3Department of Genetics and Genome Sciences, University of Connecticut, Farmington, 11 CT, 06032, USA, 12 4Broad Institute of MIT and Harvard, Cambridge, MA, USA 13 5Institute of Systems Genomics, University of Connecticut, Farmington, CT, 06032, USA 14 6Graduate School of Biomedical Sciences and Engineering, University of Maine, Orono, 15 ME, 04469, USA. 16 7Tufts University School of Medicine, Boston, MA, 02111, USA 17 18 *Corresponding authors: [email protected] and [email protected] 19 20 Keywords: Massively Parallel Reporter Assay (MPRA), human pancreatic islets, MIN6 21 beta cells, chromatin accessibility, ATAC-seq, chromatin accessibility quantitative trait 22 locus (caQTLs), genome-wide association study (GWAS), single nucleotide 23 polymorphism (SNP), Type 2 diabetes (T2D), endoplasmic reticulum (ER) stress, short 24 interspersed nuclear elements (SINE), Alu elements. bioRxiv preprint doi: https://doi.org/10.1101/2020.02.12.939348; this version posted February 13, 2020.
    [Show full text]
  • NIH Public Access Author Manuscript Nat Genet
    NIH Public Access Author Manuscript Nat Genet. Author manuscript; available in PMC 2013 February 26. NIH-PA Author ManuscriptPublished NIH-PA Author Manuscript in final edited NIH-PA Author Manuscript form as: Nat Genet. ; 44(1): 67–72. doi:10.1038/ng.1019. Meta-analysis of genome-wide association studies identifies 8 new loci for type 2 diabetes in East Asians Yoon Shin Cho1,¶,§, Chien-Hsiun Chen2,3,§, Cheng Hu4,§, Jirong Long5,§, Rick Twee Hee Ong6,§, Xueling Sim7,§, Fumihiko Takeuchi8,§, Ying Wu9,§, Min Jin Go1,§, Toshimasa Yamauchi10,§, Yi-Cheng Chang11,§, Soo Heon Kwak12,§, Ronald C.W. Ma13,§, Ken Yamamoto14,§, Linda S. Adair15, Tin Aung16,17, Qiuyin Cai5, Li-Ching Chang2, Yuan-Tsong Chen2, Yutang Gao18, Frank B. Hu19, Hyung-Lae Kim1,20, Sangsoo Kim21, Young Jin Kim1, Jeannette Jen-Mai Lee22, Nanette R. Lee23, Yun Li9,24, Jian Jun Liu25, Wei Lu26, Jiro Nakamura27, Eitaro Nakashima27,28, Daniel Peng-Keat Ng22, Wan Ting Tay16, Fuu-Jen Tsai3, Tien Yin Wong16,17,29, Mitsuhiro Yokota30, Wei Zheng5, Rong Zhang4, Congrong Wang4, Wing Yee So13, Keizo Ohnaka31, Hiroshi Ikegami32, Kazuo Hara10, Young Min Cho12, Nam H Cho33, Tien-Jyun Chang11, Yuqian Bao4, Åsa K. Hedman34, Andrew P. Morris34, Mark I. McCarthy34,35, DIAGRAM consortium36, MuTHER consortium36, Ryoichi Takayanagi31,*, Kyong Soo Park12,37,*, Weiping Jia4,*, Lee-Ming Chuang11,38,*, Juliana C.N. Chan13,*, Shiro Maeda39,*, Takashi Kadowaki10,*, Jong-Young Lee1,*, Jer-Yuarn Wu2,3,*, Yik Ying Teo22,24,40,41,42,*, E Shyong Tai43,44,45,¶,*, Xiao Ou Shu5,*, Karen L. Mohlke9,*, Norihiro Kato8,*, Bok-Ghee Han1,*, and Mark Seielstad25,46,47,¶,* 1Center for Genome Science, National Institute of Health, Osong Health Technology Administration Complex, Chungcheongbuk-do, The Republic of Korea 2Institute of Biomedical Sciences, Academia Sinica, 128 Academia Road Sec.
    [Show full text]
  • Gene Amplifications at Chromosome 7 of the Human Gastric Cancer Genome
    225-231 4/7/07 20:50 Page 225 INTERNATIONAL JOURNAL OF MOLECULAR MEDICINE 20: 225-231, 2007 225 Gene amplifications at chromosome 7 of the human gastric cancer genome SANGHWA YANG Cancer Metastasis Research Center, Yonsei University College of Medicine, 134 Shinchon-Dong, Seoul 120-752, Korea Received April 20, 2007; Accepted May 7, 2007 Abstract. Genetic aberrations at chromosome 7 are known to Introduction be related with diverse human diseases, including cancer and autism. In a number of cancer research areas involving gastric The completion of human genome sequencing and the cancer, several comparative genomic hybridization studies subsequent gene annotations, together with a rapid develop- employing metaphase chromosome or BAC clone micro- ment of high throughput screening technologies, such as DNA arrays have repeatedly identified human chromosome 7 microarrays, have made it possible to perform genome-scale as containing ‘regions of changes’ related with cancer expression profiling and comparative genomic hybridizations progression. cDNA microarray-based comparative genomic (CGHs) in various cancer models. The elucidation of gene hybridization can be used to directly identify individual target copy number variations in several cancer genomes is generating genes undergoing copy number variations. Copy number very informative results. Metaphase chromosome CGH and change analysis for 17,000 genes on a microarray format was the recent introduction of BAC and especially cDNA micro- performed with tumor and normal gastric tissues from 30 array-based CGHs (aCGH) (1) have greatly contributed to patients. A group of 90 genes undergoing copy number the identification of chromosome aberrations and of amplified increases (gene amplification) at the p11~p22 or q21~q36 and deleted genes in gastric cancer tissues and cell lines (2-9).
    [Show full text]
  • Network-Assisted Analysis of Primary Sjögren's Syndrome GWAS Data In
    Network-assisted analysis of primary Sjögren’s syndrome GWAS data in Han Chinese Kechi Fang, Kunlin Zhang, Jing Wang* Address: Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences, Beijing 100101, China. Email: Kechi Fang – [email protected]; Kunlin Zhang – [email protected]; Jing Wang – [email protected] *Corresponding author 1 Supplementary materials Page 3 – Page 5: Supplementary Figure S1. The direct network formed by the module genes from DAPPLE. Page 6: Supplementary Figure S2. Transcript expression heatmap. Page 7: Supplementary Figure S3. Transcript enrichment heatmap. Page 8: Supplementary Figure S4. Workflow of network-assisted analysis of pSS GWAS data to identify candidate genes. Page 9 – Page 734: Supplementary Table S1. A full list of PPI pairs involved in the node-weighted pSS interactome. Page 735 – Page 737: Supplementary Table S2. Detailed information about module genes and sigMHC-genes. Page 738: Supplementary Table S3. GO terms enriched by module genes. 2 NFKBIE CFLAR NFKB1 STAT4 JUN HSF1 CCDC90B SUMO2 STAT1 PAFAH1B3 NMI GTF2I 2e−04 CDKN2C LAMA4 8e−04 HDAC1 EED 0.002 WWOX PSMD7 0.008 TP53 PSMA1 HR 0.02 RPA1 0.08 UBC ARID3A PTTG1 0.2 TSC22D4 ERH NIF3L1 0.4 MAD2L1 DMRTB1 1 ERBB4 PRMT2 FXR2 MBL2 CBS UHRF2 PCNP VTA1 3 DNMT3B DNMT1 RBBP4 DNMT3A RFC3 DDB1 THRA CBX5 EED NR2F2 RAD9A HUS1 RFC4 DDB2 HDAC2 HCFC1 CDC45L PPP1CA MLLSMARCA2 PGR SP3 EZH2 CSNK2B HIST1H4C HIST1H4F HNRNPUL1 HR HIST4H4 TAF1C HIST1H4A ENSG00000206300 APEX1 TFDP1 RHOA ENSG00000206406 RPF2 E2F4 HIST1H4IHIST1H4B HIST1H4D
    [Show full text]