Sun et al. BMC Med Genomics (2021) 14:135 https://doi.org/10.1186/s12920-021-00989-w RESEARCH Open Access Identifcation of key genes in calcifc aortic valve disease via weighted gene co-expression network analysis Jin‑Yu Sun1†, Yang Hua1†, Hui Shen1, Qiang Qu1, Jun‑Yan Kan1, Xiang‑Qing Kong1, Wei Sun1* and Yue‑Yun Shen2* Abstract Background: Calcifc aortic valve disease (CAVD) is the most common subclass of valve heart disease in the elderly population and a primary cause of aortic valve stenosis. However, the underlying mechanisms remain unclear. Methods: The gene expression profles of GSE83453, GSE51472, and GSE12644 were analyzed by ‘limma’ and ‘weighted gene co‑expression network analysis (WGCNA)’ package in R to identify diferentially expressed genes (DEGs) and key modules associated with CAVD, respectively. Then, enrichment analysis was performed based on Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, DisGeNET, and TRRUST database. Protein–protein interaction network was constructed using the overlapped genes of DEGs and key modules, and we identifed the top 5 hub genes by mixed character calculation. Results: We identifed the blue and yellow modules as the key modules. Enrichment analysis showed that leukocyte migration, extracellular matrix, and extracellular matrix structural constituent were signifcantly enriched. SPP1, TNC, SCG2, FAM20A, and CD52 were identifed as hub genes, and their expression levels in calcifed or normal aortic valve samples were illustrated, respectively. Conclusions: This study suggested that SPP1, TNC, SCG2, FAM20A, and CD52 might be hub genes associated with CAVD. Further studies are required to elucidate the underlying mechanisms and provide potential therapeutic targets. Keywords: Calcifc aortic valve disease, Weighted gene co‑expression network analysis, Diferentially expressed genes, Integrated bioinformatics analysis Introduction sclerosis occurred in 26% of these patients [2, 3]. Consid- Calcifc aortic valve disease (CAVD) is the most com- ering the prolonged life expectancy and aging of the pop- mon subclass of valve heart disease in the elderly popu- ulation, the burden of CAVD is expected to substantially lation and a primary cause of aortic valve stenosis [1]. It increase from 2.5 million in 2000 to 4.5 million in 2030 was reported that the incidence of aortic valve stenosis [4], thus conferring a high economic and health burden was 2% in patients ≥ 65 years old, whereas aortic valve worldwide [5, 6]. In CAVD, the fbro-calcifc remodeling and patho- logical thickening of the aortic valve disturb pressure *Correspondence: [email protected]; [email protected] †Jin‑Yu Sun and Yang Hua contributed equally to this work overload and hemodynamic stability. Te following pro- 1 Department of Cardiology, The First Afliated Hospital of Nanjing gressive cardiac outfow obstruction and left ventricular Medical University, Nanjing 210000, China hypertrophy are likely to result in heart failure and even 2 Department of Cardiology, Liyang People’s Hospital, Liyang 213300, China premature death within a few years [4, 7]. Accumulating © The Author(s) 2021. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/. The Creative Commons Public Domain Dedication waiver (http:// creat iveco mmons. org/ publi cdoma in/ zero/1. 0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. Sun et al. BMC Med Genomics (2021) 14:135 Page 2 of 12 studies have revealed that CAVD is a complex multi- Vienna, Austria). Te probe IDs were converted into gene stage disease with sequential and interacting processes, symbols according to the annotation fle. For multiple including endothelial dysfunction/injury, lipid deposi- probes mapping to a single gene, the average expression tion, infammation, extracellular matrix remodeling, value of all its corresponding probes was used. Impor- dystrophic calcifcation, and so forth [8–10]. However, tantly, the batch efect was corrected using the combat mechanisms underlying the development or progression function of the ‘SVA’ package, which is a widely used of CAVD remain unclear, and there lacks conservative empirical Bayes method for batch correction [19]. To treatment against CAVD. Surgical or transcatheter aortic control the false discovery rate caused by multiple testing, valve replacement is the only available treatment, while the adjusted P-value was applied. After pre-processing, pharmacotherapy beyond valve replacement is still lim- DEGs between calcifed and normal aortic valve samples ited [5, 11]. were screened with a threshold of adjusted P-value < 0.05 Weighted gene co-expression network analysis and |log2 fold‐change (FC)| ≥ 0.5 by ‘limma’ package. Fur- (WGCNA) is a bioinformatics algorithm [12], which thermore, the DEGs were visualized as a volcano plot and has been widely applied to explore the changes of tran- heatmap using ‘ggplot2’ and ‘pheatmap’ package. scriptome expression patterns in complex diseases and to identify gene modules associated with clinical fea- tures [13–15]. Compared with the standardized analy- Weighted gene co‐expression network analysis (WGCNA) sis of diferentially expressed genes (DEGs), WGCNA is WGCNA is an algorithm for constructing a co-expres- a powerful systematic analysis method to recognize the sion network, which reveals the correlation patterns higher-order correlation between genes instead of detect- across genes and provides the biologically functional ing disease-related individual genes. Our study identifed interpretations of network modules. As previously DEGs between calcifed and normal aortic valve samples described [20], we selected the top 25% most variant based on the gene expression profles of GSE83453 and genes in GSE83453 to construct a co-expression net- GSE51472, GSE12644. Moreover, WGCNA was per- work using the ‘WGCNA’ package (version 1.60). After formed on the GSE83453 dataset to screen key genes and evaluating the presence of obvious outliers by cluster modules related to CAVD. Ten, the enrichment analysis analysis, the one-step network construction function was of genes in key modules was used to explore the molecu- used to construct the co-expression network and iden- lar mechanisms underlying CAVD. Finally, we identifed tify key modules. Moreover, to identify the signifcance hub genes related to CAVD and establish a protein–pro- of each module, we summarized the module eigengene tein interaction (PPI) network. (ME) based on the frst principal component of the mod- ule expression, and the module-trait relationships were Materials and methods assessed according to the correlation between MEs and Microarray data clinical traits. Ten, we evaluated the correlation strength Te gene expression profles of GSE83453 [16], by module signifcance (MS), referring to the average GSE51472 [17], and GSE12644 [18] were acquired from absolute gene signifcance (GS) of all genes within one the Gene Expression Omnibus database. All three gene module. Notably, the GS value was determined by the expression profles were acquired from human samples. log10-transformation of the P-value in the linear regres- Dataset GSE83453 was performed based on the plat- sion between expression and clinical traits. In general, form GPL10558 (Illumina HumanHT-12 V4.0 expression the modules with the highest MS values were considered beadchip), which includes 9 calcifed aortic valve samples as the key modules. and 8 normal aortic valve samples [16]. Series GSE51472 Furthermore, we used the modulePreservation func- was performed by GPL570 (Afymetrix Human Genome tion to evaluate the preservation levels of key modules. U133 Plus 2.0 Array) and included 5 calcifed aortic Zsummary analysis combines diferent preservation sta- valve samples and 5 normal aortic valve samples [17]. tistics into one single overall measure of preservation. GSE12644, based on the GPL570 platform, included 10 According to WGCNA instruction, a higher Zsummary calcifed and 10 normal aortic valve samples (Fig. 1). value indicated the stronger the evidence that a module should be preserved: the module with Zsummary value < 2 Data pre‑processing and DEG screening indicated ‘no evidence’, 2 < ­Zsummary value < 10 indicated ‘weak evidence’, and Z value > 10 indicated strong We performed log2-transformation, background correc- summary tion, and quantile normalization on the expression pro- evidence. Additionally, to confrm the clustering ability fles of GSE83453, GSE51472, and GSE12644 using the of the key modules, we also performed principal compo- ‘linear models for microarray data (limma)’ package in R nent analysis (PCA) on the gene expression profle of the 3.6.1 software (R Foundation for Statistical Computing,
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages12 Page
-
File Size-