A Single Genetic Locus Associated with Pediatric Fractures: a Genome-Wide Association Study on 3,230 Patients

Total Page:16

File Type:pdf, Size:1020Kb

A Single Genetic Locus Associated with Pediatric Fractures: a Genome-Wide Association Study on 3,230 Patients 1716 EXPERIMENTAL AND THERAPEUTIC MEDICINE 20: 1716-1724, 2020 A single genetic locus associated with pediatric fractures: A genome-wide association study on 3,230 patients ROOPE PARVIAINEN1, SINI SKARP2,3, LINDA KORHONEN1, WILLY SERLO1, MINNA MÄNNIKKÖ2 ��� JUHA-JAAKKO SINIKUMPU1 1D�p���m��� �� C������� ��� A����������, O��� C�������� F������� ��� Sp���� I�j��y S���y, R������� U��� ��� P���������, P�������� N�������y, P�������� Surgery, C���� P�y������y, D��m������y, C������� G�������, O��������� ��� Gy�������y, O����������y������y ��� Op�����m����y (PEDEGO), O��� M������ R������� C����� (MRC), U��������y �� O��� ��� O��� U��������y H��p����, FI-90029 O���; 2N������� F������ B���� C�����, F�����y �� M�������, U��������y �� O���; 3C����� ��� L��� C����� H����� R�������, F�����y �� M�������, U��������y �� O���, FI-90014 O���, F������ Received July 13, 2019; Accepted April 29, 2020 DOI: 10.3892/etm.2020.8885 Abstract. The�� understanding������������� ��of the��� biological���������� and��� environ�������- RNA 1 (PROSER2-AS1) and PROSER2, thus suggesting that mental risk factors of fractures in pediatrics is limited. Previous these m�y be novel candidate genes associated with the risk of studies have reported that fractures involve heritable traits, but pediatric fractures. the genetic factors contributing to the risk of fractures remain elusive. Furthermore, genetic influences specific to immature Introduction bone have not been thoroughly studied. Therefore, the aim of the present study was to identify genetic variations that are The incidence of fractures is highest �mong the young and associated with fractures in early childhood. The present study older p�pulations (1). O� note, fractures in pediatric patients used � prospective Northern Finland Birth Cohort (year 1986; are common injuries that result in pain, as well as short‑ and �=9,432). The study p�pulation was comprised of 3,230 cohort long‑term morbidity; it has been estimated that 27-50% of all m�mbers with available genotyp� data. A total of 48 m�mbers of individuals �xperience � fracture prior to turning 18 years the cohort (1.5%) had in‑hospital treated bone fractures during old (2,3). Goulding et al (4) estimated the risk of recurrent their first 6 years of life. Furthermore, individuals without fracture to be 12-20% in pediatrics patients aged <16 years. fracture (�=3,182) were used as controls. A genome‑wide Furthermore, these figures may indicate that pediatric patients association study (GWAS) was performed using � frequentist who do not have any diseases affecting bone integrity m�y association test. I� the GWAS analysis, � linear regression still be at risk to suffer � bone fracture or even recurrent model was fitted to test for additive effects of single‑nucleotide fractures (2-4). The causes for this m�y arise from the diet, polymorphism� (SNP�; genotyp� dosage) adjusting for sex and preferred free‑tim� activities or sex, but m�y also be linked to performing population stratification using genotypic principal genetic factors (3). components. Using the GWAS analysis, the present study iden- T�� causes and etiopathogenetic pathw�y� of pediatric tified one locus with a significant association with fractures fractures are not well studied (5). I� has been previously during childhood on chromosom� 10 (rs112635931) and six loci reported that bone‑associated and bone‑independent factors with � suggested �mplication. The lead SNP rs112635931 was contribute to the fracture risk (2,6). Furthermore, bone located near proline‑ and serine‑rich 2 (PROSER2) antisense mineral content (BMC) and bone mineral density (BMD) have been indicated to be highly heritable in genetic studies (7-11). Mora and Gilsanz (12) revealed that 80% of the variance of the BMC is influenced by genetics. Furthermore, BMD has been reported to be � risk factor for fractures in adults (1), but there Correspondence to: M�. Roop� Parviainen, D�partment of Children is limited research on pediatric fractures. Previous studies and Adolescents, Oulu Childhood Fracture and Sports I�jury Study, have linked � low BMD to increased fracture risk during Research Unit for Pediatrics, Pediatric Neurology, Pediatric Surgery, Child childhood (3,13); however, in other studies, this association P�ychiatry, Dermatology, Clinical Genetics, Obstetrics and Gynecology, was not identified (6,14). In adults, genome‑wide association Otorhinolaryngology and Ophthalmology (PEDEGO), Oulu Medical studies (GWAS) have identified genes from ≥11 different Research Center (MRC), University of Oulu and Oulu University Hospital, P.O. B�x 5, FI-90029 Oulu, Finland chromosomes (chrs) that m�y affect BMD either locally or E-mail: roop�[email protected] throughout the body (15-21). H�wever, with regards to pedi- atrics, only few studies have reported on genetic loci that Key words: fractures, risk, children, genetics, genom�-wide influence BMD or cortical bone thickness (7,8). Furthermore, association study, cohort the heritability of the fractures and BMD differ, and thus, it has been suggested that fracture risk should be analyzed inde- pendently of BMD (22). PARVIAINEN et al: A SINGLE GENETIC LOCUS ASSOCIATED WITH PEDIATRIC FRACTURES 1717 BMD and BMC are useful when �x�mining the factors T���� I. R�p����� ICD-9 �����. underlying fractures (23,24); however, their association with fractures in childhood remains to be fully elucidated. ICD-9 ���� Exp�������� Therefore, the present study focused on fractures rather than BMD or BMC. T� the best of our knowledge, only few studies 800 F������� �� ����� �� �k��� have �x�mined the association between genetics and child- 802 F������� �� ������ ����� hood fracture risk. Therefore, the aim of the present study 810 F������� �� �������� was to identify whether any genetic loci are associated with 812 F������� �� ��m���� fractures prior to the age of 7 years using GWAS. 813 F������� �� ������ �� ���� 816 F������� �� ��� �� m��� p�������� �� ��� ���� Patients and methods 821 Fracture of other or unspecified parts of femur 823 Fracture of tibia or fibula Patient population. The study p�pulation was � part of the Northern Finland Birth Cohort 1986 (NFBC1986) that ICD, International Classification of Diseases. originally included all pregnant females (�=9,362) living in the geographic area of the �w� northernmost provinces of Finland with �xpected date of delivery between 1st July 1985 and 30th June 1986. These mothers and their live‑born register worldwide and thoroughly covers acute injuries of the infants (�=9,432) were followed �p regularly since preg- population (annual coverage of injuries, ~100%) and classifies nancy. Immigration during this period to northern Finland these injuries accurately (annual accuracy, >95%) (30). was minimal, and therefore, the cohort was predominantly I� the study area, pediatric fractures are �xclusively treated Finnish (25). The detailed description of the cohort is provided in public hospitals and it is mandatory to send information in the study published �y Järvelin et al (26) and on the NFBC of these injuries to the NHDR. During the study period, website (27). out‑patient visits were not recorded in the NHDR and any O� the cohort p�pulation, 6,721 consented to the use of patients treated outside the hospital were not included. S�mple their information for research purposes. The information for fractures, including non‑displaced fracture of wrist or foot, the individuals who did not provide consent was not available. were not included in the NHDR, as these are treated �y Study subjects with diseases of the bone, including osteogen- general practitioners in primary healthcare centers. All the in esis �mperfecta, or possible bone‑affecting malignancies, were hospital treated fractures were recognized for the analysis and �xcluded from the original study p�pulation (�=3). Informed the patients were classified according to whether they had a consent from the parents was provided orally during antenatal fracture or not. The International Classification of Diseases, clinic visits. A� the 16-year follow‑up (in 2001), the cohort version‑9 (ICD-9) was used to identify the fractures (31). I� participants and their parents provided written informed the patient had several hospitalizations with the sam� ICD consent for the use of all of their collected data for scientific code, the fractures were determined to be caused by ≥2 sepa- purposes. T�� participation was voluntary. rate injuries if there was an interval of ≥6 months between the hospitalization dates. The ICD-9 codes reported for the Genotype data. Genotypes were determined for the cohort fractures are presented in Table I. m�mbers who had provided consent and their blood samples The cause of the fracture m�y also be included in the were used for this purpose. The genotyp� data were available NHDR data. Out of the 48 individuals who had � fracture in the for 3,230 individuals, including 48 cases who had � fracture and p�pulation of the present study, 37 had � reported mechanism. 3,182 controls without fractures. Genotyping was performed Furthermore, 15 patients had low‑energy accidents, including using the Illumina H�manOmniExpressEx�me v1.2 bead falling onto � plane, tripping, slipping or falling from � height chip (Illumina, Inc.) and �mputation was performed using the of <1 m. For the other fractures, it was not possible to reliably 1000G phase 3 reference panel (28). After quality control (QC), determine the energy involved in the injury. the study included 3,515,000 single‑nucleotide polymor- phism�
Recommended publications
  • Whole-Genome Microarray Detects Deletions and Loss of Heterozygosity of Chromosome 3 Occurring Exclusively in Metastasizing Uveal Melanoma
    Anatomy and Pathology Whole-Genome Microarray Detects Deletions and Loss of Heterozygosity of Chromosome 3 Occurring Exclusively in Metastasizing Uveal Melanoma Sarah L. Lake,1 Sarah E. Coupland,1 Azzam F. G. Taktak,2 and Bertil E. Damato3 PURPOSE. To detect deletions and loss of heterozygosity of disease is fatal in 92% of patients within 2 years of diagnosis. chromosome 3 in a rare subset of fatal, disomy 3 uveal mela- Clinical and histopathologic risk factors for UM metastasis noma (UM), undetectable by fluorescence in situ hybridization include large basal tumor diameter (LBD), ciliary body involve- (FISH). ment, epithelioid cytomorphology, extracellular matrix peri- ϩ ETHODS odic acid-Schiff-positive (PAS ) loops, and high mitotic M . Multiplex ligation-dependent probe amplification 3,4 5 (MLPA) with the P027 UM assay was performed on formalin- count. Prescher et al. showed that a nonrandom genetic fixed, paraffin-embedded (FFPE) whole tumor sections from 19 change, monosomy 3, correlates strongly with metastatic death, and the correlation has since been confirmed by several disomy 3 metastasizing UMs. Whole-genome microarray analy- 3,6–10 ses using a single-nucleotide polymorphism microarray (aSNP) groups. Consequently, fluorescence in situ hybridization were performed on frozen tissue samples from four fatal dis- (FISH) detection of chromosome 3 using a centromeric probe omy 3 metastasizing UMs and three disomy 3 tumors with Ͼ5 became routine practice for UM prognostication; however, 5% years’ metastasis-free survival. to 20% of disomy 3 UM patients unexpectedly develop metas- tases.11 Attempts have therefore been made to identify the RESULTS. Two metastasizing UMs that had been classified as minimal region(s) of deletion on chromosome 3.12–15 Despite disomy 3 by FISH analysis of a small tumor sample were found these studies, little progress has been made in defining the key on MLPA analysis to show monosomy 3.
    [Show full text]
  • Characterization of Dysregulated Lncrna-Mrna Network Based on Cerna Hypothesis to Reveal the Occurrence and Recurrence of Myocar
    Zhang et al. Cell Death Discovery (2018) 4:35 DOI 10.1038/s41420-018-0036-7 Cell Death Discovery ARTICLE Open Access Characterization of dysregulated lncRNA- mRNA network based on ceRNA hypothesis to reveal the occurrence and recurrence of myocardial infarction Guangde Zhang1,HaoranSun2, Yawei Zhang2, Hengqiang Zhao2, Wenjing Fan1,JianfeiLi3,YingliLv2, Qiong Song2, Jiayao Li2,MingyuZhang1 and Hongbo Shi2 Abstract Accumulating evidence has demonstrated that long non-coding RNAs (lncRNAs) acting as competing endogenous RNAs (ceRNAs) play important roles in initiation and development of human diseases. However, the mechanism of ceRNA regulated by lncRNA in myocardial infarction (MI) remained unclear. In this study, we performed a multi-step computational method to construct dysregulated lncRNA-mRNA networks for MI occurrence (DLMN_MI_OC) and recurrence (DLMN_MI_Re) based on “ceRNA hypothesis”. We systematically integrated lncRNA and mRNA expression profiles and miRNA-target regulatory interactions. The constructed DLMN_MI_OC and DLMN_MI_Re both exhibited biological network characteristics, and functional analysis demonstrated that the networks were specific for MI. Additionally, we identified some lncRNA-mRNA ceRNA modules involved in MI occurrence and recurrence. Finally, two new panel biomarkers defined by four lncRNAs (RP1-239B22.5, AC135048.13, RP11-4O1.2, RP11-285F7.2) from 1234567890():,; 1234567890():,; DLMN_MI_OC and three lncRNAs (RP11-363E7.4, CTA-29F11.1, RP5-894A10.6) from DLMN_MI_Re with high classification performance were, respectively, identified in distinguishing controls from patients, and patients with recurrent events from those without recurrent events. This study will provide us new insight into ceRNA-mediated regulatory mechanisms involved in MI occurrence and recurrence, and facilitate the discovery of candidate diagnostic and prognosis biomarkers for MI.
    [Show full text]
  • Germline Mutations Causing Familial Lung Cancer
    Journal of Human Genetics (2015) 60, 597–603 & 2015 The Japan Society of Human Genetics All rights reserved 1434-5161/15 www.nature.com/jhg ORIGINAL ARTICLE Germline mutations causing familial lung cancer Koichi Tomoshige1,2, Keitaro Matsumoto1, Tomoshi Tsuchiya1, Masahiro Oikawa1, Takuro Miyazaki1, Naoya Yamasaki1, Hiroyuki Mishima2, Akira Kinoshita2, Toru Kubo3, Kiyoyasu Fukushima3, Koh-ichiro Yoshiura2 and Takeshi Nagayasu1 Genetic factors are important in lung cancer, but as most lung cancers are sporadic, little is known about inherited genetic factors. We identified a three-generation family with suspected autosomal dominant inherited lung cancer susceptibility. Sixteen individuals in the family had lung cancer. To identify the gene(s) that cause lung cancer in this pedigree, we extracted DNA from the peripheral blood of three individuals and from the blood of one cancer-free control family member and performed whole-exome sequencing. We identified 41 alterations in 40 genes in all affected family members but not in the unaffected member. These were considered candidate mutations for familial lung cancer. Next, to identify somatic mutations and/or inherited alterations in these 40 genes among sporadic lung cancers, we performed exon target enrichment sequencing using 192 samples from sporadic lung cancer patients. We detected somatic ‘candidate’ mutations in multiple sporadic lung cancer samples; MAST1, CENPE, CACNB2 and LCT were the most promising candidate genes. In addition, the MAST1 gene was located in a putative cancer-linked locus in the pedigree. Our data suggest that several genes act as oncogenic drivers in this family, and that MAST1 is most likely to cause lung cancer.
    [Show full text]
  • Supplementary Table S4. FGA Co-Expressed Gene List in LUAD
    Supplementary Table S4. FGA co-expressed gene list in LUAD tumors Symbol R Locus Description FGG 0.919 4q28 fibrinogen gamma chain FGL1 0.635 8p22 fibrinogen-like 1 SLC7A2 0.536 8p22 solute carrier family 7 (cationic amino acid transporter, y+ system), member 2 DUSP4 0.521 8p12-p11 dual specificity phosphatase 4 HAL 0.51 12q22-q24.1histidine ammonia-lyase PDE4D 0.499 5q12 phosphodiesterase 4D, cAMP-specific FURIN 0.497 15q26.1 furin (paired basic amino acid cleaving enzyme) CPS1 0.49 2q35 carbamoyl-phosphate synthase 1, mitochondrial TESC 0.478 12q24.22 tescalcin INHA 0.465 2q35 inhibin, alpha S100P 0.461 4p16 S100 calcium binding protein P VPS37A 0.447 8p22 vacuolar protein sorting 37 homolog A (S. cerevisiae) SLC16A14 0.447 2q36.3 solute carrier family 16, member 14 PPARGC1A 0.443 4p15.1 peroxisome proliferator-activated receptor gamma, coactivator 1 alpha SIK1 0.435 21q22.3 salt-inducible kinase 1 IRS2 0.434 13q34 insulin receptor substrate 2 RND1 0.433 12q12 Rho family GTPase 1 HGD 0.433 3q13.33 homogentisate 1,2-dioxygenase PTP4A1 0.432 6q12 protein tyrosine phosphatase type IVA, member 1 C8orf4 0.428 8p11.2 chromosome 8 open reading frame 4 DDC 0.427 7p12.2 dopa decarboxylase (aromatic L-amino acid decarboxylase) TACC2 0.427 10q26 transforming, acidic coiled-coil containing protein 2 MUC13 0.422 3q21.2 mucin 13, cell surface associated C5 0.412 9q33-q34 complement component 5 NR4A2 0.412 2q22-q23 nuclear receptor subfamily 4, group A, member 2 EYS 0.411 6q12 eyes shut homolog (Drosophila) GPX2 0.406 14q24.1 glutathione peroxidase
    [Show full text]
  • SETMAR Antibody
    Product Datasheet SETMAR Antibody Catalog No: #43086 Orders: [email protected] Description Support: [email protected] Product Name SETMAR Antibody Host Species Rabbit Clonality Polyclonal Purification Antigen affinity purification. Applications WB IHC Species Reactivity Hu Specificity The antibody detects endogenous levels of total SETMAR protein. Immunogen Type protein Immunogen Description Fusion protein of human SETMAR Target Name SETMAR Other Names Mar1; HsMar1; METNASE Accession No. Swiss-Prot#: Q53H47Gene ID: 6419 Calculated MW 78kd Concentration 1mg/ml Formulation Rabbit IgG in pH7.4 PBS, 0.05% NaN3, 40% Glycerol. Storage Store at -20°C Application Details Western blotting: 1:200-1:1000 Immunohistochemistry: 1:20-1:100 Images Gel: 6%SDS-PAGE Lysate: 40 µg Lane: Jurkat cell Primary antibody: 1/400 dilution Secondary antibody: Goat anti rabbit IgG at 1/8000 dilution Exposure time: 30 seconds Address: 8400 Baltimore Ave., Suite 302, College Park, MD 20740, USA http://www.sabbiotech.com 1 Immunohistochemical analysis of paraffin-embedded Human liver cancer tissue using #43086 at dilution 1/20. Immunohistochemical analysis of paraffin-embedded Human thyroid cancer tissue using #43086 at dilution 1/20. Background This gene encodes a fusion protein that contains an N-terminal histone-lysine N-methyltransferase domain and a C-terminal mariner transposase domain. The encoded protein binds DNA and functions in DNA repair activities including non-homologous end joining and double strand break repair. The SET domain portion of this protein specifically methylates histone H3 lysines 4 and 36. This gene exists as a fusion gene only in anthropoid primates, other organisms lack mariner transposase domain. Note: This product is for in vitro research use only and is not intended for use in humans or animals.
    [Show full text]
  • Open Data for Differential Network Analysis in Glioma
    International Journal of Molecular Sciences Article Open Data for Differential Network Analysis in Glioma , Claire Jean-Quartier * y , Fleur Jeanquartier y and Andreas Holzinger Holzinger Group HCI-KDD, Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Auenbruggerplatz 2/V, 8036 Graz, Austria; [email protected] (F.J.); [email protected] (A.H.) * Correspondence: [email protected] These authors contributed equally to this work. y Received: 27 October 2019; Accepted: 3 January 2020; Published: 15 January 2020 Abstract: The complexity of cancer diseases demands bioinformatic techniques and translational research based on big data and personalized medicine. Open data enables researchers to accelerate cancer studies, save resources and foster collaboration. Several tools and programming approaches are available for analyzing data, including annotation, clustering, comparison and extrapolation, merging, enrichment, functional association and statistics. We exploit openly available data via cancer gene expression analysis, we apply refinement as well as enrichment analysis via gene ontology and conclude with graph-based visualization of involved protein interaction networks as a basis for signaling. The different databases allowed for the construction of huge networks or specified ones consisting of high-confidence interactions only. Several genes associated to glioma were isolated via a network analysis from top hub nodes as well as from an outlier analysis. The latter approach highlights a mitogen-activated protein kinase next to a member of histondeacetylases and a protein phosphatase as genes uncommonly associated with glioma. Cluster analysis from top hub nodes lists several identified glioma-associated gene products to function within protein complexes, including epidermal growth factors as well as cell cycle proteins or RAS proto-oncogenes.
    [Show full text]
  • Mouse Setmar Conditional Knockout Project (CRISPR/Cas9)
    https://www.alphaknockout.com Mouse Setmar Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Setmar conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Setmar gene (NCBI Reference Sequence: NM_178391 ; Ensembl: ENSMUSG00000034639 ) is located on Mouse chromosome 6. 2 exons are identified, with the ATG start codon in exon 1 and the TAG stop codon in exon 2 (Transcript: ENSMUST00000049246). Exon 2 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Setmar gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP23-93D20 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 2 covers 82.85% of the coding region. Start codon is in exon 1, and stop codon is in exon 2. The size of intron 1 for 5'-loxP site insertion: 10466 bp. The size of effective cKO region: ~2167 bp. The cKO region does not have any other known gene. Page 1 of 7 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 2 Targeting vector Targeted allele Constitutive KO allele (After Cre recombination) Legends Exon of mouse Setmar Homology arm cKO region loxP site Page 2 of 7 https://www.alphaknockout.com Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats.
    [Show full text]
  • Single-Cell Transcriptome Profiling of the Kidney Glomerulus Identifies Key Cell Types and Reactions to Injury
    BASIC RESEARCH www.jasn.org Single-Cell Transcriptome Profiling of the Kidney Glomerulus Identifies Key Cell Types and Reactions to Injury Jun-Jae Chung ,1 Leonard Goldstein ,2 Ying-Jiun J. Chen,2 Jiyeon Lee ,1 Joshua D. Webster,3 Merone Roose-Girma,2 Sharad C. Paudyal,4 Zora Modrusan,2 Anwesha Dey,5 and Andrey S. Shaw1 Due to the number of contributing authors, the affiliations are listed at the end of this article. ABSTRACT Background The glomerulus is a specialized capillary bed that is involved in urine production and BP control. Glomerular injury is a major cause of CKD, which is epidemic and without therapeutic options. Single-cell transcriptomics has radically improved our ability to characterize complex organs, such as the kidney. Cells of the glomerulus, however, have been largely underrepresented in previous single-cell kidney studies due to their paucity and intractability. Methods Single-cell RNA sequencing comprehensively characterized the types of cells in the glomerulus from healthy mice and from four different disease models (nephrotoxic serum nephritis, diabetes, doxo- rubicin toxicity, and CD2AP deficiency). Results Allcelltypesintheglomeruluswereidentified using unsupervised clustering analysis. Novel marker genes and gene signatures of mesangial cells, vascular smooth muscle cells of the afferent and efferent arteri- oles, parietal epithelial cells, and three types of endothelial cells were identified. Analysis of the disease models revealed cell type–specific and injury type–specific responses in the glomerulus, including acute activation of the Hippo pathway in podocytes after nephrotoxic immune injury. Conditional deletion of YAP or TAZ resulted in more severe and prolonged proteinuria in response to injury, as well as worse glomerulosclerosis.
    [Show full text]
  • Birth of a Chimeric Primate Gene by Capture of the Transposase Gene
    Birth of a chimeric primate gene by capture of the SEE COMMENTARY transposase gene from a mobile element Richard Cordaux*, Swalpa Udit†, Mark A. Batzer*, and Ce´ dric Feschotte†‡ *Department of Biological Sciences, Biological Computation and Visualization Center, Center for BioModular Multi-Scale Systems, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803; and †Department of Biology, University of Texas, Arlington, TX 76019 Edited by Susan R. Wessler, University of Georgia, Athens, GA, and approved March 27, 2006 (received for review February 10, 2006) The emergence of new genes and functions is of central impor- SETMAR transcript, which consists of these three exons, is tance to the evolution of species. The contribution of various types predicted to encode a protein of 671 amino acids and is of duplications to genetic innovation has been extensively inves- supported by 48 human cDNA clones from 18 different normal tigated. Less understood is the creation of new genes by recycling and͞or cancerous tissues (Table 1, which is published as sup- of coding material from selfish mobile genetic elements. To inves- porting information on the PNAS web site; refs. 14 and 15). tigate this process, we reconstructed the evolutionary history of These data suggest that the SETMAR protein is broadly ex- SETMAR, a new primate chimeric gene resulting from fusion of a pressed and has an important, yet unknown, function in human. SET histone methyltransferase gene to the transposase gene of a Recently, it was shown that the SET domain of the SETMAR mobile element. We show that the transposase gene was recruited protein exhibits histone methyltransferase activity (15), as do all as part of SETMAR 40–58 million years ago, after the insertion of known SET domains (16, 17).
    [Show full text]
  • Genome-Wide DNA Methylation Dynamics During Epigenetic
    Gómez‑Redondo et al. Clin Epigenet (2021) 13:27 https://doi.org/10.1186/s13148‑021‑01003‑x RESEARCH Open Access Genome‑wide DNA methylation dynamics during epigenetic reprogramming in the porcine germline Isabel Gómez‑Redondo1*† , Benjamín Planells1†, Sebastián Cánovas2,3, Elena Ivanova4, Gavin Kelsey4,5 and Alfonso Gutiérrez‑Adán1 Abstract Background: Prior work in mice has shown that some retrotransposed elements remain substantially methylated during DNA methylation reprogramming of germ cells. In the pig, however, information about this process is scarce. The present study was designed to examine the methylation profles of porcine germ cells during the time course of epigenetic reprogramming. Results: Sows were artifcially inseminated, and their fetuses were collected 28, 32, 36, 39, and 42 days later. At each time point, genital ridges were dissected from the mesonephros and germ cells were isolated through magnetic‑ activated cell sorting using an anti‑SSEA‑1 antibody, and recovered germ cells were subjected to whole‑genome bisulphite sequencing. Methylation levels were quantifed using SeqMonk software by performing an unbiased analysis, and persistently methylated regions (PMRs) in each sex were determined to extract those regions showing 50% or more methylation. Most genomic elements underwent a dramatic loss of methylation from day 28 to day 36, when the lowest levels were shown. By day 42, there was evidence for the initiation of genomic re‑methylation. We identifed a total of 1456 and 1122 PMRs in male and female germ cells, respectively, and large numbers of transpos‑ able elements (SINEs, LINEs, and LTRs) were found to be located within these PMRs. Twenty‑one percent of the introns located in these PMRs were found to be the frst introns of a gene, suggesting their regulatory role in the expression of these genes.
    [Show full text]
  • Supplemental Data
    SUPPLEMENTAL DATA A transcriptomic continuum of differentiation arrest identifies myeloid interface acute leukemias with poor prognosis Bond et al Contents: • Supplementary Table Legends • Supplementary Tables S6, S8 - S11 • Supplementary Figures S1 - S4 • Supplementary Methods • Supplementary References Supplementary Table Legends: Supplementary Table S1: Gene-sets used for GSEA analysis in this study (see Excel file). Supplementary Table S2: Details of patient cohort (see Excel file). Supplementary Table S3: Differential gene expression analysis comparing AML-like T- ALLs with other T-ALLs (see Excel file). Positive values denote genes with higher expression in AML-like T-ALLs. Supplementary Table S4: Differential gene expression analysis for the comparisons of thymic subset populations indicated in each tab (see Excel files). Supplementary Table S5: ICGS output (see Excel file). First row contains sample names, second row the ICGS clusters, and the following rows guide genes and their normalized expression. The second column indicates the guide genes groups as indicated by the black and white bars in Figure 2A (first tab), Supplementary Figure S3A (second tab (1)) and S3C (third tab (2)). Supplementary Table S6: Genes included in the targeted NGS panel. Supplementary Table S7: Mutational status by NGS (see Excel file). 0 = no mutation, 1 = mutation of known significance, 2 = mutation of unknown significance. Supplementary Table S8: List of genes used for IAL score Supplementary Table S9: Impact of IAL score on outcome according to ELN subgroup. Supplementary Table S10: Comparison of clinicobiological characteristics and mutational profiles of cases with high and low IAL scores in the ALFA-0701 cohort (3). Supplemental Table S11: Univariate analyses of Overall Survival in the ALFA-0701 cohort (3).
    [Show full text]
  • Bioinformatics Tools for the Analysis of Gene-Phenotype Relationships Coupled with a Next Generation Chip-Sequencing Data Processing Pipeline
    Bioinformatics Tools for the Analysis of Gene-Phenotype Relationships Coupled with a Next Generation ChIP-Sequencing Data Processing Pipeline Erinija Pranckeviciene Thesis submitted to the Faculty of Graduate and Postdoctoral Studies in partial fulfillment of the requirements for the Doctorate in Philosophy degree in Cellular and Molecular Medicine Department of Cellular and Molecular Medicine Faculty of Medicine University of Ottawa c Erinija Pranckeviciene, Ottawa, Canada, 2015 Abstract The rapidly advancing high-throughput and next generation sequencing technologies facilitate deeper insights into the molecular mechanisms underlying the expression of phenotypes in living organisms. Experimental data and scientific publications following this technological advance- ment have rapidly accumulated in public databases. Meaningful analysis of currently avail- able data in genomic databases requires sophisticated computational tools and algorithms, and presents considerable challenges to molecular biologists without specialized training in bioinfor- matics. To study their phenotype of interest molecular biologists must prioritize large lists of poorly characterized genes generated in high-throughput experiments. To date, prioritization tools have primarily been designed to work with phenotypes of human diseases as defined by the genes known to be associated with those diseases. There is therefore a need for more prioritiza- tion tools for phenotypes which are not related with diseases generally or diseases with which no genes have yet been associated in particular. Chromatin immunoprecipitation followed by next generation sequencing (ChIP-Seq) is a method of choice to study the gene regulation processes responsible for the expression of cellular phenotypes. Among publicly available computational pipelines for the processing of ChIP-Seq data, there is a lack of tools for the downstream analysis of composite motifs and preferred binding distances of the DNA binding proteins.
    [Show full text]