Materials and Methods Reference Genome Sampling and DNA

Total Page:16

File Type:pdf, Size:1020Kb

Materials and Methods Reference Genome Sampling and DNA Materials and Methods Reference genome Sampling and DNA extraction In 2012, we extracted DNA from the blood of a male vinous-throated parrotbill (S. w. bulomachus) sampled from Taiwan (24°06’N, 119°57’E) using a QIAGEN Genomic-tip 100/G kit and following the manufacturer’s instructions. Since avian species are of the ZW sex chromosome system, we chose a male sample, which has ZZ sex chromosomes, to ensure obtaining equal coverage of autosomes and sex chromosomes when sequenced and assembled de novo. Genomic library preparation, sequencing and reference de novo assembly Libraries with insert sizes 180 bp, 3 kb and 5 kb were constructed (with the Paired-End DNA Sample Prep Kit for 180 bp library and Mate Pair Library Prep Kit v2 for 3 kb and 5 kb libraries) and sequenced on an Illumina HiSeq 2000 platform by BGI (Beijing Genomic Institute, Hong Kong) and Genomics BioSci & Tech (Taipei, Taiwan) (table S1). The genome was assembled with ALLPATHS-LG (1) version-44099 using parameters PLOIDY=2 and PHRED_64=1. ALLPATHS-LG‘s correction step CleanCorrectedReads and ErrorCorrectJump removed 0.5% of paired-end reads and corrected 71.1% of mate-paired reads with the criteria of low frequency k-mers (K=25 and 96 for paired-end and mate-paired reads separately) (detailed of de novo assembly see table S1 and fig. S1). Masking repeated sequences and removing contaminating DNA A total of 72 Mb (6.8%) of the genome sequence was masked by RepeatMasker (table S2). To remove potential contaminating DNA in the draft genome, we followed the procedure in Ellegren et al. (2). We sampled every one kb without unknown base “N” within 100 kb on each genome scaffold to check for the presence of contaminants. A www.pnas.org/cgi/doi/10.1073/pnas.1813597116 total of 16,115 one kb fragments were sampled and blasted against the NCBI nt database with BLASTN (v. 2.2.30) with an e-value cut-off of 10-5. According to the BLASTN result, fragments of top hit species that were neither birds nor lizards were considered as contaminants. Contigs that contained these fragments were removed from the scaffolds. In total, contaminating DNA was found in 26 scaffolds (15 Mb), which included bacteria (N= 1), fishes (N= 5), arthropods (N= 2), amphibians (N= 6), and mammals (N= 12). Fifteen of them were directly removed from our analysis as the size of contaminating scaffolds equaled the contig sizes, while the rest of them were excluded from the scaffold they were located on. Genome annotation Gene structure and functional annotation We used Trinity (3, 4) to re-analyze 13 Gb of RNA-seq reads from a male S. w. bulomachus (5). Using both de novo assembly and genome guide (using the parrotbill genome assembly as a reference) methods, we generated 122 Mb and 78 Mb of the RNA assembly, respectively. Then, we merged the two assemblies with PASA (6), and finally obtained 89 Mb of parrotbill RNA assembly as gene model to train Augustus (7) for gene prediction. Augustus predicted 20,149 genes - 278 Mb of genes (26% of total genome) with 253 Mb of introns and 25 Mb of coding regions (CDS) - on the parrotbill genome. We extracted and blasted these CDSs to the zebra finch (Taeniopygia guttata) genome in the NCBI nr database with BLAT (standalone version v.34x12) using the following criteria: identity >90% and alignment length >40%; if a single CDS was mapped to multiple genes, the gene with the longest alignment was considered the best match and retained; if multiple CDSs were mapped to the same gene, CDSs with overlapping genomic locations or whose distance from one another was less than the length of the gene were considered parts of the same gene; otherwise, they were considered independent repetitive genes. To explore possible adaptation occurring in the non-coding parts of the genome, we used the ALDB database (8) for chicken lncRNAs; and criteria for lncRNA identification and alignment are both set to 50% of sequence identity. We totally identified 607 chicken lncRNA analogues in the parrotbill genome. After accounting for multiple transcripts of the same gene, we obtained 497 parrotbill lncRNAs. Population re-sequencing Whole genome re-sequencing We extracted DNA from blood samples with traditional chloroform and LiCl extraction precipitation (modified from Gemmell and Akiyama, (9) . The whole genome re-sequencing library with insert size 500 bp was constructed for each individual. Our sequencing data was generated from three different platforms: samples from western Taiwan (Taichung city and Nantou county) were sequenced on an Illumina HiSeq 2000 platform (BGI), samples from eastern Taiwan (Hualien county) were sequenced on an Illumina NextSeq 500 platform (Genomics BioSci & Tech) and samples from the mainland were sequenced on an Illumina HiSeq 2500 platform (Genomics BioSci & Tech). Re-sequencing mapping The algorithm BWA-MEM allows mapping longer reads (read length can be up to 1Mb) on the reference genome. Because six females were included in the sample from a high altitude local population (Hualien County), we excluded one set of the putative haplotype of Z chromosome of these individuals based on the result of Satsuma. Variant calling After mapping raw reads to the reference genome, we used Samtools (9) for variant calling. We set four criteria to filter out SNPs for improving the accuracy of variant calling: (1) alternate bases should be higher than one third of the average coverage; (2) the minimum read depth should not be lower than one third of the average coverage; (3) the maximum read depth should not be twice as high as the average coverage; and (4) INDEL sites were excluded. We used Vcf-tools (10) to generate a consensus sequence for each individual, and then randomly assigned one allele from heterozygous genotypes to two putative haplotypes. Divergence demography of the vinous-throated parrotbill Inferring effective population size PSMC ran with parameters -N25 -t15 -r5 -p "4+25*2+4+6", and the mutation rate and generation time were set to be 4.6 × 10-9 mutations per site per generation (11) and two years a generation, respectively. We also performed bootstrap analysis with option –b, which generated 100 rounds of bootstrapping (fig. S4). Divergence demography inferred by G-PhoCS We sampled one kb sequences without unknown base “N”, with 50 kb jumping size on all of the autosomal sequences. Total of 18,188 one kb sequences were sampled. The model was set by default parameters with standard priors described by Gronau et al.(11), except that we assumed there was no gene flow between populations. The Markov Chain was first run for 100,000 iterations as burn-in; then the parameter values were sampled every 100 iterations in the following 1,000,000 iterations, and finally resulted in 10,000 sampling iterations. Demographic parameters were recalibrated by a substitution rate of 4.6*10-9 per site per generation and an average generation time of 2 years. Genome scan Calculating summary statistics of genetic variation Because a long elapsed time allows recombination to break down haplotypes, standing genetic variation usually only leaves a signature of recent positive selection within a much shorter genomic region than does novel mutation. Therefore, we used 10 kb sliding windows to allow the detection of candidate regions carried by both ancient/short and recent/long haplotypes. Due to the uncertainty of the scaffolds’ position and orientation on the chromosome, sliding windows were generated on a per-scaffold basis with the first window starting at position one of each scaffold. The draft genome was divided into 103,658 non-overlapping windows in sizes of 10 kb. Finding candidate regions We calculated the zFST of autosomes and the Z chromosome separately, because both the homozygosity and high variance of male reproductive success could confer a lower effective population size to Z chromosome than to autosomes. Additionally, genetic drift is expected to operate more efficiently and render a higher level of FST value for genomic regions on the Z chromosome than on autosomes (12). 휋Between− 휋Within The formula for FST is (πBetween: average number of pairwise 휋Between differences between populations; πWithin: average number of pairwise differences within population). In our study, because the πWithin value was small, it brought out the possibility that high FST values could be attributed to linked selection within populations (a low recombination rate could result in low 휋Within values) rather than divergent selection between populations (13, 14). In addition, we also found that the 2 level of linkage disequilibrium (r ) was positively correlated with the FST value (r = 0.06, p < 1 × 10-15; fig. S7) in both pairs of altitudinal population pairs, and negatively correlated with the value (r = -0.27 to -0.29, all of the p values were under 1 × 10-15; fig. S8) in each population. Therefore, we further implemented ΔFST (15, 16) to eliminate the effect of linked selection. The low altitude populations, rather than the high altitude ones, were chosen as controls for population pairs from both sides of the CMR, because the high altitude sampling site in the east is 500m higher than that in the west. Fig. S1. Distribution of the length of (a) contigs (over 50 kb) and (b) scaffolds (over 50 kb). We assembled 68,427 contigs with N50 size of 41.1 kb and total length of 0.99 GB (contig size: 1kb ~ 370kb), and these contigs were concatenated into 6,512 scaffolds with N50 size of 1.94 Mb (scaffold size: 1kb ~ 16Mb). 7 Fig. S2. Length of chromosomes of the (a) zebra finch (Taeniopygia guttata) and (b) vinous-throated parrotbill (aligned by Satusma).
Recommended publications
  • Myopia in African Americans Is Significantly Linked to Chromosome 7P15.2-14.2
    Genetics Myopia in African Americans Is Significantly Linked to Chromosome 7p15.2-14.2 Claire L. Simpson,1,2,* Anthony M. Musolf,2,* Roberto Y. Cordero,1 Jennifer B. Cordero,1 Laura Portas,2 Federico Murgia,2 Deyana D. Lewis,2 Candace D. Middlebrooks,2 Elise B. Ciner,3 Joan E. Bailey-Wilson,1,† and Dwight Stambolian4,† 1Department of Genetics, Genomics and Informatics and Department of Ophthalmology, University of Tennessee Health Science Center, Memphis, Tennessee, United States 2Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland, United States 3The Pennsylvania College of Optometry at Salus University, Elkins Park, Pennsylvania, United States 4Department of Ophthalmology, University of Pennsylvania, Philadelphia, Pennsylvania, United States Correspondence: Joan E. PURPOSE. The purpose of this study was to perform genetic linkage analysis and associ- Bailey-Wilson, NIH/NHGRI, 333 ation analysis on exome genotyping from highly aggregated African American families Cassell Drive, Suite 1200, Baltimore, with nonpathogenic myopia. African Americans are a particularly understudied popula- MD 21131, USA; tion with respect to myopia. [email protected]. METHODS. One hundred six African American families from the Philadelphia area with a CLS and AMM contributed equally to family history of myopia were genotyped using an Illumina ExomePlus array and merged this work and should be considered co-first authors. with previous microsatellite data. Myopia was initially measured in mean spherical equiv- JEB-W and DS contributed equally alent (MSE) and converted to a binary phenotype where individuals were identified as to this work and should be affected, unaffected, or unknown.
    [Show full text]
  • Genetic Profiling of a Cone-Dominated Retina and Cone Photoreceptor Subtypes
    Genetic profiling of a cone-dominated retina and cone photoreceptor subtypes Vincent P. Kunze Date of birth June 8, 1986 Place of birth Kempten (Allgäu) Accepted thesis Doctor rerum naturalium (Dr. rer. nat.) Institute for Biology and Environmental Sciences Carl von Ossietzky Universität Oldenburg First examiner: Prof. Dr. Karl-Wilhelm Koch Second examiner: apl. Prof. Dr. Karin Dedek Date of thesis defense: December 18, 2017 2 Abstract Abstract Mammals have two major types of sensory neurons in the retina: rods, specialized for vision in dim-light, and cones for vision in well-lit conditions and the perception of color. In my thesis, I characterized features of cone photoreceptor subtypes and the cone-dominated retinae of the tree shrew and the thirteen-lined ground squirrel. In tree shrew, I described morphological and transcriptomic changes during development. Compared to mice, the general morphology of the developing retina was similar. There were subtle differences in the onset of transcription factors and in the relative timepoints of photoreceptor genesis. The transcriptomic analysis revealed and onset Wif1-expression after birth, a gene that could potentially suppress rod development in the tree shrew. Furthermore, I looked at molecular differences in cone photoreceptor subtypes. Most mammals have two cone types, namely S- and M-cones. They diverge in their sensitivity to different wavelengths of light, based on their expression of different light-sensitive proteins: S-opsin for blue light and M-opsin for green light. Until now, cones have been classified mostly by the opsin they express. The purpose of this project was to identify genetic differences in cones and, more specifically, to find genes that are involved in cone synapse formation.
    [Show full text]
  • FSHD Region Gene 1 (FRG1) Is Crucial for Angiogenesis Linking FRG1 to Facioscapulohumeral Muscular Dystrophy-Associated Vasculopathy
    University of Massachusetts Medical School eScholarship@UMMS Peter Jones Lab Publications Cell and Developmental Biology Laboratories 2009-05-01 FSHD region gene 1 (FRG1) is crucial for angiogenesis linking FRG1 to facioscapulohumeral muscular dystrophy-associated vasculopathy Ryan Wuebbles University of Illinois at Urbana-Champaign Et al. Let us know how access to this document benefits ou.y Follow this and additional works at: https://escholarship.umassmed.edu/peterjones Part of the Cell Biology Commons, Developmental Biology Commons, Molecular Biology Commons, Molecular Genetics Commons, Musculoskeletal Diseases Commons, and the Nervous System Diseases Commons Repository Citation Wuebbles R, Hanel ML, Jones PL. (2009). FSHD region gene 1 (FRG1) is crucial for angiogenesis linking FRG1 to facioscapulohumeral muscular dystrophy-associated vasculopathy. Peter Jones Lab Publications. https://doi.org/10.1242/dmm.002261. Retrieved from https://escholarship.umassmed.edu/ peterjones/11 Creative Commons License This work is licensed under a Creative Commons Attribution 3.0 License. This material is brought to you by eScholarship@UMMS. It has been accepted for inclusion in Peter Jones Lab Publications by an authorized administrator of eScholarship@UMMS. For more information, please contact [email protected]. Disease Models & Mechanisms 2, 267-274 (2009) doi:10.1242/dmm.002261 RESEARCH ARTICLE FSHD region gene 1 (FRG1) is crucial for angiogenesis linking FRG1 to facioscapulohumeral muscular dystrophy-associated vasculopathy Ryan D. Wuebbles1,*, Meredith L. Hanel1,* and Peter L. Jones1,‡ SUMMARY The genetic lesion that is diagnostic for facioscapulohumeral muscular dystrophy (FSHD) results in an epigenetic misregulation of gene expression, which ultimately leads to the disease pathology. FRG1 (FSHD region gene 1) is a leading candidate for a gene whose misexpression might lead to FSHD.
    [Show full text]
  • Gene Expression During Normal and FSHD Myogenesis Tsumagari Et Al
    Gene expression during normal and FSHD myogenesis Tsumagari et al. Tsumagari et al. BMC Medical Genomics 2011, 4:67 http://www.biomedcentral.com/1755-8794/4/67 (27 September 2011) Tsumagari et al. BMC Medical Genomics 2011, 4:67 http://www.biomedcentral.com/1755-8794/4/67 RESEARCHARTICLE Open Access Gene expression during normal and FSHD myogenesis Koji Tsumagari1, Shao-Chi Chang1, Michelle Lacey2,3, Carl Baribault2,3, Sridar V Chittur4, Janet Sowden5, Rabi Tawil5, Gregory E Crawford6 and Melanie Ehrlich1,3* Abstract Background: Facioscapulohumeral muscular dystrophy (FSHD) is a dominant disease linked to contraction of an array of tandem 3.3-kb repeats (D4Z4) at 4q35. Within each repeat unit is a gene, DUX4, that can encode a protein containing two homeodomains. A DUX4 transcript derived from the last repeat unit in a contracted array is associated with pathogenesis but it is unclear how. Methods: Using exon-based microarrays, the expression profiles of myogenic precursor cells were determined. Both undifferentiated myoblasts and myoblasts differentiated to myotubes derived from FSHD patients and controls were studied after immunocytochemical verification of the quality of the cultures. To further our understanding of FSHD and normal myogenesis, the expression profiles obtained were compared to those of 19 non-muscle cell types analyzed by identical methods. Results: Many of the ~17,000 examined genes were differentially expressed (> 2-fold, p < 0.01) in control myoblasts or myotubes vs. non-muscle cells (2185 and 3006, respectively) or in FSHD vs. control myoblasts or myotubes (295 and 797, respectively). Surprisingly, despite the morphologically normal differentiation of FSHD myoblasts to myotubes, most of the disease-related dysregulation was seen as dampening of normal myogenesis- specific expression changes, including in genes for muscle structure, mitochondrial function, stress responses, and signal transduction.
    [Show full text]
  • SNF2 Chromatin Remodeler-Family Proteins FRG1 and -2 Are Required for RNA-Directed DNA Methylation
    SNF2 chromatin remodeler-family proteins FRG1 and -2 are required for RNA-directed DNA methylation Martin Grotha, Hume Strouda,1, Suhua Fenga,b,c, Maxim V. C. Greenberga,2, Ajay A. Vashishtd, James A. Wohlschlegeld, Steven E. Jacobsena,b,c,3, and Israel Ausine,3 aDepartment of Molecular, Cell, and Developmental Biology, bEli & Edythe Broad Center of Regenerative Medicine & Stem Cell Research, dDepartment of Biological Chemistry, David Geffen School of Medicine, cHoward Hughes Medical Institute, University of California, Los Angeles, CA 90095; and eBasic Forestry and Biotechnology Center, Fujian Agriculture and Forestry University, Fujian, Fuzhou 350002, China Contributed by Steven E. Jacobsen, October 29, 2014 (sent for review August 2, 2014) DNA methylation in Arabidopsis thaliana is maintained by at least four (6, 7). Recruitment of Pol V is mediated by methyl-CG–binding different enzymes: DNA METHYLTRANSFERASE1 (MET1), CHROMO- noncatalytic Su(var)3-9 histone methyltransferase homologs METHYLASE3 (CMT3), DOMAINS REARRANGED METHYLTRANSFER- SUVH2 and SUVH9, and a putative chromatin remodeling ASE2 (DRM2), and CHROMOMETHYLASE2 (CMT2). However, DNA complex called the DRD1-DMS3-RDM1 complex (8, 9). In ad- methylation is established exclusively by the enzyme DRM2, which dition to SUVH2 and SUVH9, SUVR2 from the same family of acts in the RNA-directed DNA methylation (RdDM) pathway. Some SET-domain proteins was also identified as an RdDM factor by RdDM components belong to gene families and have partially redun- a systematic analysis of DNA methylation defects in mutants of dant functions, such as the endoribonucleases DICER-LIKE 2, 3,and4, Su(var)3-9 homologs (10). However, the precise molecular function and INVOLVED IN DE NOVO2 (IDN2) interactors IDN2-LIKE 1 and 2.
    [Show full text]
  • Alzheimer's Disease Neuroimaging Initiative Biomarkers As Quantitative
    Alzheimer’s & Dementia 6 (2010) 265–273 Alzheimer’s Disease Neuroimaging Initiative biomarkers as quantitative phenotypes: Genetics core aims, progress, and plans Andrew J. Saykina,b,*, Li Shena,c, Tatiana M. Foroudb, Steven G. Potkind, Shanker Swaminathana,b, Sungeun Kima,c, Shannon L. Risachera, Kwangsik Nhoa,e, Matthew J. Huentelmanf, David W. Craigf, Paul M. Thompsong, Jason L. Steing, Jason H. Mooreh,i, Lindsay A. Farrerj, Robert C. Greenj, Lars Bertramk, Clifford R. Jack, Jr.l, Michael W. Weinerm,n,o,p; and the Alzheimer’s Disease Neuroimaging Initiative aDepartment of Radiology and Imaging Sciences, Center for Neuroimaging, Indiana University School of Medicine, Indianapolis, IN, USA bDepartment of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA cCenter for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, USA dDepartment of Psychiatry and Human Behavior, University of California, Irvine, CA, USA eDivision of Medical Informatics, Regenstrief Institute, Indianapolis, IN, USA fNeurogenomics Division, The Translational Genomics Research Institute, Phoenix, AZ, USA gLaboratory of Neuro Imaging, UCLA School of Medicine, Los Angeles, CA, USA hDepartment of Genetics, Computational Genetics Laboratory, Dartmouth Medical School, Lebanon, NH, USA iDepartment of Community and Family Medicine, Computational Genetics Laboratory, Dartmouth Medical School, Lebanon, NH, USA jDepartments of Medicine (Genetics Program), Neurology, Epidemiology, Genetics,
    [Show full text]
  • Neuroblastoma Cancer Stem Cells: the Role of NXPH1 and Its Receptor Α-NRXN1
    Neuroblastoma cancer stem cells: The role of NXPH1 and its receptor α-NRXN1 Lucía Fanlo Escudero Aquesta tesi doctoral està subjecta a la llicència Reconeixement- NoComercial 4.0. Espanya de Creative Commons. Esta tesis doctoral está sujeta a la licencia Reconocimiento - NoComercial 4.0. España de Creative Commons. This doctoral thesis is licensed under the Creative Commons Attribution-NonCommercial 4.0. Spain License. UNIVERSITAT DE BARCELONA FACULTAD DE FARMÀCIA I CIÈNCIES DE L’ALIMENTACIÓ Neuroblastoma cancer stem cells: The role of NXPH1 and its receptor α-NRXN1 Lucía Fanlo Escudero 2019 UNIVERSITAT DE BARCELONA FACULTAT DE FARMÀCIA I CIÈNCIES DE L’ALIMENTACIÓ PROGRAMA DE DOCTORAT EN BIOMEDICINA Neuroblastoma cancer stem cells: The role of NXPH1 and its receptor α-NRXN1 Memoria presentada por Lucía Fanlo Escudero para optar al título de Doctora por la Universitat de Barcelona Este trabajo ha sido realizado bajo la dirección de la Dra. Elisa Martí Gorostiza y del Dr. Gwenvael Le Dréau, en el Instituto de Biología Molecular de Barcelona (IBMB-CSIC) Codirectores: Dra. Elisa Martí Gorostiza Dr. Gwenvael Le Dréau Doctoranda: Tutor: Lucía Fanlo Escudero Dr. Carles Enrich Bastús 2019 […] CLARA.- (Piensa:) ¿Cómo es posible que alguien quiera dormir cuando hay tanta hermosura en el aire, en las rocas, en los árboles? Yo no quiero dormir nunca, nunca; no quiero perder ni un segundo de vida, por que ahora que he visto la maravilla de esta noche, me figuro que siempre deben estar su- cediendo cosas maravillosas. Y si sucede una que no pueda ocurrir más que una vez y yo, por estarme durmiendo, no la veo, no me consolaré en mi vida entera.
    [Show full text]
  • Developing a Neural Implant to Enable Controlled Alterations to Brain Architecture Nicholas J. Weir (N0389094) a Thesis Submitte
    Developing a Neural Implant to Enable Controlled Alterations to Brain Architecture Nicholas J. Weir (N0389094) A thesis submitted in partial fulfilment of the requirements of Nottingham Trent University for the degree of Doctor of Philosophy December 2019 1 Copyright Statement This work is the intellectual property of the author. You may copy up to 5% of this work for private study, or personal, non-commercial research. Any re-use of the information contained within this document should be fully referenced, quoting the author, title, university, degree level and pagination. Queries or requests for any other use, or if a more substantial copy is required, should be directed in the owner(s) of the Intellectual Property Rights. 2 Acknowledgements First, I would like to thank my Director of Studies, Chris Tinsley, for his support and encouragement throughout the course of the PhD. His kind words and ability to find the silver lining in any situation made the past few years that much easier. Thanks also to my supervisory team, Alan Hargreaves, Bob Stevens and Martin McGinnity for their advice and thoughts on experiments and data that helped shape the PhD. Whilst not on my supervisory team, I’m also grateful to Amanda Miles for her help with mass spectrometry and endless technical knowledge and to Rich Hulse for his advice and support throughout. Special thanks go to my colleagues. Notably, to Awais and Jordan for their ability to distract and take my mind out of the lab during stressful times and to Charlotte for being a role model of scientific rigour and discipline.
    [Show full text]
  • The DNA Sequence and Comparative Analysis of Human Chromosome 20
    articles The DNA sequence and comparative analysis of human chromosome 20 P. Deloukas, L. H. Matthews, J. Ashurst, J. Burton, J. G. R. Gilbert, M. Jones, G. Stavrides, J. P. Almeida, A. K. Babbage, C. L. Bagguley, J. Bailey, K. F. Barlow, K. N. Bates, L. M. Beard, D. M. Beare, O. P. Beasley, C. P. Bird, S. E. Blakey, A. M. Bridgeman, A. J. Brown, D. Buck, W. Burrill, A. P. Butler, C. Carder, N. P. Carter, J. C. Chapman, M. Clamp, G. Clark, L. N. Clark, S. Y. Clark, C. M. Clee, S. Clegg, V. E. Cobley, R. E. Collier, R. Connor, N. R. Corby, A. Coulson, G. J. Coville, R. Deadman, P. Dhami, M. Dunn, A. G. Ellington, J. A. Frankland, A. Fraser, L. French, P. Garner, D. V. Grafham, C. Grif®ths, M. N. D. Grif®ths, R. Gwilliam, R. E. Hall, S. Hammond, J. L. Harley, P. D. Heath, S. Ho, J. L. Holden, P. J. Howden, E. Huckle, A. R. Hunt, S. E. Hunt, K. Jekosch, C. M. Johnson, D. Johnson, M. P. Kay, A. M. Kimberley, A. King, A. Knights, G. K. Laird, S. Lawlor, M. H. Lehvaslaiho, M. Leversha, C. Lloyd, D. M. Lloyd, J. D. Lovell, V. L. Marsh, S. L. Martin, L. J. McConnachie, K. McLay, A. A. McMurray, S. Milne, D. Mistry, M. J. F. Moore, J. C. Mullikin, T. Nickerson, K. Oliver, A. Parker, R. Patel, T. A. V. Pearce, A. I. Peck, B. J. C. T. Phillimore, S. R. Prathalingam, R. W. Plumb, H. Ramsay, C. M.
    [Show full text]
  • Human Proteins That Interact with RNA/DNA Hybrids
    Downloaded from genome.cshlp.org on October 4, 2021 - Published by Cold Spring Harbor Laboratory Press Resource Human proteins that interact with RNA/DNA hybrids Isabel X. Wang,1,2 Christopher Grunseich,3 Jennifer Fox,1,2 Joshua Burdick,1,2 Zhengwei Zhu,2,4 Niema Ravazian,1 Markus Hafner,5 and Vivian G. Cheung1,2,4 1Howard Hughes Medical Institute, Chevy Chase, Maryland 20815, USA; 2Life Sciences Institute, University of Michigan, Ann Arbor, Michigan 48109, USA; 3Neurogenetics Branch, National Institute of Neurological Disorders and Stroke, NIH, Bethesda, Maryland 20892, USA; 4Department of Pediatrics, University of Michigan, Ann Arbor, Michigan 48109, USA; 5Laboratory of Muscle Stem Cells and Gene Regulation, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, Maryland 20892, USA RNA/DNA hybrids form when RNA hybridizes with its template DNA generating a three-stranded structure known as the R-loop. Knowledge of how they form and resolve, as well as their functional roles, is limited. Here, by pull-down assays followed by mass spectrometry, we identified 803 proteins that bind to RNA/DNA hybrids. Because these proteins were identified using in vitro assays, we confirmed that they bind to R-loops in vivo. They include proteins that are involved in a variety of functions, including most steps of RNA processing. The proteins are enriched for K homology (KH) and helicase domains. Among them, more than 300 proteins preferred binding to hybrids than double-stranded DNA. These proteins serve as starting points for mechanistic studies to elucidate what RNA/DNA hybrids regulate and how they are regulated.
    [Show full text]
  • Supplemental Table 1
    Carlisle et al. 1 Supplementary Table 1. Primers for PCR amplification of bacterial 16S rRNA. Primers for Group A TA-27FMID1 CGTATCGCCTCCCTCGCGCCATCAGACGAGTGCGTAGAGTTTGATCCTGGCTCAG TA-27FMID2 CGTATCGCCTCCCTCGCGCCATCAGACGCTCGACAAGAGTTTGATCCTGGCTCAG TA-27FMID3 CGTATCGCCTCCCTCGCGCCATCAGAGACGCACTCAGAGTTTGATCCTGGCTCAG TA-27FMID4 CGTATCGCCTCCCTCGCGCCATCAGAGCACTGTAGAGAGTTTGATCCTGGCTCAG TA-27FMID5 CGTATCGCCTCCCTCGCGCCATCAGATCAGACACGAGAGTTTGATCCTGGCTCAG TA-27FMID6 CGTATCGCCTCCCTCGCGCCATCAGATATCGCGAGAGAGTTTGATCCTGGCTCAG TA-27FMID7 CGTATCGCCTCCCTCGCGCCATCAGCGTGTCTCTAAGAGTTTGATCCTGGCTCAG TA-27FMID8 CGTATCGCCTCCCTCGCGCCATCAGCTCGCGTGTCAGAGTTTGATCCTGGCTCAG TA-27FMID9 CGTATCGCCTCCCTCGCGCCATCAGTAGTATCAGCAGAGTTTGATCCTGGCTCAG TA-27FMID10 CGTATCGCCTCCCTCGCGCCATCAGTCTCTATGCGAGAGTTTGATCCTGGCTCAG TA-27FMID11 CGTATCGCCTCCCTCGCGCCATCAGTGATACGTCTAGAGTTTGATCCTGGCTCAG TA-27FMID13 CGTATCGCCTCCCTCGCGCCATCAGCATAGTAGTGAGAGTTTGATCCTGGCTCAG TA-27FMID14 CGTATCGCCTCCCTCGCGCCATCAGCGAGAGATACAGAGTTTGATCCTGGCTCAG TB-338R CTATGCGCCTTGCCAGCCCGCTCAGTGCTGCCTCCCGTAGGAGT Primers for Group B TB-27FMID1 CTATGCGCCTTGCCAGCCCGCTCAGACGAGTGCGTAGAGTTTGATCCTGGCTCAG TB-27FMID2 CTATGCGCCTTGCCAGCCCGCTCAGACGCTCGACAAGAGTTTGATCCTGGCTCAG TB-27FMID3 CTATGCGCCTTGCCAGCCCGCTCAGAGACGCACTCAGAGTTTGATCCTGGCTCAG TB-27FMID4 CTATGCGCCTTGCCAGCCCGCTCAGAGCACTGTAGAGAGTTTGATCCTGGCTCAG TB-27FMID5 CTATGCGCCTTGCCAGCCCGCTCAGATCAGACACGAGAGTTTGATCCTGGCTCAG TB-27FMID6 CTATGCGCCTTGCCAGCCCGCTCAGATATCGCGAGAGAGTTTGATCCTGGCTCAG TB-27FMID7 CTATGCGCCTTGCCAGCCCGCTCAGCGTGTCTCTAAGAGTTTGATCCTGGCTCAG TB-27FMID8 CTATGCGCCTTGCCAGCCCGCTCAGCTCGCGTGTCAGAGTTTGATCCTGGCTCAG
    [Show full text]
  • Cisplatin Treatment of Testicular Cancer Patients Introduces Long-Term Changes in the Epigenome Cecilie Bucher-Johannessen1, Christian M
    Bucher-Johannessen et al. Clinical Epigenetics (2019) 11:179 https://doi.org/10.1186/s13148-019-0764-4 RESEARCH Open Access Cisplatin treatment of testicular cancer patients introduces long-term changes in the epigenome Cecilie Bucher-Johannessen1, Christian M. Page2,3, Trine B. Haugen4 , Marcin W. Wojewodzic1, Sophie D. Fosså1,5,6, Tom Grotmol1, Hege S. Haugnes7,8† and Trine B. Rounge1,9*† Abstract Background: Cisplatin-based chemotherapy (CBCT) is part of standard treatment of several cancers. In testicular cancer (TC) survivors, an increased risk of developing metabolic syndrome (MetS) is observed. In this epigenome- wide association study, we investigated if CBCT relates to epigenetic changes (DNA methylation) and if epigenetic changes render individuals susceptible for developing MetS later in life. We analyzed methylation profiles, using the MethylationEPIC BeadChip, in samples collected ~ 16 years after treatment from 279 Norwegian TC survivors with known MetS status. Among the CBCT treated (n = 176) and non-treated (n = 103), 61 and 34 developed MetS, respectively. We used two linear regression models to identify if (i) CBCT results in epigenetic changes and (ii) epigenetic changes play a role in development of MetS. Then we investigated if these changes in (i) and (ii) links to genes, functional networks, and pathways related to MetS symptoms. Results: We identified 35 sites that were differentially methylated when comparing CBCT treated and untreated TC survivors. The PTK6–RAS–MAPk pathway was significantly enriched with these sites and infers a gene network of 13 genes with CACNA1D (involved in insulin release) as a network hub. We found nominal MetS-associations and a functional gene network with ABCG1 and NCF2 as network hubs.
    [Show full text]