Downloaded from the UCSC Website
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Reference Genome Sequence of the Model Plant Setaria
UC Davis UC Davis Previously Published Works Title Reference genome sequence of the model plant Setaria Permalink https://escholarship.org/uc/item/2rv1r405 Journal Nature Biotechnology, 30(6) ISSN 1087-0156 1546-1696 Authors Bennetzen, Jeffrey L Schmutz, Jeremy Wang, Hao et al. Publication Date 2012-05-13 DOI 10.1038/nbt.2196 Peer reviewed eScholarship.org Powered by the California Digital Library University of California ARTICLES Reference genome sequence of the model plant Setaria Jeffrey L Bennetzen1,13, Jeremy Schmutz2,3,13, Hao Wang1, Ryan Percifield1,12, Jennifer Hawkins1,12, Ana C Pontaroli1,12, Matt Estep1,4, Liang Feng1, Justin N Vaughn1, Jane Grimwood2,3, Jerry Jenkins2,3, Kerrie Barry3, Erika Lindquist3, Uffe Hellsten3, Shweta Deshpande3, Xuewen Wang5, Xiaomei Wu5,12, Therese Mitros6, Jimmy Triplett4,12, Xiaohan Yang7, Chu-Yu Ye7, Margarita Mauro-Herrera8, Lin Wang9, Pinghua Li9, Manoj Sharma10, Rita Sharma10, Pamela C Ronald10, Olivier Panaud11, Elizabeth A Kellogg4, Thomas P Brutnell9,12, Andrew N Doust8, Gerald A Tuskan7, Daniel Rokhsar3 & Katrien M Devos5 We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ~400-Mb assembly covers ~80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. -
Spontaneous Puberty in 46,XX Subjects with Congenital Lipoid Adrenal Hyperplasia
Spontaneous puberty in 46,XX subjects with congenital lipoid adrenal hyperplasia. Ovarian steroidogenesis is spared to some extent despite inactivating mutations in the steroidogenic acute regulatory protein (StAR) gene. K Fujieda, … , T Sugawara, J F Strauss 3rd J Clin Invest. 1997;99(6):1265-1271. https://doi.org/10.1172/JCI119284. Research Article Congenital lipoid adrenal hyperplasia (lipoid CAH) is the most severe form of CAH in which the synthesis of all gonadal and adrenal cortical steroids is markedly impaired. We report here the clinical, endocrinological, and molecular analyses of two unrelated Japanese kindreds of 46,XX subjects affected with lipoid CAH who manifested spontaneous puberty. Phenotypic female infants with 46,XX karyotypes were diagnosed with lipoid CAH as newborns based on a clinical history of failure to thrive, hyperpigmentation, hyponatremia, hyperkalemia, and low basal values of serum cortisol and urinary 17-hydroxycorticosteroid and 17-ketosteroid. These patients responded to treatment with glucocorticoid and 9alpha- fludrocortisone. Spontaneous thelarche occurred in association with increased serum estradiol levels at the age of 10 and 11 yr, respectively. Pubic hair developed at the age of 12 yr 11 mo in one subject and menarche was at the age of 12 yr in both cases. Both subjects reported periodic menstrual bleeding and subsequently developed polycystic ovaries. To investigate the molecular basis of the steroidogenic lesion in these patients, the StAR gene was characterized by PCR and direct DNA sequence -
Y Chromosome Dynamics in Drosophila
Y chromosome dynamics in Drosophila Amanda Larracuente Department of Biology Sex chromosomes X X X Y J. Graves Sex chromosome evolution Proto-sex Autosomes chromosomes Sex Suppressed determining recombination Differentiation X Y Reviewed in Rice 1996, Charlesworth 1996 Y chromosomes • Male-restricted • Non-recombining • Degenerate • Heterochromatic Image from Willard 2003 Drosophila Y chromosome D. melanogaster Cen Hoskins et al. 2015 ~40 Mb • ~20 genes • Acquired from autosomes • Heterochromatic: Ø 80% is simple satellite DNA Photo: A. Karwath Lohe et al. 1993 Satellite DNA • Tandem repeats • Heterochromatin • Centromeres, telomeres, Y chromosomes Yunis and Yasmineh 1970 http://www.chrombios.com Y chromosome assembly challenges • Repeats are difficult to sequence • Underrepresented • Difficult to assemble Genome Sequence read: Short read lengths cannot span repeats Single molecule real-time sequencing • Pacific Biosciences • Average read length ~15 kb • Long reads span repeats • Better genome assemblies Zero mode waveguide Eid et al. 2009 Comparative Y chromosome evolution in Drosophila I. Y chromosome assemblies II. Evolution of Y-linked genes Drosophila genomes 2 Mya 0.24 Mya Photo: A. Karwath P6C4 ~115X ~120X ~85X ~95X De novo genome assembly • Assemble genome Iterative assembly: Canu, Hybrid, Quickmerge • Polish reference Quiver x 2; Pilon 2L 2R 3L 3R 4 X Y Assembled genome Mahul Chakraborty, Ching-Ho Chang 2L 2R 3L 3R 4 X Y Y X/A heterochromatin De novo genome assembly species Total bp # contigs NG50 D. simulans 154,317,203 161 21,495729 -
The Bacteria Genome Pipeline (BAGEP): an Automated, Scalable Workflow for Bacteria Genomes with Snakemake
The Bacteria Genome Pipeline (BAGEP): an automated, scalable workflow for bacteria genomes with Snakemake Idowu B. Olawoye1,2, Simon D.W. Frost3,4 and Christian T. Happi1,2 1 Department of Biological Sciences, Faculty of Natural Sciences, Redeemer's University, Ede, Osun State, Nigeria 2 African Centre of Excellence for Genomics of Infectious Diseases (ACEGID), Redeemer's University, Ede, Osun State, Nigeria 3 Microsoft Research, Redmond, WA, USA 4 London School of Hygiene & Tropical Medicine, University of London, London, United Kingdom ABSTRACT Next generation sequencing technologies are becoming more accessible and affordable over the years, with entire genome sequences of several pathogens being deciphered in few hours. However, there is the need to analyze multiple genomes within a short time, in order to provide critical information about a pathogen of interest such as drug resistance, mutations and genetic relationship of isolates in an outbreak setting. Many pipelines that currently do this are stand-alone workflows and require huge computational requirements to analyze multiple genomes. We present an automated and scalable pipeline called BAGEP for monomorphic bacteria that performs quality control on FASTQ paired end files, scan reads for contaminants using a taxonomic classifier, maps reads to a reference genome of choice for variant detection, detects antimicrobial resistant (AMR) genes, constructs a phylogenetic tree from core genome alignments and provide interactive short nucleotide polymorphism (SNP) visualization across -
Lawrence Berkeley National Laboratory Recent Work
Lawrence Berkeley National Laboratory Recent Work Title 1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life. Permalink https://escholarship.org/uc/item/7cx5710p Journal Nature biotechnology, 35(7) ISSN 1087-0156 Authors Mukherjee, Supratim Seshadri, Rekha Varghese, Neha J et al. Publication Date 2017-07-01 DOI 10.1038/nbt.3886 Peer reviewed eScholarship.org Powered by the California Digital Library University of California RESOU r CE OPEN 1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life Supratim Mukherjee1,10, Rekha Seshadri1,10, Neha J Varghese1, Emiley A Eloe-Fadrosh1, Jan P Meier-Kolthoff2 , Markus Göker2 , R Cameron Coates1,9, Michalis Hadjithomas1, Georgios A Pavlopoulos1 , David Paez-Espino1 , Yasuo Yoshikuni1, Axel Visel1 , William B Whitman3, George M Garrity4,5, Jonathan A Eisen6, Philip Hugenholtz7 , Amrita Pati1,9, Natalia N Ivanova1, Tanja Woyke1, Hans-Peter Klenk8 & Nikos C Kyrpides1 We present 1,003 reference genomes that were sequenced as part of the Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative, selected to maximize sequence coverage of phylogenetic space. These genomes double the number of existing type strains and expand their overall phylogenetic diversity by 25%. Comparative analyses with previously available finished and draft genomes reveal a 10.5% increase in novel protein families as a function of phylogenetic diversity. The GEBA genomes recruit 25 million previously unassigned metagenomic proteins from 4,650 samples, improving their phylogenetic and functional interpretation. We identify numerous biosynthetic clusters and experimentally validate a divergent phenazine cluster with potential new chemical structure and antimicrobial activity. -
Telomere-To-Telomere Assembly of a Complete Human X Chromosome W
https://doi.org/10.1038/s41586-020-2547-7 Accelerated Article Preview Telomere-to-telomere assembly of a W complete human X chromosome E VI Received: 30 July 2019 Karen H. Miga, Sergey Koren, Arang Rhie, Mitchell R. Vollger, Ariel Gershman, Andrey Bzikadze, Shelise Brooks, Edmund Howe, David Porubsky, GlennisE A. Logsdon, Accepted: 29 May 2020 Valerie A. Schneider, Tamara Potapova, Jonathan Wood, William Chow, Joel Armstrong, Accelerated Article Preview Published Jeanne Fredrickson, Evgenia Pak, Kristof Tigyi, Milinn Kremitzki,R Christopher Markovic, online 14 July 2020 Valerie Maduro, Amalia Dutra, Gerard G. Bouffard, Alexander M. Chang, Nancy F. Hansen, Amy B. Wilfert, Françoise Thibaud-Nissen, Anthony D. Schmitt,P Jon-Matthew Belton, Cite this article as: Miga, K. H. et al. Siddarth Selvaraj, Megan Y. Dennis, Daniela C. Soto, Ruta Sahasrabudhe, Gulhan Kaya, Telomere-to-telomere assembly of a com- Josh Quick, Nicholas J. Loman, Nadine Holmes, Matthew Loose, Urvashi Surti, plete human X chromosome. Nature Rosa ana Risques, Tina A. Graves Lindsay, RobertE Fulton, Ira Hall, Benedict Paten, https://doi.org/10.1038/s41586-020-2547-7 Kerstin Howe, Winston Timp, Alice Young, James C. Mullikin, Pavel A. Pevzner, (2020). Jennifer L. Gerton, Beth A. Sullivan, EvanL E. Eichler & Adam M. Phillippy C This is a PDF fle of a peer-reviewedI paper that has been accepted for publication. Although unedited, the Tcontent has been subjected to preliminary formatting. Nature is providing this early version of the typeset paper as a service to our authors and readers. The text andR fgures will undergo copyediting and a proof review before the paper is published in its fnal form. -
Nonclassic Congenital Lipoid Adrenal Hyperplasia Diagnosed at 17 Months in a Korean Boy with Normal Male Genitalia: Emphasis on Pigmentation As a Diagnostic Clue
Case report https://doi.org/10.6065/apem.2020.25.1.46 Ann Pediatr Endocrinol Metab 2020;25:4651 Nonclassic congenital lipoid adrenal hyperplasia diagnosed at 17 months in a Korean boy with normal male genitalia: emphasis on pigmentation as a diagnostic clue Hosun Bae, MD1, Congenital lipoid adrenal hyperplasia (CLAH) is one of the most fatal conditions Min-Sun Kim, MD1, caused by an abnormality of adrenal and gonadal steroidogenesis. CLAH results Hyojung Park, MD1, from loss-of-function mutations of the steroidogenic acute regulatory (STAR) Ja-Hyun Jang, MD, PhD2, gene; the disease manifests with electrolyte imbalances and hyperpigmentation Jong-Moon Choi, MD, PhD2, in neonates or young infants due to adrenocortical hormone deficiencies, and 46, Sae-Mi Lee, MD, PhD2, XY genetic male CLAH patients can be phenotypically female. Meanwhile, some Sung Yoon Cho, MD, PhD1, patients with STAR mutations develop hyperpigmentation and mild signs of adrenal insufficiency, such as hypoglycemia, after infancy. These patients are classified as Dong-Kyu Jin, MD, PhD1 having nonclassic CLAH (NCCLAH) caused by STAR mutations that retain partial 1Department of Pediatrics, Samsung activity of STAR. We present the case of a Korean boy with normal genitalia who Medical Center, Sungkyunkwan Univer was diagnosed with NCCLAH. He presented with whole-body hyperpigmentation sity School of Medicine, Seoul, Korea and electrolyte abnormalities, which were noted at the age of 17 months after 2GC Genome, Yongin, Korea an episode of sepsis with peritonitis. The compound heterozygous mutations p.Gly221Ser and c.653C>T in STAR were identified by targeted gene-panel sequencing. Skin hyperpigmentation should be considered an important clue for diagnosing NCCLAH. -
Epigenomics Profiling Services
EPIGENOMICS PROFILING SERVICES • Chromatin analysis • DNA methylation analysis • RNA-seq analysis Diagenode helps you uncover the mysteries of epigenetics PAGE 3 Integrative epigenomics analysis DNA methylation analysis • Reduced representation bisulfite sequencing (RRBS) • Whole genome bisulfite sequencing analysis (WGBS) • Genome-wide DNA methylation • Differentially methylated region analysis DNA Chromatin RNA ChIP-seq/ChIP-qPCR analysis RNA-seq analysis • Histone modification ChIP-seq analysis • Small RNA-sequencing • Promoter analysis • mRNA analysis • Enhancer analysis • Whole transcriptome analysis • Transcription factor ChIP-seq analysis • Gene expression profiling • Customized NGS service • Chromatin accessibility (ATAC-seq) www.diagenode.com | PAGE 4 DIAGENODE EPIGENOMICS PROFILING SERVICES Expertise that you can trust Our Epigenomics Profiling Services assure the sample preparation expertise and quality data that you seek. We provide epigenome-wide analyses for understanding epigenetic mechanisms, epigenetics-related drug discovery, epigenetic biomarker identification, and functional epigenomics. Why Diagenode? • Expertise and trust: Recognized epigenetics leader, official partner of BLUEPRINT, IHEC and FAANG • Innovative technology: Utilization of the signature Bioruptor® sonication device for optimal chromatin and DNA shearing and the IP-Star® Automation device give reproducible and reliable optimization and results • Quality: Multiple QC steps in all workflows and validated antibodies plus reagents deliver superior data Human -
Long-Term Follow-Up in a Chinese Child with Congenital Lipoid Adrenal
Zhao et al. BMC Endocrine Disorders (2018) 18:78 https://doi.org/10.1186/s12902-018-0307-6 CASEREPORT Open Access Long-term follow-up in a Chinese child with congenital lipoid adrenal hyperplasia due to a StAR gene mutation Xiu Zhao1,ZheSu1* , Xia Liu1,JianmingSong2,YungenGan3, Pengqiang Wen4,ShoulinLi5,LiWang1 and Lili Pan1 Abstract Background: Congenital lipoid adrenal hyperplasia (CLAH) is an extremely rare and the most severe form of congenital adrenal hyperplasia. Typical features include disorder of sex development, early-onset adrenal crisis and enlarged adrenal glands with fatty accumulation. Case presentation: We report a case of CLAH caused by mutations in the steroidogenic acute regulatory protein (StAR) gene. The patient had typical early-onset adrenal crisis at 2 months of age. She had normal-appearing female genitalia and a karyotype of 46, XY. The serum cortisol and adrenal steroids levels were always nearly undetectable, but the adrenocorticotropic hormone levels were extremely high. Genetic analysis revealed compound heterozygous mutations at c. 229C > T (p.Q77X) in exon 3 and c. 722C > T (p.Q258X) in exon 7 of the StAR gene. The former mutation was previously detected in only two other Chinese CLAH patients. Both mutations cause truncation of the StAR protein.Thecasereportedhereappearstobeaclassicexample of CLAH with very small adrenal glands and is the second reported CLAH case with small adrenal glands thus far. In a 15-year follow-up, the patient’sheight wasapproximatelyaverageforfemalesbeforeage4andfellto− 1 SDS at 10 years of age. Her bone age was similar to her chronological age from age 4 to age 15 years. Conclusions: In conclusion, this is a classic case of CLAH with exceptionally small adrenal glands. -
"The Genecards Suite: from Gene Data Mining to Disease Genome Sequence Analyses". In: Current Protocols in Bioinformat
The GeneCards Suite: From Gene Data UNIT 1.30 Mining to Disease Genome Sequence Analyses Gil Stelzer,1,5 Naomi Rosen,1,5 Inbar Plaschkes,1,2 Shahar Zimmerman,1 Michal Twik,1 Simon Fishilevich,1 Tsippi Iny Stein,1 Ron Nudel,1 Iris Lieder,2 Yaron Mazor,2 Sergey Kaplan,2 Dvir Dahary,2,4 David Warshawsky,3 Yaron Guan-Golan,3 Asher Kohn,3 Noa Rappaport,1 Marilyn Safran,1 and Doron Lancet1,6 1Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel 2LifeMap Sciences Ltd., Tel Aviv, Israel 3LifeMap Sciences Inc., Marshfield, Massachusetts 4Toldot Genetics Ltd., Hod Hasharon, Israel 5These authors contributed equally to the paper 6Corresponding author GeneCards, the human gene compendium, enables researchers to effectively navigate and inter-relate the wide universe of human genes, diseases, variants, proteins, cells, and biological pathways. Our recently launched Version 4 has a revamped infrastructure facilitating faster data updates, better-targeted data queries, and friendlier user experience. It also provides a stronger foundation for the GeneCards suite of companion databases and analysis tools. Improved data unification includes gene-disease links via MalaCards and merged biological pathways via PathCards, as well as drug information and proteome expression. VarElect, another suite member, is a phenotype prioritizer for next-generation sequencing, leveraging the GeneCards and MalaCards knowledgebase. It au- tomatically infers direct and indirect scored associations between hundreds or even thousands of variant-containing genes and disease phenotype terms. Var- Elect’s capabilities, either independently or within TGex, our comprehensive variant analysis pipeline, help prepare for the challenge of clinical projects that involve thousands of exome/genome NGS analyses. -
Human Genome Reference Program (HGRP)
Frequently Asked Questions – Human Genome Reference Program (HGRP) Funding Announcements • RFA-HG-19-002: High Quality Human Reference Genomes (HQRG) • RFA-HG-19-003: Research and Development for Genome Reference Representations (GRR) • RFA-HG-19-004: Human Genome Reference Center (HGRC) • NOT-HG-19-011: Emphasizing Opportunity for Developing Comprehensive Human Genome Sequencing Methodologies Concept Clearance Slides https://www.genome.gov/pages/about/nachgr/september2018agendadocuments/sept2018council_hg _reference_program.pdf. *Note that this presentation includes a Concept for a fifth HGRP component seeking development of informatics tools for the pan-genome. Eligibility Questions 1. Are for-profit entities eligible to apply? a. Only higher education institutes, governments, and non-profits are eligible to apply for the HGRC (RFA-HG-19-004). b. For-profit entities are eligible to apply for HQRG and GRR (RFA-HG-19-002 and 003), and for the Notice for Comprehensive Human Genome Sequencing Methodologies (NOT-HG- 19-011). The Notice also allows SBIR applications. 2. Can foreign institutions apply or receive subcontracts? a. Foreign institutions are eligible to apply to the HQRG and GRR announcements, and Developing Comprehensive Sequencing Methodologies Notice. b. Foreign institutions, including non-domestic (U.S.) components of U.S. organizations, are not eligible for HGRC. However, the FOA does allow foreign components. c. For more information, please see the NIH Grants Policy Statement. 3. Will applications with multiple sites be considered? Yes, applications with multiple sites, providing they are eligible institutions, will be considered. Application Questions 1. How much funding is available for this program? NHGRI has set aside ~$10M total costs per year for the Human Genome Reference Program. -
De Novo Genome Assembly Versus Mapping to a Reference Genome
De novo genome assembly versus mapping to a reference genome Beat Wolf PhD. Student in Computer Science University of Würzburg, Germany University of Applied Sciences Western Switzerland [email protected] 1 Outline ● Genetic variations ● De novo sequence assembly ● Reference based mapping/alignment ● Variant calling ● Comparison ● Conclusion 2 What are variants? ● Difference between a sample (patient) DNA and a reference (another sample or a population consensus) ● Sum of all variations in a patient determine his genotype and phenotype 3 Variation types ● Small variations ( < 50bp) – SNV (Single nucleotide variation) – Indel (insertion/deletion) 4 Structural variations 5 Sequencing technologies ● Sequencing produces small overlapping sequences 6 Sequencing technologies ● Difference read lengths, 36 – 10'000bp (150-500bp is typical) ● Different sequencing technologies produce different data And different kinds of errors – Substitutions (Base replaced by other) – Homopolymers (3 or more repeated bases) ● AAAAA might be read as AAAA or AAAAAA – Insertion (Non existent base has been read) – Deletion (Base has been skipped) – Duplication (cloned sequences during PCR) – Somatic cells sequenced 7 Sequencing technologies ● Standardized output format: FASTQ – Contains the read sequence and a quality for every base http://en.wikipedia.org/wiki/FASTQ_format 8 Recreating the genome ● The problem: – Recreate the original patient genome from the sequenced reads ● For which we dont know where they came from and are noisy ● Solution: – Recreate the genome