Glossary of Genomics and Bioinformatics Terms∗ A
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Genome Analysis of the Smallest Free-Living Eukaryote Ostreococcus
Genome analysis of the smallest free-living eukaryote SEE COMMENTARY Ostreococcus tauri unveils many unique features Evelyne Derellea,b, Conchita Ferrazb,c, Stephane Rombautsb,d, Pierre Rouze´ b,e, Alexandra Z. Wordenf, Steven Robbensd, Fre´ de´ ric Partenskyg, Sven Degroeved,h, Sophie Echeynie´ c, Richard Cookei, Yvan Saeysd, Jan Wuytsd, Kamel Jabbarij, Chris Bowlerk, Olivier Panaudi, BenoıˆtPie´ gui, Steven G. Ballk, Jean-Philippe Ralk, Franc¸ois-Yves Bougeta, Gwenael Piganeaua, Bernard De Baetsh, Andre´ Picarda,l, Michel Delsenyi, Jacques Demaillec, Yves Van de Peerd,m, and Herve´ Moreaua,m aObservatoire Oce´anologique, Laboratoire Arago, Unite´Mixte de Recherche 7628, Centre National de la Recherche Scientifique–Universite´Pierre et Marie Curie-Paris 6, BP44, 66651 Banyuls sur Mer Cedex, France; cInstitut de Ge´ne´ tique Humaine, Unite´Propre de Recherche 1142, Centre National de la Recherche Scientifique, 141 Rue de Cardonille, 34396 Montpellier Cedex 5, France; dDepartment of Plant Systems Biology, Flanders Interuniversity Institute for Biotechnology and eLaboratoire Associe´de l’Institut National de la Recherche Agronomique (France), Ghent University, Technologiepark 927, 9052 Ghent, Belgium; fRosenstiel School of Marine and Atmospheric Science, University of Miami, 4600 Rickenbacker Causeway, Miami, FL 33149; gStation Biologique, Unite´Mixte de Recherche 7144, Centre National de la Recherche Scientifique–Universite´Pierre et Marie Curie-Paris 6, BP74, 29682 Roscoff Cedex, France; hDepartment of Applied Mathematics, Biometrics and -
The Genomic Structure and Expression of MJD, the Machado-Joseph Disease Gene
J Hum Genet (2001) 46:413–422 © Jpn Soc Hum Genet and Springer-Verlag 2001 ORIGINAL ARTICLE Yaeko Ichikawa · Jun Goto · Masahira Hattori Atsushi Toyoda · Kazuo Ishii · Seon-Yong Jeong Hideji Hashida · Naoki Masuda · Katsuhisa Ogata Fumio Kasai · Momoki Hirai · Patrícia Maciel Guy A. Rouleau · Yoshiyuki Sakaki · Ichiro Kanazawa The genomic structure and expression of MJD, the Machado-Joseph disease gene Received: March 7, 2001 / Accepted: April 17, 2001 Abstract Machado-Joseph disease (MJD) is an autosomal relative to the MJD gene in B445M7 indicate that there are dominant neurodegenerative disorder that is clinically char- three alternative splicing sites and eight polyadenylation acterized by cerebellar ataxia and various associated symp- signals in MJD that are used to generate the differently toms. The disease is caused by an unstable expansion of the sized transcripts. CAG repeat in the MJD gene. This gene is mapped to chromosome 14q32.1. To determine its genomic structure, Key words Machado-Joseph disease (MJD) · 14q32.1 · we constructed a contig composed of six cosmid clones and CAG repeat · Genome structure · Alternative splicing · eight bacterial artificial chromosome (BAC) clones. It spans mRNA expression approximately 300kb and includes MJD. We also deter- mined the complete sequence (175,330bp) of B445M7, a human BAC clone that contains MJD. The MJD gene was found to span 48,240bp and to contain 11 exons. Northern Introduction blot analysis showed that MJD mRNA is ubiquitously expressed in human tissues, and in at least four different Machado-Joseph disease (MJD) is an autosomal dominant sizes; namely, 1.4, 1.8, 4.5, and 7.5kb. -
Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the Cy Subunit of CFI (Atpa)And the Proteolipid Subunit of Cfo (Atph)
Copyright 0 1987 by the Genetics Society of America Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI (atpA)and the Proteolipid Subunit of CFo (atpH) Steven R. Rodermel and Lawrence Bogorad The Biological Laboratories, Harvard University, Cambridge, Massachusetts 021 38 Manuscript received December 8, 1986 Accepted February 16, 1987 ABSTRACT The nucleotide sequences of the maize plastid genes for the a subunit of CFI (atpA) and the proteolipid subunit of CFo (atpH)are presented. The evolution of these genes among higher plants is characterized by a transition mutation bias of about 2:l and by rates of synonymous and nonsynony- mous substitution which are much lower than similar rates for genes from other sources. This is consistent with the notion that the plastid genome is evolving conservatively in primary sequence. Yet, the mode and tempo of sequence evolution of these and other plastidencoded coupling factor genes are not the same. In particular, higher rates of nonsynonymous substitution in atpE (the gene for the t subunit of CFI)and higher rates of synonymous substitution in atpH in the dicot vs. monocot lineages of higher plants indicate that these sequences are likely subject to different evolutionary constraints in these two lineages. The 5‘- and 3‘- transcribed flanking regions of atpA and atpH from maize, wheat and tobacco are conserved in size, but contain few putative regulatory elements which are conserved either in their spatial arrangement or sequence complexity. However, these regions likely contain variable numbers of “species-specific”regulatory elements. The present studies thus suggest that the plastid genome is not a passive participant in an evolutionary process governed by a more rapidly changing, readily adaptive, nuclear compartment, but that novel strategies for the coordinate expression of genes in the plastid genome may arise through rapid evolution of the flanking sequences of these genes. -
Discovery of Regulatory Elements by a Computational Method for Phylogenetic Footprinting Mathieu Blanchette and Martin Tompa
Discovery of Regulatory Elements by a Computational Method for Phylogenetic Footprinting Mathieu Blanchette and Martin Tompa Presented by Ben Bachman What is a regulatory element? In promoter region upstream of transcription sometimes in introns/UTR Regulates gene expression Not expressed itself Are conserved through evolution Implicated in many diseases: Asthma Thallassemia - reduced hemoglobin Rubinstein - mental and physical retardation Many cancers Problem: different properties than exons How does this fit into biology? G. Orphnides and D. Reinberg (2002) A Unified Theory of Gene Expression. Cell 108: 439-451. How does this fit into biology? http://kachkeis.com/img/essay3_pic1.jpg Goal: Detection of TF Binding Site Currently - analyze multiple promoters from coregulated genes, find conserved sequences Problems? Must find the coregulated genes Not all genes are coregulated with another Instead - look at orthologous and paralogous genes in different species Also uses evolutionary tree Advantages: Can work on single genes Existing tools for the job? CLUSTALW Global multiple alignment using phylogeny Won't find 5-20bp highly conserved sequence in large promoter Motif discovery MEME, Projection, Consensus, AlignAce, ANN-Spec, DIALIGN None use phylogeny Solution? New tool "FootPrinter" Method - Algorithm Dynamic programming For two related leaves, find the most parsimonious way to have all possible k-mers (4^k) for some value of k Continue up the tree Return k-mers under max parsimony score for clade Work back to find locations Only allowed -
A Clone-Array Pooled Shotgun Strategy for Sequencing Large Genomes
Downloaded from genome.cshlp.org on September 24, 2021 - Published by Cold Spring Harbor Laboratory Press Perspective A Clone-Array Pooled Shotgun Strategy for Sequencing Large Genomes Wei-Wen Cai,1,2 Rui Chen,1,2 Richard A. Gibbs,1,2,5 and Allan Bradley1,3,4 1Department of Molecular and Human Genetics, 2Human Genome Sequencing Center, and 3Howard Hughes Medical Institute, Baylor College of Medicine, Houston, Texas 77030, USA A simplified strategy for sequencing large genomes is proposed. Clone-Array Pooled Shotgun Sequencing (CAPSS) is based on pooling rows and columns of arrayed genomic clones,for shotgun library construction. Random sequences are accumulated,and the data are processed by sequential comparison of rows and columns to assemble the sequence of clones at points of intersection. Compared with either a clone-by-clone approach or whole-genome shotgun sequencing,CAPSS requires relatively few library constructions and only minimal computational power for a complete genome assembly. The strategy is suitable for sequencing large genomes for which there are no sequence-ready maps,but for which relatively high resolution STS maps and highly redundant BAC libraries are available. It is immediately applicable to the sequencing of mouse,rat,zebrafish, and other important genomes,and can be managed in a cooperative fashion to take advantage of a distributed international DNA sequencing capacity. Advances in DNA sequencing technology in recent years have Drosophila genome, and the computational requirements to greatly increased the throughput and reduced the cost of ge- perform the necessary pairwise comparisons increase approxi- nome sequencing. Sequencing of a complex genome the size mately as a square of the size of the genome (see Appendix). -
Gene Linkage and Genetic Mapping 4TH PAGES © Jones & Bartlett Learning, LLC
© Jones & Bartlett Learning, LLC © Jones & Bartlett Learning, LLC NOT FOR SALE OR DISTRIBUTION NOT FOR SALE OR DISTRIBUTION © Jones & Bartlett Learning, LLC © Jones & Bartlett Learning, LLC NOT FOR SALE OR DISTRIBUTION NOT FOR SALE OR DISTRIBUTION © Jones & Bartlett Learning, LLC © Jones & Bartlett Learning, LLC NOT FOR SALE OR DISTRIBUTION NOT FOR SALE OR DISTRIBUTION © Jones & Bartlett Learning, LLC © Jones & Bartlett Learning, LLC NOT FOR SALE OR DISTRIBUTION NOT FOR SALE OR DISTRIBUTION Gene Linkage and © Jones & Bartlett Learning, LLC © Jones & Bartlett Learning, LLC 4NOTGenetic FOR SALE OR DISTRIBUTIONMapping NOT FOR SALE OR DISTRIBUTION CHAPTER ORGANIZATION © Jones & Bartlett Learning, LLC © Jones & Bartlett Learning, LLC NOT FOR4.1 SALELinked OR alleles DISTRIBUTION tend to stay 4.4NOT Polymorphic FOR SALE DNA ORsequences DISTRIBUTION are together in meiosis. 112 used in human genetic mapping. 128 The degree of linkage is measured by the Single-nucleotide polymorphisms (SNPs) frequency of recombination. 113 are abundant in the human genome. 129 The frequency of recombination is the same SNPs in restriction sites yield restriction for coupling and repulsion heterozygotes. 114 fragment length polymorphisms (RFLPs). 130 © Jones & Bartlett Learning,The frequency LLC of recombination differs © Jones & BartlettSimple-sequence Learning, repeats LLC (SSRs) often NOT FOR SALE OR DISTRIBUTIONfrom one gene pair to the next. NOT114 FOR SALEdiffer OR in copyDISTRIBUTION number. 131 Recombination does not occur in Gene dosage can differ owing to copy- Drosophila males. 115 number variation (CNV). 133 4.2 Recombination results from Copy-number variation has helped human populations adapt to a high-starch diet. 134 crossing-over between linked© Jones alleles. & Bartlett Learning,116 LLC 4.5 Tetrads contain© Jonesall & Bartlett Learning, LLC four products of meiosis. -
SOP Template Northern Blotting with P-32
Standard Operating Procedure Procedure Northern Blotting using 32P Department Location SOP Prepared By: Section 1: Purpose Northern blotting is a standard method for the detection and quantification of RNA from a cell. This is done by isolating and purifying RNA and using a radioactively-labeled DNA or RNA probe to hybridize to and detect the RNA. Using this technique, temporal and spatial location of RNA expression can be found8. Unlike RT-PCR (real-time PCR), which quantifies the amount of RNA or DNA at various times, Northern blotting can be used not only to quantify but also determine the size of the RNA. It is also useful to perform Northern blotting when studying transfer RNA (tRNA), which appears as the two lowest bands on a gel1. The probes used in Northern blotting do not have to be radioactive, but radioactive probes still have the greatest sensitivity. Quantifying mRNA of tRNA can provide insight into gene expression in cells that are exposed to different environments. Section 2: Personal Protective Equipment and Survey Equipment PPE: • Nitrile Gloves • Lab coat or lab gown • Proper enclosed shoes • Safety glasses Other Equipment: • Geiger counter • Personal chest dosimeter Section 3: Radioactive Material 32P, specific type varies according to what procedure is followed for probe design Supplier: Perkin Elmer Starting activity: 10 mCi/mL Typical use quantities: no more than 10uCi per experiment • Radioactive material (RAM) benchtop exposure time: ~10-20 minutes Activity used per experiment: _______________ RAM handling time: _______________ Frequency of experiment: _______________ Section 4: Potential Hazards • 32P is a high-energy beta emitter and has a half-life of 14.29 days. -
Complete Article
The EMBO Journal Vol. I No. 12 pp. 1539-1544, 1982 Long terminal repeat-like elements flank a human immunoglobulin epsilon pseudogene that lacks introns Shintaro Ueda', Sumiko Nakai, Yasuyoshi Nishida, lack the entire IVS have been found in the gene families of the Hiroshi Hisajima, and Tasuku Honjo* mouse a-globin (Nishioka et al., 1980; Vanin et al., 1980), the lambda chain (Hollis et al., 1982), Department of Genetics, Osaka University Medical School, Osaka 530, human immunoglobulin Japan and the human ,B-tubulin (Wilde et al., 1982a, 1982b). The mouse a-globin processed gene is flanked by long terminal Communicated by K.Rajewsky Received on 30 September 1982 repeats (LTRs) of retrovirus-like intracisternal A particles on both sides, although their orientation is opposite to each There are at least three immunoglobulin epsilon genes (C,1, other (Lueders et al., 1982). The human processed genes CE2, and CE) in the human genome. The nucleotide sequences described above have poly(A)-like tails -20 bases 3' to the of the expressed epsilon gene (CE,) and one (CE) of the two putative poly(A) addition signal and are flanked by direct epsilon pseudogenes were compared. The results show that repeats of several bases on both sides (Hollis et al., 1982; the CE3 gene lacks the three intervening sequences entirely and Wilde et al., 1982a, 1982b). Such direct repeats, which were has a 31-base A-rich sequence 16 bases 3' to the putative also found in human small nuclear RNA pseudogenes poly(A) addition signal, indicating that the CE3 gene is a pro- (Arsdell et al., 1981), might have been formed by repair of cessed gene. -
Blotting Techniques Blotting Is the Technique in Which Nucleic Acids Or
Blotting techniques Blotting is the technique in which nucleic acids or proteins are immobilized onto a solid support generally nylon or nitrocellulose membranes. Blotting of nucleic acid is the central technique for hybridization studies. Nucleic acid labeling and hybridization on membranes have formed the basis for a range of experimental techniques involving understanding of gene expression, organization, etc. Identifying and measuring specific proteins in complex biological mixtures, such as blood, have long been important goals in scientific and diagnostic practice. More recently the identification of abnormal genes in genomic DNA has become increasingly important in clinical research and genetic counseling. Blotting techniques are used to identify unique proteins and nucleic acid sequences. They have been developed to be highly specific and sensitive and have become important tools in both molecular biology and clinical research. General principle The blotting methods are fairly simple and usually consist of four separate steps: electrophoretic separation of protein or of nucleic acid fragments in the sample; transfer to and immobilization on paper support; binding of analytical probe to target molecule on paper; and visualization of bound probe. Molecules in a sample are first separated by electrophoresis and then transferred on to an easily handled support medium or membrane. This immobilizes the protein or DNA fragments, provides a faithful replica of the original separation, and facilitates subsequent biochemical analysis. After being transferred to the support medium the immobilized protein or nucleic acid fragment is localized by the use of probes, such as antibodies or DNA, that specifically bind to the molecule of interest. Finally, the position of the probe that is bound to the immobilized target molecule is visualized usually by autoradiography. -
Probabilities and Statistics in Shotgun Sequencing Shotgun Sequencing
Probabilities and Statistics in Shotgun Sequencing Shotgun Sequencing. Before any analysis of a DNA sequence can take place it is first necessary to determine the actual sequence itself, at least as accurately as is reasonably possible. Unfortunately, technical considerations make it impossible to sequence very long pieces of DNA all at once. Current sequencing technologies allow accurate reading of no more than 500 to 800bp of contiguous DNA sequence. This means that the sequence of an entire genome must be assembled from collections of comparatively short subsequences. This process is called DNA sequence “assembly ”. One approach of sequence assembly is to produce the sequence of a DNA segment (called as a “contig”, or perhaps a genome) from a large number of randomly chosen sequence reads (many overlapping small pieces, each on the order of 500-800 bases). One difficulty of this process is that the locations of the fragments within the genome and with respect to each other are not generally known. However, if enough fragments are sequenced so that there will be many overlaps between them, the fragments can be matched up and assembled. This method is called “ shotgun sequencing .” Shotgun sequencing approaches, including the whole-genome shotgun approach, are currently a central part of all genome-sequencing efforts. These methods require a high level of automation in sample preparation and analysis and are heavily reliant on the power of modern computers. There is an interplay between substrates to be sequenced (genomes and their representation in clone libraries), the analytical tools for generating a DNA sequence, the sequencing strategies, and the computational methods. -
Mitosis Vs. Meiosis
Mitosis vs. Meiosis In order for organisms to continue growing and/or replace cells that are dead or beyond repair, cells must replicate, or make identical copies of themselves. In order to do this and maintain the proper number of chromosomes, the cells of eukaryotes must undergo mitosis to divide up their DNA. The dividing of the DNA ensures that both the “old” cell (parent cell) and the “new” cells (daughter cells) have the same genetic makeup and both will be diploid, or containing the same number of chromosomes as the parent cell. For reproduction of an organism to occur, the original parent cell will undergo Meiosis to create 4 new daughter cells with a slightly different genetic makeup in order to ensure genetic diversity when fertilization occurs. The four daughter cells will be haploid, or containing half the number of chromosomes as the parent cell. The difference between the two processes is that mitosis occurs in non-reproductive cells, or somatic cells, and meiosis occurs in the cells that participate in sexual reproduction, or germ cells. The Somatic Cell Cycle (Mitosis) The somatic cell cycle consists of 3 phases: interphase, m phase, and cytokinesis. 1. Interphase: Interphase is considered the non-dividing phase of the cell cycle. It is not a part of the actual process of mitosis, but it readies the cell for mitosis. It is made up of 3 sub-phases: • G1 Phase: In G1, the cell is growing. In most organisms, the majority of the cell’s life span is spent in G1. • S Phase: In each human somatic cell, there are 23 pairs of chromosomes; one chromosome comes from the mother and one comes from the father. -
An Introduction to Recurrent Nucleotide Interactions in RNA Blake A
Overview An introduction to recurrent nucleotide interactions in RNA Blake A. Sweeney,1 Poorna Roy2 and Neocles B. Leontis2∗ RNA secondary structure diagrams familiar to molecular biologists summarize at a glance the folding of RNA chains to form Watson–Crick paired double helices. However, they can be misleading: First of all, they imply that the nucleotides in loops and linker segments, which can amount to 35% to 50% of a structured RNA, do not significantly interact with other nucleotides. Secondly, they give the impression that RNA molecules are loosely organized in three-dimensional (3D) space. In fact, structured RNAs are compactly folded as a result of numerous long-range, sequence-specific interactions, many of which involve loop or linker nucleotides. Here, we provide an introduction for students and researchers of RNA on the types, prevalence, and sequence variations of inter-nucleotide interactions that structure and stabilize RNA 3D motifs and architectures, using Escherichia coli (E. coli) 16S ribosomal RNA as a concrete example. The picture that emerges is that almost all nucleotides in structured RNA molecules, including those in nominally single-stranded loop or linker regions, form specific interactions that stabilize functional structures or mediate interactions with other molecules. The small number of noninteracting, ‘looped-out’ nucleotides make it possible for the RNA chain to form sharp turns. Base-pairing is the most specific interaction in RNA as it involves edge-to-edge hydrogen bonding (H-bonding) of the bases. Non-Watson–Crick base pairs are a significant fraction (30% or more) of base pairs in structured RNAs. © 2014 John Wiley & Sons, Ltd.