Mouse Cutc Conditional Knockout Project (CRISPR/Cas9)

Total Page:16

File Type:pdf, Size:1020Kb

Mouse Cutc Conditional Knockout Project (CRISPR/Cas9) https://www.alphaknockout.com Mouse Cutc Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Cutc conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Cutc gene (NCBI Reference Sequence: NM_001113562 ; Ensembl: ENSMUSG00000025193 ) is located on Mouse chromosome 19. 9 exons are identified, with the ATG start codon in exon 1 and the TAG stop codon in exon 9 (Transcript: ENSMUST00000112047). Exon 5~6 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Cutc gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP24-63D21 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 5 starts from about 49.14% of the coding region. The knockout of Exon 5~6 will result in frameshift of the gene. The size of intron 4 for 5'-loxP site insertion: 629 bp, and the size of intron 6 for 3'-loxP site insertion: 1953 bp. The size of effective cKO region: ~2818 bp. The cKO region does not have any other known gene. Page 1 of 7 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 3 4 5 6 7 9 Targeting vector Targeted allele Constitutive KO allele (After Cre recombination) Legends Exon of mouse Cutc Homology arm cKO region loxP site Page 2 of 7 https://www.alphaknockout.com Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. It may be difficult to construct this targeting vector. Overview of the GC Content Distribution Window size: 300 bp Sequence 12 Summary: Full Length(9318bp) | A(27.98% 2607) | C(20.21% 1883) | T(31.69% 2953) | G(20.12% 1875) Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 7 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr19 + 43757440 43760439 3000 browser details YourSeq 568 8 930 3000 86.8% chrX - 150759692 150760586 895 browser details YourSeq 565 1 929 3000 88.5% chr7 + 97908609 97909524 916 browser details YourSeq 553 1 929 3000 88.5% chr13 - 106781027 106781944 918 browser details YourSeq 551 1 879 3000 88.2% chr19 + 23685293 23686155 863 browser details YourSeq 548 31 1080 3000 86.8% chr2 + 91608258 91609627 1370 browser details YourSeq 546 31 928 3000 88.9% chr2 + 155455985 155456860 876 browser details YourSeq 545 2 923 3000 86.6% chr16 - 16005980 16006871 892 browser details YourSeq 543 1 902 3000 87.3% chr10 - 109908842 109909726 885 browser details YourSeq 522 2 898 3000 88.0% chr19 - 30369964 30370849 886 browser details YourSeq 520 21 904 3000 89.6% chr2 + 144879947 144880812 866 browser details YourSeq 512 1 929 3000 87.1% chr2 - 24236260 24237164 905 browser details YourSeq 497 19 917 3000 89.1% chr7 + 108958696 108959571 876 browser details YourSeq 497 1 898 3000 89.3% chr6 + 65844408 65845312 905 browser details YourSeq 489 62 929 3000 87.4% chr3 + 67649853 67650657 805 browser details YourSeq 487 1 929 3000 91.6% chr8 - 31365432 31366362 931 browser details YourSeq 487 60 929 3000 83.2% chr4 + 17018749 17019588 840 browser details YourSeq 486 1 930 3000 87.6% chr16 - 87064652 87065566 915 browser details YourSeq 485 20 902 3000 85.9% chr11 + 92838285 92839145 861 browser details YourSeq 484 173 929 3000 90.1% chr10 - 11270978 11271748 771 Note: The 3000 bp section upstream of Exon 5 is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr19 + 43763258 43766257 3000 browser details YourSeq 116 1324 1487 3000 83.3% chr1 - 191545386 191545536 151 browser details YourSeq 113 1304 1474 3000 83.5% chr15 - 23941843 23942000 158 browser details YourSeq 111 1303 1479 3000 85.8% chr14 - 47712914 47713075 162 browser details YourSeq 110 1303 1479 3000 81.1% chr13 + 57121249 57121408 160 browser details YourSeq 108 1325 1479 3000 82.9% chr9 + 94642623 94642764 142 browser details YourSeq 104 1300 1472 3000 86.6% chr14 - 56089056 56089226 171 browser details YourSeq 103 1334 1465 3000 88.1% chr11 + 121093516 121093645 130 browser details YourSeq 103 1341 1487 3000 82.2% chr11 + 8820872 8821012 141 browser details YourSeq 100 1323 1465 3000 82.7% chr11 - 83338300 83338429 130 browser details YourSeq 100 1323 1469 3000 83.1% chr11 + 97425330 97425468 139 browser details YourSeq 99 1345 1474 3000 89.7% chr13 - 98997767 99071364 73598 browser details YourSeq 99 1303 1465 3000 78.6% chr10 + 8920094 8920229 136 browser details YourSeq 99 1321 1479 3000 84.2% chr1 + 128059044 128059191 148 browser details YourSeq 98 1324 1465 3000 83.5% chr11 - 85651308 85651435 128 browser details YourSeq 95 1331 1468 3000 87.8% chr11 - 90673250 90673386 137 browser details YourSeq 95 1336 1465 3000 84.0% chr11 - 20345224 20345349 126 browser details YourSeq 94 1342 1474 3000 82.9% chr6 - 87472569 87472697 129 browser details YourSeq 94 1297 1452 3000 79.4% chr11 - 96754288 96754419 132 browser details YourSeq 93 1324 1465 3000 82.8% chr16 - 10253194 10253322 129 Note: The 3000 bp section downstream of Exon 6 is BLAT searched against the genome. No significant similarity is found. Page 4 of 7 https://www.alphaknockout.com Gene and protein information: Cutc cutC copper transporter [ Mus musculus (house mouse) ] Gene ID: 66388, updated on 12-Aug-2019 Gene summary Official Symbol Cutc provided by MGI Official Full Name cutC copper transporter provided by MGI Primary source MGI:MGI:1913638 See related Ensembl:ENSMUSG00000025193 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as CGI-32; AI326282; 2310039I18Rik Expression Ubiquitous expression in testis adult (RPKM 6.5), bladder adult (RPKM 4.1) and 28 other tissues See more Orthologs human all Genomic context Location: 19; 19 C3 See Cutc in Genome Data Viewer Exon count: 9 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 19 NC_000085.6 (43753023..43768638) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 19 NC_000085.5 (43827513..43843128) Chromosome 19 - NC_000085.6 Page 5 of 7 https://www.alphaknockout.com Transcript information: This gene has 4 transcripts Gene: Cutc ENSMUSG00000025193 Description cutC copper transporter [Source:MGI Symbol;Acc:MGI:1913638] Gene Synonyms 2310039I18Rik, CGI-32 Location Chromosome 19: 43,752,996-43,768,638 forward strand. GRCm38:CM001012.2 About this gene This gene has 4 transcripts (splice variants), 192 orthologues and is a member of 1 Ensembl protein family. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Cutc-201 ENSMUST00000026199.13 1298 262aa ENSMUSP00000026199.7 Protein coding CCDS29837 F8WHX2 TSL:1 GENCODE basic Cutc-202 ENSMUST00000112047.9 1233 272aa ENSMUSP00000107678.3 Protein coding CCDS50443 Q9D8X1 TSL:1 GENCODE basic APPRIS P1 Cutc-204 ENSMUST00000153295.1 788 254aa ENSMUSP00000118906.1 Protein coding - D3YY50 CDS 3' incomplete TSL:3 Cutc-203 ENSMUST00000123564.1 685 No protein - Retained intron - - TSL:3 35.64 kb Forward strand 43.75Mb 43.76Mb 43.77Mb Genes Cutc-201 >protein coding (Comprehensive set... Cutc-202 >protein coding Cutc-204 >protein coding Cutc-203 >retained intron Contigs < AC141888.4 Genes < Cox15-201protein coding (Comprehensive set... Regulatory Build 43.75Mb 43.76Mb 43.77Mb Reverse strand 35.64 kb Regulation Legend CTCF Open Chromatin Promoter Promoter Flank Transcription Factor Binding Site Gene Legend Protein Coding merged Ensembl/Havana Ensembl protein coding Non-Protein Coding processed transcript Page 6 of 7 https://www.alphaknockout.com Transcript: ENSMUST00000112047 15.54 kb Forward strand Cutc-202 >protein coding ENSMUSP00000107... Superfamily Copper homeostasis (CutC) domain superfamily Pfam Copper homeostasis CutC domain PANTHER Copper homeostasis protein CutC HAMAP Copper homeostasis protein CutC Gene3D Copper homeostasis (CutC) domain superfamily All sequence SNPs/i... Sequence variants (dbSNP and all other sources) Variant Legend missense variant synonymous variant Scale bar 0 40 80 120 160 200 272 We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC. Page 7 of 7.
Recommended publications
  • Alterations of Genetic Variants and Transcriptomic Features of Response to Tamoxifen in the Breast Cancer Cell Line
    Alterations of Genetic Variants and Transcriptomic Features of Response to Tamoxifen in the Breast Cancer Cell Line Mahnaz Nezamivand-Chegini Shiraz University Hamed Kharrati-Koopaee Shiraz University https://orcid.org/0000-0003-2345-6919 seyed taghi Heydari ( [email protected] ) Shiraz University of Medical Sciences https://orcid.org/0000-0001-7711-1137 Hasan Giahi Shiraz University Ali Dehshahri Shiraz University of Medical Sciences Mehdi Dianatpour Shiraz University of Medical Sciences Kamran Bagheri Lankarani Shiraz University of Medical Sciences Research Keywords: Tamoxifen, breast cancer, genetic variants, RNA-seq. Posted Date: August 17th, 2021 DOI: https://doi.org/10.21203/rs.3.rs-783422/v1 License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License Page 1/33 Abstract Background Breast cancer is one of the most important causes of mortality in the world, and Tamoxifen therapy is known as a medication strategy for estrogen receptor-positive breast cancer. In current study, two hypotheses of Tamoxifen consumption in breast cancer cell line (MCF7) were investigated. First, the effect of Tamoxifen on genes expression prole at transcriptome level was evaluated between the control and treated samples. Second, due to the fact that Tamoxifen is known as a mutagenic factor, there may be an association between the alterations of genetic variants and Tamoxifen treatment, which can impact on the drug response. Methods In current study, the whole-transcriptome (RNA-seq) dataset of four investigations (19 samples) were derived from European Bioinformatics Institute (EBI). At transcriptome level, the effect of Tamoxifen was investigated on gene expression prole between control and treatment samples.
    [Show full text]
  • Role and Regulation of the P53-Homolog P73 in the Transformation of Normal Human Fibroblasts
    Role and regulation of the p53-homolog p73 in the transformation of normal human fibroblasts Dissertation zur Erlangung des naturwissenschaftlichen Doktorgrades der Bayerischen Julius-Maximilians-Universität Würzburg vorgelegt von Lars Hofmann aus Aschaffenburg Würzburg 2007 Eingereicht am Mitglieder der Promotionskommission: Vorsitzender: Prof. Dr. Dr. Martin J. Müller Gutachter: Prof. Dr. Michael P. Schön Gutachter : Prof. Dr. Georg Krohne Tag des Promotionskolloquiums: Doktorurkunde ausgehändigt am Erklärung Hiermit erkläre ich, dass ich die vorliegende Arbeit selbständig angefertigt und keine anderen als die angegebenen Hilfsmittel und Quellen verwendet habe. Diese Arbeit wurde weder in gleicher noch in ähnlicher Form in einem anderen Prüfungsverfahren vorgelegt. Ich habe früher, außer den mit dem Zulassungsgesuch urkundlichen Graden, keine weiteren akademischen Grade erworben und zu erwerben gesucht. Würzburg, Lars Hofmann Content SUMMARY ................................................................................................................ IV ZUSAMMENFASSUNG ............................................................................................. V 1. INTRODUCTION ................................................................................................. 1 1.1. Molecular basics of cancer .......................................................................................... 1 1.2. Early research on tumorigenesis ................................................................................. 3 1.3. Developing
    [Show full text]
  • MOCHI Enables Discovery of Heterogeneous Interactome Modules in 3D Nucleome
    Downloaded from genome.cshlp.org on October 4, 2021 - Published by Cold Spring Harbor Laboratory Press MOCHI enables discovery of heterogeneous interactome modules in 3D nucleome Dechao Tian1,# , Ruochi Zhang1,# , Yang Zhang1, Xiaopeng Zhu1, and Jian Ma1,* 1Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA #These two authors contributed equally *Correspondence: [email protected] Contact To whom correspondence should be addressed: Jian Ma School of Computer Science Carnegie Mellon University 7705 Gates-Hillman Complex 5000 Forbes Avenue Pittsburgh, PA 15213 Phone: +1 (412) 268-2776 Email: [email protected] 1 Downloaded from genome.cshlp.org on October 4, 2021 - Published by Cold Spring Harbor Laboratory Press Abstract The composition of the cell nucleus is highly heterogeneous, with different constituents forming complex interactomes. However, the global patterns of these interwoven heterogeneous interactomes remain poorly understood. Here we focus on two different interactomes, chromatin interaction network and gene regulatory network, as a proof-of-principle, to identify heterogeneous interactome modules (HIMs), each of which represents a cluster of gene loci that are in spatial contact more frequently than expected and that are regulated by the same group of transcription factors. HIM integrates transcription factor binding and 3D genome structure to reflect “transcriptional niche” in the nucleus. We develop a new algorithm MOCHI to facilitate the discovery of HIMs based on network motif clustering in heterogeneous interactomes. By applying MOCHI to five different cell types, we found that HIMs have strong spatial preference within the nucleus and exhibit distinct functional properties. Through integrative analysis, this work demonstrates the utility of MOCHI to identify HIMs, which may provide new perspectives on the interplay between transcriptional regulation and 3D genome organization.
    [Show full text]
  • Genetic Analyses of Epigenetic Predictors That Estimate Aging, 2 Metabolic Traits, and Lifespan 3 Khyobeni Mozhui 1, 2*, Ake T
    bioRxiv preprint doi: https://doi.org/10.1101/2021.06.23.449634; this version posted June 24, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 Genetic Analyses of Epigenetic Predictors that Estimate Aging, 2 Metabolic Traits, and Lifespan 3 Khyobeni Mozhui 1, 2*, Ake T. Lu3, Caesar Z Li 3, Amin Haghani 4, Jose Vladimir Sandoval-Sierra 1, 4 Robert W. Williams 2, Steve Horvath3, 4 5 6 1 Department of Preventive Medicine, University of Tennessee Health Science Center, College of 7 Medicine, Memphis, Tennessee, USA 8 2 Department of Genetics, Genomics and Informatics, University of Tennessee Health Science 9 Center, College of Medicine, Memphis, Tennessee, USA 10 3 Department of Human Genetics, David Geffen School of Medicine, University of California Los 11 Angeles, Los Angeles, CA, USA 12 4 Department of Biostatistics, Fielding School of Public Health, University of California Los 13 Angeles, Los Angeles, CA, USA 14 15 *Correspondence: 16 Khyobeni Mozhui 17 [email protected] 1 bioRxiv preprint doi: https://doi.org/10.1101/2021.06.23.449634; this version posted June 24, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 18 Abstract 19 DNA methylation (DNAm) clocks are accurate molecular biomarkers of aging.
    [Show full text]
  • Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations Fall 2010 Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress Renuka Nayak University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Computational Biology Commons, and the Genomics Commons Recommended Citation Nayak, Renuka, "Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress" (2010). Publicly Accessible Penn Dissertations. 1559. https://repository.upenn.edu/edissertations/1559 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/1559 For more information, please contact [email protected]. Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress Abstract Genes interact in networks to orchestrate cellular processes. Here, we used coexpression networks based on natural variation in gene expression to study the functions and interactions of human genes. We asked how these networks change in response to stress. First, we studied human coexpression networks at baseline. We constructed networks by identifying correlations in expression levels of 8.9 million gene pairs in immortalized B cells from 295 individuals comprising three independent samples. The resulting networks allowed us to infer interactions between biological processes. We used the network to predict the functions of poorly-characterized human genes, and provided some experimental support. Examining genes implicated in disease, we found that IFIH1, a diabetes susceptibility gene, interacts with YES1, which affects glucose transport. Genes predisposing to the same diseases are clustered non-randomly in the network, suggesting that the network may be used to identify candidate genes that influence disease susceptibility.
    [Show full text]
  • ABSTRACTS 2013 Prise.Harvard.Edu Layout and Design: Ved Topkar ‘16 • Jennifer Tu ‘16 -4- TABLE of CONTENTS
    -2- ABSTRACTS 2013 prise.harvard.edu LAYOUT AND DESIGN: Ved Topkar ‘16 • Jennifer Tu ‘16 -4- TABLE OF CONTENTS Letter from the Director 7 Letter from the Editors 7 Abstracts BLISS 8 PRIMO 13 PRISE Astronomy and Astrophysics 17 Chemistry and Biochemistry 19 Computer Science 23 Earth and Planetary Sciences 25 Engineering and Bioengineering 26 Mathematics and Statistics 30 Microbiology and Immunology 34 Molecular and Cellular Biology 37 Neuroscience and Psychology 45 Organismic and Evolutionary Biology 53 Physics and Biophysics 54 Stem Cell and Regenerative Biology 57 SHARP 62 Acknowledgments 65 -5- -6- Letter from the Director Harvard’s Summer Undergraduate Research Village 2013 has been an outstanding collaboration of College scholars across the full spectrum of academic disciplines conducting formative and substantive research projects with Harvard faculty in the Faculty of Arts and Sciences, virtually all the other Schools in the Harvard universe, the teaching hospitals, and affiliated research enterprises across the Boston area. Along with the Program for Research in Science and Engineering (PRISE, now in its 8th year), the Behavioral Laboratory in the Social Sciences (BLISS, year 3), and the Program for Research in Markets and Organizations (PRIMO, co-hosted by the Harvard Business School, year 3), we have added the pilot SHARP: the Summer Humanities and Arts Research Program. I continue to be gratified and amazed by the consistently high degrees of enthusiasm, inclusivity, creativity, and energy in the Research Village residents of Mather House—the mind-boggling, diverse calendar of fellow-initiated activities is a testimony of the great effort and works in building community among these talented young investigators.
    [Show full text]
  • Guo, Updegrove Et Al., Supplemental Information 1
    Guo, Updegrove et al., Supplemental Information 1 Supplemental Information MicL, a new σE-dependent sRNA, combats envelope stress by repressing synthesis of Lpp, the major outer membrane lipoprotein 1,4 2,4 1,5 3 Monica S. Guo, Taylor B. Updegrove, Emily B. Gogol, Svetlana A. Shabalina, Carol A. Gross,1,6 and Gisela Storz2,6 1Department of Microbiology and Immunology, University of California, San Francisco, CA 94158, USA; 2Cell Biology and Metabolism Program, Eunice Kennedy Shriver National Institutes of Health, Bethesda, MD 20892, USA; 3National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA 4These authors contributed equally to this work. 5Current affiliation: Genentech 6Corresponding authors Email [email protected] Email [email protected] Guo, Updegrove et al., Supplemental Information 2 Supplemental Figures Guo, Updegrove et al., Supplemental Information 3 Supplemental Figure S1. Re-annotation of cutC region. (A) Transcription of the yecM-cutC region as observed in mRNA-seq. cutC appears to be co-transcribed with yecM in all conditions tested. (B) Sequence of micL-cutC genomic region. MicL (chromosome positions 1956771 to 1956462) and MicL-S (chromosome positions 1956544 to 1956462) terminate within the poly(T) stretch of the rho-independent terminator of cutC (stem indicated by arrows). The predicted -10 and -35 σE promoter elements of micL are underlined and the site of cleavage to generate MicL-S is indicated by the vertical arrow. A previous study reported a σE-dependent promoter for cutC at position 1957010 (Dartigalongue et al., 2001). We find no evidence for this site in our tiling array, deep sequencing and Northern data upon σE overexpression.
    [Show full text]
  • Statistical Methods and Analysis to Identify Disease-Related Variants
    Statistical Methods and Analysis to Identify Disease-Related Variants from Genetics Studies by Sai Chen A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy (Bioinformatics) in the University of Michigan 2017 Doctoral Committee: Professor Goncalo Abecasis, Chair Assisstant Professor Hyun Min Kang Assisstant Professor Jefferey M. Kidd Associate Professor Jun Li Professor Kerby Shedden © Sai Chen 2017 [email protected] ORCID: 0000-0003-3106-5643 To my love Bing Dedication ii Acknowledgements I would like to express my greatest gratitude to Dr. Goncalo Abecasis for his guidance in my research. When I meet difficulties in my research, his suggestions always opened a brand new view, and finally helped me solve the problem in a precise and simple way. He patiently gave me time and flexibility to try and to learn from such a broad range, and thanks to him, I finally found my true motivation and interests. I am so grateful to have such an understanding and brilliant mentor during my PhD. At the same time, I would like to thank Dr Hyun Min Kang. He guided me all the way through the GECCO project, and is always willing to teach me any new coding and analysis skills. I would like to thank my committee Dr Jun Li, Dr Jefferey Kidd and Dr Kerby Shedden. The rotations in their groups were my first step in Bioinformatics. I would like to thank Kerby Shedden for valuable suggestions on the statistical components in my thesis. Apart from my committee members, I would like to thank all my collaborators.
    [Show full text]
  • Network Mining Approach to Cancer Biomarker Discovery
    NETWORK MINING APPROACH TO CANCER BIOMARKER DISCOVERY THESIS Presented in Partial Fulfillment of the Requirements for the Degree Master of Science in the Graduate School of The Ohio State University By Praneeth Uppalapati, B.E. Graduate Program in Computer Science and Engineering The Ohio State University 2010 Thesis Committee: Dr. Kun Huang, Advisor Dr. Raghu Machiraju Copyright by Praneeth Uppalapati 2010 ABSTRACT With the rapid development of high throughput gene expression profiling technology, molecule profiling has become a powerful tool to characterize disease subtypes and discover gene signatures. Most existing gene signature discovery methods apply statistical methods to select genes whose expression values can differentiate different subject groups. However, a drawback of these approaches is that the selected genes are not functionally related and hence cannot reveal biological mechanism behind the difference in the patient groups. Gene co-expression network analysis can be used to mine functionally related sets of genes that can be marked as potential biomarkers through survival analysis. We present an efficient heuristic algorithm EigenCut that exploits the properties of gene co- expression networks to mine functionally related and dense modules of genes. We apply this method to brain tumor (Glioblastoma Multiforme) study to obtain functionally related clusters. If functional groups of genes with predictive power on patient prognosis can be identified, insights on the mechanisms related to metastasis in GBM can be obtained and better therapeutical plan can be developed. We predicted potential biomarkers by dividing the patients into two groups based on their expression profiles over the genes in the clusters and comparing their survival outcome through survival analysis.
    [Show full text]
  • Download the Supplementary Figures and Tables
    Supplementary Figures and Tables The EFI Web Resource for Genomic Enzymology Web Tools: Leveraging Protein, Genome, and Metagenome Databases to Discover Novel Enzymes and Metabolic Pathways Rémi Zallot1, Nils O. Oberg1, and John A. Gerlt*,1, 2, 3 1Institute for Genomic Biology, 2Department of Biochemistry, and 3Department of Chemistry, University of Illinois at Urbana-Champaign, 1206 West Gregory Drive, Urbana, Illinois 61801, United States. Corresponding author: John A. Gerlt ([email protected]). Phone: +1-217-244-7414 180 160 140 TrEMBL 120 SwissProt 100 80 Millions 60 40 20 0 1996 1999 2002 2005 2008 2011 2014 2017 Figure S1. The growth of the UniProtKB database. The web tools use the UniProtKB database for sequences used to generate SSNs and bioinformatic data included as node attributes in the SSNs. UniProtKB is the aggregate of the UniProtKB/TrEMBL database that contains computationally annotated entries [156,077,686 in Release 2019_04 (08-May-2019)] and the UniProtKB/SwissProt database that contains manually curated entries (560,118 in Release 2019_04). The decrease in 2015 is the result of archiving sequences from redundant proteomes in UniParc to manage the growth of the database. A B C Figure S2. The home pages for the web tools. Panel A, EFI-EST for generating sequence similarity networks (SSNs), accessible at https://efi.igb.illinois.edu/efi-est/. Panel B, EFI-GNT for collecting and analyzing genome context for bacterial, archaeal, and fungal proteins in SSNs, accessible at https://efi.igb.illinois.edu/efi-gnt/. Panel C, EFI-CGFP for mapping metagenome abundance to SSNs clusters, accessible at https://efi.igb.illinois.edu/efi-cgfp/.
    [Show full text]
  • FARE2015 WINNERS Sorted by Study Section
    FARE2015 WINNERS Sorted By Study Section Biochemistry - Proteins Andrezza Campos-Chagas Visiting Fellow NIAID Sicpin, the first immunomodulatory salivary protein described in black flies significantly reduced T and B cell proliferation and directly binds to soluble CD4 receptor Hematophagy is key to blood feeding arthropods reproductive success and an important link in pathogen transmission cycles. Salivary gland homogenates from blackflies have been shown to contain immunomodulatory activity on murine splenocytes. However, the molecule(s) responsible for this salivary activity remains elusive thus far. Here, we report the first immunosuppressive protein from blackfly salivary glands. Sicpin (Simulium cell proliferation inhibitor) was produced in E. coli and purified using size exclusion and ion exchange chromatography. Purified Sicpin was LPS-decontaminated and its sequence verified by N-terminal sequencing and LC-MS/MS analysis. Sicpin inhibited cell proliferation in a dose-response manner independently of the mitogen utilized (ConA, LPS, CD3/CD28 and Pokeweed). LPS or ConA stimulated cells had a significant lower proliferation rates (P&lt;0.001) in the presence of Sicpin (IC50=0.5uM) with 10uM completely abrogating cell proliferation. Flow cytometry analysis showed that Sicpin inhibited proliferation of CD19+ B-cells and CD4+/CD8+ T-cells. Sicpin did not induce apoptosis or necrosis in mitogen-induced proliferative responses by murine splenocytes as determined by flow cytometry. Sicpin also inhibits antigen-specific cell proliferation without inducing apoptosis in resting or mitogen-induced splenocytes. We demonstrate that the production IFN-alpha, IL4, IL5, IL6 and IL10 by splenocytes stimulated by ConA or LPS was dose-dependently reduced by Sicpin. Reduction of cytokines in presence of Sicpin could lead to a retardation of B and T cell activity.
    [Show full text]
  • Genome Sequencing and Analysis of the First Spontaneous Nanosilver 4 Resistant Bacterium Proteus Mirabilis Strain SCDR1
    bioRxiv preprint doi: https://doi.org/10.1101/089961; this version posted November 27, 2016. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Running Title: Nanosilver resistant Proteus mirabilis 2 3 Genome sequencing and analysis of the first spontaneous Nanosilver 4 resistant bacterium Proteus mirabilis strain SCDR1 5 6 Amr T. M. Saeb 1*, Khalid A. Al-Rubeaan1, Mohamed Abouelhoda 2, 3, Manojkumar Selvaraju 3, 4, and 7 Hamsa T. Tayeb 2, 3 8 9 1. Genetics and Biotechnology Department, Strategic Center for Diabetes Research, College of 10 medicine, King Saud University, KSA. 11 2. Genetics Department, King Faisal Specialist Hospital and Research Center, Riyadh, KSA. 12 3. Saudi Human Genome Project, King Abdulaziz City for Science and Technology (KACST), 13 Riyadh, KSA 14 4. Integrated Gulf Biosystems, Riyadh, KSA. 15 5. Department of Pathology & Laboratory Medicine, King Faisal Specialist Hospital and Research 16 Center, Riyadh, KSA. 17 6. Department of Medicine, King Faisal Specialist Hospital and Research Center, Riyadh, KSA 18 19 20 21 22 23 24 *Corresponding Author: 25 Amr T. M. Saeb, Ph.D. 26 Genetics and Biotechnology Department, 27 Strategic Center for Diabetes Research, 28 College of medicine, King Saud University, KSA. 29 Tel: +966-566263979 30 Fax: +966-11-4725682 31 Email: [email protected] 32 bioRxiv preprint doi: https://doi.org/10.1101/089961; this version posted November 27, 2016. The copyright holder for this preprint (which was not certified by peer review) is the author/funder.
    [Show full text]