Mouse Cenpb Knockout Project (CRISPR/Cas9)

Total Page:16

File Type:pdf, Size:1020Kb

Mouse Cenpb Knockout Project (CRISPR/Cas9) https://www.alphaknockout.com Mouse Cenpb Knockout Project (CRISPR/Cas9) Objective: To create a Cenpb knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Cenpb gene (NCBI Reference Sequence: NM_007682 ; Ensembl: ENSMUSG00000068267 ) is located on Mouse chromosome 2. 1 exon is identified, with the ATG start codon in exon 1 and the TGA stop codon in exon 1 (Transcript: ENSMUST00000089510). Exon 1 will be selected as target site. Cas9 and gRNA will be co-injected into fertilized eggs for KO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Mice homozygous for a knock-out allele display decreased body weight, small testis, oligospermia, and an age- and background-dependent reduction in female reproductive competence associated with abnormalities in uterus morphology, metral environment, gestational length, and parturition. Exon 1 starts from about 0.06% of the coding region. Exon 1 covers 100.0% of the coding region. The size of effective KO region: ~1795 bp. The KO region does not have any other known gene. Page 1 of 8 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 Legends Exon of mouse Cenpb Knockout region Page 2 of 8 https://www.alphaknockout.com Overview of the Dot Plot (up) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section upstream of start codon is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. The gRNA site is selected outside of these tandem repeats. Overview of the Dot Plot (down) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section downstream of stop codon is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 8 https://www.alphaknockout.com Overview of the GC Content Distribution (up) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(24.6% 492) | C(25.9% 518) | T(24.7% 494) | G(24.8% 496) Note: The 2000 bp section upstream of start codon is analyzed to determine the GC content. Significant high GC-content regions are found. The gRNA site is selected outside of these high GC-content regions. Overview of the GC Content Distribution (down) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(21.65% 433) | C(27.85% 557) | T(24.25% 485) | G(26.25% 525) Note: The 2000 bp section downstream of stop codon is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 4 of 8 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr2 - 131179877 131181876 2000 browser details YourSeq 170 27 586 2000 82.3% chr5 - 136097389 136097661 273 browser details YourSeq 163 24 642 2000 91.4% chr12 + 66951510 66952137 628 browser details YourSeq 163 27 215 2000 93.6% chr11 + 6537536 6537895 360 browser details YourSeq 159 24 214 2000 93.5% chr1 - 93639194 93639396 203 browser details YourSeq 158 25 211 2000 93.0% chr11 - 69869917 69870106 190 browser details YourSeq 156 26 214 2000 91.6% chr11 + 103949831 103950023 193 browser details YourSeq 155 27 214 2000 89.1% chr6 - 49183118 49183299 182 browser details YourSeq 155 27 214 2000 89.2% chr1 - 153110919 153111102 184 browser details YourSeq 154 10 199 2000 90.6% chr8 - 87923909 87924098 190 browser details YourSeq 154 21 214 2000 91.5% chr11 - 62172442 62172638 197 browser details YourSeq 153 25 195 2000 94.8% chr4 + 149763966 149764136 171 browser details YourSeq 153 26 209 2000 92.4% chr3 + 19082891 19083077 187 browser details YourSeq 153 27 621 2000 82.7% chr10 + 71472755 71473146 392 browser details YourSeq 152 26 208 2000 90.1% chr7 - 29834341 29834521 181 browser details YourSeq 152 26 212 2000 91.4% chr3 - 129399712 129399901 190 browser details YourSeq 152 26 208 2000 97.0% chr11 + 120162332 120162522 191 browser details YourSeq 152 26 208 2000 92.7% chr11 + 46791902 46792090 189 browser details YourSeq 152 26 210 2000 91.4% chr10 + 33961731 33961918 188 browser details YourSeq 151 23 195 2000 93.7% chr9 + 100436514 100436686 173 Note: The 2000 bp section upstream of start codon is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr2 - 131176080 131178079 2000 browser details YourSeq 22 1689 1710 2000 100.0% chr6 - 24394246 24394267 22 browser details YourSeq 22 31 55 2000 96.0% chr1 - 51616326 51616353 28 browser details YourSeq 20 1390 1409 2000 100.0% chr1 - 13133681 13133700 20 browser details YourSeq 20 943 962 2000 100.0% chr1 + 97857068 97857087 20 browser details YourSeq 20 670 689 2000 100.0% chr1 + 43051768 43051787 20 browser details YourSeq 20 1664 1683 2000 100.0% chr1 + 38768372 38768391 20 Note: The 2000 bp section downstream of stop codon is BLAT searched against the genome. No significant similarity is found. Page 5 of 8 https://www.alphaknockout.com Gene and protein information: Cenpb centromere protein B [ Mus musculus (house mouse) ] Gene ID: 12616, updated on 10-Oct-2019 Gene summary Official Symbol Cenpb provided by MGI Official Full Name centromere protein B provided by MGI Primary source MGI:MGI:88376 See related Ensembl:ENSMUSG00000068267 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as CENP-B Orthologs human all Genomic context Location: 2 F1; 2 63.29 cM See Cenpb in Genome Data Viewer Exon count: 1 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 2 NC_000068.7 (131177289..131180067, complement) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 2 NC_000068.6 (131003025..131005748, complement) Chromosome 2 - NC_000068.7 Page 6 of 8 https://www.alphaknockout.com Transcript information: This gene has 1 transcript Gene: Cenpb ENSMUSG00000068267 Description centromere protein B [Source:MGI Symbol;Acc:MGI:88376] Location Chromosome 2: 131,175,182-131,180,067 reverse strand. GRCm38:CM000995.2 About this gene This gene has 1 transcript (splice variant), 222 orthologues, 8 paralogues, is a member of 1 Ensembl protein family and is associated with 16 phenotypes. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Cenpb-201 ENSMUST00000089510.4 4886 599aa ENSMUSP00000086938.3 Protein coding CCDS16757 P27790 TSL:NA GENCODE basic APPRIS P1 24.89 kb Forward strand 131.17Mb 131.18Mb 131.19Mb Genes Gm14232-201 >lncRNA Cdc25b-203 >retained intron (Comprehensive set... Cdc25b-201 >protein coding Cdc25b-202 >protein coding Contigs AL831736.16 > Genes < Spef1-201protein coding (Comprehensive set... < Spef1-202protein coding < Spef1-203protein coding< Cenpb-201protein coding Regulatory Build 131.17Mb 131.18Mb 131.19Mb Reverse strand 24.89 kb Regulation Legend CTCF Enhancer Open Chromatin Promoter Promoter Flank Gene Legend Protein Coding merged Ensembl/Havana Ensembl protein coding Non-Protein Coding RNA gene processed transcript Page 7 of 8 https://www.alphaknockout.com Transcript: ENSMUST00000089510 < Cenpb-201protein coding Reverse strand 4.89 kb ENSMUSP00000086... MobiDB lite Low complexity (Seg) Coiled-coils (Ncoils) Superfamily Homeobox-like domain superfamily CENP-B, dimerisation domain superfamily SMART HTH CenpB-type DNA-binding domain Pfam DNA binding HTH domain, Psq-type DDE superfamily endonuclease domain Centromere protein CENP-B, C-terminal domain HTH CenpB-type DNA-binding domain PROSITE profiles DNA binding HTH domain, Psq-type HTH CenpB-type DNA-binding domain PANTHER PTHR19303 Major centromere autoantigen B Gene3D 1.10.10.60 CENP-B, dimerisation domain superfamily All sequence SNPs/i... Sequence variants (dbSNP and all other sources) Variant Legend inframe insertion missense variant synonymous variant Scale bar 0 60 120 180 240 300 360 420 480 599 We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC. Page 8 of 8.
Recommended publications
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • Detecting Remote, Functional Conserved Domains in Entire Genomes by Combining Relaxed Sequence-Database Searches with Fold Recognition
    HMMerThread: Detecting Remote, Functional Conserved Domains in Entire Genomes by Combining Relaxed Sequence-Database Searches with Fold Recognition Charles Richard Bradshaw1¤a, Vineeth Surendranath1, Robert Henschel2,3, Matthias Stefan Mueller2, Bianca Hermine Habermann1,4*¤b 1 Bioinformatics Laboratory, Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Saxony, Germany, 2 Center for Information Services and High Performance Computing (ZIH), Technical University, Dresden, Saxony, Germany, 3 High Performance Applications, Pervasive Technology Institute, Indiana University, Bloomington, Indiana, United States of America, 4 Bioinformatics Laboratory, Scionics c/o Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Saxony, Germany Abstract Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs) are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes.
    [Show full text]
  • Thai Hun Taiteit Ut La in Tutta Latoare
    THAI HUN TAITEIT US009970056B2UT LA IN TUTTA LATOARE (12 ) United States Patent ( 10 ) Patent No. : US 9 ,970 , 056 B2 Potashkin et al. ( 45 ) Date of Patent: May 15 , 2018 ( 54 ) METHODS AND KITS FOR DIAGNOSING , Aviles- Olmos , et al. “ Parkinson ' s Disease , Insulin Resistance and PROGNOSING AND MONITORING Novel Agents of Neuroprotection ” , Brain , 2013 , vol. 136 ( Pt 2 ) , pp . PARKINSON ' S DISEASE 37414 -- 384 .. Bassil, et al, “ Insulin , IGF - 1 and GLP - 1 Signaling in Neurodegenerative Disorders : Targets for Disease Modification ? ” , ( 71 ) Applicant: Rosalind Franklin University of Prog Neurobiol , Jul. 2014 , vol. 118 , pp . 1 - 18 . ( Abstract only ) . Medicine and Science , North Chicago , Bonnefond , et al. , “ The Emerging Genetics of Type 2 Diabetes ” , IL (US ) Trends Mol Med , Sep . 2010 , vol. 16 ( 9 ) , pp . 407 -416 . ( Abstract only ) . (72 ) Inventors : Judith Ann Potashkin , North Chicago , Cereda, et al. , “ Diabetes and Risk of Parkinson ' s Disease : A IL (US ) ; Jose Alfredo Santiago , North Systematic Review and Meta - Analysis ” , Diabetes Care, Dec . 2011 , Chicago , IL (US ) vol . 34 , pp . 2614 - 2623 . Cereda , et al. , “ Diabetes and Risk of Parkinson ' s Disease ” , Move ( 73 ) Assignee : Rosalind Franklin University of ment Disorders, 2013 , vol. 28 , pp . 257 - 261 ( Abstract only ) . Medicine and Science , North Chicago , Cereda , et al . , “ Clinical Features of Parkinson ' s Disease When Onset of Diabetes Came First: A Case - Control Study” , Neurology , IL (US ) May 8 , 2012 , vol . 78 ( 19 ), pp . 1507 - 1511 , doi: 10 .1212 /WNL . Ob013e3182553cc9 , Epub Apr. 25 , 2012 . ( Abstract only ) . ( * ) Notice : Subject to any disclaimer , the term of this Chen - Plotkin , et al. , “ Plasma Epidermal Growth Factor Levels patent is extended or adjusted under 35 Predict Cognitive Decline in Parkinson ' s Disease " , Ann Neurol , U .
    [Show full text]
  • WO 2012/174282 A2 20 December 2012 (20.12.2012) P O P C T
    (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization International Bureau (10) International Publication Number (43) International Publication Date WO 2012/174282 A2 20 December 2012 (20.12.2012) P O P C T (51) International Patent Classification: David [US/US]; 13539 N . 95th Way, Scottsdale, AZ C12Q 1/68 (2006.01) 85260 (US). (21) International Application Number: (74) Agent: AKHAVAN, Ramin; Caris Science, Inc., 6655 N . PCT/US20 12/0425 19 Macarthur Blvd., Irving, TX 75039 (US). (22) International Filing Date: (81) Designated States (unless otherwise indicated, for every 14 June 2012 (14.06.2012) kind of national protection available): AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BR, BW, BY, BZ, English (25) Filing Language: CA, CH, CL, CN, CO, CR, CU, CZ, DE, DK, DM, DO, Publication Language: English DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IS, JP, KE, KG, KM, KN, KP, KR, (30) Priority Data: KZ, LA, LC, LK, LR, LS, LT, LU, LY, MA, MD, ME, 61/497,895 16 June 201 1 (16.06.201 1) US MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, 61/499,138 20 June 201 1 (20.06.201 1) US OM, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SC, SD, 61/501,680 27 June 201 1 (27.06.201 1) u s SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, 61/506,019 8 July 201 1(08.07.201 1) u s TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW.
    [Show full text]
  • Comparative Transcriptome Analysis of Obligately Asexual and Cyclically
    Hanson et al. BMC Genomics 2013, 14:412 http://www.biomedcentral.com/1471-2164/14/412 RESEARCH ARTICLE Open Access Comparative transcriptome analysis of obligately asexual and cyclically sexual rotifers reveals genes with putative functions in sexual reproduction, dormancy, and asexual egg production Sara J Hanson1, Claus-Peter Stelzer2, David B Mark Welch3 and John M Logsdon Jr1* Abstract Background: Sexual reproduction is a widely studied biological process because it is critically important to the genetics, evolution, and ecology of eukaryotes. Despite decades of study on this topic, no comprehensive explanation has been accepted that explains the evolutionary forces underlying its prevalence and persistence in nature. Monogonont rotifers offer a useful system for experimental studies relating to the evolution of sexual reproduction due to their rapid reproductive rate and close relationship to the putatively ancient asexual bdelloid rotifers. However, little is known about the molecular underpinnings of sex in any rotifer species. Results: We generated mRNA-seq libraries for obligate parthenogenetic (OP) and cyclical parthenogenetic (CP) strains of the monogonont rotifer, Brachionus calyciflorus, to identify genes specific to both modes of reproduction. Our differential expression analysis identified receptors with putative roles in signaling pathways responsible for the transition from asexual to sexual reproduction. Differential expression of a specific copy of the duplicated cell cycle regulatory gene CDC20 and specific copies of histone H2A suggest that such duplications may underlie the phenotypic plasticity required for reproductive mode switch in monogononts. We further identified differential expression of genes involved in the formation of resting eggs, a process linked exclusively to sex in this species.
    [Show full text]
  • Using Ontology Fingerprints to Evaluate Genome-Wide Association Results
    Using Ontology Fingerprints to Evaluate Genome-wide Association Results Lam Tsoi, Michael Boehnke, Richard Klein, W. Jim Zheng Medical University of South Carolina ICBO, Buffalo, 2009 1 Overview Genome-wide association study Ontology fingerprints Using ontology fingerprints to quantify the relationship between genes and disease/phenotypes/traits Ontology fingerprints derived gene networks to identify polygenic model for diseases 2 Genome-wide Association Study 3 GWA Studies In Action 4 What is a GWA Study? A genome-wide association study is an approach that involves rapidly scanning markers across the complete sets of DNA, or genomes, of many people to find genetic variations associated with a particular disease. Once new genetic associations are identified, researchers can use the information to develop better strategies to detect, treat and prevent the disease. Such studies are particularly useful in finding genetic variations that contribute to common, complex diseases, such as asthma, cancer, diabetes, heart disease and mental illnesses 5 What is a GWA Study? Method for interrogating all 10 million variable points across human genome Variation inherited in groups, or blocks, so not all 10 million points have to be tested 6 Linkage Disequilibrium Blocks Can Have Many Genes 7 8 Genome Wide Association Study for LDL, HDL and TG 16876 individuals Clinical observations considered – Height, weight, BMI, LDL, HDL and TG level etc 9 Genome Wide Association Study for LDL, HDL and TG Identify genes that falls into the loci that are significantly associated with phenotype HDL – 237 genes from top 201 LD blocks LDL -- 212 genes from top 199 LD blocks TG -- 221 genes from top 200 LD blocks Challenge – Which genes are more relevant? ? Gene Disease Comprehensive Quantitative 10 Ontology Fingerprints 11 Biomedical Ontology Many ontologies have been developed: Gene Ontology Cell Ontology Foundation Model of Anatomy Disease Ontology … 12 Annotation of Apolipoprotein A-4 Gene annotation by Gene Ontology has been used extensively by microarray data analysis.
    [Show full text]
  • Chromatin Reader Dido3 Regulates the Genetic Network of B Cell Differentiation
    bioRxiv preprint doi: https://doi.org/10.1101/2021.02.23.432411; this version posted March 12, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Chromatin reader Dido3 regulates the genetic network of B cell differentiation Fernando Gutiérrez del Burgo1, Tirso Pons1, Enrique Vázquez de Luis2, Carlos Martínez-A1 & Ricardo Villares1,* 1 Centro Nacional de Biotecnología/CSIC, Darwin 3, Cantoblanco, E‐28049, Madrid, Spain 2 Centro Nacional de Investigaciones Cardiovasculares, Instituto de Salud Carlos III, Melchor Fernández Almagro 3, Madrid 28029, Spain. *email: [email protected] 1 bioRxiv preprint doi: https://doi.org/10.1101/2021.02.23.432411; this version posted March 12, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. ABSTRACT The development of hematopoietic lineages is based on a complex balance of transcription factors whose expression depends on the epigenetic signatures that characterize each differentiation step. The B cell lineage arises from hematopoietic stem cells through the stepwise silencing of stemness genes and balanced expression of mutually regulated transcription factors, as well as DNA rearrangement. Here we report the impact on B cell differentiation of the lack of DIDO3, a reader of chromatin status, in the mouse hematopoietic compartment.
    [Show full text]
  • Bayesian Hidden Markov Tree Models for Clustering Genes with Shared Evolutionary History
    Submitted to the Annals of Applied Statistics arXiv: arXiv:0000.0000 BAYESIAN HIDDEN MARKOV TREE MODELS FOR CLUSTERING GENES WITH SHARED EVOLUTIONARY HISTORY By Yang Liy,{,∗, Shaoyang Ningy,∗, Sarah E. Calvoz,x,{, Vamsi K. Moothak,z,x,{ and Jun S. Liuy Harvard Universityy, Broad Institutez, Harvard Medical Schoolx, Massachusetts General Hospital{, and Howard Hughes Medical Institutek Determination of functions for poorly characterized genes is cru- cial for understanding biological processes and studying human dis- eases. Functionally associated genes are often gained and lost together through evolution. Therefore identifying co-evolution of genes can predict functional gene-gene associations. We describe here the full statistical model and computational strategies underlying the orig- inal algorithm CLustering by Inferred Models of Evolution (CLIME 1.0) recently reported by us [Li et al., 2014]. CLIME 1.0 employs a mixture of tree-structured hidden Markov models for gene evolution process, and a Bayesian model-based clustering algorithm to detect gene modules with shared evolutionary histories (termed evolutionary conserved modules, or ECMs). A Dirichlet process prior was adopted for estimating the number of gene clusters and a Gibbs sampler was developed for posterior sampling. We further developed an extended version, CLIME 1.1, to incorporate the uncertainty on the evolution- ary tree structure. By simulation studies and benchmarks on real data sets, we show that CLIME 1.0 and CLIME 1.1 outperform traditional methods that use simple metrics (e.g., the Hamming distance or Pear- son correlation) to measure co-evolution between pairs of genes. 1. Introduction. The human genome encodes more than 20,000 protein- coding genes, of which a large fraction do not have annotated function to date [Galperin and Koonin, 2010].
    [Show full text]
  • PTK7) in Cancer Development and Cellular Fitness
    Characterizing the Role of Protein Tyrosine Kinase 7 (PTK7) in Cancer Development and Cellular Fitness by Sachin Anand Kumar A thesis submitted in conformity with the requirements for the degree of Master of Science Department of Molecular Genetics University of Toronto © Copyright by Sachin Anand Kumar (2015) Characterizing the Role of Protein Tyrosine Kinase 7 (PTK7) in Cancer Development and Cellular Fitness Sachin Anand Kumar Master of Science Department of Molecular Genetics University of Toronto 2015 Abstract Protein Tyrosine Kinase 7 has been recently affiliated with the development and progression of various cancers. Contradictory results about its expression, localization and processing across various cancer types has left a major gap in understanding its molecular mechanism and function. In my thesis, I utilized large data sets from essentiality screens and RNAseq, as well as antibody-related tools, to investigate the role of PTK7 in cellular fitness. I observed a negative correlation between PTK7 processing and its degree of essentiality. Furthermore, I demonstrated that the PTK7 intracellular domain (ICD) is integral to its function. Treatment of mouse xenografts with PTK7 antibodies, capable of blocking PTK7 processing, inhibited tumour growth in an ICD-expression dependent manner. Lastly, mRNA profiling of PTK7-knockdown cells and ICD over-expression cells provided a list of putative downstream gene targets under PTK7 regulation. Collectively, these findings implicate PTK7 in cellular transformation and provide insight into its signaling mechanism. II Acknowledgments First and foremost, I would like to thank my supervisor Dr. Jason Moffat. You gave me an opportunity to step outside of my comfort zone and partake in research that pushed the boundary of science, technology and discovery.
    [Show full text]
  • Comprehensive Analysis Reveals Novel Gene Signature in Head and Neck Squamous Cell Carcinoma: Predicting Is Associated with Poor Prognosis in Patients
    5892 Original Article Comprehensive analysis reveals novel gene signature in head and neck squamous cell carcinoma: predicting is associated with poor prognosis in patients Yixin Sun1,2#, Quan Zhang1,2#, Lanlin Yao2#, Shuai Wang3, Zhiming Zhang1,2 1Department of Breast Surgery, The First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, China; 2School of Medicine, Xiamen University, Xiamen, China; 3State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China Contributions: (I) Conception and design: Y Sun, Q Zhang; (II) Administrative support: Z Zhang; (III) Provision of study materials or patients: Y Sun, Q Zhang; (IV) Collection and assembly of data: Y Sun, L Yao; (V) Data analysis and interpretation: Y Sun, S Wang; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors. #These authors contributed equally to this work. Correspondence to: Zhiming Zhang. Department of Surgery, The First Affiliated Hospital of Xiamen University, Xiamen, China. Email: [email protected]. Background: Head and neck squamous cell carcinoma (HNSC) remains an important public health problem, with classic risk factors being smoking and excessive alcohol consumption and usually has a poor prognosis. Therefore, it is important to explore the underlying mechanisms of tumorigenesis and screen the genes and pathways identified from such studies and their role in pathogenesis. The purpose of this study was to identify genes or signal pathways associated with the development of HNSC. Methods: In this study, we downloaded gene expression profiles of GSE53819 from the Gene Expression Omnibus (GEO) database, including 18 HNSC tissues and 18 normal tissues.
    [Show full text]
  • Rank-Based Molecular Prognosis and Network-Guided Biomarker Discovery for Breast Cancer Yunlong Jiao
    Rank-based Molecular Prognosis and Network-guided Biomarker Discovery for Breast Cancer Yunlong Jiao To cite this version: Yunlong Jiao. Rank-based Molecular Prognosis and Network-guided Biomarker Discovery for Breast Cancer. Cancer. Université Paris sciences et lettres, 2017. English. NNT : 2017PSLEM027. tel- 01744747 HAL Id: tel-01744747 https://pastel.archives-ouvertes.fr/tel-01744747 Submitted on 27 Mar 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. THÈSE DE DOCTORAT de l’Université de recherche Paris Sciences et Lettres PSL Research University Préparée à MINES ParisTech Rank-based Molecular Prognosis and Network-guided Biomarker Discovery for Breast Cancer Pronostic moléculaire basé sur l’ordre des gènes et découverte de biomarqueurs guidé par des réseaux pour le cancer du sein École doctorale no432 SCIENCES DES MÉTIERS DE L’INGÉNIEUR Spécialité BIO-INFORMATIQUE COMPOSITION DU JURY : M. Francis BACH INRIA, Président M. Stéphan CLEMENCON Télécom ParisTech, Rapporteur M. Risi KONDOR Soutenue par Yunlong JIAO University of Chicago, Rapporteur le 11 septembre 2017 Mme Chloé-Agathe AZENCOTT MINES ParisTech, Examinateur Dirigée par Jean-Philippe VERT M. Joaquin DOPAZO Fundación Progreso y Salud, Examinateur M. Jean-Philippe VERT MINES ParisTech, Examinateur Abstract i Abstract Breast cancer is the second most common cancer worldwide and the leading cause of women’s death from cancer.
    [Show full text]
  • Program Book
    The Genetics Society of America Conferences 15th International Xenopus Conference August 24-28, 2014 • Pacific Grove, CA PROGRAM GUIDE LEGEND Information/Guest Check-In Disabled Parking E EV Charging Station V Beverage Vending Machine N S Ice Machine Julia Morgan Historic Building W Roadway Pedestrian Pathway Outdoor Group Activity Area Program and Abstracts Meeting Organizers Carole LaBonne, Northwestern University John Wallingford, University of Texas at Austin Organizing Committee: Julie Baker, Stanford Univ Chris Field, Harvard Medical School Carmen Domingo, San Francisco State Univ Anna Philpott, Univ of Cambridge 9650 Rockville Pike, Bethesda, Maryland 20814-3998 Telephone: (301) 634-7300 • Fax: (301) 634-7079 E-mail: [email protected] • Web site: genetics-gsa.org Thank You to the Following Companies for their Generous Support Platinum Sponsor Gold Sponsors Additional Support Provided by: Carl Zeiss Microscopy, LLC Monterey Convention & Gene Tools, LLC Visitors Bureau Leica Microsystems Xenopus Express 2 Table of Contents General Information ........................................................................................................................... 5 Schedule of Events ............................................................................................................................. 6 Exhibitors ........................................................................................................................................... 8 Opening Session and Plenary/Platform Sessions ............................................................................
    [Show full text]