Mouse Trim32 Conditional Knockout Project (CRISPR/Cas9)

Total Page:16

File Type:pdf, Size:1020Kb

Mouse Trim32 Conditional Knockout Project (CRISPR/Cas9) https://www.alphaknockout.com Mouse Trim32 Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Trim32 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Trim32 gene (NCBI Reference Sequence: NM_053084 ; Ensembl: ENSMUSG00000051675 ) is located on Mouse chromosome 4. 2 exons are identified, with the ATG start codon in exon 2 and the TAA stop codon in exon 2 (Transcript: ENSMUST00000050850). Exon 2 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Trim32 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP23-234I22 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Mice homozygous for a gene trapped allele exhibit mild myopathy with sarcotubular myopathy, decreased fertility, and decreased axon diameter. Mice homozygous for a knock-out allele exhibit impaired adult muscle regeneration and myopathy. Exon 2 covers 100.0% of the coding region. Start codon is in exon 2, and stop codon is in exon 2. The size of effective cKO region: ~2317 bp. The cKO region does not have any other known gene. Page 1 of 7 https://www.alphaknockout.com Overview of the Targeting Strategy gRNA region Wildtype allele T A 5' gRNA region A 3' 1 2 Targeting vector T A A Targeted allele T A A Constitutive KO allele (After Cre recombination) Legends Exon of mouse Trim32 Homology arm cKO region loxP site Page 2 of 7 https://www.alphaknockout.com Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. It may be difficult to construct this targeting vector. Overview of the GC Content Distribution Window size: 300 bp Sequence 12 Summary: Full Length(7965bp) | A(25.26% 2012) | C(22.16% 1765) | T(28.71% 2287) | G(23.87% 1901) Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 7 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr4 + 65610209 65613208 3000 browser details YourSeq 45 2702 2746 3000 100.0% chr3 + 126347505 126347549 45 browser details YourSeq 25 2862 2890 3000 96.3% chr15 + 76113497 76113527 31 browser details YourSeq 21 2587 2608 3000 100.0% chr1 - 10062001 10062023 23 Note: The 3000 bp section upstream of Exon 2 is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr4 + 65615174 65618173 3000 browser details YourSeq 27 156 199 3000 72.5% chr11 + 80018503 80018538 36 Note: The 3000 bp section downstream of Exon 2 is BLAT searched against the genome. No significant similarity is found. Page 4 of 7 https://www.alphaknockout.com Gene and protein information: Trim32 tripartite motif-containing 32 [ Mus musculus (house mouse) ] Gene ID: 69807, updated on 10-Oct-2019 Gene summary Official Symbol Trim32 provided by MGI Official Full Name tripartite motif-containing 32 provided by MGI Primary source MGI:MGI:1917057 See related Ensembl:ENSMUSG00000051675 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as 3f3; BBS11; Zfp117; 1810045E12Rik Expression Broad expression in CNS E18 (RPKM 39.0), whole brain E14.5 (RPKM 30.7) and 23 other tissues See more Orthologs human all Genomic context Location: 4 C1; 4 34.43 cM See Trim32 in Genome Data Viewer Exon count: 4 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 4 NC_000070.6 (65604986..65616240) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 4 NC_000070.5 (65266020..65277274) Chromosome 4 - NC_000070.6 Page 5 of 7 https://www.alphaknockout.com Transcript information: This gene has 4 transcripts Gene: Trim32 ENSMUSG00000051675 Description tripartite motif-containing 32 [Source:MGI Symbol;Acc:MGI:1917057] Gene Synonyms 1810045E12Rik, 3f3, BBS11, Zfp117 Location Chromosome 4: 65,604,986-65,616,238 forward strand. GRCm38:CM000997.2 About this gene This gene has 4 transcripts (splice variants), 183 orthologues, 73 paralogues, is a member of 1 Ensembl protein family and is associated with 20 phenotypes. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Trim32-201 ENSMUST00000050850.13 3194 655aa ENSMUSP00000062277.7 Protein coding CCDS18270 Q3TLR3 Q8CH72 TSL:1 GENCODE basic APPRIS P1 Trim32-202 ENSMUST00000107366.1 3191 655aa ENSMUSP00000102989.1 Protein coding CCDS18270 Q3TLR3 Q8CH72 TSL:1 GENCODE basic APPRIS P1 Trim32-204 ENSMUST00000156922.1 660 183aa ENSMUSP00000121949.1 Protein coding - A2AGS1 CDS 3' incomplete TSL:5 Trim32-203 ENSMUST00000155978.1 516 136aa ENSMUSP00000119579.1 Protein coding - A2AGS0 CDS 3' incomplete TSL:3 31.25 kb Forward strand 65.60Mb 65.61Mb 65.62Mb Genes (Comprehensive set... Trim32-202 >protein coding Trim32-201 >protein coding Trim32-204 >protein coding Trim32-203 >protein coding Contigs AL691456.7 > Genes < Astn2-202protein coding (Comprehensive set... < Astn2-201protein coding Regulatory Build 65.60Mb 65.61Mb 65.62Mb Reverse strand 31.25 kb Regulation Legend CTCF Promoter Promoter Flank Gene Legend Protein Coding merged Ensembl/Havana Ensembl protein coding Page 6 of 7 https://www.alphaknockout.com Transcript: ENSMUST00000050850 11.25 kb Forward strand Trim32-201 >protein coding ENSMUSP00000062... MobiDB lite Low complexity (Seg) Coiled-coils (Ncoils) Superfamily SSF57850 SSF57845 SSF101898 SMART B-box-type zinc finger Zinc finger, RING-type Pfam RING-type zinc-finger, LisH dimerisation motif NHL repeat PROSITE profiles Zinc finger, RING-type NHL repeat, subgroup B-box-type zinc finger PROSITE patterns Zinc finger, RING-type, conserved site PANTHER PTHR25464 PTHR25464:SF3 Gene3D Zinc finger, RING/FYVE/PHD-type Six-bladed beta-propeller, TolB-like CDD cd16587 B-box-type zinc finger cd14961 All sequence SNPs/i... Sequence variants (dbSNP and all other sources) Variant Legend synonymous variant Scale bar 0 60 120 180 240 300 360 420 480 540 655 We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC. Page 7 of 7.
Recommended publications
  • Non-Cellulosomal Cohesin- and Dockerin-Like Modules in the Three Domains of Life Ayelet Peera, Steven P
    1 Non-cellulosomal Cohesin- and Dockerin-like Modules in the Three Domains of Life Ayelet Peera, Steven P. Smithb, Edward A. Bayerc,*, Raphael Lameda and Ilya Borovoka aDepartment of Molecular Microbiology and Biotechnology, Tel Aviv University, Ramat Aviv 69978 Israel bDepartment of Biochemistry, Queen’s University Kingston Ontario Canada K7L 3N6 cDepartment of Biological Sciences, Weizmann Institute of Science, Rehovot 76100 Israel *Corresponding author: Edward A. Bayer Tel: (+972) -8-934-2373 Fax: (+972)-8-946-8256 Email: [email protected] Supplementary Table S1. Compendium of cohesins and dockerins in the three domains of life. In order to discover new putative cohesin/dockerin-containing proteins, we used sequences of all the classical cohesin and dockerin modules from C. thermocellum, C. cellulovorans, C. cellulolyticum, B. cellulosolvens and Acetivibrio cellulolyticus as well as cohesins and dockerins recently discovered in rumen bacteria, Ruminococcus albus and R. flavefaciens as BlastP queries for the main NCBI Blast server against all non- redundant protein sequences deposited in GenBank/EMBL/DDBJ databases. We also performed extensive searches using the TblastN algorithm through all publicly available microbial genome databases including those attached to the NCBI BLAST server for bacterial genomes (http://www.ncbi.nlm.nih.gov/sutils/genom_table.cgi?), as well as several additional microbial genome databases – Microbial Genomics at the DOE Joint Genome Institute (http://genome.jgi-psf.org/mic_home.html), the Rumenomics database at TIGR/JCVI (http://tigrblast.tigr.org/rumenomics/index.cgi) and Bacterial Genomes at the Sanger Centre (http://www.sanger.ac.uk/Projects/Microbes/). Once a putative cohesin or dockerin-encoding gene product was identified, gene-walking techniques were employed to analyze and locate possible cellulosome-like gene clusters.
    [Show full text]
  • Sequence and Structural Analysis of the Asp-Box Motif and Asp-Box Beta
    BMC Structural Biology BioMed Central Research article Open Access Sequence and structural analysis of the Asp-box motif and Asp-box beta-propellers; a widespread propeller-type characteristic of the Vps10 domain family and several glycoside hydrolase families Esben M Quistgaard1,2 and Søren S Thirup*1 Address: 1MIND Centre, Department of Molecular Biology, University of Aarhus, Gustav Wieds Vej 10C, DK 8000 Århus C, Denmark and 2Department of Medical Biochemistry and Biophysics, Karolinska Institute, 17177 Stockholm, Sweden Email: Esben M Quistgaard - [email protected]; Søren S Thirup* - [email protected] * Corresponding author Published: 13 July 2009 Received: 14 May 2009 Accepted: 13 July 2009 BMC Structural Biology 2009, 9:46 doi:10.1186/1472-6807-9-46 This article is available from: http://www.biomedcentral.com/1472-6807/9/46 © 2009 Quistgaard and Thirup; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background: The Asp-box is a short sequence and structure motif that folds as a well-defined β- hairpin. It is present in different folds, but occurs most prominently as repeats in β-propellers. Asp- box β-propellers are known to be characteristically irregular and to occur in many medically important proteins, most of which are glycosidase enzymes, but they are otherwise not well characterized and are only rarely treated as a distinct β-propeller family. We have analyzed the sequence, structure, function and occurrence of the Asp-box and s-Asp-box -a related shorter variant, and provide a comprehensive classification and computational analysis of the Asp-box β- propeller family.
    [Show full text]
  • Protein Family Expansions and Biological Complexity
    Protein Family Expansions and Biological Complexity Christine Vogel1,2*, Cyrus Chothia1 1 Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom, 2 Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas, United States of America During the course of evolution, new proteins are produced very largely as the result of gene duplication, divergence and, in many cases, combination. This means that proteins or protein domains belong to families or, in cases where their relationships can only be recognised on the basis of structure, superfamilies whose members descended from a common ancestor. The size of superfamilies can vary greatly. Also, during the course of evolution organisms of increasing complexity have arisen. In this paper we determine the identity of those superfamilies whose relative sizes in different organisms are highly correlated to the complexity of the organisms. As a measure of the complexity of 38 uni- and multicellular eukaryotes we took the number of different cell types of which they are composed. Of 1,219 superfamilies, there are 194 whose sizes in the 38 organisms are strongly correlated with the number of cell types in the organisms. We give outline descriptions of these superfamilies. Half are involved in extracellular processes or regulation and smaller proportions in other types of activity. Half of all superfamilies have no significant correlation with complexity. We also determined whether the expansions of large superfamilies correlate with each other. We found three large clusters of correlated expansions: one involves expansions in both vertebrates and plants, one just in vertebrates, and one just in plants.
    [Show full text]
  • The E3 Ligase TRIM32 Is an Effector of the RAS Family Gtpase RAP2
    The E3 Ligase TRIM32 is an effector of the RAS family GTPase RAP2 Berna Demiray A thesis submitted towards the degree of Doctor of Philosophy Cancer Institute University College London 2014 Declaration I, Berna Demiray, confirm that the work presented in this thesis is my own. Where information has been derived from other sources, I confirm that this has been indicated. London, 2014 The E3 Ligase TRIM32 is an Effector of the RAS family GTPase RAP2 Classical RAS oncogenes are mutated in approximately 30% of human tumours and RAP proteins are closely related to classical RAS proteins. RAP1 has an identical effector domain to RAS whereas RAP2 differs by one amino acid. RAP2 not only shares effectors with other classical RAS family members, but it also has its own specific effectors that do not bind to RAP1 or classical RAS family proteins. Thus, although closely related, RAP2 performs distinct functions, although these have been poorly characterised. Using RAP2 as bait in Tandem Affinity Purifications, we have identified several RAP2 interacting proteins including TRIM32; a protein implicated in diverse pathological processes such as Limb-Girdle Muscular Dystrophy (LGMD2H), and Bardet-Biedl syndrome (BBS). TRIM32 was shown to interact specifically with RAP2 in an activation- and effector domain-dependent manner; demonstrating stronger interaction with the RAP2 V12 mutant than the wild-type RAP2 and defective binding to the effector mutant RAP2 V12A38. The interaction was mapped to the C-terminus of TRIM32 (containing the NHL domains) while mutations found in LGMD2H (R394H, D487N, ∆588) were found to disrupt binding to RAP2. The TRIM32 P130S mutant linked to BBS did not affect binding to RAP2, suggesting that the RAP2-TRIM32 interaction may be functionally involved in LGMD2H.
    [Show full text]
  • Suppl Figure 1
    Suppl Table 2. Gene Annotation (October 2011) for the selected genes used in the study. Locus Identifier Gene Model Description AT5G51780 basic helix-loop-helix (bHLH) DNA-binding superfamily protein; FUNCTIONS IN: DNA binding, sequence-specific DNA binding transcription factor activity; INVOLVED IN: regulation of transcription; LOCATED IN: nucleus; CONTAINS InterPro DOMAIN/s: Helix-loop-helix DNA-binding domain (InterPro:IPR001092), Helix-loop-helix DNA-binding (InterPro:IPR011598); BEST Arabidopsis thaliana protein match is: basic helix-loop-helix (bHLH) D AT3G53400 BEST Arabidopsis thaliana protein match is: conserved peptide upstream open reading frame 47 (TAIR:AT5G03190.1); Has 285 Blast hits to 285 proteins in 23 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 0; Plants - 279; Viruses - 0; Other Eukaryotes - 5 (source: NCBI BLink). AT1G44760 Adenine nucleotide alpha hydrolases-like superfamily protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to stress; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: UspA (InterPro:IPR006016), Rossmann-like alpha/beta/alpha sandwich fold (InterPro:IPR014729); BEST Arabidopsis thaliana protein match is: Adenine nucleotide alpha hydrolases-li AT4G19950 unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G44860.1); Has 338 Blast hits to 330 proteins in 72 species: Archae - 2; Bacteria - 94; Metazoa - 7; Fungi - 0; Plants - 232; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). AT3G14280
    [Show full text]
  • Table S5.Xlsx
    Table S5. List of PFAM domains identified in predicted V. nonalfalfae secretome PFAM description PFAM ID Number of hits EFFECTOR-SPECIFIC PFAM 66 Calcineurin-like phosphoesterase PF00149 6 Cerato-platanin PF07249 2 CFEM (Common in Fungal Extracellular Membranes) domain PF05730 13 Chitin recongnition protein PF00187 1 Chitin recongnition protein PF03067 3 CVNH (Cyanovirin-N) domain PF08881 2 Cysteine-rich secretory protein family PF00188 4 Fungal hydrophobin PF06766 4 LysM domain PF01476 9 Lytic transglycolase PF03330 3 Necrosis inducing protein (NPP1) PF05630 6 PAN domain PF00024 1 Hce2 (Homologs of C.
    [Show full text]
  • Phd Thesis in the Laboratory of Dr
    Control of pluripotency during the oocyte-to-embryo transition in Caenorhabditis elegans Inauguraldissertation zur Erlangung der Würde einer Doktorin der Philosophie vorgelegt der Philosophisch-Naturwisschenschaftlichen Fakultät der Universität Basel von Cristina Tocchini aus Italien Basel, 2015 Original document stored on the publication server of the University of Basel edoc.unibas.ch This work is licensed under a Creative Commons Attribution 4.0 International License 1 Genehmigt von der Philosophisch-Naturwissenschaftlichen Fakultät der Universität Basel auf Antrag von: Prof. Dr. Susan M. Gasser Dr. Rafal Ciosk Dr. Anne Ephrussi Basel, den 24 März 2015 Prof. Dr. Jörg Schibler (Dekan der Philosophisch-Naturwissenschaftlichen Fakultät der Universität Basel) 2 Table of contents Summary…………………………………………………………………………………………………………………………. 5 Introduction……………………………………………………………………………………………………………………. 7 1. Pluripotency and stem cells……………………………………………………………………………………………… 8 1.1. Why studying stem cells……………………………………………………………………………………….. 8 1.2. Type of stem cells: an overview…………………………………………………………………………… 9 1.3. Pluripotency and germ cells………………………………………………………………………………… 10 1.3.1 Cytoplasmic factors controlling pluripotency in germ cells……………………………. 10 2. The oocyte-to-embryo transition ……………………………………………………………………………………… 12 2.1. Oocyte maturation……………………………………………………………………………………………. 13 2.2. Degradation of maternal factors………………………………………………………………... …......... 15 2.3. The embryonic genome activation………………………………………………………………………… 17 3. Caenorhabditis elegans
    [Show full text]
  • Supplementary File
    Putative pol III type 3 transcription units identified by COMPASSS CHR PROMOTERS Direction position in Position in Length TATA poly-T Contig from to chromosome distance distance NT_039169.7 (1) 1 0 5'-3' 811630 811630 812004 3.811.630 374 10 364 1 5'-3' 1590858 1590858 1591301 4.590.858 443 24 419 2 5'-3' 1994160 1994160 1994497 4.994.160 337 6 327 3 5'-3' 6527800 6527800 6528025 9.527.800 225 10 215 4 5'-3' 8682780 8682780 8683043 11.682.780 263 16 253 5 5'-3' 8745619 8745619 8745996 11.745.619 377 13 367 6 5'-3' 11727442 11727442 11728016 14.727.442 574 28 564 7 5'-3' 14263375 14263375 14263763 17.263.375 388 12 378 8 5'-3' 16442270 16442270 16442708 19.442.270 438 12 428 9 5'-3' 18841135 18841135 18841497 21.841.135 362 26 352 10 3'-5' 2775186 2775382 2775819 5.775.186 437 20 427 11 3'-5' 3993842 3994000 3994475 6.993.842 475 24 465 12 3'-5' 6596021 6596163 6596654 9.596.021 491 8 481 13 3'-5' 7061605 7061823 7062238 10.061.605 415 6 405 14 3'-5' 7105421 7105466 7106054 10.105.421 588 21 578 15 3'-5' 13717998 13718323 13718631 16.717.998 308 18 298 16 3'-5' 16915979 16916131 16916612 19.915.979 481 9 471 17 3'-5' 16942929 16943195 16943562 19.942.929 367 27 357 18 3'-5' 17101314 17101456 17101947 20.101.314 491 30 481 NT_039170.7 (2) 19 5'-3' 2321295 2321295 2321704 24.794.644 409 20 399 20 5'-3' 2528703 2528703 2529112 25.002.052 409 10 399 21 5'-3' 5830337 5830337 5830706 28.303.686 369 15 359 22 5'-3' 5958024 5958024 5958404 28.431.373 380 21 370 23 5'-3' 6171558 6171558 6171882 28.644.907 324 10 314 24 5'-3' 6908469 6908469 6908796
    [Show full text]