Mouse Clcn7 Conditional Knockout Project (CRISPR/Cas9)*

Total Page:16

File Type:pdf, Size:1020Kb

Mouse Clcn7 Conditional Knockout Project (CRISPR/Cas9)* http://www.alphaknockout.com/ Mouse Clcn7 Conditional Knockout Project (CRISPR/Cas9)* Objective: To create a Clcn7 conditional knockout mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Clcn7 gene ( NCBI Reference Sequence: NM_011930 ; Ensembl: ENSMUSG00000036636 ) is located on mouse chromosome 17. 25 exons are identified , with the ATG start codon in exon 1 and the TGA stop codon in exon 25 (Transcript: ENSMUST00000040729). Exon 2~5 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the mouse Clcn7 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP23-99I16 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Mice homozygous for a knock-out allele exhibit postnatal lethality, abnormal bone formation, including osteopetrosis, and retinal degeneration. Mice homozygous for a conditional allele exhibit lysosomal defects with neuronal degeneration and accumulationof giant lysosomes in renal tubule cells. The knockout of Exon 2~5 will result in frameshift of the gene, and covers 14.11% of the coding region. The size of intron 1 for 5'-loxP site insertion: 10815 bp, and the size of intron 5 for 3'-loxP site insertion: 320 bp. The size of effective cKO region: ~2525 bp. This strategy is designed based on genetic information in existing databases. Due to the complexity of biological processes, all risk of loxP insertion on gene transcription, RNA splicing and protein translation cannot be predicted at existing technological level. Page 1 of 8 http://www.alphaknockout.com/ Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 2 3 4 5 6 7 25 Targeting vector Targeted allele Constitutive KO allele (After Cre recombination) Legends Exon of mouse Clcn7 Homology arm cKO region loxP site Page 2 of 8 http://www.alphaknockout.com/ Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. It may be difficult to construct this targeting vector. Overview of the GC Content Distribution Window size: 300 bp Sequence 12 Summary: Full Length(8840bp) | A(22.31% 1972) | C(24.79% 2191) | G(26.57% 2349) | T(26.33% 2328) Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 8 http://www.alphaknockout.com/ BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN -------------------------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr17 + 25141181 25144180 3000 browser details YourSeq 219 369 2450 3000 91.7% chr4 + 38381161 38713313 332153 browser details YourSeq 171 211 493 3000 92.6% chr7 + 92754465 92873137 118673 browser details YourSeq 137 339 504 3000 88.6% chr11 - 107297467 107297624 158 browser details YourSeq 132 246 498 3000 89.5% chr8 + 110861821 110862071 251 browser details YourSeq 128 246 466 3000 89.6% chr6 + 61289215 61289666 452 browser details YourSeq 127 359 503 3000 92.2% chr17 - 74098930 74099071 142 browser details YourSeq 125 2299 2454 3000 92.1% chr6 - 18118015 18118167 153 browser details YourSeq 125 361 531 3000 86.5% chr16 - 94303870 94304024 155 browser details YourSeq 125 2295 2453 3000 90.9% chr13 + 111365957 111366116 160 browser details YourSeq 123 2295 2454 3000 90.8% chr19 - 14100550 14100706 157 browser details YourSeq 121 2299 2456 3000 90.6% chr4 - 129042179 129042333 155 browser details YourSeq 120 360 525 3000 87.4% chr7 - 100664352 100664766 415 browser details YourSeq 120 360 530 3000 84.5% chr15 - 38645227 38645380 154 browser details YourSeq 119 367 504 3000 91.0% chr9 + 44731276 44731409 134 browser details YourSeq 119 2320 2450 3000 97.6% chr7 + 29891457 29891591 135 browser details YourSeq 119 363 493 3000 95.5% chr16 + 3487910 3488040 131 browser details YourSeq 118 368 504 3000 93.5% chr7 - 126783605 126783744 140 browser details YourSeq 118 2320 2454 3000 96.2% chr1 - 4866726 4866862 137 browser details YourSeq 118 358 505 3000 87.2% chr7 + 35516763 35516903 141 Note: The 3000 bp section upstream of Exon 2 is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr17 + 25146611 25149610 3000 browser details YourSeq 28 2664 2697 3000 96.7% chr1 + 158922142 158922191 50 browser details YourSeq 24 2812 2843 3000 87.5% chr5 + 68135315 68135346 32 browser details YourSeq 21 1456 1476 3000 100.0% chr4 + 36265811 36265831 21 browser details YourSeq 20 1238 1259 3000 95.5% chr1 - 31709920 31709941 22 Note: The 3000 bp section downstream of Exon 5 is BLAT searched against the genome. No significant similarity is found. Page 4 of 8 http://www.alphaknockout.com/ Gene and protein information: Clcn7 chloride channel, voltage-sensitive 7 [ Mus musculus (house mouse) ] Gene ID: 26373, updated on 12-Feb-2021 Gene summary Official Symbol Clcn7 provided by MGI Official Full Name chloride channel, voltage-sensitive 7 provided by MGI Primary source MGI:MGI:1347048 See related Ensembl:ENSMUSG00000036636 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as ClC-; ClC-7; D17Wsu51e Expression Ubiquitous expression in genital fat pad adult (RPKM 32.4), kidney adult (RPKM 17.7) and 28 other tissues See more Orthologs human all NEW Try the new Gene table Try the new Transcript table Genomic context Location: 17 A3.3; 17 12.53 cM See Clcn7 in Genome Data Viewer Exon count: 26 Annotation release Status Assembly Chr Location 109 current GRCm39 (GCF_000001635.27) 17 NC_000083.7 (25352353..25381077) 108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 17 NC_000083.6 (25133378..25162103) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 17 NC_000083.5 (25270339..25299044) Chromosome 17 - NC_000083.7 Page 5 of 8 http://www.alphaknockout.com/ Transcript information: This gene has 7 transcripts Gene: Clcn7 ENSMUSG00000036636 Description chloride channel, voltage-sensitive 7 [Source:MGI Symbol;Acc:MGI:1347048] Gene Synonyms ClC-7 Location Chromosome 17: 25,352,365-25,381,078 forward strand. GRCm39:CM001010.3 About this gene This gene has 7 transcripts (splice variants), 200 orthologues, 8 paralogues and is associated with 68 phenotypes. Transcripts UniProt Name Transcript ID bp Protein Translation ID Biotype CCDS Flags Match Clcn7- ENSMUST00000040729.9 4071 803aa ENSMUSP00000035964.3 Protein coding CCDS28509 O70496 TSL:1 201 Q6RUT9 GENCODE basic APPRIS P3 Clcn7- ENSMUST00000160961.8 3983 783aa ENSMUSP00000124194.2 Protein coding CCDS84282 E9PYL4 TSL:1 204 GENCODE basic APPRIS ALT2 Clcn7- ENSMUST00000162862.3 4237 860aa ENSMUSP00000124527.3 Protein coding - F6SUM2 TSL:5 206 GENCODE basic APPRIS ALT2 Clcn7- ENSMUST00000159773.2 605 202aa ENSMUSP00000125546.2 Protein coding - F7BK14 CDS 5' and 3' 203 incomplete TSL:5 Clcn7- ENSMUST00000233633.2 4067 391aa ENSMUSP00000156968.2 Nonsense mediated - A0A3B2W4I8 - 207 decay Clcn7- ENSMUST00000159426.2 1006 No - Retained intron - - TSL:5 202 protein Clcn7- ENSMUST00000162722.2 584 No - Retained intron - - TSL:2 205 protein Page 6 of 8 http://www.alphaknockout.com/ 48.71 kb Forward strand 25.35Mb 25.36Mb 25.37Mb 25.38Mb 25.39Mb Genes (Comprehensive set... Ptx4-201 >protein coding Clcn7-207 >nonsense mediated decay Ccdc154-201 >protein coding Ptx4-203 >protein coding Clcn7-201 >protein coding Ccdc154-203 >protein coding Ptx4-202 >protein coding Clcn7-206 >protein coding Ccdc154-202 >protein coding Ptx4-204 >nonsense mediated decay Clcn7-204 >protein coding Clcn7-205 >retained intron Clcn7-202 >retained intron Clcn7-203 >protein coding mmu-mir-12188.1-201 >miRNA Ccdc154-204 >nonsense mediated decay Contigs AC130711.3 > Regulatory Build 25.35Mb 25.36Mb 25.37Mb 25.38Mb 25.39Mb Reverse strand 48.71 kb Regulation Legend CTCF Enhancer Open Chromatin Promoter Promoter Flank Gene Legend Protein Coding merged Ensembl/Havana Ensembl protein coding Non-Protein Coding processed transcript RNA gene Page 7 of 8 http://www.alphaknockout.com/ Transcript: ENSMUST00000040729 28.71 kb Forward strand Clcn7-201 >protein coding ENSMUSP00000035... Transmembrane heli... MobiDB lite Low complexity (Seg) Superfamily Chloride channel, core SSF54631 SMART CBS domain Prints Chloride channel ClC-7 Chloride channel, voltage gated Pfam Chloride channel, voltage gated CBS domain PROSITE profiles CBS domain PANTHER PTHR11689 PTHR11689:SF92 Gene3D Chloride channel, core 3.10.580.10 CDD cd03685 cd04591 All sequence SNPs/i... Sequence variants (dbSNP and all other sources) Variant Legend stop gained missense variant synonymous variant Scale bar 0 80 160 240 320 400 480 560 640 720 803 We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC, VectorBuilder. Page 8 of 8.
Recommended publications
  • The Capacity of Long-Term in Vitro Proliferation of Acute Myeloid
    The Capacity of Long-Term in Vitro Proliferation of Acute Myeloid Leukemia Cells Supported Only by Exogenous Cytokines Is Associated with a Patient Subset with Adverse Outcome Annette K. Brenner, Elise Aasebø, Maria Hernandez-Valladares, Frode Selheim, Frode Berven, Ida-Sofie Grønningsæter, Sushma Bartaula-Brevik and Øystein Bruserud Supplementary Material S2 of S31 Table S1. Detailed information about the 68 AML patients included in the study. # of blasts Viability Proliferation Cytokine Viable cells Change in ID Gender Age Etiology FAB Cytogenetics Mutations CD34 Colonies (109/L) (%) 48 h (cpm) secretion (106) 5 weeks phenotype 1 M 42 de novo 241 M2 normal Flt3 pos 31.0 3848 low 0.24 7 yes 2 M 82 MF 12.4 M2 t(9;22) wt pos 81.6 74,686 low 1.43 969 yes 3 F 49 CML/relapse 149 M2 complex n.d. pos 26.2 3472 low 0.08 n.d. no 4 M 33 de novo 62.0 M2 normal wt pos 67.5 6206 low 0.08 6.5 no 5 M 71 relapse 91.0 M4 normal NPM1 pos 63.5 21,331 low 0.17 n.d. yes 6 M 83 de novo 109 M1 n.d. wt pos 19.1 8764 low 1.65 693 no 7 F 77 MDS 26.4 M1 normal wt pos 89.4 53,799 high 3.43 2746 no 8 M 46 de novo 26.9 M1 normal NPM1 n.d. n.d. 3472 low 1.56 n.d. no 9 M 68 MF 50.8 M4 normal D835 pos 69.4 1640 low 0.08 n.d.
    [Show full text]
  • Supplementary Table S4. FGA Co-Expressed Gene List in LUAD
    Supplementary Table S4. FGA co-expressed gene list in LUAD tumors Symbol R Locus Description FGG 0.919 4q28 fibrinogen gamma chain FGL1 0.635 8p22 fibrinogen-like 1 SLC7A2 0.536 8p22 solute carrier family 7 (cationic amino acid transporter, y+ system), member 2 DUSP4 0.521 8p12-p11 dual specificity phosphatase 4 HAL 0.51 12q22-q24.1histidine ammonia-lyase PDE4D 0.499 5q12 phosphodiesterase 4D, cAMP-specific FURIN 0.497 15q26.1 furin (paired basic amino acid cleaving enzyme) CPS1 0.49 2q35 carbamoyl-phosphate synthase 1, mitochondrial TESC 0.478 12q24.22 tescalcin INHA 0.465 2q35 inhibin, alpha S100P 0.461 4p16 S100 calcium binding protein P VPS37A 0.447 8p22 vacuolar protein sorting 37 homolog A (S. cerevisiae) SLC16A14 0.447 2q36.3 solute carrier family 16, member 14 PPARGC1A 0.443 4p15.1 peroxisome proliferator-activated receptor gamma, coactivator 1 alpha SIK1 0.435 21q22.3 salt-inducible kinase 1 IRS2 0.434 13q34 insulin receptor substrate 2 RND1 0.433 12q12 Rho family GTPase 1 HGD 0.433 3q13.33 homogentisate 1,2-dioxygenase PTP4A1 0.432 6q12 protein tyrosine phosphatase type IVA, member 1 C8orf4 0.428 8p11.2 chromosome 8 open reading frame 4 DDC 0.427 7p12.2 dopa decarboxylase (aromatic L-amino acid decarboxylase) TACC2 0.427 10q26 transforming, acidic coiled-coil containing protein 2 MUC13 0.422 3q21.2 mucin 13, cell surface associated C5 0.412 9q33-q34 complement component 5 NR4A2 0.412 2q22-q23 nuclear receptor subfamily 4, group A, member 2 EYS 0.411 6q12 eyes shut homolog (Drosophila) GPX2 0.406 14q24.1 glutathione peroxidase
    [Show full text]
  • CBS Domains Regulate CLC Chloride Channel Gating: Role of the R
    CBS Domains Regulate CLC Chloride Channel Gating: Role of the R-Helix Linker By Sonya Davé Dissertation Submitted to the Faculty of the Graduate School of Vanderbilt University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY in Molecular Physiology and Biophysics December, 2010 Nashville, Tennessee Approved: Dr. Hassane Mchaourab Dr. Jerod Denton Dr. Jens Meiler Dr. Al Beth Dr. Danny Winder - i - I dedicate my thesis to the best parents in the world, Gopali and Suresh Dave’, and my loving friend, Vishwas Sinha. - ii - ACKNOWLEDGEMENTS I am grateful and lucky to have Dr. Kevin Strange as my advisor. He has always been around to guide me and keep my research on track. Kevin has also taught me to communicate science articulately, be it presentations, papers or grants. He has shown me what is important to become an outstanding scientist, both in terms of ideas and performing experiments. His excellent teaching abilities have made me an experienced electrophysiologist. His jokes, friendship and teasing have also made me a better person. Of course, this project could go nowhere without funding Kevin has secured. This work was supported by NIH grants R01 DK51610 to Kevin Strange and R01 GM080403 to Jens Meiler. I am also grateful to American Heart Association’s Predoctoral Training Grant for my stipend. All current and former members of the Strange Laboratory have been very valuable colleagues and enjoyable friends. Our lab manager Rebecca Morrison has always been there with a thorough answer and solution for many questions arising in experiments. Former lab members Dr.
    [Show full text]
  • Mapping Transmembrane Binding Partners for E-Cadherin Ectodomains
    SUPPLEMENTARY INFORMATION TITLE: Mapping transmembrane binding partners for E-cadherin ectodomains. AUTHORS: Omer Shafraz 1, Bin Xie 2, Soichiro Yamada 1, Sanjeevi Sivasankar 1, 2, * AFFILIATION: 1 Department of Biomedical Engineering, 2 Biophysics Graduate Group, University of California, Davis, CA 95616. *CORRESPONDING AUTHOR: Sanjeevi Sivasankar, Tel: (530)-754-0840, Email: [email protected] Figure S1: Western blots a. EC-BioID, WT and Ecad-KO cell lysates stained for Ecad and tubulin. b. HRP-streptavidin staining of biotinylated proteins eluted from streptavidin coated magnetic beads incubated with cell lysates of EC-BioID with (+) and without (-) exogenous biotin. c. C-BioID, WT and Ecad-KO cell lysates stained for Ecad and tubulin. d. HRP-streptavidin staining of biotinylated proteins eluted from streptavidin coated magnetic beads incubated with cell lysates of C-BioID with (+) and without (-) exogenous biotin. (+) Biotin (-) Biotin Sample 1 Sample 2 Sample 3 Sample 4 Sample 1 Sample 2 Sample 3 Sample 4 Percent Percent Percent Percent Percent Percent Percent Percent Gene ID Coverage Coverage Coverage Coverage Coverage Coverage Coverage Coverage CDH1 29.6 31.4 41.1 36.5 10.8 6.7 28.8 29.1 DSG2 26 14.6 45 37 0.8 1.9 1.6 18.7 CXADR 30.2 26.2 32.7 27.1 0.0 0.0 0.0 6.9 EFNB1 24.3 30.6 24 30.3 0.0 0.0 0.0 0.0 ITGA2 16.5 22.2 30.1 33.4 1.1 1.1 5.2 7.2 CDH3 21.8 9.7 20.6 25.3 1.3 1.3 0.0 0.0 ITGB1 11.8 16.7 23.9 20.3 0.0 2.9 8.5 5.8 DSC3 9.7 7.5 11.5 13.3 0.0 0.0 2.6 0.0 EPHA2 23.2 31.6 31.6 30.5 0.8 0.0 0.0 5.7 ITGB4 21.8 27.8 33.1 30.7 0.0 1.2 3.9 4.4 ITGB3 23.5 22.2 26.8 24.7 0.0 0.0 5.2 9.1 CDH6 22.8 18.1 28.6 24.3 0.0 0.0 0.0 9.1 CDH17 8.8 12.4 20.7 18.4 0.0 0.0 0.0 0.0 ITGB6 12.7 10.4 14 17.1 0.0 0.0 0.0 1.7 EPHB4 11.4 8.1 14.2 16.3 0.0 0.0 0.0 0.0 ITGB8 5 10 15 17.6 0.0 0.0 0.0 0.0 ITGB5 6.2 9.5 15.2 13.8 0.0 0.0 0.0 0.0 EPHB2 8.5 4.8 9.8 12.1 0.0 0.0 0.0 0.0 CDH24 5.9 7.2 8.3 9 0.0 0.0 0.0 0.0 Table S1: EC-BioID transmembrane protein hits.
    [Show full text]
  • The Analysis of Myotonia Congenita Mutations Discloses Functional Clusters of Amino Acids Within the CBS2 Domain and the C-Terminal Peptide of the Clc-1 Channel
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by AIR Universita degli studi di Milano The analysis of myotonia congenita mutations discloses functional clusters of amino acids within the CBS2 domain and the C-terminal peptide of the ClC-1 channel Concetta Altamura1 Sabrina Lucchiari2,3 Dalila Sahbani1 Gianna Ulzi2,3 Giacomo P.Comi2,3 Paola D'Ambrosio4 Roberta Petillo4 Luisa Politano4 Liliana Vercelli5 Tiziana Mongini5 Maria Teresa Dotti6 Rosanna Cardani7 Giovanni Meola8 Mauro Lo Monaco9,10 Emma Matthews11 MichaelG.Hanna11 Maria Rosaria Carratù12 Diana Conte1 Paola Imbrici1 Jean-François Desaphy12 1Department of Pharmacy-Drug Sciences, University of Bari Aldo Moro, Bari, Italy 2Dino Ferrari Centre, Neuroscience Section, Department of Pathophysiology and Transplantation (DEPT), University of Milan, Milan, Italy 3Neurology Unit, IRCCS Fondazione Ca’ Grande Ospedale Maggiore Policlinico, Milan, Italy 4Cardiomyology and Medical Genetics, Department of Experimental Medicine, University of Campania, Naples, Italy 5Neuromuscular Unit, Department of Neurosciences, Hospital Città della Salute e della Scienza of Torino, University of Torino, Turin, Italy 6Unit of Neurology and Neurometabolic Disorders, Department of Medicine, Surgery and Neurosciences, University of Siena, Siena, Italy 7Laboratory of Muscle Histopathology and Molecular Biology, IRCCS Policlinico San Donato, Milan, Italy 8Department of Biomedical Sciences for Health, University of Milan, IRCCS Policlinico San Donato, Milan, Italy 9Institute
    [Show full text]
  • Published Version
    Biochimica et Biophysica Acta 1859 (2017) 1859–1871 Contents lists available at ScienceDirect Biochimica et Biophysica Acta journal homepage: www.elsevier.com/locate/bbamem Crystallographic and biochemical characterization of the dimeric architecture of site-2 protease Magdalena Schacherl a,⁎,1, Monika Gompert a, Els Pardon b,c, Tobias Lamkemeyer d,2, Jan Steyaert b,c, Ulrich Baumann a a Institute of Biochemistry, University of Cologne, Otto-Fischer-Str. 14, 50674 Cologne, Germany b Structural Biology Brussels, Vrije Universiteit Brussel (VUB), 1050 Brussels, Belgium c Structural Biology Research Center, VIB, 1050 Brussels, Belgium d Cluster of Excellence in Cellular Stress Responses in Aging-associated Diseases (CECAD), University of Cologne, Cologne, Germany article info abstract Article history: Regulated intramembrane proteolysis by members of the site-2 protease family (S2P) is an essential signal trans- Received 22 June 2016 duction mechanism conserved from bacteria to humans. There is some evidence that extra-membranous Received in revised form 8 May 2017 domains, like PDZ and CBS domains, regulate the proteolytic activity of S2Ps and that some members act as di- Accepted 10 May 2017 mers. Here we report the crystal structure of the regulatory CBS domain pair of S2P from Archaeoglobus fulgidus, Available online 11 May 2017 AfS2P, in the apo and nucleotide-bound form in complex with a specific nanobody from llama. Cross-linking and SEC-MALS analyses show for the first time the dimeric architecture of AfS2P both in the membrane and in Keywords: fi Site-2 protease detergent micelles. The CBS domain pair dimer (CBS module) displays an unusual head-to-tail con guration CBS domain and nucleotide binding triggers no major conformational changes in the magnesium-free state.
    [Show full text]
  • Deep Evolutionary History of the Phox and Bem1 (PB1) Domain
    bioRxiv preprint doi: https://doi.org/10.1101/2020.01.13.903906; this version posted January 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 Deep Evolutionary History of the Phox and Bem1 (PB1) Domain 2 Across Eukaryotes 3 4 Sumanth Kumar Mutte1 and Dolf Weijers1, * 5 1Laboratory of Biochemistry, Wageningen University, Stippeneng 4, 6708WE, Wageningen, 6 the Netherlands 7 8 Sumanth Kumar Mutte: [email protected] 9 * Corresponding author (Dolf Weijers): [email protected] 10 11 12 13 14 15 16 17 18 19 20 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.01.13.903906; this version posted January 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 21 ABSTRACT 22 Protein oligomerization is a fundamental process to build complex functional modules. 23 Domains that facilitate the oligomerization process are diverse and widespread in nature 24 across all kingdoms of life. One such domain is the Phox and Bem1 (PB1) domain, which is 25 functionally (relatively) well understood in the animal kingdom. However, beyond animals, 26 neither the origin nor the evolutionary patterns of PB1-containing proteins are understood. 27 While PB1 domain proteins have been found in other kingdoms, including plants, it is unclear 28 how these relate to animal PB1 proteins.
    [Show full text]
  • Cryo-EM Structure of the Lysosomal Chloride-Proton Exchanger CLC-7 in Complex with OSTM1 Marina Schrecker, Julia Korobenko, Richard K Hite*
    RESEARCH ARTICLE Cryo-EM structure of the lysosomal chloride-proton exchanger CLC-7 in complex with OSTM1 Marina Schrecker, Julia Korobenko, Richard K Hite* Structural Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States Abstract The chloride-proton exchanger CLC-7 plays critical roles in lysosomal homeostasis and bone regeneration and its mutation can lead to osteopetrosis, lysosomal storage disease and neurological disorders. In lysosomes and the ruffled border of osteoclasts, CLC-7 requires a b- subunit, OSTM1, for stability and activity. Here, we present electron cryomicroscopy structures of CLC-7 in occluded states by itself and in complex with OSTM1, determined at resolutions up to 2.8 A˚ . In the complex, the luminal surface of CLC-7 is entirely covered by a dimer of the heavily glycosylated and disulfide-bonded OSTM1, which serves to protect CLC-7 from the degradative environment of the lysosomal lumen. OSTM1 binding does not induce large-scale rearrangements of CLC-7, but does have minor effects on the conformation of the ion-conduction pathway, potentially contributing to its regulatory role. These studies provide insights into the role of OSTM1 and serve as a foundation for understanding the mechanisms of CLC-7 regulation. Introduction CLC-7 is a member of the CLC family of chloride (Cl-) channels and chloride (Cl-)/proton (H+) trans- porters and is expressed in the lysosome and the resorption lacuna of osteoclasts (Graves et al., *For correspondence: [email protected] 2008; Ishida et al., 2013; Kornak et al., 2001; Weinert et al., 2010). In the membranes of these acidic compartments, CLC-7 uses the large pH gradient to catalyze the uptake of two Cl- ions for Competing interests: The each H+ released (Graves et al., 2008; Leisle et al., 2011; Ludwig et al., 2013).
    [Show full text]
  • Suppl Figure 1
    Suppl Table 2. Gene Annotation (October 2011) for the selected genes used in the study. Locus Identifier Gene Model Description AT5G51780 basic helix-loop-helix (bHLH) DNA-binding superfamily protein; FUNCTIONS IN: DNA binding, sequence-specific DNA binding transcription factor activity; INVOLVED IN: regulation of transcription; LOCATED IN: nucleus; CONTAINS InterPro DOMAIN/s: Helix-loop-helix DNA-binding domain (InterPro:IPR001092), Helix-loop-helix DNA-binding (InterPro:IPR011598); BEST Arabidopsis thaliana protein match is: basic helix-loop-helix (bHLH) D AT3G53400 BEST Arabidopsis thaliana protein match is: conserved peptide upstream open reading frame 47 (TAIR:AT5G03190.1); Has 285 Blast hits to 285 proteins in 23 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 0; Plants - 279; Viruses - 0; Other Eukaryotes - 5 (source: NCBI BLink). AT1G44760 Adenine nucleotide alpha hydrolases-like superfamily protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to stress; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: UspA (InterPro:IPR006016), Rossmann-like alpha/beta/alpha sandwich fold (InterPro:IPR014729); BEST Arabidopsis thaliana protein match is: Adenine nucleotide alpha hydrolases-li AT4G19950 unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G44860.1); Has 338 Blast hits to 330 proteins in 72 species: Archae - 2; Bacteria - 94; Metazoa - 7; Fungi - 0; Plants - 232; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). AT3G14280
    [Show full text]
  • Deep Evolutionary History of the Phox and Bem1 (PB1) Domain Across Eukaryotes Sumanth Kumar Mutte & Dolf Weijers*
    www.nature.com/scientificreports OPEN Deep Evolutionary History of the Phox and Bem1 (PB1) Domain Across Eukaryotes Sumanth Kumar Mutte & Dolf Weijers* Protein oligomerization is a fundamental process to build complex functional modules. Domains that facilitate the oligomerization process are diverse and widespread in nature across all kingdoms of life. One such domain is the Phox and Bem1 (PB1) domain, which is functionally well-studied in the animal kingdom. However, beyond animals, neither the origin nor the evolutionary patterns of PB1-containing proteins are understood. While PB1 domain proteins have been found in other kingdoms including plants, it is unclear how these relate to animal PB1 proteins. To address this question, we utilized large transcriptome datasets along with the proteomes of a broad range of species. We discovered eight PB1 domain-containing protein families in plants, along with four each in Protozoa and Fungi and three families in Chromista. Studying the deep evolutionary history of PB1 domains throughout eukaryotes revealed the presence of at least two, but likely three, ancestral PB1 copies in the Last Eukaryotic Common Ancestor (LECA). These three ancestral copies gave rise to multiple orthologues later in evolution. Analyzing the sequence and secondary structure properties of plant PB1 domains from all the eight families showed their common ubiquitin β-grasp fold, despite poor sequence identity. Tertiary structural models of these plant PB1 families, combined with Random Forest based classifcation, indicated family-specifc diferences attributed to the length of PB1 domain and the proportion of β-sheets. Thus, this study not only identifes novel PB1 families, but also provides an evolutionary basis to understand their diverse functional interactions.
    [Show full text]
  • Latest Release of Uniprotkb
    Pfam Documentation Pfam Team May 12, 2021 CONTENTS 1 Contents: 3 2 License 81 3 Citing Pfam 83 4 Get in touch 85 i ii Pfam Documentation Pfam is a large collection of protein families, each represented by multiple sequence aligments and profile hidden Markov models (HMMs). CONTENTS 1 Pfam Documentation 2 CONTENTS CHAPTER ONE CONTENTS: 1.1 Summary Proteins are generally comprised of one or more functional regions, commonly termed domains. The presence of different domains in varying combinations in different proteins gives rise to the diverse repertoire of proteins found in nature. Identifying the domains present in a protein can provide insights into the function of that protein. The Pfam database is a large collection of protein domain families. Each family is represented by multiple sequence alignments and a profile hidden Markov model (HMM). Each Pfam family, sometimes referred to as a Pfam-A entry, consists of a curated seed alignment containing a small set of representative members of the family, profile HMMs built from the seed alignment, and an automatically generated full alignment, which contains all detectable protein sequences belonging to the family, as defined by profile HMM searches of primary sequence databases. Pfam entries are classified in one of six types: Family A collection of related protein regions Domain A structural unit Repeat A short unit which is unstable in isolation but forms a stable structure when multiple copies are present Motifs A short unit found outside globular domains Coiled-Coil Regions that predominantly contain coiled-coil motifs, regions that typically contain alpha-helices that are coiled together in bundles of 2-7.
    [Show full text]
  • Functional and Structural Conservation of CBS Domains from CLC Chloride Channels
    J Physiol 557.2 (2004) pp 363–378 363 Functional and structural conservation of CBS domains from CLC chloride channels Raul´ Estevez´ 1, Michael Pusch2, Carles Ferrer-Costa3, Modesto Orozco3 and Thomas J. Jentsch1 1Zentrum f¨ur Molekulare Neurobiologie Hamburg (ZMNH), Hamburg University, Falkenried 94, D-20246 Hamburg, Germany 2Istituto di Biofisica, Via de Marini 6, I-16149 Genova, Italy 3Departament de Bioqu´ımica i Biolog´ıa Molecular, Facultat de Quimica, Universitat de Barcelona, Marti i Franques 1, Barcelona 08028, and Institut de Recerca Biom´edica, Parc Cient´ıfic de Barcelona, Josep Samitier 1–5, Barcelona 08028, Spain All eukaryotic CLC Cl− channel subunits possess a long cytoplasmic carboxy-terminus that contains two so-called CBS (cystathionine β-synthase) domains. These domains are found in various unrelated proteins from all phylae. The crystal structure of the CBS domains of inosine monophosphate dehydrogenase (IMPDH) is known, but it is not known whether this structure is conserved in CLC channels. Working primarily with ClC-1, we used deletion scanning mutagenesis, coimmunoprecipitation and electrophysiology to demonstrate that its CBS domains interact. The replacement of CBS domains of ClC-1 with the corresponding CBS domains from other CLC channels and even human IMPDH yielded functional channels, indicating a high degree of structural conservation. Based on a homology model of the pair of CBS domains of CLC channels, we identified some residues that, when mutated, affected the common gate which acts on both pores of the dimeric channel. Thus, we propose that the structure of CBS domains from CLC channels is highly conserved and that they play a functional role in the common gate.
    [Show full text]