Mouse Ldhc Conditional Knockout Project (CRISPR/Cas9)

Total Page:16

File Type:pdf, Size:1020Kb

Mouse Ldhc Conditional Knockout Project (CRISPR/Cas9) https://www.alphaknockout.com Mouse Ldhc Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Ldhc conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Ldhc gene (NCBI Reference Sequence: NM_013580 ; Ensembl: ENSMUSG00000030851 ) is located on Mouse chromosome 7. 8 exons are identified, with the ATG start codon in exon 2 and the TAA stop codon in exon 8 (Transcript: ENSMUST00000014545). Exon 3 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Ldhc gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP24-169L3 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Homozygous male mice are infertile. Spermatogenesis appears normal, but sperm motility decreases rapidly after their release from the epididymus. In vitro fertilization is blocked unless the zona pellucida is removed; even then, the rate of sperm penetration is lower than for wild-type sperm. Exon 3 starts from about 12.75% of the coding region. The knockout of Exon 3 will result in frameshift of the gene. The size of intron 2 for 5'-loxP site insertion: 4513 bp, and the size of intron 3 for 3'-loxP site insertion: 3072 bp. The size of effective cKO region: ~618 bp. The cKO region does not have any other known gene. Page 1 of 8 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele gRNA region 5' gRNA region 3' 1 3 8 Targeting vector Targeted allele Constitutive KO allele (After Cre recombination) Legends Exon of mouse Ldhc Homology arm cKO region loxP site Page 2 of 8 https://www.alphaknockout.com Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Overview of the GC Content Distribution Window size: 300 bp Sequence 12 Summary: Full Length(7118bp) | A(29.56% 2104) | C(21.96% 1563) | T(26.66% 1898) | G(21.82% 1553) Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 8 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr7 + 46863160 46866159 3000 browser details YourSeq 322 738 1723 3000 95.8% chr7 - 46863897 46864882 986 browser details YourSeq 269 334 924 3000 89.9% chr5 - 114186795 114187422 628 browser details YourSeq 267 489 924 3000 93.3% chr14 + 56772160 56772638 479 browser details YourSeq 263 488 919 3000 86.4% chr16 - 93915863 93916198 336 browser details YourSeq 254 514 924 3000 89.0% chr16 - 18520913 18521220 308 browser details YourSeq 251 497 906 3000 87.5% chr4 - 152057355 152057662 308 browser details YourSeq 250 496 915 3000 88.0% chr1 + 167338352 167338670 319 browser details YourSeq 248 513 914 3000 87.3% chr18 + 61924410 61924700 291 browser details YourSeq 244 489 906 3000 90.1% chr11 + 103757065 103877621 120557 browser details YourSeq 240 601 1349 3000 91.3% chr3 - 105920742 105921431 690 browser details YourSeq 239 513 924 3000 94.8% chr17 + 32261634 32727458 465825 browser details YourSeq 235 493 917 3000 86.1% chr8 + 26181555 26181881 327 browser details YourSeq 224 519 925 3000 89.9% chr12 - 65056935 65057237 303 browser details YourSeq 224 1284 1717 3000 87.9% chr11 - 107499223 107499558 336 browser details YourSeq 223 551 920 3000 90.6% chr6 + 39431181 39431481 301 browser details YourSeq 219 630 924 3000 91.4% chr4 + 108244306 108352809 108504 browser details YourSeq 217 630 924 3000 90.7% chr19 - 37183079 37183668 590 browser details YourSeq 217 539 924 3000 90.4% chr1 - 181135880 181136228 349 browser details YourSeq 214 630 920 3000 93.2% chr2 - 84684190 84701316 17127 Note: The 3000 bp section upstream of Exon 3 is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN -------------------------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr7 + 46866778 46869777 3000 browser details YourSeq 378 219 1822 3000 92.7% chr11 + 68928551 69137720 209170 browser details YourSeq 263 569 1215 3000 92.6% chr15 + 78931946 78932632 687 browser details YourSeq 250 620 1215 3000 84.5% chr1 - 36008085 36008582 498 browser details YourSeq 231 619 1189 3000 93.0% chr12 + 109467398 109468041 644 browser details YourSeq 229 621 1215 3000 84.4% chr4 - 99025654 99025977 324 browser details YourSeq 220 624 1482 3000 93.0% chr18 - 24465234 24559339 94106 browser details YourSeq 218 643 1215 3000 85.0% chr14 - 121392688 121393095 408 browser details YourSeq 212 619 1195 3000 86.6% chr3 + 69015436 69015867 432 browser details YourSeq 167 348 776 3000 85.2% chr5 + 149573486 149573755 270 browser details YourSeq 162 624 1048 3000 95.1% chr19 - 6033547 6034074 528 browser details YourSeq 162 620 795 3000 94.8% chr13 + 33345131 33345304 174 browser details YourSeq 161 620 797 3000 97.1% chr18 - 21606675 21606853 179 browser details YourSeq 158 623 802 3000 94.4% chr9 + 65008503 65008692 190 browser details YourSeq 158 619 797 3000 94.3% chr13 + 58552644 58552821 178 browser details YourSeq 156 620 807 3000 90.6% chr13 - 75705968 75706137 170 browser details YourSeq 154 643 1180 3000 83.4% chr9 - 88535657 88535922 266 browser details YourSeq 154 623 790 3000 95.9% chr2 + 26328372 26328539 168 browser details YourSeq 153 620 800 3000 93.3% chr1 + 105913815 105913998 184 browser details YourSeq 152 617 803 3000 90.4% chr10 - 18398291 18398456 166 Note: The 3000 bp section downstream of Exon 3 is BLAT searched against the genome. No significant similarity is found. Page 4 of 8 https://www.alphaknockout.com Gene and protein information: Ldhc lactate dehydrogenase C [ Mus musculus (house mouse) ] Gene ID: 16833, updated on 24-Oct-2019 Gene summary Official Symbol Ldhc provided by MGI Official Full Name lactate dehydrogenase C provided by MGI Primary source MGI:MGI:96764 See related Ensembl:ENSMUSG00000030851 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as Ldh3; Ldh-3; Ldh-x; Ldhc4; LDH-C4 Expression Restricted expression toward testis adult (RPKM 1111.1) See more Orthologs human all Genomic context Location: 7 B3; 7 30.6 cM See Ldhc in Genome Data Viewer Exon count: 10 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 7 NC_000073.6 (46861263..46878147) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 7 NC_000073.5 (54116633..54133512) Chromosome 7 - NC_000073.6 Page 5 of 8 https://www.alphaknockout.com Transcript information: This gene has 5 transcripts Gene: Ldhc ENSMUSG00000030851 Description lactate dehydrogenase C [Source:MGI Symbol;Acc:MGI:96764] Gene Synonyms Ldh-3, Ldh3, Ldhc4 Location Chromosome 7: 46,861,203-46,878,142 forward strand. GRCm38:CM001000.2 About this gene This gene has 5 transcripts (splice variants), 99 orthologues, 4 paralogues, is a member of 1 Ensembl protein family and is associated with 6 phenotypes. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Ldhc-201 ENSMUST00000014545.10 1370 332aa ENSMUSP00000014545.4 Protein coding CCDS21290 P00342 Q548Z6 TSL:1 GENCODE basic APPRIS P1 Ldhc-204 ENSMUST00000210585.1 1071 274aa ENSMUSP00000147492.1 Protein coding - A0A1B0GRE9 TSL:1 GENCODE basic Ldhc-202 ENSMUST00000126004.2 872 236aa ENSMUSP00000115652.2 Protein coding - D3YZE4 CDS 3' incomplete TSL:5 Ldhc-205 ENSMUST00000211784.1 829 171aa ENSMUSP00000148038.1 Protein coding - A0A1B0GSR2 TSL:1 GENCODE basic Ldhc-203 ENSMUST00000148565.7 767 235aa ENSMUSP00000114206.1 Protein coding - D3YVR7 CDS 3' incomplete TSL:3 Page 6 of 8 https://www.alphaknockout.com 36.94 kb Forward strand 46.86Mb 46.87Mb 46.88Mb Genes (Comprehensive set... Ldha-208 >protein coding Ldhc-201 >protein coding Ldha-202 >protein coding Ldhc-204 >protein coding Ldha-214 >protein coding Ldhc-205 >protein coding Ldha-209 >nonsense mediated decay Ldhc-203 >protein coding Ldha-215 >protein coding Ldhc-202 >protein coding Ldha-203 >protein coding Ldha-210 >protein coding Ldha-201 >protein coding Ldha-213 >protein coding Ldha-211 >retained intron Contigs < AC090123.68 Regulatory Build 46.86Mb 46.87Mb 46.88Mb Reverse strand 36.94 kb Regulation Legend CTCF Enhancer Promoter Flank Transcription Factor Binding Site Gene Legend Protein Coding Ensembl protein coding merged Ensembl/Havana Non-Protein Coding processed transcript Page 7 of 8 https://www.alphaknockout.com Transcript: ENSMUST00000014545 16.94 kb Forward strand Ldhc-201 >protein coding ENSMUSP00000014... PDB-ENSP mappings TIGRFAM L-lactate dehydrogenase Superfamily NAD(P)-binding domain superfamily Lactate dehydrogenase/glycoside hydrolase, family 4, C-terminal Prints L-lactate/malate dehydrogenase Pfam Lactate/malate dehydrogenase, N-terminal Lactate/malate dehydrogenase, C-terminal PROSITE profiles PS51257 PROSITE patterns L-lactate dehydrogenase, active site PIRSF L-lactate/malate dehydrogenase PANTHER PTHR43128 PTHR43128:SF5 HAMAP L-lactate dehydrogenase Gene3D 3.40.50.720 Lactate dehydrogenase/glycoside hydrolase, family 4, C-terminal CDD cd05293 All sequence SNPs/i... Sequence variants (dbSNP and all other sources) Variant Legend missense variant synonymous variant Scale bar 0 40 80 120 160 200 240 280 332 We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC.
Recommended publications
  • PIR Brochure
    Protein Information Resource Integrated Protein Informatics Resource for Genomic & Proteomic Research For four decades the Protein Information Resource (PIR) has provided databases and protein sequence analysis tools to the scientific community, including the Protein Sequence Database, which grew out from the Atlas of Protein Sequence and Structure, edited by Margaret Dayhoff [1965-1978]. Currently, PIR major activities include: i) UniProt (Universal Protein Resource) development, ii) iProClass protein data integration and ID mapping, iii) PIRSF protein pir.georgetown.edu classification, and iv) iProLINK protein literature mining and ontology development. UniProt – Universal Protein Resource What is UniProt? UniProt is the central resource for storing and UniProt (Universal Protein Resource) http://www.uniprot.org interconnecting information from large and = + + disparate sources and the most UniProt: the world's most comprehensive catalog of information on proteins comprehensive catalog of protein sequence and functional annotation. UniProt Knowledgebase UniProt Reference UniProt Archive (UniProtKB) Clusters (UniRef) (UniParc) When to use UniProt databases Integration of Swiss-Prot, TrEMBL Non-redundant reference A stable, and PIR-PSD sequences clustered from comprehensive Use UniProtKB to retrieve curated, reliable, Fully classified, richly and accurately UniProtKB and UniParc for archive of all publicly annotated protein sequences with comprehensive or fast available protein comprehensive information on proteins. minimal redundancy and extensive sequence searches at 100%, sequences for Use UniRef to decrease redundancy and cross-references 90%, or 50% identity sequence tracking from: speed up sequence similarity searches. TrEMBL section UniRef100 Swiss-Prot, Computer-annotated protein sequences TrEMBL, PIR-PSD, Use UniParc to access to archived sequences EMBL, Ensembl, IPI, and their source databases.
    [Show full text]
  • Cdna Cloning, Expression and Homology Modeling of a Luciferase from the Firefly Lampyroidea Maculata
    Journal of Biochemistry and Molecular Biology, Vol. 39, No. 5, September 2006, pp. 578-585 cDNA Cloning, Expression and Homology Modeling of a Luciferase from the Firefly Lampyroidea maculata Abdo Rahman Emamzadeh1, Saman Hosseinkhani1,*, Majid Sadeghizadeh2, Maryam Nikkhah3, Mohammad Javad Chaichi4 and Mojtaba Mortazavi1 1Department of Biochemistry, Faculty of Basic Sciences, Tarbiat Modarres University, Tehran, Iran 2Department of Genetics, Faculty of Basic Sciences, Tarbiat Modarres University, Tehran, Iran 3Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran 4Department of Chemistry, Mazandaran University, Babolsar, Iran Received 7 April 2006, Accepted 23 May 2006 The cDNA of a firefly luciferase from lantern mRNA of Introduction Lampyroidea maculata has been cloned, sequenced and functionally expressed. The cDNA has an open reading Firefly luciferase (EC 1.13.12.7) is a well-characterized enzyme frame of 1647 bp and codes for a 548-residue-long that is responsible for the bioluminescence reaction. It polypeptide. Noteworthy, sequence comparison as well as catalyzes the oxidation of firefly luciferin with molecular homology modeling showed the highest degree of similarity oxygen in the presence of ATP and Mg2+ to emit yellow-green with H. unmunsana and L. mingrelica luciferases, light (McElroy, 1969; White et al., 1971; DeLuca, 1976; suggesting a close phylogenetic relationship despite the Wood, 1995). The initial reaction catalyzed by firefly geographical distance separation. The deduced amino acid luciferase is the formation of luciferyl adenylate with the sequence of the luciferase gene of firefly L. maculata release of inorganic pyrophosphate. The luciferase-bound showed 93% identity to H. unmunsana. Superposition of luciferyl adenylate reacts rapidly with molecular oxygen to the three-dimensional model of L.
    [Show full text]
  • Letters to Nature
    letters to nature Received 7 July; accepted 21 September 1998. 26. Tronrud, D. E. Conjugate-direction minimization: an improved method for the re®nement of macromolecules. Acta Crystallogr. A 48, 912±916 (1992). 1. Dalbey, R. E., Lively, M. O., Bron, S. & van Dijl, J. M. The chemistry and enzymology of the type 1 27. Wolfe, P. B., Wickner, W. & Goodman, J. M. Sequence of the leader peptidase gene of Escherichia coli signal peptidases. Protein Sci. 6, 1129±1138 (1997). and the orientation of leader peptidase in the bacterial envelope. J. Biol. Chem. 258, 12073±12080 2. Kuo, D. W. et al. Escherichia coli leader peptidase: production of an active form lacking a requirement (1983). for detergent and development of peptide substrates. Arch. Biochem. Biophys. 303, 274±280 (1993). 28. Kraulis, P.G. Molscript: a program to produce both detailed and schematic plots of protein structures. 3. Tschantz, W. R. et al. Characterization of a soluble, catalytically active form of Escherichia coli leader J. Appl. Crystallogr. 24, 946±950 (1991). peptidase: requirement of detergent or phospholipid for optimal activity. Biochemistry 34, 3935±3941 29. Nicholls, A., Sharp, K. A. & Honig, B. Protein folding and association: insights from the interfacial and (1995). the thermodynamic properties of hydrocarbons. Proteins Struct. Funct. Genet. 11, 281±296 (1991). 4. Allsop, A. E. et al.inAnti-Infectives, Recent Advances in Chemistry and Structure-Activity Relationships 30. Meritt, E. A. & Bacon, D. J. Raster3D: photorealistic molecular graphics. Methods Enzymol. 277, 505± (eds Bently, P. H. & O'Hanlon, P. J.) 61±72 (R. Soc. Chem., Cambridge, 1997).
    [Show full text]
  • The Interpro Database, an Integrated Documentation Resource for Protein
    The InterPro database, an integrated documentation resource for protein families, domains and functional sites R Apweiler, T K Attwood, A Bairoch, A Bateman, E Birney, M Biswas, P Bucher, L Cerutti, F Corpet, M D Croning, et al. To cite this version: R Apweiler, T K Attwood, A Bairoch, A Bateman, E Birney, et al.. The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Research, Oxford University Press, 2001, 29 (1), pp.37-40. 10.1093/nar/29.1.37. hal-01213150 HAL Id: hal-01213150 https://hal.archives-ouvertes.fr/hal-01213150 Submitted on 7 Oct 2015 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. © 2001 Oxford University Press Nucleic Acids Research, 2001, Vol. 29, No. 1 37–40 The InterPro database, an integrated documentation resource for protein families, domains and functional sites R. Apweiler1,*, T. K. Attwood2,A.Bairoch3, A. Bateman4,E.Birney1, M. Biswas1, P. Bucher5, L. Cerutti4,F.Corpet6, M. D. R. Croning1,2, R. Durbin4,L.Falquet5,W.Fleischmann1, J. Gouzy6,H.Hermjakob1,N.Hulo3, I. Jonassen7,D.Kahn6,A.Kanapin1, Y. Karavidopoulou1, R.
    [Show full text]
  • Bioinformatics Analysis and Characterization of a Secretory Cystatin from Thelohanellus Kitauei
    Bioinformatics analysis and characterization of a secretory cystatin from Thelohanellus kitauei Fengli Zhang Chinese Academy of Agricultural Sciences Feed Research Institute Yalin Yang Chinese Academy of Agricultural Sciences Feed Research Institute Chenchen Gao Chinese Academy of Agricultural Sciences Feed Research Institute Yuanyuan Yao Chinese Academy of Agricultural Sciences Feed Research Institute Rui Xia Chinese Academy of Agricultural Sciences Feed Research Institute Juan Hu Chinese Academy of Agricultural Sciences Feed Research Institute Chao Ran Chinese Academy of Agricultural Sciences Feed Research Institute Zhen Zhang Chinese Academy of Agricultural Sciences Feed Research Institute Zhigang Zhou ( [email protected] ) Chinese Academy of Agricultural Sciences Feed Research Institute Original article Keywords: TK-cystatin, Thelohanellus kitauei, Bioinformatics analysis, Protease inhibitor, Inhibitory activity, Molecular docking Posted Date: June 15th, 2020 DOI: https://doi.org/10.21203/rs.3.rs-26747/v2 License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License Version of Record: A version of this preprint was published on June 23rd, 2020. See the published version at https://doi.org/10.1186/s13568-020-01052-0. Page 1/20 Abstract Thelohanellus kitauei , is a group of obligate parasitic Myxozoans, which causes intestinal giant-cystic disease of common carp ( Cyprinus carpio) and has resulted in signicant economic losses in carp farms. Cystatin secreted by parasites can regulate the immune response of host to facilitate parasite’s survival. In this study, the secretory TK-cystatin gene, encoding a protein of 120 amino acid residues (13.65 kDa), was cloned from T. kitauei genome. Phylogenetic analysis showed that TK-cystatin gene is closely related to the cystatin-A from Hydra vulgaris .
    [Show full text]
  • Data Resources and Tools
    6/22/2012 Introductory Bioinformatic Techniques 2012 Wits Bioinformatics Shaun Aron Sequence Structure Function Introductory Bioinformatics 2012 - Shaun Aron Introductory Bioinformatics 2012 - Shaun Aron A variety of protein resources online Evolutionary relationships between proteins Several websites/portals dedicated to Detection of local similarities between providing a single interface to multiple proteins to detect functional domains resources Functional predictions Important to differentiate between Protein structural predictions databases, websites and portals Protein-Protein interactions Protein-Nucleotide interactions Protein engineering and design Introductory Bioinformatics 2012 - Shaun Aron Introductory Bioinformatics 2012 - Shaun Aron Domain identification Protein classification Multiple domains in eukaryotic proteins . Families Sequence based methods Domain Prediction Structure based methods . Sequence based . SCOP . Structure based . CATH Introductory Bioinformatics 2012 - Shaun Aron Introductory Bioinformatics 2012 - Shaun Aron 1 6/22/2012 Experimental determination of structures . X-ray crystallography (requires crystallisation of protein) Sequence and information databases . NMR (for smaller proteins) . NCBI Entrez Protein Database – contains protein . EM (For large complex structures) Structural comparisons sequences from GenBank, RefSeq , as well as Protein folding records from SwissProt, PIR, PRF, and PDB . Folding simulations . UniProtKB – the “Protein knowledgebase”, a Structure prediction comprehensive
    [Show full text]
  • Mouse Ldhb Conditional Knockout Project (CRISPR/Cas9)
    https://www.alphaknockout.com Mouse Ldhb Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Ldhb conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Ldhb gene (NCBI Reference Sequence: NM_008492 ; Ensembl: ENSMUSG00000030246 ) is located on Mouse chromosome 6. 8 exons are identified, with the ATG start codon in exon 2 and the TGA stop codon in exon 8 (Transcript: ENSMUST00000032373). Exon 3 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Ldhb gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP23-134H20 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Electrophoretic variants of LDHB are determined by: the a allele with fast anodal mobility in all inbred strains tested; and the b allele with slower mobility in Peru-Coppock stock. Three additional variants are known in wild M. spretus from southern France and Spain. Alleles are codominant. Exon 3 starts from about 12.97% of the coding region. The knockout of Exon 3 will result in frameshift of the gene. The size of intron 2 for 5'-loxP site insertion: 3954 bp, and the size of intron 3 for 3'-loxP site insertion: 2661 bp. The size of effective cKO region: ~1353 bp. The transcript Ldhb-202 may not be affected by deleting this cKO region.
    [Show full text]
  • Predicting Protein Structure and Function with Interpro
    Predicting protein structure and function with InterPro Phil Jones, Alex Mitchell, Amaia Sangrador (V4.0, Jun 2011) www.ebi.ac.uk Contents Course Information ........................................................................................... 3 Course learning objectives ............................................................................... 3 An introduction to InterPro ............................................................................... 4 I. Searching InterPro using a text search ........................................................ 7 Learning objectives ............................................................................................. 7 What information can be found in the InterPro entry page? ................................. 8 How do I interpret an InterPro protein view? ........................................................ 9 Summary ........................................................................................................... 11 Exercises .......................................................................................................... 11 Exercise 1 – Searching InterPro using a UniPro identifier ........................... 11 Exercise 2 – Exploring InterPro entries: General annotation ....................... 12 Exercise 3 – Exploring InterPro entries: Relationships ................................ 13 Exercise 4 – Exploring InterPro entries: Structure ....................................... 14 II. InterPro Scan ..............................................................................................
    [Show full text]
  • Introduction to Bioinformatics
    INTRODUCTION TO BIOINFORMATICS > Please take the initial BIOINF525 questionnaire: < http://tinyurl.com/bioinf525-questions Barry Grant University of Michigan www.thegrantlab.org BIOINF 525 http://bioboot.github.io/bioinf525_w16/ 12-Jan-2016 Barry Grant, Ph.D. [email protected] Ryan Mills, Ph.D. [email protected] Hongyang Li (GSI) [email protected] COURSE LOGISTICS Lectures: Tuesdays 2:30-4:00 PM Rm. 2062 Palmer Commons Labs: Session I: Thursdays 2:30 - 4:00 PM Session II: Fridays 10:30 - 12:00 PM Rm. 2036 Palmer Commons Website: http://tinyurl.com/bioinf525-w16 Lecture, lab and background reading material plus homework and course announcements MODULE OVERVIEW Objective: Provide an introduction to the practice of bioinformatics as well as a practical guide to using common bioinformatics databases and algorithms 1.1. ‣ Introduction to Bioinformatics 1.2. ‣ Sequence Alignment and Database Searching 1.3 ‣ Structural Bioinformatics 1.4 ‣ Genome Informatics: High Throughput Sequencing Applications and Analytical Methods TODAYS MENU Overview of bioinformatics • The what, why and how of bioinformatics? • Major bioinformatics research areas. • Skepticism and common problems with bioinformatics. Bioinformatics databases and associated tools • Primary, secondary and composite databases. • Nucleotide sequence databases (GenBank & RefSeq). • Protein sequence database (UniProt). • Composite databases (PFAM & OMIM). Database usage vignette • Searching with ENTREZ and BLAST. • Reference slides and handout on major databases. HOMEWORK Complete the initial
    [Show full text]
  • Lactate Dehydrogenase – Wikipedia
    8/13/2021 Lactate dehydrogenase - Wikipedia Lactate dehydrogenase Lactate dehydrogenase (LDH or LD) is an enzyme found in nearly all living cells. LDH catalyzes the conversion of lactate to pyruvate and Lactate dehydrogenase back, as it converts NAD+ to NADH and back. A dehydrogenase is an enzyme that transfers a hydride from one molecule to another. LDH exists in four distinct enzyme classes. This article is specifically about the NAD(P)-dependent L-lactate dehydrogenase. Other LDHs act on D-lactate and/or are dependent on cytochrome c: D-lactate dehydrogenase (cytochrome) and L-lactate dehydrogenase (cytochrome). LDH is expressed extensively in body tissues, such as blood cells and heart muscle. Because it is released during tissue damage, it is a marker of common injuries and disease such as heart failure. Lactate dehydrogenase M tetramer (LDH5), Human Identifiers Contents EC no. 1.1.1.27 (https://www.qmul.ac. Reaction uk/sbcs/iubmb/enzyme/EC1/1/ Active site 1/27.html) Isozymes CAS no. 9001-60-9 (http://www.commo nchemistry.org/ChemicalDetai Protein families l.aspx?ref=9001-60-9&title=) Interactive pathway map Databases Enzyme regulation IntEnz view (https://www.ebi.a Ethanol-induced hypoglycemia IntEnz c.uk/intenz/query?cmd=Searc Substrate regulation hEC&ec=1.1.1.27) Transcriptional regulation BRENDA BRENDA entry (http://www.br Genetics enda-enzymes.org/enzyme.ph Mutations p?ecno=1.1.1.27) Role in muscular fatigue ExPASy NiceZyme view (https://enzym Blood test e.expasy.org/EC/1.1.1.27) Testing in cancer KEGG KEGG entry (https://www.gen Hemolysis ome.jp/dbget-bin/www_bget?e Tissue turnover nzyme+1.1.1.27) HIV MetaCyc metabolic pathway (https://bio Testing in other body fluids cyc.org/META/substring-searc Exudates and transudates h?type=NIL&object=1.1.1.27) Meningitis and encephalitis PRIAM profile (http://priam.prabi.fr/cgi In cancer treatment -bin/PRIAM_profiles_CurrentR Prokaryotes elease.pl?EC=1.1.1.27) See also PDB RCSB PDB (https://www.rcsb.
    [Show full text]
  • Cloning and Sequence Analysis of Glyceraldehyde-3-Phosphate Dehydrogenase Gene in Yak*
    1673 Asian-Aust. J. Anim. Sci. Vol. 21, No. 11 : 1673 - 1679 November 2008 www.ajas.info Cloning and Sequence Analysis of Glyceraldehyde-3-Phosphate Dehydrogenase Gene in *Yak Sheng-wei Li, Ming-feng Jiang**, Yong-tao Liu, Tu-feng Yang, Yong Wang and Jin-cheng Zhong College of Life Sciences and Technology, Southwest University for Nationalities, ChengDu, 610041, China ABSTRACT :In order to study the biological function of gapdh gene in yak, and prove whether the gapdh gene was a useful intra­ reference gene that can be given an important role in molecular biology research of yak, the cDNA sequence encoding glyceraldehyde- 3-phosphate dehydrogenase from yak was cloned by the RT-PCR method using gene specific PCR primers. The sequence results indicated that the cloned cDNA fragment (1,008 bp) contained a 1,002 bp open reading frame, encoding 333 amino acids (AAs) with a molecular mass of 35.753 kDa. The deduced amino acids sequence showed a high level of sequence identity to Bos Taurus (99.70%), Xenopus laevis (94.29%), Homo sapiens (97.01%), Mus musculus (97.90%) and Sus scrofa (98.20%). The expression of yak’s gapdh gene in heart, spleen, kidney and brain tissues was also detected; the results showed that the gapdh gene was expressed in all these tissues. Further analysis of yak GAPDH amino acid sequence implied that it contained a complete glyceraldehyde-3-phosphate dehydrogenase active site (ASCTTNCL) which ranged from 148 to 155 amino acid residues. It also contained two conserved domains, a NAD binding domain in its N-terminal and a complete catalytic domain of sugar transport in its C-terminal.
    [Show full text]
  • The Genome Sequence of the Facultative Intracellular Pathogen Brucella Melitensis
    The genome sequence of the facultative intracellular pathogen Brucella melitensis Vito G. DelVecchio*†, Vinayak Kapatral‡, Rajendra J. Redkar*, Guy Patra*, Cesar Mujer*, Tamara Los‡, Natalia Ivanova‡, Iain Anderson‡, Anamitra Bhattacharyya‡, Athanasios Lykidis‡, Gary Reznik‡, Lynn Jablonski‡, Niels Larsen‡, Mark D’Souza‡, Axel Bernal‡, Mikhail Mazur‡, Eugene Goltsman‡, Eugene Selkov‡, Philip H. Elzer§, Sue Hagius§, David O’Callaghan¶, Jean-Jacques Letessonʈ, Robert Haselkorn‡, Nikos Kyrpides‡, and Ross Overbeek‡ *Institute of Molecular Biology and Medicine, University of Scranton, Scranton, PA 18510; ‡Integrated Genomics, Incorporated, 2201 West Campbell Park Drive, Chicago, IL 60612; §Department of Veterinary Science, Louisiana State University Ag Center, 111 Dalrymple Building, Baton Rouge, LA 70803; ¶Institut National de la Sante´et de la Recherche Me´dicale, U431, Faculte´deMe´ decine, Avenue Kennedy, 30900 Nıˆmes,France; and ʈUnite´de Recherche en Biologie Mole´culaire, Laboratoire d’Immunologie et de Microbiologie, Universite´of Namur, 61 Rue de Bruxelles, 5000 Namur, Belgium Contributed by Robert Haselkorn, October 29, 2001 Brucella melitensis is a facultative intracellular bacterial pathogen Brucella melitensis (15–17). However, each biovar exhibits a pref- that causes abortion in goats and sheep and Malta fever in humans. erence for specific hosts: B. melitensis infects goats and sheep, The genome of B. melitensis strain 16M was sequenced and found Brucella abortus causes abortion in cattle, and Brucella suis infects to contain 3,294,935 bp distributed over two circular chromosomes swine. Brucella canis, Brucella ovis, and Brucella neotomae are of 2,117,144 bp and 1,177,787 bp encoding 3,197 ORFs. By using the pathogenic to dogs, sheep, and desert wood rats, respectively.
    [Show full text]