Institutionen För Datavetenskap Department of Computer and Information Science

Total Page:16

File Type:pdf, Size:1020Kb

Institutionen För Datavetenskap Department of Computer and Information Science Institutionen för datavetenskap Department of Computer and Information Science Examensarbete A Lexicon for Gene Normalization av Maria Lingemark LIU-IDA/LITH-EX-A--09/038 2009-09-01 Linköpings universitet Linköpings universitet SE-581 83 Linköping, Sweden 581 83 Linköping Linköpings universitet Institutionen för datavetenskap Examensarbete A Lexicon for Gene Normalization av Maria Lingemark LIU-IDA/LITH-EX-A--09/038 2009-09-01 Handledare: He Tan Examinator: He Tan Datum Avdelning, institution Date Division, department Institutionen för datavetenskap Department of Computer and Information Science 2009-09-01 Linköpings universitet Språk Rapporttyp ISBN Language Report category Svenska/Swedish Licentiatavhandling ISRN LIU-IDA/LITH-EX-A--09/038 X Engelska/English x Examensarbete C-uppsats Serietitel och serienummer ISSN D-uppsats Title of series, numbering Övrig rapport URL för elektronisk version http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-20250 Titel Title A Lexicon for Gene Normalization Författare Author Maria Lingemark Sammanfattning Abstract Researchers tend to use their own or favourite gene names in scientific literature, even though there are official names. Some names may even be used for more than one gene. This leads to problems with ambiguity when automatically mining biological literature. To disambiguate the gene names, gene normalization is used. In this thesis, we look into an existing gene normalization system, and develop a new method to find gene candidates for the ambiguous genes. For the new method a lexicon is created, using information about the gene names, symbols and synonyms from three different databases. The gene mention found in the scientific literature is used as input for a search in the lexicon, and all genes in the lexicon that match the mention are returned as gene candidates for that mention. These candidates are then used in the systems disambiguation step. Results show that the new method gives a better over all result from the system, with an increase in precision and a small decrease in recall. Nyckelord Keywords Bioinformatics, gene normalization, string matching, text mining Abstract Researchers tend to use their own or favourite gene names in scientific literature, even though there are official names. Some names may even be used for more than one gene. This leads to problems with ambiguity when automatically min- ing biological literature. To disambiguate the gene names, gene normalization is used. In this thesis, we look into an existing gene normalization system, and develop a new method to find gene candidates for the ambiguous genes. For the new method a lexicon is created, using information about the gene names, symbols and synonyms from three different databases. The gene mention found in the scientific literature is used as input for a search in this lexicon, and all genes in the lexicon that match the mention are returned as gene candidates for that mention. These candidates are then used in the system’s disambiguation step. Results show that the new method gives a better over all result from the system, with an increase in precision and a small decrease in recall. Contents 1 Introduction 1 1.1 Background.............................. 1 1.2 Motivation .............................. 2 1.3 Problemformulation ......................... 2 1.4 Limitations .............................. 2 1.5 Method ................................ 2 1.6 Structure ............................... 3 2 Background 5 2.1 Text Mining and Biological Text Mining . 5 2.2 GeneNames.............................. 6 2.3 GeneNormalization ......................... 6 2.3.1 OurSystem.......................... 7 2.4 LiteratureResources ......................... 8 2.5 Databases............................... 9 2.5.1 EntrezGene.......................... 9 2.5.2 HGNC............................. 9 2.5.3 SwissProt ........................... 10 3 Related Work 11 4 Creating the lexicon 13 4.1 Preprocessing ............................. 13 4.2 Merging ................................ 15 4.2.1 Step 1: Combining Entrez and HGNC . 16 4.2.2 Step 2: Combining combo file and SwissProt . 16 4.3 Cleaning................................ 19 4.4 Rules.................................. 19 4.5 Implementation............................ 21 5 Searching the Lexicon 23 5.1 ExactStringMatching. 23 5.2 ApproximateStringMatching. 23 5.2.1 JaccardIndex......................... 24 i 5.2.2 Dice’s Coefficient . 24 5.2.3 Levenshtein Distance . 24 5.2.4 Q-Gram............................ 25 5.3 Implementation............................ 25 6 Evaluation 27 6.1 TestData ............................... 27 6.2 Method ................................ 27 6.3 Result ................................. 28 6.3.1 ExactStringMatching. 28 6.3.2 Approximate String Matching . 29 6.3.3 Evaluation of the Lexicon in the Gene Normalization System 32 7 Discussion 33 7.1 Comparing Lexicon to Database Query . 33 7.2 FutureWork ............................. 33 A Vocabulary 35 ii Chapter 1 Introduction 1.1 Background The amount of scientific literature has increased rapidly in the past decade and much of it has been published online as well as in journal articles. The online literature is an important resource for researchers, but the amount of information that can be found can be overwhelming. It is very impractical to manually identify relevant information and since the information is stored within the free text it also makes it hard to find the relevant documents. There are automated processes, such as natural language processing (NLP) that can be used to retrieve information from this vast quantity of text. When it comes to biomedical publications the problem is not only that of finding the useful information, but also that of identifying the correct genes mentioned in the text. When trying to find information the first problem is to identify the parts of the text that mention genes. When this is done the gene mentions found need to be identified as the actual genes the author is referring to by linking them to gene database entries. This is called gene normalization. Gene normalization is needed because a gene mention found is not always the official name or symbol of that particular gene. It can be a new name the author of the article decided to use for the gene, or the name or symbol used can be ambiguous. Many systems exist wchich are trying to solve this problem, but there are still room for improvement. In this thesis we will look at a method to retrieve gene candidates for gene symbol disambiguation, which means to identify the correct gene if there is an ambiguity. This thesis’ work focuses on one part of a larger system which recognizes gene mentions in raw text, retrieve gene candidates for those mentions and uses the information about the candidates and the gene mentions from the text in a disambiguation step. 1 1.2 Motivation This thesis focuses on a gene candidate retrieval step for gene normalization. In gene normalization one needs to find a way to decide which gene a mention refers to. One of the ways to do this is to retrieve gene candidates from a database, or a collection of databases, and match the information about the candidates against the information about the mention found in text. The gene candidate retrieval step aims to find the genes a mention in text might refer to. 1.3 Problem formulation When creating a lexicon for gene candidate retrieval there are a few things that need to be decided. The problem formulation can be summarized to: • Can a lexicon be used to retrieve gene candidates with a better result than directly querying databases? • Which databases should be used in the creation of the lexicon? • What structure should the lexicon have? • How can we search the lexicon? Is exact string matching enough or do we need to use approximate string matching? 1.4 Limitations The only organism considered in this thesis is human, which makes querying databases easier since it is possible to focus on only human genes. Some genes also share the same name with the corresponding gene from other species, but we will not have to deal with that problem since we already know it is a human gene we are looking for. There are only three different databases included in the creation of the lexicon which limits the number of synonyms found. 1.5 Method The gene candidate retrieval is done by using a lexicon which is built by com- bining information, such as gene names and synonyms, from three different databases. The lexicon is used to search for all the genes a given gene mention might refer to. The search is done by using exact string matching as well as a few different approximate string matching algorithms in two different search methods. A lexicon can hold a combination of information, such as gene name and synonyms, from several different gene databases. This can increase the chance of finding the right gene in the gene candidate retrieval step since there can be synonyms which only can be found in one database. 2 1.6 Structure The thesis will start with an introduction and then some background informa- tion in the areas relevant to the creation of the lexicon. The following chapter is a short description of related work. Chapter 4 and 5 will cover the creation and usage of the lexicon. Chapter 6 is an evaluation of the gene candidate re- trieval results using the lexicon and a comparison to the results from querying the database Entrez Gene. In the last chapter the results are discussed along with future work. 3 4 Chapter 2 Background 2.1 Text Mining and Biological Text Mining Text mining is a process where one tries to automatically extract useful informa- tion from text documents. The extraction is done by identifying and exploring interesting patterns that are found in the unstructured textual data. A text mining system takes raw data (text documents) as input and produces various types of output, e.g. patterns and trends (Feldman & Sanger, 2007). Text mining can be divided in different steps. Example of steps are information retrieval (IR), natural language processing (NLP) and information extraction (IE)(National Text Mining Centre & Redfearn, 2008).
Recommended publications
  • Influence of Serum Amyloid a (SAA1) And
    Influence of Serum Amyloid A (SAA1) and SAA2 Gene Polymorphisms on Renal Amyloidosis, and on SAA/ C-Reactive Protein Values in Patients with Familial Mediterranean Fever in the Turkish Population AYSIN BAKKALOGLU, ALI DUZOVA, SEZA OZEN, BANU BALCI, NESRIN BESBAS, REZAN TOPALOGLU, FATIH OZALTIN, and ENGIN YILMAZ ABSTRACT. Objective. To evaluate the effect of serum amyloid A (SAA) 1 and SAA2 gene polymorphisms on SAA levels and renal amyloidosis in Turkish patients with familial Mediterranean fever (FMF). Methods. SAA1 and SAA2 gene polymorphisms and SAA levels were determined in 74 patients with FMF (39 female, 35 male; median age 11.5 yrs, range 1.0–23.0). All patients were on colchicine therapy. SAA1 and SAA2 gene polymorphisms were analyzed using polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP). SAA and C-reactive protein (CRP) values were measured and SAA/CRP values were calculated. Results. The median SAA level was 75 ng/ml (range 10.2–1500). SAA1 gene polymorphisms were: α/α genotype in 23 patients (31.1%), α/ß genotype in 30 patients (40.5%), α/γ genotype in one patient (1.4 %), ß/ß genotype in 14 patients (18.9%), ß/γ genotype in 5 patients (6.8 %), and γ/γ geno- type in one patient (1.4%). Of the 23 patients who had α/α genotype for the SAA1 polymorphism, 7 patients had developed renal amyloidosis (30.4%) compared to only one patient without this geno- type (1/51; 2.0%); p < 0.001. SAA2 had no effect on renal amyloidosis. SAA1 and SAA2 genotypes had no significant effect on SAA levels.
    [Show full text]
  • Diversity and Complexity of the Mouse Saa1 and Saa2 Genes
    Exp. Anim. 63(1), 99–106, 2014 —Original— Diversity and Complexity of the Mouse Saa1 and Saa2 genes Masayuki MORI1), Geng TIAN1), Akira ISHIKAWA2), and Keiichi HIGUCHI1) 1)Department of Aging Biology, Institute of Pathogenesis and Disease Prevention, Shinshu University Graduate School of Medicine, 3–1–1 Asahi, Matsumoto, Nagano 390-8621, Japan 2)Laboratory of Animal Genetics, Division of Applied Genetics and Physiology, Graduate School of Bioagricultural Sciences, Nagoya University, Chikusa, Nagoya, Aichi 464-8601, Japan Abstract: Mouse strains show polymorphisms in the amino acid sequences of serum amyloid A 1 (SAA1) and serum amyloid A 2 (SAA2). Major laboratory mouse strains are classified based on the sequence as carrying the A haplotype (e.g., BALB/c) or B haplotype (e.g., SJL/J) of the Saa1 and Saa2 gene unit. We attempted to elucidate the diversity of the mouse Saa1 and Saa2 family genes at the nucleotide sequence level by a systematic survey of 6 inbred mouse strains from 4 Mus subspecies, including Mus musculus domesticus, Mus musculus musculus, Mus musculus castaneus, and Mus spretus. Saa1 and Saa2 genes were obtained from the mouse genome by PCR amplification, and each full-length nucleotide sequence was determined. We found that Mus musculus castaneus mice uniquely possess 2 divergent Saa1 genes linked on chromosome 7. Overall, the mouse strains had distinct composite patterns of amino acid substitutions at 9 positions in SAA1 and SAA2 isoforms. The mouse strains also had distinct composite patterns of 2 polymorphic upstream regulatory elements that influenced gene transcription in in vitro reporter assays. B haplotype mice were revealed to possess an LTR insertion in the downstream region of Saa1.
    [Show full text]
  • Human Serum Amyloid a (SAA)
    Clinical and Inflammation Research Area Human serum amyloid A (SAA) erum amyloid A recombinant SAA as well as purified endogenous apolipoprotein SAA has a tendency to aggregate and form oligomers Sfamily consists of (4-6). Presumably, the association of SAA molecules three members that in is mediated by amino acid residues located within human beings are cod- α-helix regions 1 (residues 2-8) and 3 (residues ed by different genes: 52-59) (4). SAA1, SAA2, and SAA4 (reviewed in 1-3). SAA1 The biological function of SAA and SAA2 are so-called acute phase isoforms. The biological function of SAA in inflammation is Their expression is in- unclear. It has been suggested that SAA is involved creased in response to in the recycling of cholesterol from damaged tissues. inflammation. SAA4 is It might play the role of a signaling molecule that a constitutive isoform, redirects HDL particles to activated macrophages the expression of which does not change during an and mediates the removal of stored cholesterol from acute-phase response. In addition, one more related them. Released cholesterol is then transferred to HDL gene (SAA3) has been identified, although this gene to be used again in the membranes of new cells that are is not expressed in human beings. required during acute inflammation and tissue repair (7). Besides that, published studies demonstrate Biochemical properties of SAA that recombinant SAA exhibits significant proinflammatory activity by inducing the synthesis SAA1 and SAA2 are synthesized in the liver of several cytokines and promoting chemotaxis for and secreted to the blood. When in the blood, monocytes and neutrophils in vitro (1, 8).
    [Show full text]
  • (SAA2) (NM 030754) Human Recombinant Protein Product Data
    OriGene Technologies, Inc. 9620 Medical Center Drive, Ste 200 Rockville, MD 20850, US Phone: +1-888-267-4436 [email protected] EU: [email protected] CN: [email protected] Product datasheet for TP304977 serum amyloid A2 (SAA2) (NM_030754) Human Recombinant Protein Product data: Product Type: Recombinant Proteins Description: Recombinant protein of human serum amyloid A2 (SAA2), transcript variant 1 Species: Human Expression Host: HEK293T Tag: C-Myc/DDK Predicted MW: 13.3 kDa Concentration: >50 ug/mL as determined by microplate BCA method Purity: > 80% as determined by SDS-PAGE and Coomassie blue staining Buffer: 25 mM Tris.HCl, pH 7.3, 100 mM glycine, 10% glycerol Bioactivity: Cell treatment (PMID: 29757436) Preparation: Recombinant protein was captured through anti-DDK affinity column followed by conventional chromatography steps. Storage: Store at -80°C. Stability: Stable for 12 months from the date of receipt of the product under proper storage and handling conditions. Avoid repeated freeze-thaw cycles. RefSeq: NP_110381 Locus ID: 6289 UniProt ID: P0DJI9 RefSeq Size: 594 Cytogenetics: 11p15.1 RefSeq ORF: 366 Synonyms: SAA; SAA1 This product is to be used for laboratory only. Not for diagnostic or therapeutic use. View online » ©2021 OriGene Technologies, Inc., 9620 Medical Center Drive, Ste 200, Rockville, MD 20850, US 1 / 2 serum amyloid A2 (SAA2) (NM_030754) Human Recombinant Protein – TP304977 Summary: This gene encodes a member of the serum amyloid A family of apolipoproteins. The encoded preproprotein is proteolytically processed to generate the mature protein. This protein is a major acute phase protein that is highly expressed in response to inflammation and tissue injury.
    [Show full text]
  • Influence of Serum Amyloid a (SAA1) and SAA2 Gene Polymorphisms On
    Influence of Serum Amyloid A (SAA1) and SAA2 Gene Polymorphisms on Renal Amyloidosis, and on SAA/ C-Reactive Protein Values in Patients with Familial Mediterranean Fever in the Turkish Population AYSIN BAKKALOGLU, ALI DUZOVA, SEZA OZEN, BANU BALCI, NESRIN BESBAS, REZAN TOPALOGLU, FATIH OZALTIN, and ENGIN YILMAZ ABSTRACT. Objective. To evaluate the effect of serum amyloid A (SAA) 1 and SAA2 gene polymorphisms on SAA levels and renal amyloidosis in Turkish patients with familial Mediterranean fever (FMF). Methods. SAA1 and SAA2 gene polymorphisms and SAA levels were determined in 74 patients with FMF (39 female, 35 male; median age 11.5 yrs, range 1.0–23.0). All patients were on colchicine therapy. SAA1 and SAA2 gene polymorphisms were analyzed using polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP). SAA and C-reactive protein (CRP) values were measured and SAA/CRP values were calculated. Results. The median SAA level was 75 ng/ml (range 10.2–1500). SAA1 gene polymorphisms were: α/α genotype in 23 patients (31.1%), α/ß genotype in 30 patients (40.5%), α/γ genotype in one patient (1.4 %), ß/ß genotype in 14 patients (18.9%), ß/γ genotype in 5 patients (6.8 %), and γ/γ geno- type in one patient (1.4%). Of the 23 patients who had α/α genotype for the SAA1 polymorphism, 7 patients had developed renal amyloidosis (30.4%) compared to only one patient without this geno- type (1/51; 2.0%); p < 0.001. SAA2 had no effect on renal amyloidosis. SAA1 and SAA2 genotypes had no significant effect on SAA levels.
    [Show full text]
  • Comparison of Gene Expression Profiles
    [CANCER RESEARCH 62, 3939–3944, July 15, 2002] Advances in Brief Comparison of Gene Expression Profiles between Hepatitis B Virus- and Hepatitis C Virus-infected Hepatocellular Carcinoma by Oligonucleotide Microarray Data on the Basis of a Supervised Learning Method1 Norio Iizuka, Masaaki Oka,2 Hisafumi Yamada-Okabe, Naohide Mori, Takao Tamesa, Toshimasa Okada, Norikazu Takemoto, Akira Tangoku, Kenji Hamada, Hironobu Nakayama, Takanobu Miyamoto, Shunji Uchimura, and Yoshihiko Hamamoto Departments of Surgery II [N. I., M. O., N. M., T. T., T. O., N. T., A. T.] and Bioregulatory Function [N. I.], Yamaguchi University School of Medicine, Yamaguchi 755-8505; Department of Computer Science and Systems Engineering, Faculty of Engineering, Yamaguchi University, Yamaguchi 755-8611 [T. M., S. U., Y. H.]; and Department of Oncology, Nippon Roche Research Center, Kanagawa 247-8530 [H. Y-O., K. H., H. N.], Japan Abstract nisms responsible for the pathogenesis of HCC differ between HBV and HCV infections. Several studies compared gene expression be- Gene expression profiles of hepatocellular carcinomas (HCCs) associ- tween nontumorous liver and HCC and revealed gene expression ated with hepatitis B virus (HBV) and hepatitis C virus (HCV) were patterns that are rather specific to HCC (10–14). However, there is analyzed and compared. Oligonucleotide microarrays containing >6000 genes and subsequent gene selection by a supervised learning method only one study that compared gene expression patterns between HCC yielded 83 genes for which expression differed between the two types of with HBV infection (B-type HCC) and HCC with HCV infection HCCs. Expression levels of 31 of these 83 genes were increased in HBV- (C-type HCC; 14), and only a limited number of specimens were associated HCCs, and expression levels of the remaining 52 genes were analyzed.
    [Show full text]
  • (12) United States Patent (10) Patent No.: US 7,662,389 B2 Clark Et Al
    USOO7662389B2 (12) United States Patent (10) Patent No.: US 7,662,389 B2 Clark et al. (45) Date of Patent: Feb. 16, 2010 (54) USE OF SERUM AMYLOIDA GENE IN Cotton et al., Proc. Natl. Acad. Sci. USA 85.4397 (1988): “Reactivity DAGNOSIS AND TREATMENT OF of cytosine and thymine in single-base-pair mismatches with GLAUCOMIA AND IDENTIFICATION OF hydroxylamine and osmium tetroxide and its application to the study ANT-GLAUCOMAAGENTS of mutations'. Cotton, Mutat. Res. 285:125-144 (1993); "Current methods of muta tion detection'. (75) Inventors: Abbot F. Clark, Arlington, TX (US); Croninet al., Human Mutation 7:244-255 (1996); "Cystic Fibrosis Wan-Heng Wang, Grapevine, TX (US); Mutation Detection by Hybridization to Light-Generated DNA Loretta Graves McNatt, Hurst, TX Probe Arrays”. (US) Ermilov et al. (1993); Arkh Patol.; “Senile amyloidosis of the eye as a manifestation of senile cerebral amyloidosis' abstract with article (73) Assignee: Alcon, Inc., Hunenberg (CH) in Russian 55(6):43-45. Furlenato, CJ, and Campa A, Biochem. Biophys. Res. Commun 268:405-408 (2002), “A novel function of serum amyloid A. apotent (*) Notice: Subject to any disclaimer, the term of this stimulus for the release of tumor necrosis factor-alpha, interleukin-1 patent is extended or adjusted under 35 beta, and interleukin-8 by human blood neutrophil”. U.S.C. 154(b) by 130 days. Gasparini et al., Mol. Cell Probes 6:1-7 (1992); "Restriction site generating-polymerase chain reaction (RG-PCR) for the probeless (21) Appl. No.: 11/615.454 detection of hidden genetic variation: application to the study of some common cystic fibrosis mutations'.
    [Show full text]
  • Serum Amyloid a (SAA1) Mouse Monoclonal Antibody [Clone ID: 585] Product Data
    OriGene Technologies, Inc. 9620 Medical Center Drive, Ste 200 Rockville, MD 20850, US Phone: +1-888-267-4436 [email protected] EU: [email protected] CN: [email protected] Product datasheet for AM09286PU-N Serum Amyloid A (SAA1) Mouse Monoclonal Antibody [Clone ID: 585] Product data: Product Type: Primary Antibodies Clone Name: 585 Applications: ELISA, WB Recommended Dilution: ELISA. Western Blot: Use of this SSA antibody at a concentration of 0.1-0.5 µg will allow visualization of 100 ng/lane of recombinant Human SAA. Reactivity: Human Host: Mouse Isotype: IgG2b Clonality: Monoclonal Immunogen: Highly purified recombinant Human SAA. Specificity: Reacts with natural and recombinant Human SAA. Does not show any cross-reaction with other Human Cytokines or Growth Factors tested such as IL1 beta, IL-8, MCAF, TGF beta and EGF. Formulation: 0.01 M PBS, pH 7.2 without preservatives. State: Aff - Purified State: Lyophilized purified IgG fraction. Reconstitution Method: Restore with Double distillated water to adjust the final concentration to 1.0 mg/ml. Purification: Affinity Chromatography on Protein G. Conjugation: Unconjugated Storage: Store the antibody at -20°C. Avoid repeated freezing and thawing. Stability: Shelf life: one year from despatch. Gene Name: Homo sapiens serum amyloid A1 (SAA1), transcript variant 1 Database Link: Entrez Gene 6288 Human P0DJI8 This product is to be used for laboratory only. Not for diagnostic or therapeutic use. View online » ©2021 OriGene Technologies, Inc., 9620 Medical Center Drive, Ste 200, Rockville, MD 20850, US 1 / 2 Serum Amyloid A (SAA1) Mouse Monoclonal Antibody [Clone ID: 585] – AM09286PU-N Background: The Serum Amyloid A (SAA) family comprises a number of differentially expressed lipoproteins, acute phase SAA1 and SAA2, the former being a major component in plasma, and constitutive SAA's (C-SAAs).
    [Show full text]
  • Recombinant Mouse Serum Amyloid A1 Catalog Number: 2948-SA
    Recombinant Mouse Serum Amyloid A1 Catalog Number: 2948-SA DESCRIPTION Source E. coli-derived mouse Serum Amyloid A1 protein Gly20-Tyr122 Accession # P05366 N-terminal Sequence Gly20 Analysis Predicted Molecular 11.8 kDa Mass SPECIFICATIONS SDS-PAGE 11 kDa, reducing conditions Activity Measured by its ability to induce TNF-α secretion by J774A.1 mouse reticulum cell sarcoma macrophage cells. The ED50 for this effect is 1.5-7.5 μg/mL. Endotoxin Level <0.10 EU per 1 μg of the protein by the LAL method. Purity >95%, by SDS-PAGE visualized with Silver Staining and quantitative densitometry by Coomassie® Blue Staining. Formulation Lyophilized from a 0.2 μm filtered solution in Tris-HCl, NaCl, PEG and Imidazole. See Certificate of Analysis for details. PREPARATION AND STORAGE Reconstitution Reconstitute at 100 μg/mL in PBS. Shipping The product is shipped at ambient temperature. Upon receipt, store it immediately at the temperature recommended below. Stability & Storage Use a manual defrost freezer and avoid repeated freeze-thaw cycles. 12 months from date of receipt, -20 to -70 °C as supplied. 1 month, 2 to 8 °C under sterile conditions after reconstitution. 3 months, -20 to -70 °C under sterile conditions after reconstitution. BACKGROUND Mouse Serum Amyloid A protein-1 (SAA1; previously SAA2 in mouse) is a multifunctional apolipoprotein produced by hepatocytes in response to pro-inflammatory cytokines (1 - 4). It is secreted as a 12 kDa, 103 amino acid (aa), nonglycosylated protein and circulates as part of the HDL complex (1 - 4). The SAA1 gene is one of five SAA genes in mouse (3).
    [Show full text]
  • Mouse Anti-Human Serum Amyloid a Monoclonal Antibody, Clone 4I84 (DCABH-6007) This Product Is for Research Use Only and Is Not Intended for Diagnostic Use
    Mouse Anti-Human Serum Amyloid A Monoclonal Antibody, clone 4I84 (DCABH-6007) This product is for research use only and is not intended for diagnostic use. PRODUCT INFORMATION Specificity Recognize human SAA1. Do not recognize human SAA2, human SAA4, feline, canine, equine or bovine SAA. Immunogen Human serum amyloid A (SAA) Isotype IgG1 Source/Host Mouse Species Reactivity Human Clone 4I84 Purity Protein G purified Purification Purity≥ 95 % Conjugate Unconjugated Applications ELISA(Cap), ELISA(Det), LFIA We recommend the following for sandwich ELISA (Capture - Detection): DMAB8184 - DCABH-6007; DMAB8186 - DCABH-6007; DCABH-6007 - CABT-54364MH; CABT-54364MH - DCABH-6007 Epitope Recognizes aa 64-71 but also reacts with aa 28-38 indicating that the epitope may be partly conformational. Format Liquid Concentration Lot specific Size 1 mg Buffer 50 mM Na-citrate, pH 6.0, 0.9 % NaCI, 0.095 % NaN3 as a preservative. Preservative 0.095% sodium azide Storage Store at 2-8℃. 45-1 Ramsey Road, Shirley, NY 11967, USA Email: [email protected] Tel: 1-631-624-4882 Fax: 1-631-938-8221 1 © Creative Diagnostics All Rights Reserved Ship Wet ice Warnings This product is for research use only and is not intended for diagnostic use. BACKGROUND Introduction Serum amyloid A (SAA) proteins are a family of apolipoproteins associated with high-density lipoprotein (HDL) in plasma. Different isoforms of SAA are expressed constitutively (constitutive SAAs) at different levels or in response to inflammatory stimuli (acute phase SAAs). These proteins are produced predominantly by the liver. The conservation of these proteins throughout invertebrates and vertebrates suggests that SAAs play a highly essential role in all animals.
    [Show full text]
  • Complete Primary Structures of Two Major Murine Serum Amyloid A
    Proc. Nati. Acad. Sci. USA Vol. 82, pp. 2915-2919, May 1985 Immunology Complete primary structures of two major murine serum amyloid A proteins deduced from cDNA sequences (amyloidosis/cDNA cloning/sequence/acute-phase protein) KEN-ICHI YAMAMOTO AND SHUNSUKE MIGITA Department of Molecular Immunology, Cancer Research Institute, Kanazawa University, Kanazawa 920, Japan Communicated by Frank W. Putnam, December 26, 1984 ABSTRACT cDNA clones encoding two major mouse mouse amyloid A protein has shown that mouse amyloid A serum amyloid A proteins, SAA1 and SAA2, were isolated from protein contains only a single type of amino-terminal amino a liver cDNA library of the lipopolysaccharide-stimulated acid sequence, which is identical with that of SAA2, indicat- BALB/c mouse, and their nucleotide sequences were deter- ing that amyloid A protein is derived predominantly from mined. The insert of the SAA2 cDNA clone contained 607 SAA2 (13). However, it is not known what structural differ- nucleotides with a 5' untranslated region of 36 nucleotides, a ences are responsible for selective deposition of SAA2 in signal peptide region corresponding to 19 amino acids, a amyloid tissues. In the present study, we have isolated two mature protein region corresponding to 103 amino acids, and cDNA clones corresponding to mouse SAA1 and SAA2 a 3' untranslated region of 202 nucleotides. The SAA1 cDINA mRNA and have determined the nucleotide sequences. A insert contained 549 nucleotides specifying a part of a signal comparison of the amino acid sequences deduced from the peptide region, a mature protein region, and a 3' untranslated cDNA sequences reveals amino acid differences in nine region.
    [Show full text]
  • SAA2 and SAA1 Amyloid a Genes, Activation of the Human Acute
    Differential Glucocorticoid Enhancement of the Cytokine-Driven Transcriptional Activation of the Human Acute Phase Serum Amyloid A Genes, SAA1 and SAA2 This information is current as of September 28, 2021. Caroline F. Thorn and Alexander S. Whitehead J Immunol 2002; 169:399-406; ; doi: 10.4049/jimmunol.169.1.399 http://www.jimmunol.org/content/169/1/399 Downloaded from References This article cites 47 articles, 20 of which you can access for free at: http://www.jimmunol.org/content/169/1/399.full#ref-list-1 http://www.jimmunol.org/ Why The JI? Submit online. • Rapid Reviews! 30 days* from submission to initial decision • No Triage! Every submission reviewed by practicing scientists • Fast Publication! 4 weeks from acceptance to publication by guest on September 28, 2021 *average Subscription Information about subscribing to The Journal of Immunology is online at: http://jimmunol.org/subscription Permissions Submit copyright permission requests at: http://www.aai.org/About/Publications/JI/copyright.html Email Alerts Receive free email-alerts when new articles cite this article. Sign up at: http://jimmunol.org/alerts The Journal of Immunology is published twice each month by The American Association of Immunologists, Inc., 1451 Rockville Pike, Suite 650, Rockville, MD 20852 Copyright © 2002 by The American Association of Immunologists All rights reserved. Print ISSN: 0022-1767 Online ISSN: 1550-6606. The Journal of Immunology Differential Glucocorticoid Enhancement of the Cytokine-Driven Transcriptional Activation of the Human Acute Phase Serum Amyloid A Genes, SAA1 and SAA2 Caroline F. Thorn and Alexander S. Whitehead1 The human acute phase serum amyloid A (A-SAA) genes, SAA1 and SAA2, have a high degree of sequence identity that extends ϳ450 bp upstream of their transcription start sites.
    [Show full text]