Overview of the Chemical Compound and Drug Name Recognition (CHEMDNER) Task

Total Page:16

File Type:pdf, Size:1020Kb

Overview of the Chemical Compound and Drug Name Recognition (CHEMDNER) Task Overview of the chemical compound and drug name recognition (CHEMDNER) task Martin Krallinger1⋆, Florian Leitner1, Obdulia Rabal2, Miguel Vazquez1, Julen Oyarzabal2, and Alfonso Valencia1 1 Structural Computational Biology Group, Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre, Madrid, Spain 2 Small Molecule Discovery Platform, Center for Applied Medical Research (CIMA), University of Navarra {mkrallinger, fleitner, avalencia}@cnio.es http://www.cnio.es Abstract. There is an increasing need to facilitate automated access to information relevant for chemical compounds and drugs described in text, including scientific articles, patents or health agency reports. A number of recent efforts have implemented natural language processing (NLP)andtextminingtechnologiesforthechemicaldomain(ChemNLP orchemicaltextmining).DuetothelackofmanuallylabeledGoldStan- dard datasets together with comprehensive annotation guidelines, both theimplementationaswellasthecomparativeassessmentofChemNLP technologiesBSFopaque.Twokeycomponentsformostchemicaltextmin- ing technologies are the indexing of documents with chemicals (chemi- cal document indexing - CDI) and finding the mentions of chemicals in text (chemical entity mention recognition - CEM). These two tasks formed part of the chemical compound and drug named entity recog- nition (CHEMDNER) task introduced at the fourth BioCreative chal- lenge, a community effort to evaluate biomedical text mining applica- tions.Forthistask,theCHEMDNERtextcorpuswasconstructed,con- sistingof10,000abstractscontainingatotalof84,355mentionsofchem- ical compounds and drugs that have been manually labeled by domain expertsfollowingspecificannotationguidelines.Thiscorpuscoversrep- resentative abstracts from major chemistry-related sub-disciplines such asmedicinalchemistry,biochemistry,organicchemistryandtoxicology. A total of 27 teams – 23 academic and 4 commercial HSPVQT, comprisedof 87 researchers – submitted results for this task. Of these teams, 26 provided submissions for the CEM subtask and 23 for the CDI subtask. Teams were provided with the manual annotations of 7,000 abstracts toimplement and train their systems and then had to return predictions for the 3,000 test set abstracts during a short period of time. When comparing exact matches of the automated results against the manually labeled Gold Standard annotations, the best teams reached an F-score ⋆ Corresponding author Proceedings of the fourth BioCreative challenge evaluation workshop, vol. 2 of 87.39% JO the CEM task and of 88.20% JO the CDI task. This can beregardedasaverycompetitiveresultwhencomparedtotheexpected upper boundary, the agreement between to human annotators, at 91%. In general, the technologies used to detect chemicals and drugs by the teams included machine learning methods (particularly CRFs using a considerablerangeofdifferentfeatures),interactionofchemistry-related lexical resources and manual rules (e.g., to cover abbreviations, chemi- calformulaorchemicalidentifiers).Bypromotingtheavailabilityofthe software of the participating systems as well as through the release of the CHEMDNER corpus to enable implementation of new tools, this work fosters the development of text mining applications like the au- tomatic extraction of biochemical reactions, toxicological properties of compounds,orthedetectionofassociationsbetweengenesormutations BOEdrugsinthecontextpharmacogenomics. Key words: CHEMDNER; ChemNLP; BioCreative; Named Entity Recog- nition; Chemical compounds; Drugs; Text Mining 1 Introduction There is an increasing interest, both on the academic side as well as the in- dustry, to facilitate more efficient access to information on chemical compounds and drugs (chemical entities) described in text repositories, including scientific articles, patents, health agency reports, or the Web. Chemicals and drugs are also among the entity types most frequently searched in the PubMed database. With the rapid accumulation of scientific literature, linking chemical entities to articles through manually curation cannot cope with the amount of newly pub- lished articles. If text mining methods should improve information access for this domain, two fundamental initial tasks need to be addressed first, namely (1) identifying mentions of chemical compounds within the text and (2) indexing the documents with the compounds described in them. The recognition of chem- ical entities is also crucial for other subsequent text processing strategies, such as detection of drug-protein interactions, adverse effects of chemical compounds and their associations to toxicological endpoints, or the extraction of pathway and metabolic reaction relations. Most of the research carried out in biomedical sciences and also bioinformat- ics is centered on a set of entities, mainly genes, proteins, chemicals and drugs. There is a considerable demand in facilitating a more efficient retrieval of docu- ments and sentences that describe these entities. Also a more granular extraction of certain types of descriptions and relationships from the literature on these bio- entities has attracted considerable attention [8]. For finding bio-entity relevant documents by text retrieval (TR) systems as well as for the automatic extraction of particular relationships using information extraction (IE) methods a key step is the correct identification of bio-entity referring expressions in text. Finding the mentions of entities in text is commonly called named entity recognition (NER). Originally NER systems were built for the identification in text of pre- defined categories of proper names from the general domain (newswire texts), BioCreative IV - CHEMDNER track such as organizations, locations or person names [10]. Among the bio-entities that were studied in more detail so far are mentions of genes, proteins, DNA, RNA, cell lines, cell types, drugs, chemical compounds and mutations [9, 6, 15, 8]. Some effort has also been dedicated to find medication and disease names as well as mentions of species and organisms [3] in text. A common characteristic of most named entities is that they generally refer to some sort of real world object or object class. From a linguistic perspective, named entities are often characterized by being referenced in text by their proper names. Most of the previous research related to the recognition of biomedical entities focused mainly on the detection of gene/protein mentions. For the detection of gene/protein mentions in text, previous BioCreative challenges (BioCreative I and II) could attracted a considerable interest and served to determine the performance of various methods under a controlled evaluation setting [13]. Despite its importance, only a rather limited number of publicly accessible chemical compound recognition systems are currently available [15]. Among the main bottlenecks at present encountered to implement and compare the perfor- mance of such systems are: (a) the lack of suitable training/test data, (b) the intrinsic difficulty in defining annotation guidelines of what actually constitutes a chemical compound or drug, (c) the heterogeneity in terms of scope and tex- tual data sources used, as well as (d) the lack of comparative evaluation efforts for this topic. To addresses these issues, the CHEMDNER task was organized, a commu- nity challenge to examine the efficiency of automatic recognition of chemical entities in text. From the point of view of the teams that submitted results for this task, the main motivations for participation provided by them included using the framework offered by the CHEMDNER task as a way to (a) im- prove/demonstrate the performance of their software, (b) push the state of the art for this topic, (c) test their tools on a different corpus, (d) adapt their system to detect (new) types of chemical entities as defined in the CHEMDNER cor- pus, (e) to be able compare their system to other strategies, (f) to improve the scalability of their system and (g) to explore the integration of different NER system for this task. In order to regulate the naming of entities in natural sciences, one strategy is to define some nomenclature and terminological guidelines to constrain how such entities are correctly constructed. For instance the International Union of Pure and Applied Chemistry (IUPAC) has defined a set of rules for the chemical nomenclature. Unfortunately naming standards are not sufficiently followed in practice when examining the scientific literature [14]. Chemical science is characterized by a considerable degree of specialization, resulting in implicit variability across diGGerent sub-disciplines. 5IFSFGPSF an almostarbitrary collection of expressions may be used in the literature to refer to thevery same chemical compound. This variability can be explained by the use ofaliases, e.g. diGGerent synonyms used for the same entity. In addition one can encounter variability through alternative typographical expressions. The problem of variability has an eGGect on the resulting recall of text mining systems, Proceedings of the fourth BioCreative challenge evaluation workshop, vol. 2 requiring mapping of the various synonyms and alternative variants into a single canonical form. On the other hand, a given word can correspond to a chemical entity or to some other concept. This results in ambiguities that can only be resolved by taking into account the context of the mentions (context-based sense disam- biguation). With this respect an important source of ambiguity is the heavy use of acronyms, abbreviations,
Recommended publications
  • General Inorganic Chemistry
    General Inorganic Chemistry Pre DP Chemistry Period 1 • Teacher: Annika Nyberg • [email protected] • Urgent messages via Wilma! Klicka här för att ändra format på underrubrik i bakgrunden • Course book: • CliffsNotes: Chemistry Quick Review http://www.chem1.com/acad/webtext/virtualtextbook.html Content • Introduction • The Structure of Matter (Chapter 1) • The Atom (Chapter 2 and 3) • Chemical bonding (Chapter 5) • The Mole (Chapter 2) • Solutions (Chapter 9) • Acids and bases (Chapter 10) • Quiz • Revision • EXAM 9.00-11.45 Assessment Exam: 80 % Quiz: 20% + practical work, activity and absences 1. Chemistry: a Science for the twenty-first century ● Chemistry has ancient roots, but is now a modern and active, evolving science. ● Chemistry is often called the central science, because a basic knowledge of chemistry is essential for students in biology, physics, geology and many other subjects. https://www.youtube.com/watch?v=tTlnrhiadnI ● Chemical research and development has provided us with new substances with specific properties. These substances have improved the quality of our lives. Health and medicine vaccines sanitation systems antibiotics anesthesia and all other drugs Energy new alternative energy sources (e.g. solar energy to electric energy, nuclear fission) electric cars with long lasting batteries Environment greenhouse gases acid rain and smog Materials and Technology ● polymers (rubber and nylon), ceramics (cookware), liquid crystals (electronic displays), adhesives (Post-It notes), coatings (latex-paint), silicon chips (computers) Food and Agriculture substances for biotechnology ● The purpose of this course is to make you understand how chemists see the world. ● In other words, if you see one thing (in the macroscopic world) you think another (visualize the particles and events in the microscopic world).
    [Show full text]
  • Basic Chemistry
    CH 2- THE CHEMISTRY OF LIFE Atoms . The study of chemistry begins with the basic unit of matter, the atom. The Greek philosopher Democritus called the smallest fragment of matter the atom, from the Greek word atomos. Atoms (cont.) . Placed side by side, 100 million atoms would make a row only about 1 centimeter long. Atoms contain subatomic particles that are even smaller. Atoms (cont.) . What three subatomic particles make up atoms? Atoms (cont.) . The subatomic particles that make up atoms are . protons . neutrons . electrons Atoms (cont.) . Smallest property of an element that still has the properties of that element . “The building blocks of matter” . Atoms are made of smaller (subatomic) particles arranged in a particular way . p+ (proton) . n° (neutron) . e- (electron) Atoms (cont.) . Protons and neutrons have about the same mass. Protons are positively charged particles (+). Neutrons carry no charge (◦). Strong forces bind protons and neutrons together to form the nucleus, which is at the center of the atom. Atoms (cont.) . The electron is a negatively charged particle (−) with 1/1840 the mass of a proton. Electrons are in constant motion in the space surrounding the nucleus (e- cloud). Atoms (cont.) . The subatomic particles in a helium atom. Atoms (cont.) • Electrons are attracted to the positively charged nucleus but remain outside the nucleus because of the energy of their motion. • Because atoms have equal numbers of electrons and protons, and because these subatomic particles have equal but opposite charges, atoms are neutral. Atoms (cont.) . Atomic number- # of p+ AND electrons in an atom . Mass number- total # of p+ + n° in an atom .
    [Show full text]
  • All About the Chemical Bonds and Compounds
    All about the Chemical Bonds and Compounds Video Transcript Almost everything we see or touch in daily life – such as the food we eat, the water we drink, the air we breathe, and so on – is the result of chemical bonding. In other words, the world around us is generally not composed of isolated atoms. Instead, atoms bond to one another to form molecules and hence chemical compounds, which make up the world around us. Chemical bonding is the physical process that causes atoms and molecules to be attracted to each other and held together in more stable chemical compounds. There are three primary types of chemical bonds and compounds: Ionic bonds, covalent bonds, and metallic bonds. With ionic bonds, electrons are exchanged or transferred between atoms. They exist in ionic compounds. Generally, they’re a metal and a nonmetal – sodium chloride, magnesium oxide, etc. With covalent bonds, electrons are shared among the atoms. They exist in covalent and molecular compounds. Generally, they are nonmetals – carbon dioxide, dihydrogen monoxide, etc. With metallic bonds, a pool of electrons roam freely across entire molecule. They exist in metallic compounds. Generally, they’re metals and alloys – copper, gold, etc. A chemical compound is a group of two or more different atoms that are attracted to each other. Compounds can be divided into ionic compounds, covalent compounds, and metallic compounds. This table lists some key properties of each. Be aware that it is the valence electrons (those in the outermost level) that are involved in bonding. Atoms try to fill their outer energy levels because it’s energetically favorable for atoms to be in that configuration and it makes them stable.
    [Show full text]
  • Chemical Formula
    Chemical Formula Jean Brainard, Ph.D. Say Thanks to the Authors Click http://www.ck12.org/saythanks (No sign in required) AUTHOR Jean Brainard, Ph.D. To access a customizable version of this book, as well as other interactive content, visit www.ck12.org CK-12 Foundation is a non-profit organization with a mission to reduce the cost of textbook materials for the K-12 market both in the U.S. and worldwide. Using an open-content, web-based collaborative model termed the FlexBook®, CK-12 intends to pioneer the generation and distribution of high-quality educational content that will serve both as core text as well as provide an adaptive environment for learning, powered through the FlexBook Platform®. Copyright © 2013 CK-12 Foundation, www.ck12.org The names “CK-12” and “CK12” and associated logos and the terms “FlexBook®” and “FlexBook Platform®” (collectively “CK-12 Marks”) are trademarks and service marks of CK-12 Foundation and are protected by federal, state, and international laws. Any form of reproduction of this book in any format or medium, in whole or in sections must include the referral attribution link http://www.ck12.org/saythanks (placed in a visible location) in addition to the following terms. Except as otherwise noted, all CK-12 Content (including CK-12 Curriculum Material) is made available to Users in accordance with the Creative Commons Attribution-Non-Commercial 3.0 Unported (CC BY-NC 3.0) License (http://creativecommons.org/ licenses/by-nc/3.0/), as amended and updated by Creative Com- mons from time to time (the “CC License”), which is incorporated herein by this reference.
    [Show full text]
  • Chapter 10 – Chemical Reactions Notes
    Chapter 8 – Chemical Reactions Notes Chemical Reactions: Chemical reactions are processes in which the atoms of one or more substances are rearranged to form different chemical compounds. How to tell if a chemical reaction has occurred (recap): Temperature changes that can’t be accounted for. o Exothermic reactions give off energy (as in fire). o Endothermic reactions absorb energy (as in a cold pack). Spontaneous color change. o This happens when things rust, when they rot, and when they burn. Appearance of a solid when two liquids are mixed. o This solid is called a precipitate. Formation of a gas / bubbling, as when vinegar and baking soda are mixed. Overall, the most important thing to remember is that a chemical reaction produces a whole new chemical compound. Just changing the way that something looks (breaking, melting, dissolving, etc) isn’t enough to qualify something as a chemical reaction! Balancing Equations Notes: Things to keep in mind when looking at the recipes for chemical reactions: 1) The stuff before the arrow is referred to as the “reactants” or “reagents”, and the stuff after the arrow is called the “products.” 2) The number of atoms of each element is the same on both sides of the arrow. Even though there may be different numbers of molecules, the number of atoms of each element needs to remain the same to obey the law of conservation of mass. 3) The numbers in front of the formulas tell you how many molecules or moles of each chemical are involved in the reaction. 4) Equations are nothing more than chemical recipes.
    [Show full text]
  • 1 5. Chemical Bonding
    5. Chemical Bonding: The Covalent Bond Model 5.1 The Covalent Bond Model Almost all chemical substances are found as aggregates of atoms in the form of molecules and ions produced through the reactions of various atoms of elements except the noble-gas elements which are stable mono-atomic gases. Chemical bond is a term that describes the attractive force that is holding the atoms of the same or different kind of atoms in forming a molecule or ionic solid that has more stability than the individual atoms. Depending on the kinds of atoms participating in the interaction there seem to be three types of bonding: Gaining or Losing Electrons: Ionic bonding: Formed between many ions formed by metal and nonmetallic elements. Sharing Electrons: Covalent bonding: sharing of electrons between two atoms of non-metals. Metallic Bonding: sharing of electrons between many atoms of metals. Ionic Compounds Covalent Compounds Metallic Compounds 1. Metal and non-meal Non-metal and non-meal Metal of one type or, element combinations. elements combinations. combinations of two or metal elements combinations. 2. High melting brittle Gases, liquids, or waxy, low Conducting, high melting, crystalline solids. melting soft solids. malleable, ductile crystalline solids. 3. Do not conduct as a solid Do not conduct electricity at Conduct electricity at solid but conducts electricity any state. and molten states. when molten. 4. Dissolved in water produce Most are soluble in non-polar Insoluble in any type of conducting solutions solvents and few in water. solvents. (electrolytes) and few These solutions are non- are soluble in non-polar conducting (non- solvents.
    [Show full text]
  • A Science That Studies the Composition and Properties of Matter. 1. Matter
    Chemistry Basics I. Intro A. Chemistry Def- a science that studies the composition and properties of matter . 1. matter- anything that takes up space and has mass. B. Basic Terms: 1. Atom: neutral particle having one nucleus; the smallest representative sample of an element. In other words it is the base building block, of an element. 2. Element: a substance in which all of the atoms have the same atomic number. A substance that cannot be broken down by chemical reactions into anything that is both stable and simpler. Furthermore, all the atoms within an element have the same number of protons. 3. Molecule: a neutral particle composed of two or more atoms combined in a definite ratio of whole numbers. 4. Compound: a substance consisting of chemically combined atoms from two or more elements and present in a definite ratio.( Oxygen mass is always 8 times that of hydrogen) 5. Physical property: a property that can be observed without changing the chemical makeup of a substance. (eg. Color) Is melting point one? 6. Chemical reaction: chemicals, starting materials (reactants), interact with each other resulting in other substances, ending materials (products), with different properties. 7. Chemical Property: The ability of a substance, either by itself or with other substances, to undergo a change into new substances. 8. Extensive Property: a property that depends on sample size.(Volume) 9. Intensive Property: properties independent of size. ( Color, electrical conductivity, melting point…) 10. Pure substance: a substance that is always the same, regardless of its source. 11. Mixtures: a physical combination of elements or compounds that has no particular proportion by mass.
    [Show full text]
  • A Revolution That Never Happened
    Studies in History and Philosophy of Science 49 (2015) 80e90 Contents lists available at ScienceDirect Studies in History and Philosophy of Science journal homepage: www.elsevier.com/locate/shpsa A Revolution that never happened Ursula Klein Max Planck Institute for the History of Science, Germany article info abstract Article history: If we define scientific revolutions as changes of scientists’ ontologies, types of causal explanation, and Available online 19 December 2014 paradigmatic types of methods and instruments, Antoine-Laurent Lavoisier’s contribution to chemistry did not amount to a scientific revolution. Contrary to the received view that Lavoisier initiated a Keywords: “chemical revolution,” which is accepted by Chang and Kusch, I argue that Lavoisier shared with the Chemical revolution; phlogistonists their “flat ontology” of chemical substance, established decades before the 1770s, their Ontology of substances; types of explaining chemical transformation, and their quantitative methods. Based on my historical Elements; reconstruction, I criticize Chang’s argument that the late eighteenth-century phlogistic systems and Mixts; ’ Chemical compounds; Lavoisier s system belonged to two different theoretical traditions. As a consequence, I also question ’ ’ Affinity Chang s argument that the acceptance of Lavoisier s system can be explained in terms of dominance of “compositionism” over “principlism.” Ó 2014 Elsevier Ltd. All rights reserved. When citing this paper, please use the full journal title Studies in History and Philosophy of Science 1. Introduction following the path to Lavoisier’s system, I will compare the phlo- gistic and the antiphlogistic system from a broader historical and As is well known, Thomas Kuhn highlighted Lavoisier’s “chem- philosophical perspective.
    [Show full text]
  • Introduction to Chemistry
    Introduction to Chemistry Author: Tracy Poulsen Digital Proofer Supported by CK-12 Foundation CK-12 Foundation is a non-profit organization with a mission to reduce the cost of textbook Introduction to Chem... materials for the K-12 market both in the U.S. and worldwide. Using an open-content, web-based Authored by Tracy Poulsen collaborative model termed the “FlexBook,” CK-12 intends to pioneer the generation and 8.5" x 11.0" (21.59 x 27.94 cm) distribution of high-quality educational content that will serve both as core text as well as provide Black & White on White paper an adaptive environment for learning. 250 pages ISBN-13: 9781478298601 Copyright © 2010, CK-12 Foundation, www.ck12.org ISBN-10: 147829860X Except as otherwise noted, all CK-12 Content (including CK-12 Curriculum Material) is made Please carefully review your Digital Proof download for formatting, available to Users in accordance with the Creative Commons Attribution/Non-Commercial/Share grammar, and design issues that may need to be corrected. Alike 3.0 Unported (CC-by-NC-SA) License (http://creativecommons.org/licenses/by-nc- sa/3.0/), as amended and updated by Creative Commons from time to time (the “CC License”), We recommend that you review your book three times, with each time focusing on a different aspect. which is incorporated herein by this reference. Specific details can be found at http://about.ck12.org/terms. Check the format, including headers, footers, page 1 numbers, spacing, table of contents, and index. 2 Review any images or graphics and captions if applicable.
    [Show full text]
  • Chemical Compounds - Vocabulary
    CHEMICAL COMPOUNDS - VOCABULARY COVALENT COMPOUND A compound that has only covalent bonds between atoms of which it is made. The covalent bond results from sharing valence electrons between two atoms. The representative unit of a covalent compound is usually a molecule. IONIC COMPOUND A compound that is built of monatomic or polyatomic ions of opposite charges. The bonds between ions of opposite charges are called ionic bonds. Many, but not all ionic compounds contain a metallic element. The representative unit of an ionic compound is always a formula unit. There no molecules in ionic compounds. Ions per se are not compounds. FORMULA UNIT The smallest combination of atoms or ions that correctly represents the ratio in which they are combined in a chemical compound. Formula unit is a more general term than a molecule. ORGANIC COMPOUND A compound that is built of atoms of carbon and other elements (most commonly, hydrogen, nitrogen, oxygen, sulfur, and halogens) except for carbon monoxide, carbon dioxide, carbonic acid, carbonates, and hydrogen carbonates. If there is no carbon present, a compound is certainly inorganic. The simplest organic compound is CH4, methane. BINARY COMPOUND Ionic or covalent compound that is made of atoms of two different elements. Examples: MgCl2; Al2O3; HCl; H2O; CH4. DIATOMIC MOLECULE A molecule that is built of exactly two atoms either of the same element or of two different elements. Examples: O2; N2; CO; HF. MONATOMIC ION A single atom that lost or gained one or more electrons as compared to the neutral atom of the same element. The electric charge must be indicted when a formula of it is written unless the ion is combined with another ion in the formula of an ionic compound.
    [Show full text]
  • 2: Alkanes and Cycloalkanes
    (2/94)(1,2,8/95)(6,7/97)(10/98)(1,9-11/99) Neuman Chapter 2 2: Alkanes and Cycloalkanes Alkanes Alkane Systematic Nomenclature Cycloalkanes Conformations of Alkanes Conformations of Cycloalkanes Conformations of Alkylcyclohexanes Preview You learned in the Chapter 1 that all organic molecules have carbon skeletons. These carbon skeletons show great diversity in the ways that C atoms bond to each other, and in their three-dimensional shapes. Alkanes and cycloalkanes consist entirely of carbon skeletons bonded to H atoms since they have no functional groups. As a result, they serve as a basis for understanding the structures of all other organic molecules. This chapter describes the skeleltal isomerism of alkanes and cycloalkanes, their three-dimensional conformations, and their systematic nomenclature that is the basis for the names of all other organic compounds. 2.1 Alkanes We refer to alkanes as hydrocarbons because they contain only C (carbon) and H (hydrogen) atoms. Since alkanes are the major components of petroleum and natural gas, they often serve as a commercial starting point for the preparation of many other classes of organic molecules. Structures of Alkanes (2.1A) Organic chemists use a variety of different types of structures to represent alkanes such as these shown for methane (one C), ethane (two C's), and propane (three C's). [graphic 2.1] Kekulé, Electron-Dot and Three-Dimensional Structures. The structures showing C and H atoms connected by lines are Kekulé structures. Remember from Chapter 1 that these lines represent chemical bonds that are pairs of electrons located in molecular orbitals encompassing the two bonded atoms.
    [Show full text]
  • Chemical Formulas the Elements
    Chemical Formulas A chemical formula gives the numbers and types of atoms that are found in a substance. When the substance is a discrete molecule, then the chemical formula is also its molecular formula. Fe (iron) is a chemical formula Fe2O3 is a molecular formula The Elements The chemical formulas of most of the elements are simply their elemental symbol: Na (sodium) Fe (iron) He (helium) U (uranium) These chemical formulas are said to be monatomic—only an atom in chemical formula 1 The Elements There are seven elements that occur naturally as diatomic molecules—molecules that contain two atoms: H2 (hydrogen) N2 (nitrogen) O2 (oxygen) F2 (fluorine) Cl2 (chlorine) Br2 (bromine) I2 (iodine) The last four elements in this list are in the same family of the Periodic Table Binary Compounds A binary compound is one composed of only two different types of atoms. Rules for binary compound formulas 1. Element to left in Periodic Table comes first except for hydrogen: KCl PCl3 Al2S3 Fe3O4 2 Binary Compounds 2. Hydrogen comes last unless other element is from group 16 or 17: LiH, NH3, B2H4, CH4 3. If both elements are from the same group, the lower element comes first: SiC, BrF3 Other Compounds For compounds with three or more elements that are not ionic, if it contains carbon, this comes first followed by hydrogen. Other elements are then listed in alphabetical order: C2H6O C4H9BrO CH3Cl C8H10N4O2 3 Other Compounds However, the preceding rule is often ignored when writing organic formulas (molecules containing carbon, hydrogen, and maybe other elements) in order to give a better idea of how the atoms are connected: C2H6O is the molecular formula for ethanol, but nobody ever writes it this way—instead the formula is written C2H5OH to indicate one H atom is connected to the O atom.
    [Show full text]