CHEMICAL REPRESENTATION GUIDE 2016 Copyright Notice
Total Page:16
File Type:pdf, Size:1020Kb
CHEMICAL REPRESENTATION GUIDE 2016 Copyright Notice ©2015 Dassault Systèmes. All rights reserved. 3DEXPERIENCE, the Compass icon and the 3DS logo, CATIA, SOLIDWORKS, ENOVIA, DELMIA, SIMULIA, GEOVIA, EXALEAD, 3D VIA, BIOVIA and NETVIBES are commercial trademarks or registered trademarks of Dassault Systèmes or its subsidiaries in the U.S. and/or other countries. All other trademarks are owned by their respective owners. Use of any Dassault Systèmes or its subsidiaries trademarks is subject to their express written approval. Acknowledgments and References To print photographs or files of computational results (figures and/or data) obtained using BIOVIA software, acknowledge the source in an appropriate format. BIOVIA may grant permission to republish or reprint its copyrighted materials. Requests should be submitted to BIOVIA Support, either through electronic mail to [email protected] , or in writing to: BIOVIA Support 5005 Wateridge Vista Drive, San Diego, CA 92121 USA No-Structures 28 Contents Reaction Representation 29 Introduction 29 Overview 1 Reaction Mapping 29 Audience for this Guide 1 Mapping Reactions Automatically 30 Prerequisite Knowledge 1 Mapping Reactions Manually 31 Related BIOVIA Documentation 1 Stereoconfiguration Atom Properties in Molecule Representation 3 Mapped Reactions 32 Substances, Structures and Fragments 3 Properties of Bonds in Mapped Reactions 32 The BIOVIA Periodic Table 3 Simple Bond Properties 32 Atom Properties 3 Combined Bond Properties 33 Charges, Radicals and Isotopes 3 Example: Ambiguity in the Fate of Reacting Valences and Implicit Hydrogens 4 Atoms and Bonds 33 Example: Multiple Fates of Bonds in Default Valences 4 Products 34 Explicit Valence 5 Stereogroup Information in Reactants and Rules for Calculation of Valence and Implicit Products 34 Hydrogen 5 Markush (Rgroup) Structures 36 Bond Types 5 Library Representation 36 Aromaticity 7 The Root Structure 37 Tautomers 8 Rgroups and Rgroup Members 38 Salts 8 Nested Rgroups 38 Tetrahedral Stereochemistry 9 One Attachment Point 40 Grouping Related Stereogenic Centers 9 Two Attachment Points 40 Structures with One Stereogenic Center 10 Unconnected Rgroup Atom 41 Structures with Multiple Stereogenic Centers 11 Null Members 42 Chiral Labels 13 Markush Structures Differ From Markush Rules for Unambiguous Representation of Queries 43 Tetrahedral Stereochemistry 15 Enumeration of Markush Structures 43 Original Accelrys Representation of Tetrahedral Stereochemistry 16 Scaffold-based Enumeration 43 Examples of Stereogroups 16 Reaction-based Enumeration 44 Example 1: Racemic Mixture 16 Rgroup Decomposition 44 Example 2: Acquiring Increasing Amounts of Homology Groups 47 Information on the Stereochemistry of a About Homology Groups 47 Sample 17 Creating Substructure Search (SSS) Queries 47 Example 3: A Mixture of Epimers in a Limitations 48 Reaction Product 20 Biopoloymer Representation 49 Stereochemistry of Allenes and Biaryls 23 Configuring Databases for Biopolymer Cis and Trans Stereochemistry 24 Registration and Searching 49 Meso Compounds 24 Condensed Representation of Biopolymers 49 Spiro Compounds 25 Hybrid Representation 49 Chemical and Data Substance Groups *Atoms Alone 49 (Sgroups) 25 Pseudoatoms Alone 50 Data Sgroups 25 *Atoms and Pseudoatoms 51 Abbreviated Structures 26 Summary of Biopolymer Structure Multiple Groups 26 Conventions 51 Other Chemical Sgroups 27 Compatibility of Sequences Created in Structural Uncertainty 27 ISIS/Draw 52 Star Atoms (*) 27 ISIS/Draw Sequences Use the Full Structure 52 Convention Reaction Product 67 ISIS/Draw Residue Templates Lack Explicit Mixture of Stereoisomers 67 Attachment Atoms 52 Aspirin Tablet 68 Stereochemistry of ISIS/Draw Residue Ordered Mixture 69 Template 52 Polymer Representation 70 Compatibility of Condensed and Full Structure Conventions 53 Introduction to Polymer Representation 70 Required Sgroup Fields for Biopolymer Structure-based and Source-based Representation 53 Representation 70 Sgroup Field for Identifying Attachment Polymer Bracket Types 71 Atoms 53 Structural Repeating Unit (SRU) Brackets 71 Sgroup Field for *Atom Representation 54 Monomer Brackets (mon) 71 Structures Used in Biopolymer Representation 54 Mer Brackets (mer) 71 Biopolymer Residues 55 Copolymer Brackets (co) 71 Single-attachment Groups 55 Additional Polymer Brackets 72 Protecting Groups 55 Cyclization and Phase Shifting 72 PEG Molecules 55 Polymer Repeat Pattern 73 Special Features of Abbreviated Structures 55 Polymers with Two Crossing Bonds 73 Template Format for Biopolymer Residues 56 Ladder-type Polymers 74 Abbreviation Class 56 Polymers with Three or More Brackets 76 Terminal Leaving Groups 57 Polymer End Groups 77 Order and Bond Matching Attributes of Why Are Representation Conventions Attachment Atoms 58 Important? 77 Attachment Atoms on Reactive Chains 58 Guidelines for Graphical Representation of Explicit Hydrogen Leaving Groups on Polymers 78 Attachment Atoms of Reactive Chains 59 Structure-based and Source-based Explicit Hydrogen Leaving Groups on Representation 78 Histidine 61 Stereoregularity in Polymers 79 Molfile Features in Biopolymer Templates 62 Using Attached Data in Polymer Abbreviation Class (SCL) 62 Representation 79 Sgroup Subscript (SMT) 62 Required Attached Data for Polymer Representation 79 Abbreviation Attachment Atom (SAP) 62 Polymer or Copolymer Type 79 Creating and Enforcing Conventions for Biopolymer Representation 63 Stereoregularity 79 Choose One Convention for Each Chemical Guidelines for Defining Additional Attached Entity 63 Data 80 Use Standard Abbreviations 63 Examples of Structure-Based Representation 81 Subsequence Searching in Isentris Applications 64 Regular Homopolymers 81 Subsequence Search Differs from Simple Homopolymers 81 Substructure Search 64 Stereoregularity 82 BIOVIA Draw Programming Interface (API) for Ladder-type Polymers 82 subsequence search 65 Irregular Homopolymers 82 Implementing Subsequence Search for Full Structure Representations 65 Alternating and Periodic Polymers 83 Statistical, Random, and Unspecified Related Documentation for BIOVIA Draw 65 Copolymers 84 Mixture Representation 66 Unspecified Copolymers 84 Ordered and Unordered Mixtures 66 Random Copolymers 85 Using * atoms for Unspecified Structures in Mixtures 66 Statistical Copolymers 86 Examples of Mixtures 67 Regular Block Copolymers 86 Unordered Mixtures 67 Ordered Diblock 86 Block Copolymer with Junction Unit 87 Dependencies Between Switches 106 Segmented Block Copolymers 87 Examples of the Interactivity of Flexmatch Star Block Copolymers 89 Switches 106 Irregular Block Copolymers 89 Sulfones 106 Chemically Modified Polymers 90 Sulfoxides 107 Graft Polymers and Copolymers 91 Thiocarboxylic Acid Salts 107 Single Graft at a Known Site 92 Organometallic Complexes 107 Mixed Graft at a Known Site 92 Flexmatch Search of Structures with Tetrahedral Stereochemistry 109 Cross-linked Polymers 92 Absolute Configuration 111 Examples of Source-based Representation 94 Relative Stereoconfigurations (OR groups) 112 Guidelines for Source-based Representation 94 Example 1 112 Homopolymers 95 Example 2 112 Alternating and Other Periodic Polymers 95 Example 3 113 Copolymers from Monomers that Do Not Homopolymerize 96 Example 4 115 Statistical, Random, and Unspecified Example 5 115 Copolymers 96 Example 6 117 Unspecified Copolymers 96 Mixtures of Relative Stereoconfigurations Random Copolymers 97 (AND groups) 117 Statistical Copolymers 98 Flexmatch Search of Polymers 119 Block Copolymer 98 Useful Combinations of Flexmatch Switches 121 Exact Search (Flexmatch) 100 Exact Match/As Drawn 121 Flexmatch Switches 100 Exact Match/As Drawn plus Tautomers 122 How to Specify Switches 100 Tetrahedral Stereochemistry 122 Description of Switches 101 Cis/Trans Geometric Stereochemistry 122 Isotopes (MAS) 101 Exact Match/As Drawn plus Stereoisomers 123 Bonds (BON, STE, TAU) 101 Tetrahedral Stereochemistry 123 Bond (BON) 101 Cis and Trans Stereochemistry 124 Stereochemistry (STE) 101 Original Tautomer Search 124 Tautomer Bonds (TAU) 102 Exact Match/As Drawn plus Salts 124 Salts and Parent Compounds (CHA, ION, The Least Restrictive Flexmatch Switches 126 FRA, SAL) 102 Exact Match/As Drawn Polymer Search 126 Charge (CHA and ION) 102 Sourced-based and Structure-based Fragments (FRA) 102 Polymer Search 126 Hydrogen Count (HYD) 103 Copolymer Search 127 Salts (SAL) 103 Substructure Search 128 Alternative Structure Representations (MET, Definition of Substructure Search 128 RAD, VAL, IgnoreChargesInPiSystems) 103 Query Features on Atoms and Bonds 129 Metal Bonds (MET) 103 Allowing or Excluding Specific Atoms 130 Radicals (RAD) 103 Atom Query Feature: Any Atom (A) 130 Valence (VAL) 104 Atom Query Feature: Heteroatoms (Q) 130 Substance Groups (Sgroups) 104 Atom Query Feature: List 130 Attached Data (DAT) 104 Atom Query Feature: Not List 130 Polymer End Groups (end) 104 Atom Query Feature: H0 131 Mixtures (MIX) 104 Atom Query Feature: Unsaturated Atom Monomer/SRU Uniqueness (MSU) 104 (u) 131 Polymers (POL) 105 Allowing a Specific Number of Attachments 132 Polymer Type (TYP) 106 Atom Query Feature: Substitution Count 132 of Zero (s0) Reserved Names for Sgroup Fields 146 Atom Query Feature: Substitution Count Guidelines for Defining Your Own Attached of One (s1) 132 Data 147 Atom Query Feature: Substitution Count Examples of Attached Data 147 of Two (s2) 133 Distinguishing Relative Stereoisomers 147 Atom Query Feature: Substitution Count Isotopic Purity 149 of Three (s3) 133 Creating Sgroup Fields in Your Database 149 Atom Query Feature: