Small Molecules of Different Origins Have Distinct Distributions of Structural Complexity That Correlate with Protein-Binding Profiles

Total Page:16

File Type:pdf, Size:1020Kb

Small Molecules of Different Origins Have Distinct Distributions of Structural Complexity That Correlate with Protein-Binding Profiles Small molecules of different origins have distinct distributions of structural complexity that correlate with protein-binding profiles Paul A. Clemonsa,1, Nicole E. Bodycombea, Hyman A. Carrinskia, J. Anthony Wilsona, Alykhan F. Shamjia, Bridget K. Wagnera, Angela N. Koehlera, and Stuart L. Schreibera,b,c,1 aBroad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, MA 02142; and bHoward Hughes Medical Institute and cDepartment of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA 02138 Contributed by Stuart L. Schreiber, September 3, 2010 (sent for review August 9, 2010) Using a diverse collection of small molecules generated from a (ii) measured protein-binding activities of each member against variety of sources, we measured protein-binding activities of each each of 100 diverse proteins using small-molecule microarrays individual compound against each of 100 diverse (sequence-unre- (SMMs) (12, 13); and (iii) correlated these computed and lated) proteins using small-molecule microarrays. We also analyzed measured properties (Fig. 1). The resulting correlations suggest structural features, including complexity, of the small molecules. that structural features of small molecules relating to hybridiza- We found that compounds from different sources (commercial, aca- tion and stereochemistry are important contributors to binding demic, natural) have different protein-binding behaviors and that proteins selectively. these behaviors correlate with general trends in stereochemical and shape descriptors for these compound collections. Increasing Results the content of sp3-hybridized and stereogenic atoms relative to To quantify stereochemical and shape complexity, we calculated compounds from commercial sources, which comprise the majority two parameters for each compound for comparative analysis be- of current screening collections, improved binding selectivity and tween the three compound sources. While it is likely that more frequency. The results suggest structural features that synthetic complex descriptors may better physically model molecular shape chemists can target when synthesizing screening collections for complexity, one motivation of the current study was to link simple biological discovery. Because binding proteins selectively can be size-independent metrics with compound performance. First, we a key feature of high-value probes and drugs, synthesizing com- defined stereochemical complexity as the proportion of carbon pounds having features identified in this study may result in C ∕C atoms that are stereogenic ( stereogenic total). This metric pro- improved performance of screening collections. vides a size-independent global assessment of stereochemical complexity, varying on the range [0,1] for each molecule. Inspect- chemical diversity ∣ cheminformatics ∣ natural products ∣ small-molecule CHEMISTRY ∣ ing histograms of this metric as a function of compound source microarrays small-molecule probes (Fig. 2) revealed that CC is lowest in stereochemical complexity (median ¼ 0.00; mean ¼ 0.022). In contrast, NP is highest in mall-molecule probe- and drug-discovery activities in acade- stereochemical complexity (median ¼ 0.24; mean ¼ 0.24), while Smia and the pharmaceutical industry often begin with high- DC has intermediate values (median ¼ 0.11; mean ¼ 0.12). throughput screening. Many thousands of small molecules are Second, we defined shape complexity as the ratio of sp3-hybri- tested with the expectation that each has potential as a discovery dized carbon atoms to total sp3- and sp2-hybridized carbons lead. Thus, assembling or synthesizing compound collections for (Csp3∕½Csp2 þ Csp3). This metric is similar to the recently reported small-molecule screening represents an important step in discov- 3 Fsp metric (14); again, this metric is size-independent, varying BIOPHYSICS AND ery success, particularly when selecting among compounds from a A COMPUTATIONAL BIOLOGY variety of synthetic and natural sources. Unbiased methods to on the range [0,1] for each molecule. Using this metric (Fig. 3 ), we observed that CC is lowest in proportion of sp3-hybridized evaluate the assay performance of compounds from different ¼ 0 22 ¼ 0 27 NP ¼ sources, and to relate performance to chemical structure (defined carbons (median . ; mean . ), is highest (median 0 55 ¼ 0 55 DC by computed structural properties) (1, 2), can provide guidance . ; mean . ), and has an intermediate distribution ¼ 0 36 ¼ 0 39 to one element of more valuable small-molecule screening (median . ; mean . ). When we restricted our analysis to carbon atoms in the molecular scaffold (15), we observed a collections. 3 Comparative analyses between compounds often involve che- decrease in overall sp carbon proportions in all three popula- minformatic analysis of compound structures (3–5) or retrospec- tions (Fig. 3B). This effect was most striking in CC (median ¼ tive analysis of compound performance by mining the literature 0.071; mean ¼ 0.16), indicating that a substantial portion of 3 (6–8) or historical data (9, 10). For example, intermediate mole- sp carbons in these molecules are in appendages rather than cular complexity has been suggested as theoretically preferable in (predominantly flat) core skeletons. In contrast, NP molecules for drug leads (11), and this relationship is supported by evidence retain a large proportion of their sp3 carbons in core skeletons mined from historical data (9). In this study, we performed un- (median ¼ 0.50; mean ¼ 0.49). biased comparisons of compounds from natural and synthetic sources by first identifying compounds with unknown activities and then exposing them to a common assay platform. We iden- Author contributions: P.A.C., A.S., A.N.K., and S.L.S. designed research; P.A.C., N.E.B., and i A.N.K. performed research; P.A.C., N.E.B., H.A.C., J.A.W., B.K.W., and A.N.K. contributed tified a compound collection comprising three subsets: ( ) 6,152 new reagents/analytic tools; P.A.C., N.E.B., H.A.C., A.S., and A.N.K. analyzed data; and compounds from commercial sources that are representative of P.A.C. and S.L.S. wrote the paper. many common screening collections (commercial compounds; The authors declare no conflict of interest. CC ii ); ( ) 6,623 compounds assembled from the academic syn- Freely available online through the PNAS open access option. thetic chemistry community using, e.g., diversity-oriented synth- 1To whom correspondence may be addressed. E-mail: [email protected] or esis (diverse compounds; DC); and (iii) 2,477 naturally occurring [email protected]. NP i compounds (natural products; ). We then ( ) analyzed distri- This article contains supporting information online at www.pnas.org/lookup/suppl/ butions of stereochemical and shape complexity for each set; doi:10.1073/pnas.1012741107/-/DCSupplemental. www.pnas.org/cgi/doi/10.1073/pnas.1012741107 PNAS ∣ November 2, 2010 ∣ vol. 107 ∣ no. 44 ∣ 18787–18792 Downloaded by guest on September 30, 2021 Fig. 1. Study design to relate structural complexity to protein-binding profiles. Three sources of compounds were studied; diverse samples of each subset are shown to illustrate differences between the subsets (all structures in the study are presented in Dataset S1 and Dataset S2). In addition to analyzing computed properties, we sought to epitope tag as a control. For the present study, we scored as “hits” determine differences in protein-binding abilities of compounds for each protein those compounds whose average deviation from from different origins. For this study, we analyzed a dataset re- control-spot intensities exceeded a fixed threshold for statistical sulting from testing all members of the compound collection in- significance, after correction for multiple hypothesis testing (see dividually in binding assays against each of 100 different purified Materials and Methods). We also eliminated compounds that proteins using the SMM platform, which gives a preliminary bound the antibody to the control epitope tag, because these indication of protein binding. In terms of sequence, proteins were events likely do not correspond to specific binding to the 100 pa- selected having a wide range of structural types rather than re- nel proteins. These experiments resulted in a matrix of binary hit presenting a family or families of proteins. In terms of function, calls for 15,252 compounds versus 100 proteins, of which 3,433 (22.5%) compounds bound at least one protein (Fig. 4). We note the proteins were selected having varying roles in transcriptional that this high fraction of compounds binding any protein is not regulation. SMM slides were exposed in triplicate to each of 100 unexpected with 100 parallel protein-binding assays; for example, purified proteins in independent experiments and to a common if hit rates were 0.5% for each protein, and hit compounds were selected randomly for each of 100 proteins, we would expect 1 − ð0.995Þ100 ≈39% of compounds to bind at least one protein. We characterized the resulting 100 protein-binding profiles for each of CC, DC, and NP subsets using two measures of per- formance that inform how useful a small-molecule collection might be to the screening community: (i) a measure of the rate at which hits were identified from each subset, and (ii) a measure of specificity of the discovered hits from each subset, based on the number of proteins bound by a given hit. First, we examined the propensity of compounds from different sources to score positive in any protein-binding assay.
Recommended publications
  • Raman Imaging of Small Biomolecules
    BB48CH15_Min ARjats.cls April 16, 2019 13:8 Annual Review of Biophysics Raman Imaging of Small Biomolecules Yihui Shen, Fanghao Hu, and Wei Min Department of Chemistry, Columbia University, New York, NY 10027, USA; email: [email protected] Annu. Rev. Biophys. 2019. 48:347–69 Keywords First published as a Review in Advance on Raman microscopy, small molecule imaging, vibrational tags, lipid March 20, 2019 metabolism, drug imaging The Annual Review of Biophysics is online at biophys.annualreviews.org Abstract Access provided by Columbia University on 06/21/19. For personal use only. https://doi.org/10.1146/annurev-biophys-052118- Imaging techniques greatly facilitate the comprehensive knowledge of Annu. Rev. Biophys. 2019.48:347-369. Downloaded from www.annualreviews.org 115500 biological systems. Although imaging methodology for biomacromolecules Copyright © 2019 by Annual Reviews. such as protein and nucleic acids has been long established, micro- All rights reserved scopic techniques and contrast mechanisms are relatively limited for small biomolecules, which are equally important participants in biological pro- cesses. Recent developments in Raman imaging, including both microscopy and tailored vibrational tags, have created exciting opportunities for non- invasive imaging of small biomolecules in living cells, tissues, and organ- isms. Here, we summarize the principle and workflow of small-biomolecule imaging by Raman microscopy. Then, we review recent efforts in imaging, for example, lipids, metabolites, and drugs. The unique advantage of Raman imaging has been manifested in a variety of applications that have provided novel biological insights. 347 BB48CH15_Min ARjats.cls April 16, 2019 13:8 Contents THENEEDFORIMAGINGSMALLBIOMOLECULES........................ 348 IMAGINGSMALLMOLECULESBYVIBRATIONALCONTRAST............. 349 LINEAR AND NONLINEAR RAMAN SCATTERING SPECTROSCOPY ANDMICROSCOPY.........................................................
    [Show full text]
  • Small Molecules, Big Impact
    FOR RESEARCH USE ONLY. NOT INTENDED FOR HUMAN OR ANIMAL DIAGNOSTIC OR THERAPEUTIC USES. STEMCELL TECHNOLOGIES INC.’S QUALITY MANAGEMENT SYSTEM IS CERTIFIED TO ISO 13485 MEDICAL DEVICE STANDARDS. TGFβ: The TGFβ superfamily members are major regulators of self-renewal and differentiation to mesendodermal lineages. p38 / p53: p38 MAPK is activated in response to cellular stresses, and influences apoptosis and cellular differentiation. p53 is activated in Pathway Activators: IDE1, IDE2; Pathway Inhibitors: SB431542, LY364947, RepSox, Dorsomorphin, LDN193189 response to cellular stresses and regulates growth arrest, apoptosis and/or DNA repair. Inhibition of p53 is used to facilitate reprogramming. Pathway Inhibitors: BIRB-796, SB202190, SB203580, Cyclic Pifithrin-α, Pifithrin-μ Hedgehog: The Hedgehog pathway is important in the differentiation of mesenchymal subtypes (e.g. osteocytes) and in neural subtypes (e.g. dopaminergic neurons). FGF: FGF signaling via MEK/ERK is critical for self-renewal and proliferation of human PSCs. However, other receptor tyrosine kinases can Pathway Activator: Purmorphamine; Pathway Inhibitors: Cyclopamine, HPI-1 also act through the MEK/ERK and PI3K signaling pathways. Pathway Activators: Pyrintegrin, PS-48; Pathway Inhbitors: Pluripotin, PD0325901, PD98059, Reversine IWP-2 WNT: The WNT pathway regulates differentiation to the mesoendodermal lineages, induces self-renewal of PSCs, and aids in reprogramming. IWP-3 Pathway Activators: BIO, CHIR99021, Kenpaullone, SB216763; Pathway Inhibitors: IWP-2, IWP-3, IWP-4,
    [Show full text]
  • Monoclonal Antibodies and Small-Molecule Drugs: WHAT GENERAL NEUROLOGISTS NEED to KNOW
    THIS CONTENT IS SPONSORED BY AMGEN INC. AND NOVARTIS PHARMACEUTICALS CORPORATION USA-334-83498 Monoclonal Antibodies and Small-Molecule Drugs: WHAT GENERAL NEUROLOGISTS NEED TO KNOW APRIL 2020 I VOL. 28 NO. 4 A Member of the Network Introduction plasma concentration 1–8 days) than IV, but this route permits self- Numerous drug modalities have been approved to treat neurologic administration at home.9,22 diseases, including small-molecule drugs (SMDs) and biologics SMDs generally have wide distribution into tissues, organs and such as monoclonal antibodies (mAbs).1-3 While both are consid- plasma.10,23 They are metabolized by CYP450 enzymes via oxidation, ered targeted therapies,2-7 each has unique characteristics that may leading to renal elimination in the urine, and by conjugation reac- affect their use in clinical practice.5,8,9 This article describes the gen- tions (eg, glucuronidation), leading to hepatic/biliary elimination in eral characteristics of SMDs and mAbs and discusses some of their the stool.10,24,25 mAbs have a small range of distribution.10,23 Particu- safety implications relevant to their use by general neurologists in larly relevant to neurology, therapeutic mAbs do not readily cross clinical practice. the blood–brain barrier and therefore have minimal distribution in the central nervous system (CNS).8,9 Therapeutic mAbs are also Characteristics of SMDs and mAbs too large for clearance by renal or hepatic mechanisms, and instead The general characteristics and pharmacokinetics (PK) of are metabolized by two primary
    [Show full text]
  • Mechanisms of Action for Small Molecules Revealed by Structural Biology in Drug Discovery
    International Journal of Molecular Sciences Review Mechanisms of Action for Small Molecules Revealed by Structural Biology in Drug Discovery Qingxin Li 1,* and CongBao Kang 2,* 1 Guangdong Provincial Engineering Laboratory of Biomass High Value Utilization, Guangdong Provincial Bioengineering Institute (Guangzhou Sugarcane Industry Research Institute), Guangdong Academy of Sciences, Guangzhou 510316, China 2 Experimental Drug Development Centre (EDDC), Agency for Science, Technology and Research (A*STAR), 10 Biopolis Road, Chromos, #05-01, Singapore 138670, Singapore * Correspondence: [email protected] (Q.L.); [email protected] (C.K.); Tel.: +86-020-84168436 (Q.L.); +65-64070602 (C.K.) Received: 12 June 2020; Accepted: 20 July 2020; Published: 24 July 2020 Abstract: Small-molecule drugs are organic compounds affecting molecular pathways by targeting important proteins. These compounds have a low molecular weight, making them penetrate cells easily. Small-molecule drugs can be developed from leads derived from rational drug design or isolated from natural resources. A target-based drug discovery project usually includes target identification, target validation, hit identification, hit to lead and lead optimization. Understanding molecular interactions between small molecules and their targets is critical in drug discovery. Although many biophysical and biochemical methods are able to elucidate molecular interactions of small molecules with their targets, structural biology is the most powerful tool to determine the mechanisms of action for both targets and the developed compounds. Herein, we reviewed the application of structural biology to investigate binding modes of orthosteric and allosteric inhibitors. It is exemplified that structural biology provides a clear view of the binding modes of protease inhibitors and phosphatase inhibitors.
    [Show full text]
  • Small Molecule and Protein Docking Introduction
    Small Molecule and Protein Docking Introduction • A significant portion of biology is built on the paradigm sequence structure function • As we sequence more genomes and get more structural information, the next challenge will be to predict interactions and binding for two or more biomolecules (nucleic acids, proteins, peptides, drugs or other small molecules) Introduction • The questions we are interested in are: – Do two biomolecules bind each other? – If so, how do they bind? – What is the binding free energy or affinity? • The goals we have are: – Searching for lead compounds – Estimating effect of modifications – General understanding of binding – … Rationale • The ability to predict the binding site and binding affinity of a drug or compound is immensely valuable in the area or pharmaceutical design • Most (if not all) drug companies use computational methods as one of the first methods of screening or development • Computer-aided drug design is a more daunting task, but there are several examples of drugs developed with a significant contribution from computational methods Examples • Tacrine – inhibits acetylcholinesterase and boost acetylcholine levels (for treating Alzheimer’s disease) • Relenza – targets influenza • Invirase, Norvir, Crixivan – Various HIV protease inhibitors • Celebrex – inhibits Cox-2 enzyme which causes inflammation (not our fault) Docking • Docking refers to a computational scheme that tries to find the best binding orientation between two biomolecules where the starting point is the atomic coordinates of the two molecules • Additional data may be provided (biochemical, mutational, conservation, etc.) and this can significantly improve the performance, however this extra information is not required Bound vs. Unbound Docking • The simplest problem is the “bound” docking problem.
    [Show full text]
  • Reporting Data from High-Throughput Screening of Small-Molecule Libraries
    COMMENTARY Reporting data from high-throughput screening of small-molecule libraries James Inglese, Caroline E Shamu & R Kiplin Guy Publications reporting results of small-molecule screens are becoming more common as academic researchers increasingly make use of high-throughput screening (HTS) facilities. However, no standards have been formally established for reporting small-molecule screening data, and often key information important for the evaluation and interpretation of results is omitted in published HTS protocols. Here, we propose concise guidelines for reporting small-molecule HTS data. High-throughput screening (HTS) is becoming providing a basis for evaluation, comparison in the assay. Controls are used to determine a routine method for identifying probes used in and replication of small-molecule screens. We an assay ‘window’ and validate the biological chemical biology. The flexibility of the process have divided small-molecule screen protocol response. It is often desirable to provide an has allowed numerous and disparate areas of information into five categories: the assay, the indication as to the efficacy of the controls. biology to engage with an equally diverse palate library, the HTS process, the post-HTS analy- For example, here is an assay strategy descrip- of chemistry. However, with this diversity, the sis of data and compound structures, and the tion that might be written for a fluorescence many components associated with HTS—the screen results (Fig. 1). Within each category, polarization assay: bioassay, the assay format, the nature of the we describe key pieces of information that are library, reagent and sample delivery meth- important for interpretation and replication This screen was carried out to identify compounds ods, the detection instrumentation, the level of a screen (Table 1).
    [Show full text]
  • Small Molecule Drug Discovery
    Small Molecule Drug Discovery Centre for TIA Pipeline Drug Candidate Accelerator Optimisation TetraQ Other NCRIS facilities Universities, Medical research institutes Other therapeutic areas within TIA Small Molecule Discovery and Development Biological Target Compounds Australian Centre for Integrated Australia Translational Preclinical Drug Medicinal Development Chemistry Facility Australia is ideally positioned to be a major player in the global Pharma ecosystem where risk‑adverse companies are looking to in‑license assets that have demonstrated development potential. To enhance the progression of early stage discoveries through to preclinical development, TIA has formed a Small Molecules Group, an integrated capability network of National Research Infrastructure across Australia. These facilities are committed to providing high quality research infrastructure to support leading‑edge research and to boost innovation in Australia. SUPPORTED BY: CONTACT US: T: (07) 3735 4433 E: [email protected] Level 14, 330 Collins Street, Melbourne VIC 3000 www.therapeuticinnovation.com.au Compounds Australia Capability/services offered: • National compound management and logistics facility for small molecule compound libraries and natural product extract and fraction libraries. • Cost effective compound management, supporting drug discovery through high throughput screening • Provides storage and management of the national open access compound collection, GRIDD’s NatureBank collection and numerous proprietary compound collections.
    [Show full text]
  • Identification and Characterization of a Small Molecule Inhibitor of Formin
    Chemistry & Biology Article Identification and Characterization of a Small Molecule Inhibitor of Formin-Mediated Actin Assembly Syed A. Rizvi,1,5 Erin M. Neidt,2,5 Jiayue Cui,1 Zach Feiger,3 Colleen T. Skau,2 Margaret L. Gardel,3 Sergey A. Kozmin,1 and David R. Kovar2,4,* 1Department of Chemistry 2Department of Molecular Genetics and Cell Biology 3Department of Physics 4Department of Biochemistry and Molecular Biology The University of Chicago, Chicago, IL 60637, USA 5These authors contributed equally to this work *Correspondence: [email protected] DOI 10.1016/j.chembiol.2009.10.006 SUMMARY nant tumor function and are highly overexpressed in different cancer cell types including colorectal, carcinoma, leukemia, Formins stimulate actin filament assembly for funda- melanoma, and lymphoid (Favaro et al., 2003; Kitzing et al., mental cellular processes including division, adhe- 2007; Sarmiento et al., 2008; Zhu et al., 2008). Determining for- sion, establishing polarity, and motility. A formin min’s numerous roles is particularly complicated because most inhibitor would be useful because most cells express organisms express multiple isoforms (Goode and Eck, 2007). multiple formins whose functions are not known and For example, there are at least 20 formin genes in plants, 18 in because metastatic tumor formation depends on the mammals, 6 each in Drosophila melanogaster and Caenorhabdi- tis elegans, 3 in fission yeast, and 2 in budding yeast (Higgs, deregulation of formin-dependent processes. We 2005). A major challenge is to elucidate all formin-dependent identified a general small molecule inhibitor of formin cellular functions in both healthy and diseased tissues. homology 2 domains (SMIFH2) by screening com- Formins are multidomain proteins that contain a highly pounds for the ability to prevent formin-mediated conserved actin assembly formin homology 2 FH2 domain and actin assembly in vitro.
    [Show full text]
  • Cell Communication and Signaling Mechanisms Terms to Learn
    LQB383 Testbank Week 8 – Cell Communication and Signaling Mechanisms Terms to learn – match the terms to the definitions -------------------------------------------------------------------------------------------------------------------------- Adaptor Intracellular signaling protein Morphogen Endocrine cell Gap junction Neurotransmitter Hormone Receptor Extracellular signal molecule Signaling cascade Steroid hormone Synaptic signaling Paracrine signaling Second messenger Autocrine signaling Nuclear receptor family Interaction domain Contact-dependent signaling Definitions 1. General term for a protein that binds a specific extracellular molecule (ligand) and initiates a response in a cell. 2. Communicating cell-cell junction that allows ions and small molecules to pass from the cytoplasm of one cell to the cytoplasm of another. 3. Hydrophobic signaling molecule with characteristic four-ringed structure derived from cholesterol. 4. Short-range cell-cell communication via secreted local mediators that act on adjacent cells. 5. Specialised cell that secretes a hormone into the blood 6. The cell responds to its own secreted molecules. 7. Small signaling molecule secreted by the presynaptic nerve cell at a chemical synapse to relay the signal to the postsynaptic cell. 8. Small molecule that is formed in the cytosol, or released into it, in response to an extracellular signal, and that helps to relay the signal in the interior of the cell. 9. Molecule from the outside of the cell that communicates the behaviour or actions of other cells in the environment and elicits an appropriate response. 10. General term for a signal relay chain containing multiple amplification steps. 1 Receptor 2 Gap junction 3 Steroid hormone 4 Paracrine signaling 5 Endocrine cell 6 Autocrine signaling 7 Neurotransmitter 8 Second messenger 9 Extracellular signal molecule 10 Signaling cascade Multiple Choice Questions 1.
    [Show full text]
  • Using Small Molecules to Control Stem Cell Growth and Differentiation
    Using Small Molecules to Control Stem Cell Growth and Differentiation Kirsty E. Clarke1, Victoria B. Christie1, Andy Whiting2 Introduction and Stefan A. Przyborski1 1Biological Sciences, Durham University, Small molecules are routinely used to manipulate signaling South Road, Durham, DH1 3LE, UK pathways during the in vitro culture of cells. Signaling pathways that control cell proliferation and differentiation are important 2 Department of Chemistry, Durham University, targets for small molecules in the culture of stem cells. Targeting South Road, Durham, DH1 3LE, UK pathways such as the canonical Wnt, transforming growth Correspondence e-mail: [email protected] factor-β (TGF-β) and retinoic acid signaling pathways can be Kirsty Clarke and Victoria Christie are research scientists useful to enhance and maintain the proliferation of stem cells, within the laboratory of Professor Stefan Przyborski at Durham or to guide stem cell fate toward specific lineages in con trolled University. Research in Professor Przyborski’s group focuses on differentiation. This review provides a brief overview of the the development and application of technology to improve the small molecules that interact with the primary signaling path- growth, differentiation and function of cultured cells, including ways that govern stem cell proliferation and differentiation the use of small molecules that control stem cell fate. to mediate stem cell behavior, along with the role of small Professor Whiting is an organic chemist and a long-standing molecules in the dedifferentiation of somatic cells to create collaborator of Professor Przyborski, working on the design and populations of pluripotent stem cells. development of small molecules to control cellular behavior.
    [Show full text]
  • Combating Small Molecule Aggregation with Machine Learning
    Combating small molecule aggregation with machine learning Kuan Lee1, Ann Yang1, Yen-Chu Lin1,*, Daniel Reker2, Gonçalo J. L. Bernardes3,4, Tiago Rodrigues4,5,* 1. Insilico Medicine Taiwan Ltd., Keelung Rd., Xinyi Dist., Taipei, Taiwan 2. Department of Biomedical Engineering, Duke University, Durham, NC 27708, USA. 3. Department of Chemistry, University of Cambridge, Lensfield Road, CB2 1EW, Cambridge, UK. 4. Instituto de Medicina Molecular, Faculdade de Medicina da Universidade de Lisboa, Av. Prof. Egas Moniz 1649-028 Lisboa, Portugal. 5. Instituto de Investigação do Medicamento (iMed.ULisboa), Faculdade de Farmácia, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003 Lisboa, Portugal. * Corresponding authors: [email protected] [email protected] Abstract Biological screens are plagued by false positive hits resulting from aggregation. Thus, methods to triage small colloidally aggregating molecules (SCAMs) are in high demand. Herein, we disclose a bespoke machine-learning tool to confidently and intelligibly flag such entities. Our data demonstrate an unprecedented utility of machine learning for predicting SCAMs, achieving 80% of correct predictions in a challenging out-of-sample validation. The tool outperformed a panel of expert chemists, who correctly predicted 61±7% of the same test molecules in a Turing-like test. Further, the computational routine provided insight into molecular features governing aggregation that had remained hidden to expert intuition. Leveraging our tool, we quantify that up to 15–20% of ligands in publicly available chemogenomic databases have the high potential to aggregate at typical screening concentrations, imposing caution in systems biology and drug design programs. Our approach provides a means to augment human intuition, mitigate attrition and a pathway to accelerate future molecular medicine.
    [Show full text]
  • Biological Product Definitions
    Biological Product Definitions What is a biological product? Reference Product Biological products are regulated by the Food and Drug A reference product is the single biological Administration (FDA) and are used to diagnose, prevent, product, already approved by FDA, against which a proposed biosimilar product is compared treat, and cure diseases and medical conditions. Biological products are a diverse category of products and are generally large, complex molecules. These products may Biosimilar Product be produced through biotechnology in a living system, A biosimilar is a biological product that is such as a microorganism, plant cell, or animal cell, highly similar and has no clinically meaningful and are often more difficult to characterize than small differences from an existing FDA-approved molecule drugs. There are many types of biological reference product products approved for use in the United States, including therapeutic proteins (such as filgrastim), monoclonal Interchangeable Product antibodies (such as adalimumab), and vaccines (such as An interchangeable product is a biosimilar those for influenza and tetanus). product that meets additional requirements The nature of biological products, including the inherent variations that can result from the manufacturing process, can present challenges in characterizing and What does it mean to be manufacturing these products that often do not exist in the development of small molecule drugs. Slight differences “highly similar”? between manufactured lots of the same biological product (i.e., acceptable within-product variations) are normal A manufacturer developing a proposed biosimilar and expected within the manufacturing process. As part demonstrates that its product is highly similar to of its review, FDA assesses the manufacturing process the reference product by extensively analyzing (i.e., and the manufacturer’s strategy to control within-product characterizing) the structure and function of both the variations.
    [Show full text]