Statistics and Chemometrics for Analytical Chemistry

Total Page:16

File Type:pdf, Size:1020Kb

Statistics and Chemometrics for Analytical Chemistry 0131291920_COVER 3/2/05 10:48 AM Page 1 Statistics This popular textbook gives a clear account of the principles of the Statistics and main statistical methods used in modern analytical laboratories. Such Chemometrics methods underpin high quality analyses in areas such as the safety of food, water and medicines, environmental monitoring, and chemical and manufacturing. The treatment throughout emphasises the underlying for Analytical statistical ideas, and no detailed knowledge of mathematics is required. Chemometrics Chemistry There are numerous worked examples, including the use of Microsoft Statistics and Excel and Minitab, and a large number of student exercises, many of them based on examples from the analytical literature. Fifth edition key features Chemometrics • expanded treatment of control charts • additions to cover single point calibration and method comparison techniques for Analytical • extended treatment of robust methods for • major additions to sections on multivariate regression • numerous worked examples, using Microsoft Excel and Minitab Analytical Chemistry Chemistry • an attractive two-colour text design • updated Instructors’ Manual • improved website, www.pearsoned.co.uk/miller, including examples for lecturers and students James N Miller This book is aimed at undergraduate and graduate courses in Analytical Chemistry and related topics. It will also be a valuable & Jane C Miller resource for researchers and chemists working in analytical chemistry. Professor James Miller – is Emeritus Professor of Analytical Chemistry at Loughborough Fifth edition University. He has published numerous reviews and papers on analytical techniques and been awarded the SAC Silver Medal, the Theophilus Redwood Lectureship and the SAC Gold Medal by the Royal Miller Miller & Society of Chemistry. A past President of the Analytical Division of the RSC, he is a member of the Society’s Council and has served on the editorial boards of many analytical and spectroscopic journals. Dr Jane Miller – completed a PhD at Cambridge University’s Cavendish Laboratory and is an experienced teacher of mathematics and physics at higher education and 6th form levels. She holds an MSc in Applied Statistics and is the author of several specialist A-level statistics texts. Fifth edition www.pearson-books.com www.pearsoned.co.uk/miller SCA_A01.qxd 4/7/05 5:32 PM Page i Statistics and Chemometrics for Analytical Chemistry Fifth Edition SCA_A01.qxd 4/7/05 5:32 PM Page ii We work with leading authors to develop the strongest educational materials in chemistry, bringing cutting-edge thinking and best learning practice to a global market. Under a range of well-known imprints, including Prentice Hall, we craft high quality print and electronic publications which help readers to understand and apply their content, whether studying or at work. To find out more about the complete range of our publishing please visit us on the World Wide Web at: www.pearsoned.co.uk SCA_A01.qxd 4/7/05 5:33 PM Page iii James N. Miller Jane C. Miller Statistics and Chemometrics for Analytical Chemistry Fifth Edition SCA_A01.qxd 4/7/05 5:33 PM Page iv Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Visit us on the World Wide Web at: www.pearsoned.co.uk Third edition published under the Ellis Horwood imprint 1993 Fourth edition 2000 Fifth edition 2005 © Ellis Horwood Limited 1993 © Pearson Education Limited 2000, 2005 The rights of J. N. Miller and J. C. Miller to be identified as authors of this Work have been asserted by them in accordance with the Copyright, Designs and Patents Act 1988. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without either the prior written permission of the Publishers or a licence permitting restricted copying in the United Kingdom issued by the Copyright Licensing Agency Ltd, 90 Tottenham Court Road, London W1T 4LP. All trademarks used herein are the property of their respective owners. The use of any trademark in this text does not vest in the author or publisher any trademark ownership rights in such trademarks, nor does the use of such trademarks imply any affiliation with or endorsement of this book by such owners. Microsoft is a trademark of Microsoft Corporation; MINITAB is a registered trademark of Minitab, Inc.; The Unscrambler is a trademark of Camo ASA. ISBN 0 131 29192 0 British Library Cataloguing-in-Publication Data A catalogue record for this book can be obtained from the British Library Library of Congress Cataloging-in-Publication Data Miller, J. N. (James N.), 1943– Statistics and chemometrics for analytical chemistry / James N. Miller and Jane C. Miller. — 5th ed. p. cm. Includes bibliographical references. ISBN 0-13-129192-0 (pbk.) 1. Chemistry, Analytic — Statistical methods — Textbooks. I. Miller, J. C. (Jane Charlotte) II. Title. QD75.4.S8M54 2005 543'.072 — dc22 2004060168 10987654321 09 08 07 06 05 Typeset by 68 in 9.25/12pt Stone Serif Printed in Great Britain by Ashford Colour Press, Gosport, Hants SCA_A01.qxd 4/7/05 5:33 PM Page v Contents Preface to the fifth edition ix Preface to the first edition xi Acknowledgements xiii Glossary of symbols xv 1 Introduction 1 1.1 Analytical problems 1 1.2 Errors in quantitative analysis 2 1.3 Types of error 3 1.4 Random and systematic errors in titrimetric analysis 6 1.5 Handling systematic errors 9 1.6 Planning and design of experiments 12 1.7 Calculators and computers in statistical calculations 13 Bibliography 15 Exercises 16 2 Statistics of repeated measurements 18 2.1 Mean and standard deviation 18 2.2 The distribution of repeated measurements 20 2.3 Log-normal distribution 24 2.4 Definition of a ‘sample’ 24 2.5 The sampling distribution of the mean 26 2.6 Confidence limits of the mean for large samples 27 2.7 Confidence limits of the mean for small samples 28 2.8 Presentation of results 29 2.9 Other uses of confidence limits 31 2.10 Confidence limits of the geometric mean for a log-normal distribution 31 2.11 Propagation of random errors 32 2.12 Propagation of systematic errors 35 Bibliography 37 Exercises 37 SCA_A01.qxd 4/7/05 5:33 PM Page vi vi Contents 3 Significance tests 39 3.1 Introduction 39 3.2 Comparison of an experimental mean with a known value 39 3.3 Comparison of two experimental means 41 3.4 Paired t-test 45 3.5 One-sided and two-sided tests 47 3.6 F-test for the comparison of standard deviations 49 3.7 Outliers 51 3.8 Analysis of variance 54 3.9 Comparison of several means 55 3.10 The arithmetic of ANOVA calculations 58 3.11 The chi-squared test 61 3.12 Testing for normality of distribution 63 3.13 Conclusions from significance tests 67 Bibliography 69 Exercises 69 4 The quality of analytical measurements 74 4.1 Introduction 74 4.2 Sampling 74 4.3 Separation and estimation of variances using ANOVA 76 4.4 Sampling strategy 77 4.5 Quality control methods – Introduction 78 4.6 Shewhart charts for mean values 79 4.7 Shewhart charts for ranges 81 4.8 Establishing the process capability 83 4.9 Average run length: cusum charts 86 4.10 Zone control charts (J-charts) 89 4.11 Proficiency testing schemes 90 4.12 Collaborative trials 93 4.13 Uncertainty 98 4.14 Acceptance sampling 102 Bibliography 103 Exercises 104 5 Calibration methods: regression and correlation 107 5.1 Introduction: instrumental analysis 107 5.2 Calibration graphs in instrumental analysis 108 5.3 The product–moment correlation coefficient 110 5.4 The line of regression of y on x 114 5.5 Errors in the slope and intercept of the regression line 115 5.6 Calculation of a concentration and its random error 118 5.7 Limits of detection 121 5.8 The method of standard additions 124 5.9 Use of regression lines for comparing analytical methods 126 5.10 Weighted regression lines 131 5.11 Intersection of two straight lines 135 SCA_A01.qxd 4/7/05 5:33 PM Page vii Contents vii 5.12 ANOVA and regression calculations 136 5.13 Curvilinear regression methods – Introduction 138 5.14 Curve fitting 141 5.15 Outliers in regression 145 Bibliography 146 Exercises 147 6 Non-parametric and robust methods 150 6.1 Introduction 150 6.2 The median: initial data analysis 151 6.3 The sign test 156 6.4 The Wald–Wolfowitz runs test 158 6.5 The Wilcoxon signed rank test 159 6.6 Simple tests for two independent samples 162 6.7 Non-parametric tests for more than two samples 165 6.8 Rank correlation 167 6.9 Non-parametric regression methods 169 6.10 Robust methods – Introduction 171 6.11 Robust estimates of location and spread 173 6.12 Robust regression methods 175 6.13 Re-sampling statistics 176 6.14 Conclusions 178 Bibliography 179 Exercises 179 7 Experimental design and optimization 181 7.1 Introduction 181 7.2 Randomization and blocking 182 7.3 Two-way ANOVA 183 7.4 Latin squares and other designs 186 7.5 Interactions 187 7.6 Factorial versus one-at-a-time design 192 7.7 Factorial design and optimization 193 7.8 Optimization: basic principles and univariate methods 197 7.9 Optimization using the alternating variable search method 200 7.10 The method of steepest ascent 202 7.11 Simplex optimization 205 7.12 Simulated annealing 209 Bibliography 209 Exercises 210 8 Multivariate analysis 213 8.1 Introduction 213 8.2 Initial analysis 214 8.3 Principal component analysis 215 8.4 Cluster analysis 220 8.5 Discriminant analysis 223 SCA_A01.qxd 4/7/05 5:33 PM Page viii viii Contents 8.6 K-nearest neighbour method 227 8.7
Recommended publications
  • Real-Time Wavelet Compression and Self-Modeling Curve
    REAL-TIME WAVELET COMPRESSION AND SELF-MODELING CURVE RESOLUTION FOR ION MOBILITY SPECTROMETRY A dissertation presented to the faculty of the College of Arts and Sciences of Ohio University In partial fulfillment of the requirements for the degree Doctor of Philosophy Guoxiang Chen March 2003 This dissertation entitled REAL-TIME WAVELET COMPRESSION AND SELF-MODELING CURVE RESOLUTION FOR ION MOBILITY SPECTROMETRY BY GUOXIANG CHEN has been approved for the Department of Chemistry and Biochemistry and the College of Arts and Sciences by Peter de B. Harrington Associate Professor of Chemistry and Biochemistry Leslie A. Flemming Dean, College of Arts and Sciences CHEN, GUOXIANG. Ph.D. March 2003. Analytical Chemistry Real-Time Wavelet Compression and Self-Modeling Curve Resolution for Ion Mobility Spectrometry (203 pp.) Director of Dissertation: Peter de B. Harrington Chemometrics has proven useful for solving chemistry problems. Most of the chemometric methods are applied in post-run analyses, for which data are processed after being collected and archived. However, in many applications, real-time processing is required to obtain knowledge underlying complex chemical systems instantly. Moreover, real-time chemometrics can eliminate the storage burden for large amounts of raw data that occurs in post-run analyses. These attributes are important for the construction of portable intelligent instruments. Ion mobility spectrometry (IMS) furnishes inexpensive, sensitive, fast, and portable sensors that afford a wide variety of potential applications. SIMPLe-to-use Interactive Self-modeling Mixture Analysis (SIMPLISMA) is a self-modeling curve resolution method that has been demonstrated as an effective tool for enhancing IMS measurements. However, all of the previously reported studies have applied SIMPLISMA as a post-run tool.
    [Show full text]
  • Curriculum Vitae
    Jeff Goldsmith 722 W 168th Street, 6th floor New York, NY 10032 jeff[email protected] Date of Preparation April 20, 2021 Academic Appointments / Work Experience 06/2018{Present Department of Biostatistics Mailman School of Public Health, Columbia University Associate Professor 06/2012{05/2018 Department of Biostatistics Mailman School of Public Health, Columbia University Assistant Professor 01/2009{12/2010 Department of Biostatistics Bloomberg School of Public Health, Johns Hopkins University Research Assistant (R01NS060910) 01/2008{12/2009 Department of Biostatistics Bloomberg School of Public Health, Johns Hopkins University Research Assistant (U19 AI060614 and U19 AI082637) Education 08/2007{05/2012 Johns Hopkins University PhD in Biostatistics, May 2012 Thesis: Statistical Methods for Cross-sectional and Longitudinal Functional Observations Advisors: Ciprian Crainiceanu and Brian Caffo 08/2003{05/2007 Dickinson College BS in Mathematics, May 2007 Jeff Goldsmith 2 Honors 04/2021 Dean's Excellence in Leadership Award 03/2021 COPSS Leadership Academy For Emerging Leaders in Statistics 06/2017 Tow Faculty Scholar 01/2016 Public Voices Fellow 10/2013 Calderone Junior Faculty Prize 05/2012 ASA Biometrics Section Travel Award 12/2011 Invited Paper in \Highlights of JCGS" Session at Interface 05/2011 Margaret Merrell Award for Outstanding Research by a Biostatistics Doc- toral Student 05/2011 School-wide Teaching Assistant Recognition Award 05/2011 Helen Abbey Award for Excellence in Teaching 03/2011 ENAR Distinguished Student Paper Award 05/2010 Jane and Steve Dykacz Award for Outstanding Paper in Medical Statistics 05/2009 Nominated for School-wide Teaching Assistant Recognition Award 08/2007{05/2012 Sommer Scholar 05/2007 James Fowler Rusling Prize 05/2007 Lance E.
    [Show full text]
  • Comparison of Harmonic, Geometric and Arithmetic Means for Change Detection in SAR Time Series Guillaume Quin, Béatrice Pinel-Puysségur, Jean-Marie Nicolas
    Comparison of Harmonic, Geometric and Arithmetic means for change detection in SAR time series Guillaume Quin, Béatrice Pinel-Puysségur, Jean-Marie Nicolas To cite this version: Guillaume Quin, Béatrice Pinel-Puysségur, Jean-Marie Nicolas. Comparison of Harmonic, Geometric and Arithmetic means for change detection in SAR time series. EUSAR. 9th European Conference on Synthetic Aperture Radar, 2012., Apr 2012, Germany. hal-00737524 HAL Id: hal-00737524 https://hal.archives-ouvertes.fr/hal-00737524 Submitted on 2 Oct 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. EUSAR 2012 Comparison of Harmonic, Geometric and Arithmetic Means for Change Detection in SAR Time Series Guillaume Quin CEA, DAM, DIF, F-91297 Arpajon, France Béatrice Pinel-Puysségur CEA, DAM, DIF, F-91297 Arpajon, France Jean-Marie Nicolas Telecom ParisTech, CNRS LTCI, 75634 Paris Cedex 13, France Abstract The amplitude distribution in a SAR image can present a heavy tail. Indeed, very high–valued outliers can be observed. In this paper, we propose the usage of the Harmonic, Geometric and Arithmetic temporal means for amplitude statistical studies along time. In general, the arithmetic mean is used to compute the mean amplitude of time series.
    [Show full text]
  • University of Cincinnati
    UNIVERSITY OF CINCINNATI Date:___________________ I, _________________________________________________________, hereby submit this work as part of the requirements for the degree of: in: It is entitled: This work and its defense approved by: Chair: _______________________________ _______________________________ _______________________________ _______________________________ _______________________________ Gibbs Sampling and Expectation Maximization Methods for Estimation of Censored Values from Correlated Multivariate Distributions A dissertation submitted to the Division of Research and Advanced Studies of the University of Cincinnati in partial ful…llment of the requirements for the degree of DOCTORATE OF PHILOSOPHY (Ph.D.) in the Department of Mathematical Sciences of the McMicken College of Arts and Sciences May 2008 by Tina D. Hunter B.S. Industrial and Systems Engineering The Ohio State University, Columbus, Ohio, 1984 M.S. Aerospace Engineering University of Cincinnati, Cincinnati, Ohio, 1989 M.S. Statistics University of Cincinnati, Cincinnati, Ohio, 2003 Committee Chair: Dr. Siva Sivaganesan Abstract Statisticians are often called upon to analyze censored data. Environmental and toxicological data is often left-censored due to reporting practices for mea- surements that are below a statistically de…ned detection limit. Although there is an abundance of literature on univariate methods for analyzing this type of data, a great need still exists for multivariate methods that take into account possible correlation amongst variables. Two methods are developed here for that purpose. One is a Markov Chain Monte Carlo method that uses a Gibbs sampler to es- timate censored data values as well as distributional and regression parameters. The second is an expectation maximization (EM) algorithm that solves for the distributional parameters that maximize the complete likelihood function in the presence of censored data.
    [Show full text]
  • Primena Statistike U Kliničkim Istraţivanjima Sa Osvrtom Na Korišćenje Računarskih Programa
    UNIVERZITET U BEOGRADU MATEMATIČKI FAKULTET Dušica V. Gavrilović Primena statistike u kliničkim istraţivanjima sa osvrtom na korišćenje računarskih programa - Master rad - Mentor: prof. dr Vesna Jevremović Beograd, 2013. godine Zahvalnica Ovaj rad bi bilo veoma teško napisati da nisam imala stručnu podršku, kvalitetne sugestije i reviziju, pomoć prijatelja, razumevanje kolega i beskrajnu podršku porodice. To su razlozi zbog kojih želim da se zahvalim: . Mom mentoru, prof. dr Vesni Jevremović sa Matematičkog fakulteta Univerziteta u Beogradu, koja je bila ne samo idejni tvorac ovog rada već i dugogodišnja podrška u njegovoj realizaciji. Njena neverovatna upornost, razne sugestije, neiscrpni optimizam, profesionalizam i razumevanje, predstavljali su moj stalni izvor snage na ovom master-putu. Članu komisije, doc. dr Zorici Stanimirović sa Matematičkog fakulteta Univerziteta u Beogradu, na izuzetnoj ekspeditivnosti, stručnoj recenziji, razumevanju, strpljenju i brojnim korisnim savetima. Članu komisije, mr Marku Obradoviću sa Matematičkog fakulteta Univerziteta u Beogradu, na stručnoj i prijateljskoj podršci kao i spremnosti na saradnju. Dipl. mat. Radojki Pavlović, šefu studentske službe Matematičkog fakulteta Univerziteta u Beogradu, na upornosti, snalažljivosti i kreativnosti u pronalaženju raznih ideja, predloga i rešenja na putu realizacije ovog master rada. Dugogodišnje prijateljstvo sa njom oduvek beskrajno cenim i oduvek mi mnogo znači. Dipl. mat. Zorani Bizetić, načelniku Data Centra Instituta za onkologiju i radiologiju Srbije, na upornosti, idejama, detaljnoj reviziji, korisnim sugestijama i svakojakoj podršci. Čak i kada je neverovatno ili dosadno ili pametno uporna, mnogo je i dugo volim – skoro ceo moj život. Mast. biol. Jelici Novaković na strpljenju, reviziji, bezbrojnim korekcijama i tehničkoj podršci svake vrste. Hvala na osmehu, budnom oku u sitne sate, izvrsnoj hrani koja me je vraćala u život, nes-kafi sa penom i transfuziji energije kada sam bila na rezervi.
    [Show full text]
  • Comparison of Hyperspectral Imaging and Near-Infrared Spectroscopy to Determine Nitrogen and Carbon Concentrations in Wheat
    remote sensing Article Comparison of Hyperspectral Imaging and Near-Infrared Spectroscopy to Determine Nitrogen and Carbon Concentrations in Wheat Iman Tahmasbian 1,* , Natalie K. Morgan 2, Shahla Hosseini Bai 3, Mark W. Dunlop 1 and Amy F. Moss 2 1 Department of Agriculture and Fisheries, Queensland Government, Toowoomba, QLD 4350, Australia; Scopus affiliation ID: 60028929; [email protected] 2 School of Environmental and Rural Science, University of New England, Armidale, NSW 2351, Australia; [email protected] (N.K.M.); [email protected] (A.F.M.) 3 Centre for Planetary Health and Food Security, School of Environment and Science, Griffith University, Brisbane, QLD 4111, Australia; s.hosseini-bai@griffith.edu.au * Correspondence: [email protected] Abstract: Hyperspectral imaging (HSI) is an emerging rapid and non-destructive technology that has promising application within feed mills and processing plants in poultry and other intensive animal industries. HSI may be advantageous over near infrared spectroscopy (NIRS) as it scans entire samples, which enables compositional gradients and sample heterogenicity to be visualised and analysed. This study was a preliminary investigation to compare the performance of HSI with that of NIRS for quality measurements of ground samples of Australian wheat and to identify the most important spectral regions for predicting carbon (C) and nitrogen (N) concentrations. In total, 69 samples were scanned using an NIRS (400–2500 nm), and two HSI cameras operated in Citation: Tahmasbian, I.; Morgan, 400–1000 nm (VNIR) and 1000–2500 nm (SWIR) spectral regions. Partial least square regression N.K; Hosseini Bai, S.; Dunlop, M.W; (PLSR) models were used to correlate C and N concentrations of 63 calibration samples with their Moss, A.F Comparison of spectral reflectance, with 6 additional samples used for testing the models.
    [Show full text]
  • Construction of a Neuro-Immune-Cognitive Pathway-Phenotype Underpinning the Phenome Of
    Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 20 October 2019 doi:10.20944/preprints201910.0239.v1 Construction of a neuro-immune-cognitive pathway-phenotype underpinning the phenome of deficit schizophrenia. Hussein Kadhem Al-Hakeima, Abbas F. Almullab, Arafat Hussein Al-Dujailic, Michael Maes d,e,f. a Department of Chemistry, College of Science, University of Kufa, Iraq. E-mail: [email protected]. b Medical Laboratory Technology Department, College of Medical Technology, The Islamic University, Najaf, Iraq. E-mail: [email protected]. c Senior Clinical Psychiatrist, Faculty of Medicine, Kufa University. E-mail: [email protected]. d* Department of Psychiatry, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand; e Department of Psychiatry, Medical University of Plovdiv, Plovdiv, Bulgaria; f IMPACT Strategic Research Centre, Deakin University, PO Box 281, Geelong, VIC, 3220, Australia. Corresponding author Prof. Dr. Michael Maes, M.D., Ph.D. Department of Psychiatry Faculty of Medicine Chulalongkorn University Bangkok Thailand 1 © 2019 by the author(s). Distributed under a Creative Commons CC BY license. Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 20 October 2019 doi:10.20944/preprints201910.0239.v1 https://scholar.google.co.th/citations?user=1wzMZ7UAAAAJ&hl=th&oi=ao [email protected]. 2 Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 20 October 2019 doi:10.20944/preprints201910.0239.v1 Abstract In schizophrenia, pathway-genotypes may be constructed by combining interrelated immune biomarkers with changes in specific neurocognitive functions that represent aberrations in brain neuronal circuits. These constructs provide insight on the phenome of schizophrenia and show how pathway-phenotypes mediate the effects of genome X environmentome interactions on the symptomatology/phenomenology of schizophrenia.
    [Show full text]
  • Multivariate Chemometrics As a Strategy to Predict the Allergenic Nature of Food Proteins
    S S symmetry Article Multivariate Chemometrics as a Strategy to Predict the Allergenic Nature of Food Proteins Miroslava Nedyalkova 1 and Vasil Simeonov 2,* 1 Department of Inorganic Chemistry, Faculty of Chemistry and Pharmacy, University of Sofia, 1 James Bourchier Blvd., 1164 Sofia, Bulgaria; [email protected]fia.bg 2 Department of Analytical Chemistry, Faculty of Chemistry and Pharmacy, University of Sofia, 1 James Bourchier Blvd., 1164 Sofia, Bulgaria * Correspondence: [email protected]fia.bg Received: 3 September 2020; Accepted: 21 September 2020; Published: 29 September 2020 Abstract: The purpose of the present study is to develop a simple method for the classification of food proteins with respect to their allerginicity. The methods applied to solve the problem are well-known multivariate statistical approaches (hierarchical and non-hierarchical cluster analysis, two-way clustering, principal components and factor analysis) being a substantial part of modern exploratory data analysis (chemometrics). The methods were applied to a data set consisting of 18 food proteins (allergenic and non-allergenic). The results obtained convincingly showed that a successful separation of the two types of food proteins could be easily achieved with the selection of simple and accessible physicochemical and structural descriptors. The results from the present study could be of significant importance for distinguishing allergenic from non-allergenic food proteins without engaging complicated software methods and resources. The present study corresponds entirely to the concept of the journal and of the Special issue for searching of advanced chemometric strategies in solving structural problems of biomolecules. Keywords: food proteins; allergenicity; multivariate statistics; structural and physicochemical descriptors; classification 1.
    [Show full text]
  • Copula-Based Analysis of Dependent Data with Censoring and Zero Inflation
    Copula-Based Analysis of Dependent Data with Censoring and Zero Inflation by Fuyuan Li B.S. in Telecommunication Engineering, May 2012, Beijing University of Technology M.S. in Statistics, May 2014, The George Washington University A Dissertation submitted to The Faculty of The Columbian College of Arts and Sciences of The George Washington University in partial satisfaction of the requirements for the degree of Doctor of Philosophy January 10, 2019 Dissertation directed by Huixia J. Wang Professor of Statistics The Columbian College of Arts and Sciences of The George Washington University certifies that Fuyuan Li has passed the Final Examination for the degree of Doctor of Philosophy as of December 7, 2018. This is the final and approved form of the dissertation. Copula-Based Analysis of Dependent Data with Censoring and Zero Inflation Fuyuan Li Dissertation Research Committee: Huixia J. Wang, Professor of Statistics, Dissertation Director Tapan K. Nayak, Professor of Statistics, Committee Member Reza Modarres, Professor of Statistics, Committee Member ii c Copyright 2019 by Fuyuan Li All rights reserved iii Acknowledgments This work would not have been possible without the financial support of the National Science Foundation grant DMS-1525692, and the King Abdullah University of Science and Technology office of Sponsored Research award OSR-2015-CRG4-2582. I am grateful to all of those with whom I have had the pleasure to work during this and other related projects. Each of the members of my Dissertation Committee has provided me extensive personal and professional guidance and taught me a great deal about both scientific research and life in general.
    [Show full text]
  • Predicting Plant Species Richness and Vegetation Patterns in Cultural Landscapes Using Disturbance Parameters
    Agriculture, Ecosystems and Environment 122 (2007) 446–452 www.elsevier.com/locate/agee Predicting plant species richness and vegetation patterns in cultural landscapes using disturbance parameters C. Buhk a,*, V. Retzer b, C. Beierkuhnlein b, A. Jentsch a a Disturbance Ecology and Vegetation Dynamics, Helmholtz Centre for Environmental Research – UFZ, Permoserstr. 15, 04318 Leipzig, Germany and Bayreuth University, 95440 Bayreuth, Germany b Biogeography, University of Bayreuth, Universita¨tsstr. 30, 95440 Bayreuth, Germany Received 1 August 2006; received in revised form 22 February 2007; accepted 27 February 2007 Available online 24 April 2007 Abstract A new methodological framework for plant diversity assessment at the landscape scale is presented that exhibits the following strengths: (1) potential for easily standardizable sampling procedure; (2) characterization of disturbance regime; (3) use of selected disturbance descriptors as explanatory variables which probably allow for better transferability than site specific land use types—for example, to evaluate the emerging use of energy plants that pose novel management challenges without historic precedence to many landscapes; (4) analysis of quantitative and qualitative aspects of plant species diversity (alpha and beta diversity). For data analysis, a powerful regression method (PLS- R) was applied. On this basis, after further validation and transferability tests, a practical tool for the development and validation of effective agri-environmental programmes may be developed. # 2007 Elsevier B.V. All rights reserved. Keywords: Agri-environmental programmes; Alpha diversity; Beta diversity; Fichtelgebirge; Land use; Disturbance; Pattern analysis 1. Introduction paid per ha of arable land or grassland managed, and are no longer subsidized according to the quantity of certain goods The EU Sustainable Development Strategy, launched by produced or land use activity maintained.
    [Show full text]
  • Principal Component Analysis
    Tutorial n Chemometrics and Intelligent Laboratory Systems, 2 (1987) 37-52 Elsevier Science Publishers B.V., Amsterdam - Printed in The Netherlands Principal Component Analysis SVANTE WOLD * Research Group for Chemometrics, Institute of Chemistry, Umei University, S 901 87 Urned (Sweden) KIM ESBENSEN and PAUL GELADI Norwegian Computing Center, P.B. 335 Blindern, N 0314 Oslo 3 (Norway) and Research Group for Chemometrics, Institute of Chemistry, Umed University, S 901 87 Umeci (Sweden) CONTENTS Intr~uction: history of principal component analysis ...... ............... ....... 37 Problem definition for multivariate data ................ ............... ....... 38 A chemical example .............................. ............... ....... 40 Geometric interpretation of principal component analysis ... ............... ....... 41 Majestical defi~tion of principal component analysis .... ............... ....... 41 Statistics; how to use the residuals .................... ............... ....... 43 Plots ......................................... ............... ....... 44 Applications of principal component analysis ............ ............... ....... 46 8.1 Overview (plots) of any data table ................. ............... ..*... 46 8.2 Dimensionality reduction ....................... ............... 46 8.3 Similarity models ............................. ............... , . 47 Data pre-treatment .............................. ............... *....s. 47 10 Rank, or dim~sion~ty, of a principal components model. .......................... 48
    [Show full text]
  • Incorporating a Geometric Mean Formula Into The
    Calculating the CPI Incorporating a geometric mean formula into the CPI Beginning in January 1999, a new geometric mean formula will replace the current Laspeyres formula in calculating most basic components of the Consumer Price Index; the new formula will better account for the economic substitution behavior of consumers 2 Kenneth V. Dalton, his article describes an important improve- bias” in the CPI. This upward bias was a techni- John S. Greenlees, ment in the calculation of the Consumer cal problem that tied the weight of a CPI sample and TPrice Index (CPI). The Bureau of Labor Sta- item to its expected price change. The flaw was Kenneth J. Stewart tistics plans to use a new geometric mean for- effectively eliminated by changes to the CPI mula for calculating most of the basic compo- sample rotation and substitution procedures and nents of the Consumer Price Index for all Urban to the functional form used to calculate changes Consumers (CPI-U) and the Consumer Price In- in the cost of shelter for homeowners. In 1997, a dex for Urban Wage Earners and Clerical Work- new approach to the measurement of consumer ers (CPI-W). This change will become effective prices for hospital services was introduced.3 Pric- with data for January 1999.1 ing procedures were revised, from pricing indi- The geometric mean formula will be used in vidual items (such as a unit of blood or a hospi- index categories that make up approximately 61 tal inpatient day) to pricing the combined sets of percent of total consumer spending represented goods and services provided on selected patient by the CPI-U.
    [Show full text]