Improved Model for Fullerene C60 Solubility in Organic Solvents Based on Quantum-Chemical and Topological Descriptors

Improved Model for Fullerene C60 Solubility in Organic Solvents Based on Quantum-Chemical and Topological Descriptors

J Nanopart Res (2011) 13:3235–3247 DOI 10.1007/s11051-011-0238-x RESEARCH PAPER Improved model for fullerene C60 solubility in organic solvents based on quantum-chemical and topological descriptors Tetyana Petrova • Bakhtiyor F. Rasulev • Andrey A. Toropov • Danuta Leszczynska • Jerzy Leszczynski Received: 10 April 2009 / Accepted: 17 January 2011 / Published online: 5 February 2011 Ó Springer Science+Business Media B.V. 2011 Abstract Fullerenes are sparingly soluble in many energy (HOMO), certain heteroatom fragments, and solvents. The dependence of fullerene’s solubility on geometrical parameters with solubility. Several other molecular structure of the solvent must be understood important parameters of solvents that affect the C60 in order to manage efficiently this class of compounds. solubility have been also evaluated by the QSPR To find such dependency ab initio quantum-chemical analysis. The employed GA-MLRA approach calculations in combination with quantitative struc- enhanced by application of quantum-chemical calcu- ture–property relationship (QSPR) tool were used to lations yields reliable results, allowing one to build model the solubility of fullerene C60 in 122 organic simple, interpretable models that can be used for solvents. A genetic algorithm and multiple regression predictions of C60 solubility in various organic analysis (GA-MLRA) were applied to generate cor- solvents. relation models. The best performance is accom- plished by the four-variable MLRA model with Keywords C60 Á Fullerene Á Solubility Á QSPR Á 2 prediction coefficient rtest = 0.903. This study reveals DFT Á Quantum-chemical descriptors Á a correlation of highest occupied molecular orbital Modeling and simulation Á Predictive method Electronic supplementary material The online version of this article (doi:10.1007/s11051-011-0238-x) contains supplementary material, which is available to authorized users. Introduction T. Petrova Á B. F. Rasulev (&) Á J. Leszczynski It is well known that the fullerenes are sparingly Interdisciplinary Center for Nanotoxicity, Department soluble in various solvents (Ruoff et al. 1993). This of Chemistry and Biochemistry, Jackson State University, 1400 J. R. Lynch Street, P.O. Box 17910, Jackson, phenomenon has crucial consequences for both the MS 39217, USA basic studies on fullerenes and their industrial e-mail: [email protected] applications. The dependence of fullerene’s solubility on both molecular structure of the solvent and A. A. Toropov Istituto di Ricerche Farmacologiche Mario Negri, temperature must be understood in order to efficiently Via La Masa 19, 20156 Milano, Italy separate different members of the fullerene family from each other and from their precursors or deriv- D. Leszczynska atives (Korobov and Smith 2000). Though experi- Department of Civil and Environmental Engineering, Jackson State University, 1400 J. R. Lynch Street, mental data on fullerene solubility is available, there P.O. Box 17910, Jackson, MS 39217, USA is still no reliable theory to explain (or predict) 123 3236 J Nanopart Res (2011) 13:3235–3247 absolute values of fullerene solubility and variations (Vanin et al. 2008). However, this approach is in solubility when changing the solvent or using extremely time-consuming to apply it for large different fullerenes. This serious limitation is one of dataset of solvents and therefore not desirable to the reasons why separation of fullerenes on a large use in QSPR. There were also other articles regarding scale is still rather difficult without the aid of the solubility of C60 in different solvents (Marcus chromatography. Moreover, the data on solubility of 1997, Marcus et al. 2001; Abraham et al. 2000; the fullerene C60 in organic solvents could help to Hansen and Smith 2004; Kiss et al. 2000). The recent develop its various applications in chemistry and study in this field was published by Liu et al. (2005). technology. The authors used the biggest to date dataset of 128 At present, the only approach allowing construc- solvents. They built a model using least-squares tion of predictive model of fullerene C60 solubility in support vector machine (LSSVM) method and the different organic solvents is quantitative structure– obtained correlation coefficients were r2 = 0.761 for property relationship (QSPR) based on molecular the whole set and r2 = 0.861 for the 122 compounds descriptors (i.e., parameters) calculated with molec- (six compounds, i.e., outliers, removed). The corre- ular graphs of the solvents (Estrada and Gutman lation coefficients for the splitted dataset to training 1996; Simon 1987; Balaban 1983). The first attempt (92 compounds) and test (30 compounds) sets were to explain the trends in C60 solubility was carried out 0.910 and 0.908, respectively. The authors used by Sivaraman et al. (1992) who found a correlation CODESSA software (Katritzky et al. 1994, 1995)to between experimental fullerene’s solubility and cal- generate a set of descriptors and semiempirical culated solubility parameter of solvent by studying quantum–mechanical approach to calculate quan- fullerene’s solubility for 15 solvents. The same group tum-chemical parameters for the considered solvents. also published another study devoted to fullerene They showed that such descriptors as Randic index solubility. For a series of solvents they plotted the (order 3), relative molecular weight, HOMO-1 energy fullerene solubility versus the Hildebrand solubility (highest occupied molecular orbital), relative nega- parameter (Sivaraman et al. 1994). In another study, tive charge, average bonding information content, Ruoff et al. (1993) studied the C60 solubility in 47 and average one-electron reactivity index for a C solvents. They examined the dependence of solubility atom are involved in correlation model. In our recent on the polarizability, polarity, molecular size, and study (Toropov et al. 2007a) devoted to solubility of cohesive energy density (energy difference between C60 the QSPR model based on SMILES (Simplified solid phase and free atoms system) of the solvents. Molecular Input line Entry System) notations and However, they found no distinctive parameter that optimal descriptors was developed. This model universally explains the solubility of C60 and no allows to achieve correlation coefficient for the predictive model was built in their study. Smith et al. training set of r2 = 0.861 and for the external set of 2 (1996) employed the theoretical linear solvation rtest = 0.890. The approach implemented in the energy approach with solubility dataset of 101 current study differs considerably from the previous organic solvents. The correlation coefficient was studies because in the earlier investigations only a r2 = 0.67 and statistically significant descriptors that very simple linear model with one complex param- were revealed in their study are molar volume, Lewis eter (correlation weight based on SMILES notation) acidity, Lewis basicity, and electrostatic basicity. was used that provided a good predictive ability. In Thus, a good solvent for C60 should have a large addition, recently the authors of this study in molar volume and also be both a good Lewis acid and collaboration with the experimental biology group a good Lewis base. However, it should possess published the study where they aimed to choose a minimal polarity of the regions of negative charge. better solvent for C60 fullerene in biological systems, Though it is an interesting idea, the drawback of this based on cytotoxicity tests (Cook et al. 2010). study is too low correlation coefficient of the As it was pointed out above, there are many modeled relationship. In addition, a study has been different approaches to calculate and predict C60 published where authors aimed to apply molecular solubility in organic solvents. Some of them are fully dynamics approach to model the solubility of fuller- mechanistic (Marcus 1997, Marcus et al. 2001; ene in two solvents, water and carbon disulfide Abraham et al. 2000; Hansen and Smith 2004; 123 J Nanopart Res (2011) 13:3235–3247 3237 Stukalin et al. 2003), developed from the thermody- the chemical diversity of the considered solvent com- namical point of view; others are statistically based, pounds. The descriptor’s typology involves: (a) consti- with good correlation coefficients, but not transparent tutional (atom and group fragments), (b) functional and complicated in interpretation (Kiss et al. 2000; groups, (c) atom-centered fragments, (d) empirical, Liu et al. 2005; Toropov et al. 2007a, b, 2008, 2009). (e) topological, (f) walk counts, (g) various autocorre- In this study, we aimed to find simple, transparent lations from the molecular graph, (h) Randic molecular relationship and computationally fast approach, pos- profiles from the geometry matrix, (i) geometrical, sibly mechanistically interpretable, to predict the (j) WHIM, and (k) GETAWAY descriptors, and various solubility of C60 in various solvents. In addition, indicator descriptors. The meaning of these molecular the study intends to estimate predictive potential of descriptors and the calculation procedures are the topological descriptors and quantum-chemical summarized elsewhere (Todeschini and Consonni parameters obtained by high level ab initio calcula- 2000). tions in QSPR modeling of the fullerene C60 solubil- Considering the importance of the electronical ity in organic solvents. molecular properties for QSAR/QSPR, the additional descriptors were calculated by the second approach— the quantum-chemical descriptors (Hehre et al. 1986; Methods

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    13 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us