Next Article in Journal
Block Copolyesters Containing 2,5-Furan and trans-1,4-Cyclohexane Subunits with Outstanding Gas Barrier Properties
Previous Article in Journal
Analysis of Expression Pattern of snoRNAs in Different Cancer Types with Machine Learning Algorithms
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting the Toxicity of Ionic Liquids toward Acetylcholinesterase Enzymes Using Novel QSAR Models

1
School of Materials Science and Energy Engineering, Foshan University, Foshan 528000, China
2
School of Material and Chemical Engineering, Zhengzhou University of Light Industry, Zhengzhou 450001, China
3
Department of Chemical Engineering, University of California, Santa Barbara, CA 93106-5080, USA
4
Department Beijing Key Laboratory of Ionic Liquids Clean Process, Key Laboratory of Green Process and Engineering, State Key Laboratory of Multiphase Complex Systems, Institute of Process Engineering, Chinese Academy of Sciences, Beijing 100190, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2019, 20(9), 2186; https://0-doi-org.brum.beds.ac.uk/10.3390/ijms20092186
Submission received: 31 March 2019 / Revised: 26 April 2019 / Accepted: 26 April 2019 / Published: 2 May 2019
(This article belongs to the Special Issue Advances in Cheminformatics and Nanoinformatics)

Abstract

:
Limited information on the potential toxicity of ionic liquids (ILs) becomes the bottleneck that creates a barrier in their large-scale application. In this work, two quantitative structure-activity relationships (QSAR) models were used to evaluate the toxicity of ILs toward the acetylcholinesterase enzyme using multiple linear regression (MLR) and extreme learning machine (ELM) algorithms. The structures of 57 cations and 21 anions were optimized using quantum chemistry calculations. The electrostatic potential surface area (SEP) and the screening charge density distribution area (Sσ) descriptors were calculated and used for prediction of IL toxicity. Performance and predictive aptitude between MLR and ELM models were analyzed. Highest squared correlation coefficient (R2), and also lowest average absolute relative deviation (AARD%) and root-mean-square error (RMSE) were observed for training set, test set, and total set for the ELM model. These findings validated the superior performance of ELM over the MLR toxicity prediction model.

1. Introduction

With ever-increasing demand for clean energy and stricter environmental regulations, scientists have always strived for sustainable and green chemical products [1]. The apparent benefits of ionic liquids (ILs) has spurred intensive research in order to replace the conventional hazardous solvents for broad scale industrial applications, such as organic reaction [2,3,4,5,6,7,8], catalytic processes [9,10,11,12], gas separation [13,14,15,16], storage materials [17,18,19,20], etc. Among the unique properties of ILs, negligible vapor pressure has primarily attracted the attention of many researchers, and the field has sought improvements in positively affecting overall efficiency of many chemical processes under the principles of green chemistry [21,22].
State-of-the-art research shows that the previously accepted advantage of low toxicity for ILs has proved to be overestimated, meaning that to some extent ILs in reality pose hazard potentials to humans and environment [23]. Rogers [22] et al. challenged the notion of green ILs because of their hazardous properties, e.g., unknown toxicity and stability. Meanwhile, Jastorff [24] et al. discussed the toxicity and ecotoxicity of ILs using quantitative structure-activity relationships (QSAR), which could be used to aid the rational design of optimal solvents from a technical and environmental perspective. Docherty [25] et al. found that ILs with hexyl- and octyl-imidazolium and pyridinium bromides had significant antimicrobial activity to pure cultures of Saccharomyces cerevisiae, Escherichia coli, Bacillus subtilis, and others. Later, Pang [26] et al. reviewed the environmentally relevant issues of ILs, such as environmental application and toxicity. Karunanithi [27] et al. studied the life cycle of aquatic ecotoxicity impacts for five common ILs by integrating physical properties, toxicity data, and transport parameters in the model. For the first time, they reported the freshwater ecotoxicity characterization factors for ILs. They found that an average of 83% of ecotoxicity impact was due to chemicals released during the upstream synthesis steps, while the remaining 17% ecotoxicity was related to the life-cycle energy consumption. All these findings underscore the need to develop sustainable and nontoxic ILs in future research. Unlike other basic physicochemical properties, such as density, viscosity, thermal stability, etc., to the best of our knowledge, research studies on the toxicity of ILs are still insufficient, and therefore need to be addressed further. In this regard, key developments in toxicity of ILs will highlight the designable feasibility of ILs to be environmentally benign with huge potential benefits for sustainable and green chemistry [28]. On the other hand, it is worthy to emphasize that around 108 ILs are accessible due to the combination of enormously different cations and anions. Consequently, challenges have been encountered when investigating the toxicity of ILs [28,29]. To deal with these issues, it is highly necessary to develop efficient models to predict the toxicity of ILs with high accuracy.
Recently, QSAR studies have been reported in correlation with, and prediction of, properties of ILs [30,31,32,33,34,35,36]. QSAR is a powerful tool for accurately predicting the physical and chemical properties when applied to a set of molecular descriptors. In this regard, QSAR is supposed to be useful for estimating and screening chemical compounds for specific applications, having the desired characteristics for elucidating the underlying relation between micro-structures and macro-properties. Since the pioneering studies by Jastorff and co-authors [37], research attempts have been devoted to improving understanding and estimating the toxicity of ILs [30,38,39,40,41,42]. In subsequent years, Pereira et al. presented their toxicological assessment of a group of environmentally-friendly ILs with benign cholinium cations and linear alkanoate anions, using filamentous fungi as model eukaryotic organisms [39]. They found that the toxicity of ILs was increased with elongation in the linear chains of the anion, while branching resulted in reduced toxicity due to depressed lipophilicity [39]. Similar conclusions were drawn by Lima and Coutinho [43]. In order to further these studies, Ranke and co-workers studied up to 74 ionic liquids with different cations and anions to show the influence of cation lipophilicity on the cytotoxicity in IPC-81 leukemia cells of rats. They found that substituents in the cation increased the toxicity of ILs [41]. Torrecilla et al. developed QSAR models based on MLR and neural network for prediction of toxicological effects of 96 ILs on IPC-81 [42]. Romero and co-workers studied EC50 values of each compound in an aqueous solution using the microtox standard procedure. They found that the short length of the R2 side chain on imidazolium cation was favorable to diminish the toxic effect, while anions had a minor effect on the toxicity of ILs [44]. Yan et al. investigated the toxicity of ILs in acetyl cholinesterase enzyme (AChE) by the QSAR method using topological indices based on the atom characters. Their results demonstrated that the MLR model was capable of predicting the logEC50 (AChE) of ILs by using a 177 unit training set and a 44 unit testing set of topological indices generated from cations and anions [45]. Recently, Singh et al. investigated the chemical attributes of a wide variety of ILs towards their inhibitory potential of AChE using a supported vector machine (SVM) and a cascade correlation network (CCN) [46]. Their results showed that the proposed QSAR models had more statistical confidence, especially with respect to external validation, which has not been focused upon in other studies. This work addresses the success in predicting different toxicity classes and precise toxicity end-points of ILs using proposed QSAR. Like other researchers, we have previously focused on establishing the toxicity database of ILs, including over 4000 pieces of data and using QSAR models based on quantum chemistry descriptors and an artificial intelligence algorithm to study the toxicity toward the IPC-81 [40]. The results revealed that the nonlinear model developed by the SVM algorithm was more reliable in the prediction of toxicity of ILs. Until now, very limited information has been available on the toxicity of ILs towards AChE. Our main objective in this work was to enhance the understanding of IL’s toxicity towards AChE by using novel extreme learning machine (ELM) algorithms with better accuracy in a bid to shed light on designing novel environmentally benign ILs for future applications.

2. Results and Discussion

2.1. Quantitative Prediction of Multiple Linear Regression (MLR) Model

To obtain the predictive model with an optimal number of descriptors, the stepwise regression approach was utilized to choose the effective input descriptors. As shown in Figure 1, the number of descriptors gradually grows as the squared correlation coefficient (R2) and adjusted squared correlation coefficient (R2adjusted) of the model grow, with simultaneous decrease in standard error (Std. Error). When the number of input parameters overtakes 11, the R2, R2adjusted, and standard error do not change evidently. Thus, the MLR model based on the optimal set of descriptors was determined. The final model is shown as Equation (1), where P0 stands for the intercept of the prediction model, while Ci and Cj mean the ith and jth coefficients respectively; SEP-h is the electrostatic potential surface area for cations or anions at the electrostatic potential h while Sσ-k means the screening charge density distribution area for cations or anions at the screening charge density k.
logEC 50 ( AChE ) = P 0 + i = 1 8 C i S E P h + j = 9 11 C j S σ k
Details of the coefficients and the parameters, including the t values of the descriptors, can be found in Table 1. In total, eight SEP parameters and three Sσ parameters were chosen for the model, where C is the cation and the numbers are the corresponding electrostatic potential and surface screening charge density, respectively. Since t values indicate the significance of the parameters, it can be seen that Sσ-C0.013 is the most vital parameter for the prediction of the toxicity of ILs. The σ-profile range can be qualitatively divided into three main parts: the hydrogen-bond (HB) donor region (σ < −0.0082 e/Å2) and the HB acceptor region (σ > 0.0082 e/ Å2), as well as the non-polar region (−0.0082 e/Å2 ≤ σ ≤ 0.0082 e/Å2). Therefore, it can be concluded that the larger the Sσ-C0.013 value of the ILs, the lower the toxicity of the ILs. The SEP-C36.25 and SEP-C82.75 are the second and third crucial parameters, while their coefficients are 0.361 and −0.213, respectively. It should be mentioned that all the selected parameters are from cations, which illustrates that the cations of ILs have a vital influence on the toxicity of ILs, while the anions have little effect. This finding is consistent with the result that the cation makes the dominant contribution to toxicity for most commercial ILs [45].
Experimental and predicted the logEC50 values of ILs towards acetyl cholinesterase enzyme (AChE) (logEC50 (AChE)) values of ILs (as shown in Supporting Information Table S1) via the MLR model for both the training set as well as the test set are plotted in Figure 2, where it can be seen that the predicted values are close to the experimental results. Figure 3 compares the absolute relative deviation (ARD%) within the range of logEC50 (AChE) values from the MLR model. It is evident that the largest proportion of the predicted values falls in the range of 1–5%, which accounts for 48.13%. Similarly, the percentages of predicted data fall in the range of 5–10% and over 10%, accounting for 28.75% and 14.38%, respectively. Only 8.75% of the predicted ARD% values are in the range of 0–1%. These conclusions showed the good performance of the MLR model to predict the toxicity of ILs.

2.2. Quantitative Prediction of Extreme Learning Machine (ELM) Model

Based upon the foundation of 11 descriptors selected via stepwise regression method and the data sets from the former model, a more effective nonlinear ELM model was established. In the ELM model, there are three layers, as shown in Figure 4. The sine function in the ELM model acted as an activation function between the input and intermediate layers, while the linear function was from the intermediate neurons to the output variables. The bias and ωij produced randomly were the parameters between the first and intermediate layers. The ωjk was the coefficient needed to calculate between the intermediate and final layer. After the determination of the functions, the next step was to acquire the ideal number of neurons between the input variables and intermediate neurons. As shown in Figure 5, in terms of the training set, the R2 value slightly increases with the growth in the number of neurons, whereas the average absolute relative deviation (AARD%) considerably drops when the number of neurons is less than 45, beyond which it dramatically rises. The variation tendency of R2 and AARD% for the test set are almost opposite to the growing number of neurons. Therefore, the best number of neurons (45) is acquired for the prediction the toxicity of ILs by the ELM model.
The comparisons of the predicted logEC50 (AChE) values by ELM (as shown in Supporting Information Table S1) in both the training and test sets with the experimental logEC50 (AChE) values are shown in the Figure 6, which reveals that the predicted values match well with the experimental data. Figure 7 illustrates the proportion of logEC50 (AChE) values in the different ARD% ranges of the ELM model. It is clear that the largest percentage of ARD% from the logEC50 (AChE) values is 54.38% within the range of 1–5%, followed by the 22.5% in the range of 0–1%. The percentage of ARD% of the logEC50 (AChE) values in the range of 5–10% and over 10% account for 18.75% and 4.38%, respectively. These results showcase the validity and reliability of the ELM model to predict the toxicity of ILs.

2.3. Comparison between Two Quantitative Structure-activity Relationships (QSAR) Models

As discussed above, satisfactory results are obtained for the two models having high R2 values. Detailed comparison of the statistical parameters of the training set, test set, and total data of different QSAR models are given in Table 2.
It is evident from Table 2 that the R2 values of the training set, test set, and total data using the ELM model are 0.969, 0.950, and 0.964, respectively, which are comparatively higher than of the R2 values of the MLR model, i.e., 0.920 for the training set, 0.914 for the test set, and 0.917 for the total set. This trend is valid for both AARD% and RMSE, meaning that the non-linear ELM shows better predictive performance than MLR. For example, AARD% values for the training set by the MLR and ELM models are 5.18 and 2.86, while the RMSE values are 1.534 and 0.950, respectively. A similar situation can easily be observed for the test set and all data sets using two models. Undoubtedly, the ELM algorithm is a powerful prediction tool with better accuracy.
In addition, according to the effective criteria used in the literature [47,48,49,50], a QSAR model with acceptable predictive ability should satisfy the conditions in Equations (2)–(5). As can be seen from Table 3, all the coefficients for both the MLR and ELM models satisfy the above-mentioned criteria, illustrating the good predictive power of the models in this study. To further verify the reliability of the models established in this study, we also predicted the toxicity values of five new ILs. The specific prediction results can be found in the Supporting Information (Table S2). It can be seen that the predicted values agree well with the experimental values, indicating that the models we built are reliable, and the ELM model has better prediction performance than the MLR model.
R 2 > 0.7
R 2 R 0 2 R 2 < 0.1   a n d   0.85   < k < 1.15
R 2 R 0 2 R 2 < 0.1   a n d   0.85   < k < 1.15
| R 0 2 R 0 2 | < 0.3
We compared our estimated results with the reported values, as summarized in Table 4. Our two QSAR models, MLR (R2 = 0.917) and ELM (R2 = 0.964), have higher R2 values compared with those of other models, except the Multiple linear regression (MLP) model (R2 = 0.973). Although MLP exhibits the highest value of R2, our models used less input parameters. Our models show AARD% values of 6.35 for MLR and 3.29 for ELM, which fall in the range of the reported values, i.e., 2.8 to 9.35 (Table 4). In addition, as can be seen from Table 4, the RMSE values of our models are also relatively low compared to the literature [45]. Taking these factors into consideration, our models are reliable and meaningful for accurate prediction of the toxicity of ILs towards AChE.

3. Framework

As we know, quantitative structure-activity relationships (QSAR) can be implemented in the form of linear and non-linear algorithms. The former are relatively easy and intuitively present the impact of each parameter upon the properties, while the latter are considered to be suitable for accurate prediction in real-world scenarios. In this work, the toxicity of ionic liquids (ILs) was predicted according to the scheme shown in Figure 4.

4. Dataset and Structural Descriptors

4.1. Dataset

Herein, toxicity data in acetyl cholinesterase enzyme (AChE) of 160 ILs was collected from the widely acknowledged ionic liquid (IL) database [40,51,52]. Concentration for 50% of maximal effect (EC50) values (μM) for AChE, written as log EC50, were converted into the form of a logarithm of half maximal effective concentration. The whole data set was divided into two parts: a training set of 80% ILs to build the model and a set of 20% ionic liquids (ILs) to evaluate the model’s predictability. Detailed information on the ILs studied in this paper, including the names, SMILES strings, log10(EC50) values, etc., can be found in Supporting Information.

4.2. Calculation of Descriptors

As established, the related compounds are represented by theoretical molecular descriptors, which are of key significance for the predictive performance of the models [40]. The screening charge density distribution area (Sσ), obtained from the histogram function of the σ-profiles given by the conductor-like screening model for real solvent (COSMO-RS) computation, is an a priori quantum chemistry descriptor that could quantitatively represent the molecule’s polar surface screen charge on the polarity scale [36]. The electrostatic potential surface area (SEP) of molecules refers to the surface areas of the molecules within the interval of the different electrostatic potential, and it can be used for the prediction of the material’s properties due to its rich information at the electron level. In this work, two descriptors SEP and Sσ-profiles were employed to develop models and were calculated by different programs based on the optimal structures of cations and anions. First of all, the SEP files of the corresponding cations were calculated using Multiwfn software with an electrostatic potential range of 0~150 kcal/mol, and was kept the same for the anions in the range of −150~0 kcal/mol, with a step size of 0.5 kcal/mol [53]. For example, the SEP-C88.75 means the molecular surface areas in the electrostatic potential scale of 88.5–89 kcal/mol. Then, the conductor-like screening model (COSMO) files of cations and anions were obtained using Gaussian 03 software [54]. Finally, Sσ-profiles files of corresponding cations and anions were calculated with a σ-profile ranging from −0.03 to 0.03 e/Å2 and with a step size of 0.001 e/Å2, respectively. We chose cation (1-(cyanomethyl)-3-methylimidazolium) and anion (1-octylsulfate) to present the representative SEP and Sσ-profiles, and the results were depicted in Figure 8; Figure 9, respectively. As shown in Figure 8, the darker shade of red for 1-(cyanomethyl)-3-methylimidazolium or blue for 1-octylsulfate indicates stronger polarity. Figure 9 presents a similar situation of electrostatic potential surfaces.

4.3. Multiple Linear Regression (MLR)

Multiple Linear Regression (MLR) is widely harnessed in quantitative structure-activity relationships (QSAR) to determinate the relationship between a dependent variable and various independent variables. The linear combination relationship of the independent variables is described by a set of coefficients in this model. The structural characteristics to the toxicity can be linked as follows:
logEC 50 = b 0 + b 1 x 1 + b 2 x 2 + + b n x n
where b0 is the intercept and b1, b2, and bn are regression coefficients of the corresponding descriptors; n is the number of descriptors used in the equation in order to figure out the optimal regression model; x denotes the descriptor used to describe the chemical structure of the compound. MLR consists of stepwise selection of descriptors, which combines the forward and backward procedures.

4.4. Extreme Learning Machine (ELM)

Extreme learning machine (ELM) architecture was proposed for the first time by Huang et al., as a single hidden layer feed-forward neural network that has a very strong learning ability for application in basic classification and regression [55]. The most key characteristic of ELM is that it has faster learning speed compared with the traditional learning algorithms. In addition, ELM can be easily used and is a fast-learning and effective algorithm for the feed-forward neural network with single hidden layer interpolation capability and universal approximation capability, which are the two important characteristics of the feed-forward neural network. To some extent, ELM is similar to an artificial neural network (ANN), and its weights and biases in the first layer are randomly initialized and kept constant, while the weights of the second layer are selected by diminishing the least squared error. In practical applications, the training data set is mainly concerned with the specific issues [56]. The data sets include actual results and the related factors. Through iterations to finish the learning process, the impact factors and corresponding results were put into the ELM for training during the training process. Afterward, using the trained ELM, we only needed to input the training data set, similar to the influencing factors. It has been reported that ELM is feasible in many fields, such as protein sequence classification, regression problems, etc., to easily achieve good performance with fast speed [57]. Evidently, increasingly more research in this regard shows that excellent predictive performance can be achieved using the non-linear model developed by the ELM algorithm. More information on the theory and application of ELM can be found elsewhere [55,56,57].

4.5. Evaluation of Quantitative Structure-activity Relationships (QSAR)

In this work, several parameters were defined to evaluate the model performance, as measured by different metrics: adjusted squared correlation coefficient (R2adjusted), squared correlation coefficient (R2), average absolute relative deviation (AARD%), root-mean-square error (RMSE), absolute relative deviation (ARD%), the coefficients of calculated values versus experimental values ( R 0 2 ), experimental values versus calculated values ( R 0 2 ), slope k of the calculated versus experimental values, and slope k’ of the experimental versus calculated values. The corresponding definition could be expressed in Equations (7)–(15).
R 2 = i = 1 N p ( y i exp y ¯ m ) 2 i = 1 N p ( y i cal y i exp ) 2 i = 1 N P ( y i exp y ¯ m ) 2
R 2 adjusted = 1 ( 1 R 2 ) ( N P 1 ) N P p 1
AARD ( % ) = 100 × i = 1 N P ( | y i cal y i exp y i exp | ) / N p
RMSE = i = 1 N P ( y i cal y i exp ) 2 / N P
ARD ( % ) = | 100 × ( y i cal / y i exp 1.0 ) |
R 0 2 = 1 i = 1 N p ( y i cal k y i cal ) 2 i = 1 N P ( y i cal y cal ¯ ) 2
R 0 2 = 1 i = 1 N p ( y i exp k y i exp ) 2 i = 1 N P ( y i exp k y exp ¯ ) 2
k = i = 1 N p y i exp y i cal i = 1 N P ( y i cal ) 2
k = i = 1 N p y i exp y i cal i = 1 N P ( y i exp ) 2
where y means logEC50 (AChE) value of ILs while the “cal” and “exp” superscripts stand for calculated and experimental values, respectively. Np denotes the number of data points for the corresponding dataset; ȳm represents the average log EC50 (AChE) value of ILs for all of the selected data.

5. Conclusions

Safe operation and benign utilization are crucial for ILs, therefore the scope of accurately predicting the toxicity of ILs is of vital interest for scientific and technological advancement. In this work, detailed data analyses of the toxicity of ILs were made based on the established database and novel QSAR models were developed for toxicity prediction of 160 ILs in AChE. Good correlation between the estimated and original data was observed for the training set, test set, and the total set. It was found that the newly used ELM model gave the highest R2 values and the lowest values for AARD% and RMSE. The results validated the superior performance of ELM for estimation of toxicity of ILs. This study will be helpful in design and screening of environmentally friendly commercial ILs.

Supplementary Materials

Supplementary materials can be found at https://0-www-mdpi-com.brum.beds.ac.uk/1422-0067/20/9/2186/s1.

Author Contributions

Writing—original draft, P.Z.; Writing—review & editing, X.K., Y.Z., U.L. and H.Z.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ranke, J.; Stolte, S.; Störmann, R.; Arning, J.; Jastorff, B. Design of sustainable chemical products the example of ionic liquids. Chem. Rev. 2007, 107, 2183–2206. [Google Scholar] [CrossRef]
  2. Niedermaier, I.; Kolbeck, C.; Taccardi, N.; Schulz, P.S.; Li, J.; Drewello, T.; Wasserscheid, P.; Steinrück, H.P.; Maier, F. Organic reactions in ionic liquids studied by in situ XPS. Chem. Phys. 2012, 13, 1725–1735. [Google Scholar] [CrossRef] [PubMed]
  3. Yao, W.; Wang, H.; Cui, G.; Li, Z.; Zhu, A.; Zhang, S.; Wang, J. Tuning the hydrophilicity and hydrophobicity of the respective cation and anion: Reversible phase transfer of ionic liquids. Angew. Chem. Int. Ed. 2016, 55, 7934–7938. [Google Scholar] [CrossRef]
  4. Chen, K.; Shi, G.; Zhang, W.; Li, H.; Wang, C. Computer-Assisted Design of Ionic Liquids for Efficient Synthesis of 3 (2 H)-Furanones: A Domino Reaction Triggered by CO2. J. Am. Chem. Soc. 2016, 138, 14198–14201. [Google Scholar] [CrossRef] [PubMed]
  5. Biswas, K.; Zhang, Q.; Chung, I.; Song, J.-H.; Androulakis, J.; Freeman, A.J.; Kanatzidis, M.G. Synthesis in ionic liquids: [Bi2Te2Br](AlCl4), a direct gap semiconductor with a cationic framework. J. Am. Chem. Soc. 2010, 132, 14760–14762. [Google Scholar] [CrossRef]
  6. Sawant, A.; Raut, D.; Darvatkar, N.; Salunkhe, M. Recent developments of task-specific ionic liquids in organic synthesis. Green Chem. Lett. Rev. 2011, 4, 41–54. [Google Scholar] [CrossRef] [Green Version]
  7. Cooper, E.R.; Andrews, C.D.; Wheatley, P.S.; Webb, P.B.; Wormald, P.; Morris, R.E. Ionic liquids and eutectic mixtures as solvent and template in synthesis of zeolite analogues. Nature 2004, 430, 1012. [Google Scholar] [CrossRef]
  8. Plechkova, N.V.; Seddon, K.R. Applications of ionic liquids in the chemical industry. Chem. Soc. Rev. 2008, 37, 123–150. [Google Scholar] [CrossRef] [PubMed]
  9. Zhang, G.R.; Munoz, M.; Etzold, B.J. Accelerating Oxygen-Reduction Catalysts through Preventing Poisoning with Non-Reactive Species by Using Hydrophobic Ionic Liquids. Angew. Chem. Int. Ed. 2016, 55, 2257–2261. [Google Scholar] [CrossRef]
  10. Asadi, M.; Kim, K.; Liu, C.; Addepalli, A.V.; Abbasi, P.; Yasaei, P.; Phillips, P.; Behranginia, A.; Cerrato, J.M.; Haasch, R. Nanostructured transition metal dichalcogenide electrocatalysts for CO2 reduction in ionic liquid. Science 2016, 353, 467–470. [Google Scholar] [CrossRef] [Green Version]
  11. Olivier-Bourbigou, H.; Magna, L.; Morvan, D. Ionic liquids and catalysis: Recent progress from knowledge to applications. Appl. Catal. A: Gen. 2010, 373, 1–56. [Google Scholar] [CrossRef]
  12. Wasserscheid, P.; Keim, W. Ionic liquids—new “solutions” for transition metal catalysis. Angew. Chem. Int. Ed. 2000, 39, 3772–3789. [Google Scholar] [CrossRef]
  13. Gurkan, B.E.; de la Fuente, J.C.; Mindrup, E.M.; Ficke, L.E.; Goodrich, B.F.; Price, E.A.; Schneider, W.F.; Brennecke, J.F. Equimolar CO2 absorption by anion-functionalized ionic liquids. J. Am. Chem. Soc. 2010, 132, 2116–2117. [Google Scholar] [CrossRef] [PubMed]
  14. Cao, L.; Gao, J.; Zeng, S.; Dong, H.; Gao, H.; Zhang, X.; Huang, J. Feasible ionic liquid-amine hybrid solvents for carbon dioxide capture. Int. J. Greenh. Gas. Control. 2017, 66, 120–128. [Google Scholar] [CrossRef]
  15. Wang, C.; Luo, H.; Jiang, D.E.; Li, H.; Dai, S. Carbon dioxide capture by superbase-derived protic ionic liquids. Angew. Chem. Int. Ed. 2010, 49, 5978–5981. [Google Scholar] [CrossRef]
  16. Cao, L.; Huang, J.; Zhang, X.; Zhang, S.; Gao, J.; Zeng, S. Imidazole tailored deep eutectic solvents for CO2 capture enhanced by hydrogen bonds. Phys. Chem. Chem. Phys. 2015, 17, 27306–27316. [Google Scholar] [CrossRef] [PubMed]
  17. Zhang, S.; Sun, J.; Zhang, X.; Xin, J.; Miao, Q.; Wang, J. Ionic liquid-based green processes for energy production. Chem. Soc. Rev. 2014, 43, 7838–7869. [Google Scholar] [CrossRef] [PubMed]
  18. Watanabe, M.; Thomas, M.L.; Zhang, S.; Ueno, K.; Yasuda, T.; Dokko, K. Application of ionic liquids to energy storage and conversion materials and devices. Chem. Rev. 2017, 117, 7190–7239. [Google Scholar] [CrossRef] [PubMed]
  19. MacFarlane, D.R.; Forsyth, M.; Howlett, P.C.; Kar, M.; Passerini, S.; Pringle, J.M.; Ohno, H.; Watanabe, M.; Yan, F.; Zheng, W. Ionic liquids and their solid-state analogues as materials for energy generation and storage. Nat. Rev. Mater. 2016, 1, 15005. [Google Scholar] [CrossRef]
  20. Guo, F.; Zhang, S.; Wang, J.; Teng, B.; Zhang, T.; Fan, M. Synthesis and applications of ionic liquids in clean energy and environment: A review. Curr. Org. Chem. 2015, 19, 455–468. [Google Scholar] [CrossRef]
  21. Anastas, P.T.; Zimmerman, J.B. Peer reviewed: Design through the 12 principles of green engineering. Environ. Sci. Technol. 2003, 37, 94A–101A. [Google Scholar] [CrossRef]
  22. Swatloski, R.P.; Holbrey, J.D.; Rogers, R.D. Ionic liquids are not always green: Hydrolysis of 1-butyl-3-methylimidazolium hexafluorophosphate. Green Chem. 2003, 5, 361–363. [Google Scholar] [CrossRef]
  23. Zhao, D.; Liao, Y.; Zhang, Z. Toxicity of ionic liquids. Clean–Soilairwater 2007, 35, 42–48. [Google Scholar] [CrossRef]
  24. Jastorff, B.; Störmann, R.; Ranke, J.; Mölter, K.; Stock, F.; Oberheitmann, B.; Hoffmann, W.; Hoffmann, J.; Nüchter, M.; Ondruschka, B. How hazardous are ionic liquids? Structure–activity relationships and biological testing as important elements for sustainability evaluation. Green Chem. 2003, 5, 136–142. [Google Scholar] [CrossRef]
  25. Docherty, K.M.; Kulpa, C.F., Jr. Toxicity and antimicrobial activity of imidazolium and pyridinium ionic liquids. Green Chem. 2005, 7, 185–189. [Google Scholar] [CrossRef]
  26. Amde, M.; Liu, J.-F.; Pang, L. Environmental application, fate, effects, and concerns of ionic liquids: A review. Environ. Sci. Technol. 2015, 49, 12611–12627. [Google Scholar] [CrossRef] [PubMed]
  27. Mehrkesh, A.; Karunanithi, A.T. Life-cycle perspectives on aquatic ecotoxicity of common ionic liquids. Environ. Sci. Technol. 2015, 50, 6814–6821. [Google Scholar] [CrossRef] [PubMed]
  28. Rogers, R.D.; Seddon, K.R. Ionic liquids--solvents of the future? Science 2003, 302, 792–793. [Google Scholar] [CrossRef]
  29. Bubalo, M.C.; Radošević, K.; Redovniković, I.R.; Halambek, J.; Srček, V.G. A brief overview of the potential environmental hazards of ionic liquids. Ecotoxicol. Environ. Saf. 2014, 99, 1–12. [Google Scholar] [CrossRef]
  30. Scire, S.; Musumarra, G. A QSPR approach to the ecotoxicity of ionic liquids (Vibrio fischeri) using VolSurf principal properties. Toxicol. Res. 2016, 5, 1090–1096. [Google Scholar]
  31. Yousefinejad, S.; Hemmateenejad, B. Chemometrics tools in QSAR/QSPR studies: A historical perspective. Chemom. Intell. Lab. Syst. 2015, 149, 177–204. [Google Scholar] [CrossRef]
  32. Gupta, S.; Basant, N.; Singh, K.P. Nonlinear QSAR modeling for predicting cytotoxicity of ionic liquids in leukemia rat cell line: An aid to green chemicals designing. Environ. Sci. Pollut. Res. 2015, 22, 1–12. [Google Scholar] [CrossRef] [PubMed]
  33. Ghaedi, A. Predicting the cytotoxicity of ionic liquids using QSAR model based on SMILES optimal descriptors. J. Mol. Liq. 2015, 208, 269–279. [Google Scholar] [CrossRef]
  34. Zhao, Y.; Gani, R.; Afzal, R.M.; Zhang, X.; Zhang, S. Ionic liquids for absorption and separation of gases: An extensive database and a systematic screening method. Aiche J. 2017, 63, 1353–1367. [Google Scholar] [CrossRef]
  35. Rybinska, A.; Sosnowska, A.; Grzonkowska, M.; Barycki, M.; Puzyn, T. Filling environmental data gaps with QSPR for ionic liquids: Modeling n-octanol/water coefficient. J. Hazard. Mater. 2016, 303, 137. [Google Scholar] [CrossRef] [PubMed]
  36. Zhao, Y.; Huang, Y.; Zhang, X.; Zhang, S. A quantitative prediction of the viscosity of ionic liquids using Sσ-profile molecular descriptors. Phys. Chem. Chem. Phys. 2015, 17, 3761–3767. [Google Scholar] [CrossRef]
  37. Ranke, J.; Jastorff, B. Multidimensional risk analysis of antifouling biocides. Environ. Sci. Pollut. Res. 2000, 7, 105–114. [Google Scholar] [CrossRef]
  38. Bailey, M.M.; Townsend, M.B.; Jernigan, P.L.; Sturdivant, J.; Hough-Troutman, W.L.; Rasco, J.F.; Swatloski, R.P.; Rogers, R.D.; Hood, R.D. Developmental toxicity assessment of the ionic liquid 1-butyl-3-methylimidazolium chloride in CD-1 mice. Green Chem. 2008, 10, 1213–1217. [Google Scholar] [CrossRef]
  39. Petkovic, M.; Ferguson, J.L.; Gunaratne, H.N.; Ferreira, R.; Leitão, M.C.; Seddon, K.R.; Rebelo, L.P.N.; Pereira, C.S. Novel biocompatible cholinium-based ionic liquids-toxicity and biodegradability. Green Chem. 2010, 12, 643–649. [Google Scholar] [CrossRef]
  40. Zhao, Y.; Zhao, J.; Huang, Y.; Zhou, Q.; Zhang, X.; Zhang, S. Toxicity of ionic liquids: Database and prediction via quantitative structure-activity relationship method. J. Hazard. Mater. 2014, 278, 320–329. [Google Scholar] [CrossRef]
  41. Ranke, J.; Bottin-Weber, M.U.; Stock, F.; Stolte, S.; Arning, J.; Stoermann, R.; Jastorff, B. Lipophilicity parameters for ionic liquid cations and their correlation to in vitro cytotoxicity. Ecotoxicol. Environ. Saf. 2007, 67, 430–438. [Google Scholar] [CrossRef]
  42. Torrecilla, J.S.; García, J.; Rojo, E.; Rodríguez, F. Estimation of toxicity of ionic liquids in Leukemia Rat Cell Line and Acetylcholinesterase enzyme by principal component analysis, neural networks and multiple lineal regressions. J. Hazard. Mater. 2009, 164, 182. [Google Scholar] [CrossRef]
  43. Oliveira, M.V.; Vidal, B.T.; Melo, C.M.; de Miranda, R.d.C.; Soares, C.M.; Coutinho, J.A.; Ventura, S.P.; Mattedi, S.; Lima, Á.S. (Eco) toxicity and biodegradability of protic ionic liquids. Chemosphere 2016, 147, 460–466. [Google Scholar] [CrossRef]
  44. Romero, A.; Santos, A.; Tojo, J.; Rodriguez, A. Toxicity and biodegradability of imidazolium ionic liquids. J. Hazard. Mater. 2008, 151, 268–273. [Google Scholar] [CrossRef]
  45. Yan, F.; Xia, S.; Wang, Q.; Ma, P. Predicting Toxicity of Ionic Liquids in Acetylcholinesterase Enzyme by the Quantitative Structure–Activity Relationship Method Using Topological Indexes. J. Chem. Eng. Data 2012, 57, 2252–2257. [Google Scholar] [CrossRef]
  46. Basant, N.; Gupta, S.; Singh, K.P. Predicting acetyl cholinesterase enzyme inhibition potential of ionic liquids using machine learning approaches: An aid to green chemicals designing. J. Mol. Liq. 2015, 209, 404–412. [Google Scholar] [CrossRef]
  47. Golbraikh, A.; Tropsha, A. Beware of q2! J. Mol. Graph. Model. 2002, 20, 269–276. [Google Scholar] [CrossRef]
  48. Afantitis, A.; Melagraki, G.; Tsoumanis, A.; Valsami-Hones, E.; Lynch, I. A nanoinformatics decision support tool for the virtual screening of gold nanoparticle cellular association using protein corona fingerprints. Nanotoxicology 2018, 12, 1148–1165. [Google Scholar] [CrossRef] [PubMed]
  49. Zhang, S.; Golbraikh, A.; Oloff, S.; Kohn, H. A novel automated lazy learning QSAR (ALL-QSAR) approach: Method development, applications, and virtual screening of chemical databases using validated ALL-QSAR models. J. Chem. Inf. Modeling 2006, 46, 1984–1995. [Google Scholar] [CrossRef]
  50. Afantitis, A.; Melagraki, G.; Sarimveis, H.; Koutentis, P.A.; Markopoulos, J. Development and evaluation of a QSPR model for the prediction of diamagnetic susceptibility. QSAR & Comb. Sci. 2008, 27, 432–436. [Google Scholar]
  51. Zhang, S.; Sun, N.; He, X.; Lu, X.; Zhang, X. Physical Properties of Ionic Liquids: Database and Evaluation. J. Phys. Chem. Ref. Data 2006, 35, 1475–1517. [Google Scholar] [CrossRef]
  52. The UFT/Merck Ionic Liquids Biological Effects Database. Available online: http://www.il-eco.uft.uni-bremen.de/ (accessed on 22 April 2014).
  53. Lu, T.; Chen, F. Multiwfn: A multifunctional wavefunction analyzer. J. Comput. Chem. 2012, 33, 580. [Google Scholar] [CrossRef] [PubMed]
  54. Frisch, M.; Schlegel, G.T.H.; Scuseria, G.; Robb, M.; Cheeseman, J.; Montgomery, J., Jr.; Vreven, T.; Kudin, K.; Burant, J. Gaussian 03, revision B. 03; revision B. 03; Gaussian Inc.: Pittsburgh, PA, USA, 2003. [Google Scholar]
  55. Chorowski, J.; Wang, J.; Zurada, J.M. Review and performance comparison of SVM- and ELM-based classifiers. Neurocomputing 2014, 128, 507–516. [Google Scholar] [CrossRef]
  56. Ding, S.; Zhao, H.; Zhang, Y.; Xu, X.; Nie, R. Extreme learning machine: Algorithm, theory and applications. Artif. Intell. Rev. 2015, 44, 103–115. [Google Scholar] [CrossRef]
  57. Huang, G.B.; Siew, C.K. Extreme learning machine: RBF network case. In Proceedings of the ICARCV 2004 8th Control, Automation, Robotics and Vision Conference, Kunming, China, 6–9 December 2004; pp. 1029–1036. [Google Scholar]
Figure 1. The squared correlation coefficient (R2) versus parameter number for the training data set.
Figure 1. The squared correlation coefficient (R2) versus parameter number for the training data set.
Ijms 20 02186 g001
Figure 2. Comparisons of predicted the logEC50 values of ILs towards acetyl cholinesterase enzyme (AChE) (logEC50 (AChE)) by multiple linear regression (MLR) in external validation with the experimental logEC50 (AChE).
Figure 2. Comparisons of predicted the logEC50 values of ILs towards acetyl cholinesterase enzyme (AChE) (logEC50 (AChE)) by multiple linear regression (MLR) in external validation with the experimental logEC50 (AChE).
Ijms 20 02186 g002
Figure 3. Percent of logEC50 (AChE) values in different absolute relative deviation (ARD%) ranges of the MLR model.
Figure 3. Percent of logEC50 (AChE) values in different absolute relative deviation (ARD%) ranges of the MLR model.
Ijms 20 02186 g003
Figure 4. Scheme of linear and non-linear models programming methodology. The screening charge density distribution area (Sσ) and the electrostatic potential surface area (SEP) of cations and anions were calculated employed to develop models optimal number of descriptors. The effective input descriptors were chosen by the stepwise regression approach and then the multiple linear regression (MLR) model was built with the obtained coefficients (Ci) and selected descriptor (Si) to predict the the toxicity values (logEC50) of ionic liquids (ILs) towards acetyl cholinesterase enzyme (AChE). The extreme learning machine (ELM) comprising three parts—the input layer, hidden layer, and output layer—was established based on the same input descriptors. The parameters wij and bias from the input layer to hidden layer were generated randomly and the parameters wjk were coefficients from the hidden layer to output layer.
Figure 4. Scheme of linear and non-linear models programming methodology. The screening charge density distribution area (Sσ) and the electrostatic potential surface area (SEP) of cations and anions were calculated employed to develop models optimal number of descriptors. The effective input descriptors were chosen by the stepwise regression approach and then the multiple linear regression (MLR) model was built with the obtained coefficients (Ci) and selected descriptor (Si) to predict the the toxicity values (logEC50) of ionic liquids (ILs) towards acetyl cholinesterase enzyme (AChE). The extreme learning machine (ELM) comprising three parts—the input layer, hidden layer, and output layer—was established based on the same input descriptors. The parameters wij and bias from the input layer to hidden layer were generated randomly and the parameters wjk were coefficients from the hidden layer to output layer.
Ijms 20 02186 g004
Figure 5. The R2 and average absolute relative deviation (AARD%) versus the parameter number for the ELM) model.
Figure 5. The R2 and average absolute relative deviation (AARD%) versus the parameter number for the ELM) model.
Ijms 20 02186 g005
Figure 6. Comparisons of the predicted logEC50 (AChE) by ELM in external validation with the experimental logEC50 (AChE).
Figure 6. Comparisons of the predicted logEC50 (AChE) by ELM in external validation with the experimental logEC50 (AChE).
Ijms 20 02186 g006
Figure 7. Percent of logEC50 (AChE) values in different ARD% ranges of the ELM model.
Figure 7. Percent of logEC50 (AChE) values in different ARD% ranges of the ELM model.
Ijms 20 02186 g007
Figure 8. The electrostatic potential surface area (SEP) of representative cations and anions of ionic liquids (ILs).
Figure 8. The electrostatic potential surface area (SEP) of representative cations and anions of ionic liquids (ILs).
Ijms 20 02186 g008
Figure 9. The screening charge density distribution area (Sσ) of representative cations and anions of ILs.
Figure 9. The screening charge density distribution area (Sσ) of representative cations and anions of ILs.
Ijms 20 02186 g009
Table 1. Coefficients (Ci), parameters (SEP-h and Sσ-k) and the t values for Equation (1).
Table 1. Coefficients (Ci), parameters (SEP-h and Sσ-k) and the t values for Equation (1).
iCoefficients-Ci1 SEP-h3t
1−0.185SEP-C88.75−6.582
2−0.266SEP-C35.25−6.731
30.361SEP-C36.2510.565
4−0.213SEP-C82.75−10.501
5−0.289SEP-C31.25−7.917
60.133SEP-C89.754.936
70.204SEP-C64.257.241
8−0.142SEP-C53.25−5.498
jCoefficients-Cj2 Sσ-kt
90.537Sσ-C0.01318.957
10−0.102Sσ-C-0.012−9.583
110.234Sσ-C-0.0168.303
P02.712
1 The electrostatic potential surface area (SEP) at the electrostatic potential h; 2 The screening charge density distribution area (Sσ) at the screening charge density k; 3 t value stands for the importance of the parameter.
Table 2. Comparison of the statistical parameters by different quantitative structure-activity relationships (QSAR) models.
Table 2. Comparison of the statistical parameters by different quantitative structure-activity relationships (QSAR) models.
ModelDatasetNo.1R22 AARD%3 RMSE
MLRTraining1280.9205.180.136
Test320.91411.050.180
Total1600.9176.350.145
ELMTraining1280.9692.860.084
Test320.9505.020.134
Total1600.9643.290.096
1 Squared correlation coefficient (R2); 2 Average absolute relative deviation (AARD%); 3 Root-mean-square error (RMSE).
Table 3. The results of external validation of the test set in the multiple linear regression (MLR) and extreme learning machine (ELM) models.
Table 3. The results of external validation of the test set in the multiple linear regression (MLR) and extreme learning machine (ELM) models.
ModelR21k2k′3R024 R 0 2 (R2-R02)/R2
MLR0.9140.98761.00580.99750.9996−0.094
ELM0.9500.99441.00180.99961.0000−0.051
1k is slope of the calculated versus experimental values; 2 k′ is slope of the experimental versus calculated values; 3 the coefficients of calculated values versus experimental values ( R 0 2 ); 4 the coefficients of experimental values versus calculated values ( R 0 2 ).
Table 4. Comparisons of our work with the reported works.
Table 4. Comparisons of our work with the reported works.
MethodNo. ParameterR2AARD%RMSERef.
1 MLR120.8147.7/[42]
2 MLP120.9732.8/[42]
3 RB120.8427.1/[42]
MLR170.8779.350.212[45]
MLR110.9176.350.145This work
4 ELM110.9643.290.096This work
1 Multiple linear regression (MLR); 2 Multilayer perceptron (MLP); 3 Radial-basis function (RB); 4 Extreme learning machine (ELM); / indicates that the RMSE value cannot be obtained from literature.

Share and Cite

MDPI and ACS Style

Zhu, P.; Kang, X.; Zhao, Y.; Latif, U.; Zhang, H. Predicting the Toxicity of Ionic Liquids toward Acetylcholinesterase Enzymes Using Novel QSAR Models. Int. J. Mol. Sci. 2019, 20, 2186. https://0-doi-org.brum.beds.ac.uk/10.3390/ijms20092186

AMA Style

Zhu P, Kang X, Zhao Y, Latif U, Zhang H. Predicting the Toxicity of Ionic Liquids toward Acetylcholinesterase Enzymes Using Novel QSAR Models. International Journal of Molecular Sciences. 2019; 20(9):2186. https://0-doi-org.brum.beds.ac.uk/10.3390/ijms20092186

Chicago/Turabian Style

Zhu, Peng, Xuejing Kang, Yongsheng Zhao, Ullah Latif, and Hongzhong Zhang. 2019. "Predicting the Toxicity of Ionic Liquids toward Acetylcholinesterase Enzymes Using Novel QSAR Models" International Journal of Molecular Sciences 20, no. 9: 2186. https://0-doi-org.brum.beds.ac.uk/10.3390/ijms20092186

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop