Nano-SAR Modeling for Predicting the Cytotoxicity of Metal Oxide Nanoparticles to PaCa2

Shi, Haihua; Pan, Yong; Yang, Fan; Cao, Jiakai; Tan, Xinlong; Yuan, Beilei; Jiang, Juncheng

doi:10.3390/molecules26082188

Open AccessArticle

Nano-SAR Modeling for Predicting the Cytotoxicity of Metal Oxide Nanoparticles to PaCa2

by

Haihua Shi

¹,

Yong Pan

^1,*,

Fan Yang

¹,

Jiakai Cao

¹,

Xinlong Tan

¹,

Beilei Yuan

¹ and

Juncheng Jiang

^1,2

¹

Jiangsu Key Laboratory of Hazardous Chemicals Safety and Control, College of Safety Science and Engineering, Nanjing Tech University, Nanjing 210009, China

²

School of Environment & Safety Engineering, Changzhou University, Changzhou 213164, China

^*

Author to whom correspondence should be addressed.

Molecules 2021, 26(8), 2188; https://doi.org/10.3390/molecules26082188

Submission received: 4 March 2021 / Revised: 3 April 2021 / Accepted: 6 April 2021 / Published: 10 April 2021

(This article belongs to the Special Issue Environmental Toxicology)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Nowadays, the impact of engineered nanoparticles (NPs) on human health and environment has aroused widespread attention. It is essential to assess and predict the biological activity, toxicity, and physicochemical properties of NPs. Computation-based methods have been developed to be efficient alternatives for understanding the negative effects of nanoparticles on the environment and human health. Here, a classification-based structure-activity relationship model for nanoparticles (nano-SAR) was developed to predict the cellular uptake of 109 functionalized magneto-fluorescent nanoparticles to pancreatic cancer cells (PaCa2). The norm index descriptors were employed for describing the structure characteristics of the involved nanoparticles. The Random forest algorithm (RF), combining with the Recursive Feature Elimination (RFE) was employed to develop the nano-SAR model. The resulted model showed satisfactory statistical performance, with the accuracy (ACC) of the test set and the training set of 0.950 and 0.966, respectively, demonstrating that the model had satisfactory classification effect. The model was rigorously verified and further extensively compared with models in the literature. The proposed model could be reasonably expected to predict the cellular uptakes of nanoparticles and provide some guidance for the design and manufacture of safer nanomaterials.

Keywords:

cellular uptake; metal oxide nanoparticles; cytotoxicity; nano-SAR; norm index descriptors

Graphical Abstract

1. Introduction

In recent years, nanotechnology has been considered as one of the key enabling technologies for global economic growth. With the continuous development of nanotechnology, new kinds of nanomaterials are springing up all over the world [1,2,3]. Nanomaterials are widely used in traditional materials, catalysis [4], medical devices [5,6], electronic equipment [7], coatings, and other industries [8,9,10] owing to their unique properties, such as excellent optical, electrical, and magnetic properties. More and more attention has been paid to the inherent disadvantages of nanomaterials and the resulting hazards that may be exposed in the workplace among consumers and in the environment. Although recent studies have found that some nanomaterials may have biological hazards, understanding of the adverse effects of these products is still in its infancy.

In vitro and in vivo studies are commonly used to assess biological or toxic effects [11]. Nevertheless, experimental methods are laborious, time-consuming, and sometimes involve some ethical issues. Thus, there is a strong desire to build a fast and high-throughput nano-toxicity evaluation system or prediction model as a supplement to traditional experimental methods. Among different kinds of methods, quantitative structure–activity relationship (QSAR) is seen as the most promising approach, which was proposed in the early stages by Corwin Hansch in 1962 and then exploited for developing novel chemicals, primarily for drugs [12]. QSAR is mainly based on the following hypothesis: the molecular structure of a compound contains information that determines its physical, chemical, and biological properties. These physical and chemical properties further affect the biological activity of the compounds. That is to say, an association is found between the molecular structure and biology-related activity of the compounds. A great deal of investigations indicate that it is very urgent and essential to extend the traditional QSAR paradigm to nano-sized materials and evolve “nano-(Q)SAR” models to relate the properties of interest with structure information of novel synthetic nanoparticles, which can provide a theoretical basis for the design of functionalized nanoparticles with expected characteristics [13].

Pancreatic cancer is the fourth leading cause of cancer death with a survival rate of less than 5% at five years. At present, many studies [14,15,16,17] have reported the inhibitory effect of some chemical reagents to pancreatic cancer cells, such as gemcitabine, paclitaxel, and berberine. However, the prognosis is still poor. So far, no chemotherapy has demonstrated efficacy in terms of survival for this cancer. Nanomaterials are increasingly used in daily life, but the safety issues they would cause cannot be ignored, especially their biological toxicity. Due to the intermittent or frequent exposure to the human body, metal oxide nanoparticles (MNPs) may invade the human body through various accessible paths, such as inhalation, skin absorption, and ingestion [18]. Once invaded the human body, they may cause systemic, cellular, or genome toxicity, and of course, it may be exposed to pancreatic cells. Therefore, researches on PaCa2 cell are still necessary.

Currently, different nano-(Q)SAR researches have been conducted for predicting the cellular uptake of 109 functionalized magnetic fluorescent MNPs in PaCa2 cell line. All MNPs possess same superparamagnetic core decorated with different synthetic small molecules [19,20,21]. Chau et al. [22] developed a nano-SAR model for predicting the cellular uptake of 105 nanoparticles to pancreatic cancer cell lines with a single metal core. Four modeling methods were employed to develop candidate models, namely, support vector machine, k nearest neighbor, Logistic Regression and Naïve Bayes. The eventual consensus models had a sensitivity of 86.7 to 98.2% and specificity of 67.3 to 76.6%. Kar et al. [23] developed a more accurate cellular uptake model with six conceptually simple and computable descriptors through partial least squares (PLS) regression approach. Winkler et al. [24] calculated two-dimensional Dragon descriptors, then used linear and nonlinear methods to generate four nano-QSAR models for predicting the uptakes of PaCa2 and human umbilical vein endothelial cell lines (HUVEC). Ojha et al. [25] predicted the uptakes of PaCa2, HUVEC, and human macrophage (U937) cell lines by calculating two-dimensional Dragon descriptors and SiRMS descriptors. Toropov et al. [26] established a reliable nano-QSAR model by using the best descriptor based on SMILES, and then the best parameters were selected using Monte Carlo partial least squares (MC-PLS), 109 datasets were divided randomly into five groups and established QSAR modeling separately. Ronghua Qi et al. [18] developed two nano-QSAR models to predict the cellular uptakes of 109 nanoparticles to PaCa2 and HUVEC cell lines.

In this work, the norm index descriptors were firstly used to describe the structural properties of the MNPs involved. Then, based on the nano-SAR modeling principles of the Organization for Economic Cooperation and Development (OECD) [21], a nano-SAR model was developed to predict the cellular uptake endpoints of 109 MNPs with different surface modification in the PaCa2 cell line. Finally, internal and external verification were made to strictly verify the developed model and define its applicability domain. The model contributes to understand nano-SAR and provide theoretical basis for the design and synthesis of green nanomaterials with high efficiency and harmlessness.

2. Results and Discussion

2.1. Nano-SAR Model Performance

Based on the data of cellular uptakes of 109 magnetic fluorescent MNPs with surface modification in the PaCa2 line, a nano-SAR model with toxicity endpoint as the dependent variable is established. The performance of the model on the training and test set is assessed with the indicators defined in Equations (1)–(4) [27]. True positive (TP) represents that a toxic MNP is correctly classified as positive, true negative (TN) represents that a non-toxic MNP is correctly classified as negative, while false positive (FP) represents a non-toxic MNP is incorrectly classified as toxic and false Negative (FN) represents a toxic MNP is incorrectly classified as non-toxic.

S E = \frac{T P}{T P + F N}

(1)

S P = \frac{T N}{T N + F P}

(2)

A C C = \frac{T N + T P}{T N + T P + F P + F N}

(3)

M C C = \frac{T P \times T N - F P \times F N}{\sqrt{(T P + F N) (T P + F P) (T N + F N) (T N + F P)}}

(4)

Given the above calculation, the detailed statistical parameters are given in Table 1.

The results of the real label and the predicted label are shown in the confusion matrix in Figure 1.

2.2. Model Stability Validation and Results Assessment

The cross-validating process is a statistical method to evaluate the stability of the models. It is more stable and comprehensive than the method of dividing the training set and test set in a single way. In this work, the result of five-fold cross-validating process is 0.909, which demonstrates the good stability and reliability of the model, and can be reasonably used for predicting the cytotoxicity of MNPs.

The ACC of the test set is 0.950, indicating that the classifier has good classification effect and predictive ability, in addition, the subtle difference between the ACC of training and test set (0.966 and 0.950) shows that the model is effective and not subject to overfitting. Furthermore, the results show that the sensitivity and specificity values are greater than 0.9 in the entire data. Fjodorova et al. [28] recommended that the supervised model should be high sensitivity and specificity. It should be noticed that sensitivity is a very significant parameter in a nano-SAR model. Actually, the low sensitivity value indicates the model has a low ability to distinguish the toxicity of various compounds. The specificity is another important indicator. High specificity value means the model has a high ability to distinguish the false positive compounds [29].

The above results show that the model provides high classification accuracy after internal and external verification, and can be reliably employed for predicting the cytotoxicity of MNPs. Moreover, this work indicates that it is possible to predict the cytotoxicity of MNPs through the nano-SAR method using norm index descriptors. Once a reliable model is established, the cytotoxicity of MNPs can be quickly predicted by input of structural parameters of MNPs.

2.3. Applicability Domain of the Proposed Model

It should be noted that any developed nano-SAR model should have a clear application domain (AD). As for any nano-SAR model, that only the predictions for materials are within its AD can make it considered to be reliable. In this study, for each category, all test set samples are within the application domain, and the model is reliable.

2.4. Comparisons with Other Models in the Literature

The proposed model for predicting cellular uptakes of MNPs in the PaCa2 cell line is based on identical data set reported in the literature. Comparisons of the present model with other reported models for the cellular uptake of MNPs was carried out (shown in Table 2). The external predictability metrics could indicate the prediction performance of proposed models, it was not hard to find that the performance of the present model outperformed those of the previous models proposed by Singh et al. [30]. In particular, it should be noticed that models of Singh et al. were established using eleven descriptors, while only five descriptors were employed in our work. Based on a statistical perspective, the more input descriptors employed in the proposed model, the better statistical parameters will be obtained. Nevertheless, the basic strategy of nano-SAR analysis is to find optimum relationship models between the molecular structures and desired properties with selected descriptors as less as possible. The nano-SAR models with fewer employed descriptors can be considered to be more robust and simpler to use.

3. Materials and Methods

3.1. Data Set

The dataset of the cellular uptake of 109 nanoparticles was taken from the published article [19] and presented in Table S1. All nanoparticles had the same metal core with different surface-modifying organic molecules. Nanoparticles were made magnetofluorescent with the addition of fluorescein isothiocyanate (FITC) molecules on their surfaces to enable measurement of cellular uptake. Compared to other cell lines, it was found that the cellular uptake in PaCa2 had more obvious diversity and was highly dependent on surface modifications. Thus in our work, the uptakes data of MNPs in PaCa2 were employed for the model development. Cellular uptake had the expression to be the logarithm of MNPs concentration (pM) in each cell, ranging from 2.23 to 4.44.

For binary classification, the standard of Chau and Yap was referred [22]. Due to this standard, the MNPs achieving cellular uptakes of over 5000 NPs per cell were regarded as better cellular uptakes (positive class), while MNPs with cellular uptakes of less than 5000 particles per cell were regarded as poor cell uptakes (negative class). Therefore, 59 MNPs were in positive class and the end-point values were set at 1, and the rest 50 MNPs were in negative class and the end-point values were set at 0.

3.2. Dataset Splitting

Dividing the dataset is an indispensable step for the development of nano-SAR study. Before nano-SAR modeling, all the whole 109 nanoparticles in the data set were randomly divided into a training set with 89 data and a test set with 20 data. The training set is applied to develop the nano-SAR model, whereas the test set is employed for evaluating the performance.

3.3. Molecular Descriptors Calculation

Here, we adopted one novel type of norm index descriptors reported by Yali Wang et al. [31] to predict the cellular uptakes of MNPs. The detailed calculating procedures are as follows: Firstly, the 3D structure of each MNP was achieved with Chemdraw (version 14), with the optimization by complying with the MM2 module (the program of class 1 Allinger molecular mechanics). Secondly, for further optimization, the GAUSSIAN (version GAUSSIANVIEW 6.0.16) was employed to carry out Density Functional Theory (DFT) M06-2X functional calculation on the basis of 6-311+G (d, p). Then, a range of distance matrices consisting of step matrix DM1 and Euclidean distance matrix DM2 were retrieved from the optimized structures. The specific calculating procedure is as follows:

D M 1 = [a_{i j}] (a = n t h e p a t h b e t w e e n a t o m i j i s n)

(5)

D M 2 = [b_{i j}] b_{i j} = {\begin{matrix} r_{i j} \\ 0 \end{matrix} \begin{matrix} i f i \neq j \\ i f i = j \end{matrix}

(6)

r_{i j}

denotes the Euclidean spatial distance of atom i and j. Moreover, for introducing the contribution of single atom and enhancing the performance of the approach, here, a property matrix PM integrated with several atomic properties was proposed and defined as:

P M = [S N E N E_{i} \tanh (a c)]

(7)

where, SN, EN,

E_{i}

and ac are electron shell number, electro-negativity, ionization energy, and atom charge, separately.

Next, integrating the matrices DM and the proposed property matrix PM, three extended matrices were made, and the combinational details are as follows in Equations (4)–(6):

E M_{1, m, n} = [D M_{m} P M (:, n)]

(8)

E M_{2, m, n} = [P M (:, n) \times P M {(:, n)}^{T} + D M_{m}]

(9)

{EM}_{3, m, n} = [(P M (:, n) \times P M {(:, n)}^{T}) \times D M_{m}]

(10)

With these matrices, we employed the norm indexes consisting of norm (EM, 1), norm (EM, 2) and norm (EM, 3). Herein, norm (EM, 1), norm (EM, 2) and norm (EM, 3) refer to the largest column sum, the largest singular value, and the Frobenius norm of the matrix EM, separately. Therefore, three norm indexes have the definition as Equations (7)–(9):

n o r m (E M, 1) = \max_{j} [\sum_{i = 1}^{p} | E M_{i j} |] j = 1, \dots, q

(11)

n o r m (E M, 2) = \sqrt{\max (λ_{1} (E M^{H} \times E M))}

(12)

n o r m (E M, 3) = \sqrt{(\sum_{j = 1}^{q} \sum_{i = 1}^{p} E M_{i j}^{2})}

(13)

where p and q are the number of rows and columns of matrix EM, respectively. The

λ_{i}

refers the eigenvalue of the matrix. The

E M^{H}

refers the Hermite matrix of the matrix EM.

3.4. Descriptor Selection and Modeling

High-dimensional data will not only increase the complexity of calculation, but also lower the efficiency of the predictive models for classification [32]. In order to establish an effective and reliable model, it is, therefore, essential to select the most relevant features. In this study, we decreased the dimension of feature space using the Random forest algorithm (RF), combining it with the Recursive Feature Elimination (RFE) [33], which could eliminate data redundancy and generate more compact feature subsets. Figure 2 illustrates the process of the RF-RFE approach. Firstly, we used the RF algorithm to train our model by complying with the training data, and the importance of each feature was obtained based on the relevant classification contribution. Then, the features were sorted based on their importance from high to low. A ranking of features was obtained here. Finally, we eliminated the least important feature, and then retrained the RF model with the updated features, and obtained the classification performance with the current feature set. This is an iterative process until the feature set is empty. As a result, a list of performance measurement values corresponding to each subset was generated. All these steps above were carried out by PyCharm software (PyCharm Community Edition 2019.3.4).

3.5. Model Validation

Model validating process can be absolutely necessary for ensuring reliability of the developed nano-SAR model. According to the OECD regulations [21], only validated models can be considered to be reliable. Here, we adopted all kinds of validating methods to validate the performance of the developed nano-SAR model for its fitness, robustness, and predictability.

Firstly, for binary classification, the most commonly used statistical parameters such as Sensitivity (SE), Specificity (SP), Accuracy (ACC), and Matthews correlation coefficient (MCC), were used to evaluate the fitness of the nano-SAR model [34].

Secondly, the robustness of the model was represented by the k-fold cross-validating process (k-CV), k usually takes five or ten, which is the most common method in the internal validating process [35,36]. The advantage of this method is that it can perform reliable and fair testing on the dataset [37]. In this way, not only the robustness but also the internal predictability of the model can be verified.

In addition, the nano-SAR model is often validated in two steps, that is, the internal validating process and the external validating process. The external validating process is fairly significant and widely used method to evaluate both the external predictability and the generalizability of the nano-SAR model for novel compounds. Here, the external validating process was executed by splitting the available data set into a training set and an external test set. The training set is used for selecting descriptors and developing models, while the test set is used to achieve external validation.

3.6. Applicability Domain (AD)

According to the OECD standard 3rd, it is necessary to determine the application domain of the model when an acceptable (Q)SAR model is proposed. AD describes the physicochemical space upon which the developed model is trained, and thus can be applied to make predictions. Merely the structures of the new compounds are “similar” to those in the training set can obtain an effective prediction result [38]. That is, for each category (toxic and non-toxic) in this study, if the leverage value of the test set sample is within the range of the training set, the prediction is considered to be valid. Otherwise, it is considered to be beyond the application domain of the model, the prediction result is invalid. The leverage value h_i is defined as:

h_{i} = x_{i}^{T} {(X^{T} X)}^{- 1} x_{i}

, where x_i denotes a row vector of descriptors for a particular ith MNP and X denotes the m × n matrix of descriptors in all samples.

4. Conclusions

In this work, a new nano-SAR model based on norm index descriptors was developed to predict the cytotoxicity of 109 functional magnetic fluorescence MNPs to the PaCa2 cell line. The results indicate that the developed model could provide satisfactory predictions. Based on several internal and external validating strategies, the robustness and predictivity of the model were rigorously validated. The main findings of this study include:

The employed norm index descriptors combining the atomic distance matrices with the property matrix could accurately and effectively characterize the structural features of MNPs and lead to a nano-SAR model with satisfactory model performance.
The Random forest algorithm (RF) combined with the Recursive Feature Elimination (RFE) method could be successfully employed to explore and describe the internal relationships between the nanostructure and cytotoxicity of MNPs.
Since a considerable number of MNPs were involved in the development of the model, and a rigorous model validating process and extensive model comparisons were performed, the proposed model in this study could be reasonably considered as reliable in predicting the cytotoxicity of novel MNPs or other MNPs for which experimental data are unknown.

Supplementary Materials

The following are available online, Table S1: List of chemicals conjugated to nanoparticles and their corresponding cellular uptake.

Author Contributions

Data curation, H.S. and B.Y.; Formal analysis, H.S., J.C., X.T. and F.Y.; Software, H.S.; Methodology, H.S., F.Y. and J.C.; Writing-original draft, H.S.; Funding acquisition, Y.P.; Validation, X.T., Y.P. and B.Y.; Writing—review & editing, Y.P. and J.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Natural Science Fund for Distinguished Young Scholars of Jiangsu Province of China (BK20190036), and Natural Science Fund of Jiangsu Higher Education Institutions of China (No. 18KJA620002).

Data Availability Statement

The data presented in this study are available in Supplementary Materials.

Conflicts of Interest

The authors declare no conflict of interest.

Sample Availability

Samples of the compounds are available from the authors.

References

Khojasteh, H.; Safajou, H.; Mortazavi-Derazkola, S.; Salavati-Niasari, M.; Heydaryan, K.; Yazdani, M. Economic procedure for facile and eco-friendly reduction of graphene oxide by plant extracts; a comparison and property investigation. J. Clean. Prod. 2019, 229, 1139–1147. [Google Scholar] [CrossRef]
Mortazavi-Derazkola, S.; Salavati-Niasari, M.; Khojasteh, H.; Amiri, O.; Ghoreishi, S.M. Green synthesis of magnetic Fe3O4/SiO2/HAp nanocomposite for atenolol delivery and in vivo toxicity study. J. Clean. Prod. 2017, 168, 39–50. [Google Scholar] [CrossRef]
Zinatloo-Ajabshir, S.; Mortazavi-Derazkola, S.; Salavati-Niasari, M. Nd2O3 nanostructures: Simple synthesis, characterization and its photocatalytic degradation of methylene blue. J. Mol. Liq. 2017, 234, 430–436. [Google Scholar] [CrossRef] [Green Version]
Wu, J.J.X.; Li, S.R.; Wei, H. Multifunctional nanozymes: Enzyme-like catalytic activity combined with magnetism and surface plasmon resonance. Nanoscale Horiz. 2018, 3, 367–382. [Google Scholar] [CrossRef]
Bajpai, V.K.; Shukla, S.; Kang, S.M.; Hwang, S.K.; Song, X.; Huh, Y.S.; Han, Y.K. Developments of cyanobacteria for nano-marine drugs: Relevance of nanoformulations in cancer therapies. Mar. Drugs 2018, 16, 179. [Google Scholar] [CrossRef] [Green Version]
El-Sayed, I.H.; Huang, X.H.; El-Sayed, M.A. Selective laser photo-thermal therapy of epithelial carcinoma using anti-EGFR antibody conjugated gold nanoparticles. Cancer Lett. 2006, 239, 129–135. [Google Scholar] [CrossRef]
Dutta, T.; Kim, K.H.; Deep, A.; Szulejko, J.E.; Vellingiri, K.; Kumar, S.; Kwon, E.E.; Yun, S.T. Recovery of nanomaterials from battery and electronic wastes: A new paradigm of environmental waste management. Renew. Sust. Energ. Rev. 2018, 82, 3694–3704. [Google Scholar] [CrossRef]
Auffan, M.; Rose, J.; Bottero, J.Y.; Lowry, G.V.; Jolivet, J.P.; Wiesner, M.R. Towards a definition of inorganic nanoparticles from an environmental, health and safety perspective. Nat. Nanotechnol. 2009, 4, 634–641. [Google Scholar] [CrossRef]
Dong, J.Y.; Zink, J.I. Taking the temperature of the interiors of magnetically heated nanoparticles. ACS Nano 2014, 8, 5199–5207. [Google Scholar] [CrossRef]
Jiang, S.; Gao, Q.; Chen, H.C.; Roco, M.C. The roles of sharing, transfer, and public funding in nanotechnology knowledge-diffusion networks. J. Assoc. Inf. Sci. Technol. 2015, 66, 1017–1029. [Google Scholar] [CrossRef]
Pan, Y.; Li, T.; Cheng, J.; Telesca, D.; Zink, J.I.; Jiang, J.C. Nano-QSAR modeling for predicting the cytotoxicity of metal oxide nanoparticles using novel descriptors. RSC Adv. 2016, 6, 25766–25775. [Google Scholar] [CrossRef]
Hansch, C.; M, P.P.; Fujita, T.; Muir, R.M. Correlation of Biological Activity of Phenoxyacetic Acids with Hammett Substituent Constants and Partition Coefficients. Nature 1962, 194, 178–180. [Google Scholar] [CrossRef]
Winkler, D.A.; Mombelli, E.; Pietroiusti, A.; Tran, L.; Worth, A.; Fadeel, B.; McCall, M.J. Applying quantitative structure-activity relationship approaches to nanotoxicology: Current status and future potential. Toxicology 2013, 313, 15–23. [Google Scholar] [CrossRef]
Park, S.H.; Sung, J.H.; Kim, E.J.; Chung, N. Berberine induces apoptosis via ROS generation in PANC-1 and MIA-PaCa2 pancreatic cell lines. Braz. J. Med. Biol. Res. 2015, 48, 111–119. [Google Scholar] [CrossRef] [Green Version]
Doi, T.; Ishikawa, T.; Okayama, T.; Oka, K.; Mizushima, K.; Yasuda, T.; Sakamoto, N.; Katada, K.; Kamada, K.; Uchiyama, K.; et al. The JAK/STAT pathway is involved in the upregulation of PD-L1 expression in pancreatic cancer cell lines. Oncol. Rep. 2017, 37, 1545–1554. [Google Scholar] [CrossRef] [Green Version]
Hao, C.; Zhang, X.; Zhang, H.; Shang, H.; Bao, J.; Wang, H.; Li, Z. Sugiol (12-hydroxyabieta-8,11,13-trien-7-one) targets human pancreatic carcinoma cells (Mia-PaCa2) by inducing apoptosis, G2/M cell cycle arrest, ROS production and inhibition of cancer cell migration. J. Buon 2018, 23, 205–210. [Google Scholar]
Brulle, L.; Vandamme, M.; Ries, D.; Martel, E.; Robert, E.; Lerondel, S.; Trichet, V.; Richard, S.; Pouvesle, J.-M.; Le Pape, A. Effects of a non thermal plasma treatment alone or in combination with gemcitabine in a MIA PaCa2-luc orthotopic pancreatic carcinoma model. PLoS ONE 2012, 7, e52653. [Google Scholar] [CrossRef]
Qi, R.; Pan, Y.; Cao, J.; Jia, Z.; Jiang, J. The cytotoxicity of nanomaterials: Modeling multiple human cells uptake of functionalized magneto-fluorescent nanoparticles via nano-QSAR. Chemosphere 2020, 249. [Google Scholar] [CrossRef]
Weissleder, R.; Kelly, K.; Sun, E.Y.; Shtatland, T.; Josephson, L. Cell-specific targeting of nanoparticles by multivalent attachment of small molecules. Nat. Biotechnol. 2005, 23, 1418–1423. [Google Scholar] [CrossRef]
Fourches, D.; Pu, D.Q.Y.; Tassa, C.; Weissleder, R.; Shaw, S.Y.; Mumper, R.J.; Tropsha, A. Quantitative nanostructure-activity relationship modeling. ACS Nano 2010, 4, 5703–5712. [Google Scholar] [CrossRef] [Green Version]
OECD. Guidance Document on the Validation of (Quantitative) Structure-Activity Relationship [(Q)SAR] Models. 2014. Available online: http://www.oecd.org/ (accessed on 3 September 2014).
Chau, Y.T.; Yap, C.W. Quantitative nanostructure-activity relationship modelling of nanoparticles. RSC Adv. 2012, 2, 8489–8496. [Google Scholar] [CrossRef]
Kar, S.; Gajewicz, A.; Puzyn, T.; Roy, K. Nano-quantitative structure-activity relationship modeling using easily computable and interpretable descriptors for uptake of magnetofluorescent engineered nanoparticles in pancreatic cancer cells. Toxicol. Vitr. 2014, 28, 600–606. [Google Scholar] [CrossRef]
Winkler, D.A.; Burden, F.R.; Yan, B.; Weissleder, R.; Tassa, C.; Shaw, S.; Epa, V.C. Modelling and predicting the biological effects of nanomaterials. SAR QSAR Environ. Res. 2014, 25, 161–172. [Google Scholar] [CrossRef] [PubMed]
Ojha, P.K.; Kar, S.; Roy, K.; Leszczynski, J. Toward comprehension of multiple human cells uptake of engineered nano metal oxides: Quantitative inter cell line uptake specificity (QICLUS) modeling. Nanotoxicology 2019, 13, 14–34. [Google Scholar] [CrossRef]
Toropov, A.A.; Toropova, A.P.; Puzyn, T.; Benfenati, E.; Gini, G.; Leszczynska, D.; Leszczynski, J. QSAR as a random event: Modeling of nanoparticles uptake in PaCa2 cancer cells. Chemosphere 2013, 92, 31–37. [Google Scholar] [CrossRef]
Singh, K.P.; Basant, N.; Gupta, S. Support vector machines in water quality management. Anal. Chim. Acta 2011, 703, 152–162. [Google Scholar] [CrossRef]
Fjodorova, N.; Vracko, M.; Novic, M.; Roncaglioni, A.; Benfenati, E. New public QSAR model for carcinogenicity. Chem. Cent. J. 2010, 4. [Google Scholar] [CrossRef] [Green Version]
Cheng, F.; Shen, J.; Yu, Y.; Li, W.; Liu, G.; Lee, P.W.; Tang, Y. In silico prediction of Tetrahymena pyriformis toxicity for diverse industrial chemicals with substructure pattern recognition and machine learning methods. Chemosphere 2011, 82, 1636–1643. [Google Scholar] [CrossRef]
Singh, K.P.; Gupta, S. Nano-QSAR modeling for predicting biological activity of diverse nanomaterials. RSC Adv. 2014, 4, 13215–13230. [Google Scholar] [CrossRef]
Wang, Y.L.; Yan, F.Y.; Jia, Q.Z.; Wang, Q. Assessment for multi-endpoint values of carbon nanotubes: Quantitative nanostructure-property relationship modeling with norm indexes. J. Mol. Liq. 2017, 248, 399–405. [Google Scholar] [CrossRef]
Wu, Y.; Zhang, A. Feature selection for classifying high-dimensional numerical data. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Washington, DC, USA, 27 June–2 July 2004; pp. II251–II258. [Google Scholar]
Chen, Q.; Meng, Z.P.; Liu, X.Y.; Jin, Q.G.; Su, R. Decision variants for the automatic determination of optimal feature subset in RF-RFE. Genes 2018, 9, 301. [Google Scholar] [CrossRef] [Green Version]
Nasiri, A.; Omid, M.; Taheri-Garavand, A. An automatic sorting system for unwashed eggs using deep learning. J. Food Eng. 2020, 283, 9. [Google Scholar] [CrossRef]
Singh, K.P.; Singh, A.K.; Gupta, S.; Rai, P. Modeling and optimization of reductive degradation of chloramphenicol in aqueous solution by zero-valent bimetallic nanoparticles. Environ. Sci. Pollut. Res. 2012, 19, 2063–2078. [Google Scholar] [CrossRef] [PubMed]
Benigni, R.; Netzeva, T.I.; Benfenati, E.; Franke, R.; Helma, C.; Hulzebos, E.; Marchant, C.; Richard, A.M.; Woo, Y.; Yang, C. The expanding role of predictive toxicology: An update on the (Q)SAR models for mutagens and carcinogens. J. Environ. Sci. Health Part C 2007, 25, 53–97. [Google Scholar] [CrossRef]
Singh, G.; Panda, R.K. Daily sediment yield modeling with artificial neural network using 10-fold cross validation method: A small agricultural watershed. Int. J. Earth Sci. Eng. 2011, 4, 443–450. [Google Scholar]
Kovarich, S.; Papa, E.; Gramatica, P. QSAR classification models for the prediction of endocrine disrupting activity of brominated flame retardants. J. Hazard. Mater. 2011, 190, 106–112. [Google Scholar] [CrossRef]

Figure 1. Confusion matrix.

Figure 2. The main procedure of the recursive feature elimination (RFE) method.

Table 1. Performance matrices of the full model.

Sub-Set	n	SE	SP	ACC	MCC
Training set	89	0.958	0.976	0.966	0.933
Test set	20	0.909	1	0.950	0.905
Complete	109	0.949	0.980	0.972	0.927

Table 2. Comparison of statistical parameters between present model and past models.

Works	Method	Sub-Set	SE	SP	ACC	MCC
Singh et al.	DTB	Training set	1	0.974	0.988	0.980
	DTB	Test set	0.882	1	0.926	0.860
	DTF	Training set	1	1	1	1
	DTF	Test set	0.875	0.909	0.889	0.780
This work	RF	Training set	0.958	0.976	0.966	0.933
This work	RF	Test set	0.909	1	0.950	0.905

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shi, H.; Pan, Y.; Yang, F.; Cao, J.; Tan, X.; Yuan, B.; Jiang, J. Nano-SAR Modeling for Predicting the Cytotoxicity of Metal Oxide Nanoparticles to PaCa2. Molecules 2021, 26, 2188. https://doi.org/10.3390/molecules26082188

AMA Style

Shi H, Pan Y, Yang F, Cao J, Tan X, Yuan B, Jiang J. Nano-SAR Modeling for Predicting the Cytotoxicity of Metal Oxide Nanoparticles to PaCa2. Molecules. 2021; 26(8):2188. https://doi.org/10.3390/molecules26082188

Chicago/Turabian Style

Shi, Haihua, Yong Pan, Fan Yang, Jiakai Cao, Xinlong Tan, Beilei Yuan, and Juncheng Jiang. 2021. "Nano-SAR Modeling for Predicting the Cytotoxicity of Metal Oxide Nanoparticles to PaCa2" Molecules 26, no. 8: 2188. https://doi.org/10.3390/molecules26082188

Article Menu

Nano-SAR Modeling for Predicting the Cytotoxicity of Metal Oxide Nanoparticles to PaCa2

Abstract

1. Introduction

2. Results and Discussion

2.1. Nano-SAR Model Performance

2.2. Model Stability Validation and Results Assessment

2.3. Applicability Domain of the Proposed Model

2.4. Comparisons with Other Models in the Literature

3. Materials and Methods

3.1. Data Set

3.2. Dataset Splitting

3.3. Molecular Descriptors Calculation

3.4. Descriptor Selection and Modeling

3.5. Model Validation

3.6. Applicability Domain (AD)

4. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Sample Availability

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI