Abstract
It is perhaps noteworthy in the recent literature of biomedical engineering that there have been a plethora of studies concerned with the activation of human paraoxonase1 (PON1) to reduce the high concentrations of homocysteine in the human serum. Some of diethyl p-nitrophenyl phosphate derivatives were studied at the B3LYP/6-311g(d,p), B3LYP/6-311++g(2d,2p) and B3LYP/6-311++g(3df,3dp) basis set level through the relationship between their molecular and electronic structure, as well as via their relations with paraoxonase1 activity. In an attempt to shed light on which descriptors may contribute to the activation of PON1, this paper investigates the nature of the relationship between the activity of the enzymes and the descriptors of 15 molecules, namely HOMO, LUMO, Energy gap, hardness, softness, Electronegativity, chemical potential, electrophilicity index(ω), Nucleofugality, Electrofugality. Two data analysis methods, CART Decision Tree and Artificial Neural Networks (ANNs) were used for the linear part and non-linear part of the data set, respectively. The results of the study show correlations between the activity of the enzyme and the studied descriptors. Moreover, the results of the study reveal that by using the CART method not only do we know the significant descriptors but also we have their critical values and orders.
Key words:
biomedical engineering, Paraoxonase, PON1 activation, CART Model, ANNs, derivatives of diethyl p-nitrophenyl phosphate
Introduction
Paraxonase is a group of enzymes that catalyzes the organophosphates and lactones. This group contains three genotypic forms coded as PON set, which are located in the long arm of chromosome 7 of human beings [1-3]. The three types of paraxonase are: PON1 that is synthesized in the liver and functions as an antioxidant, PON2 that is expressed as intracellular protein that can protect cells against oxidative damage, and PON3 that is similar to PON1 in activity but differs from it in substrate specificity [4,5]. Both PON1 and PON3 have clinical significance in that they are involved in lowering the risk of developing atherosclerosis and coronary artery disease [2,6]. PON1 or serum paraxonase, known also as serum aryldialkylphosphatase 1 and homocysteine thiolactonase, is an enzyme that is found in many mammalian species and in humans and is encoded by the PON1gene. Although PON1 hydrolyzes aromatic carboxylic acid esters, toxic organophosphate compounds, and lactones, its natural substrates and physiological function(s) are far from settled [3-5]. Human PON1 is a glycoprotein composed of 354 amino acids, which associates with high-density lipoprotein (
HDL
, a cholesterol carrier in the circulation) [1,3,4,7,8]. Serum PON1 is secreted mainly by the liver, local synthesis occurs in several tissues though. The structure contains 2 calcium ions which are essential for catalytic activity and enzyme stability [3-5,8,9].
PON1 is one of three members of mammalian family enzymes that contain PON2 and PON3. These enzymes were originally discovered through their involvement in the hydrolysis of organophosphate. Members of paraoxonase exhibit a wide range of physiologically important hydrolytic activities, including drug metabolism and detoxification of nerve agents. Due to the similarity between PON1 and PON3, both of these enzymes come to be associated with high-density lipoprotein (HDL) [5].
The "natural" substrates for PON1 are lactones [2]. However, PON1 has evolved to be a highly promiscuous enzyme that is capable of hydrolyzing a wide variety of substrates, such as lactones (including a number of important pharmaceutical agents viz., statins), glucuronide drugs, thiolactones, arylesters, cyclic carbonates, organophosphorus pesticides, let alone nerve gases such as sarin, soman and VX, estrogen esters and lipid-peroxides (oxidized lipids). PON1 was first discovered through its ability to hydrolyze and therefore detoxify organophosphorus compounds which are widely used as pesticides and nerve gases. Despite decades of research, it is only now becoming clear that PON1 protects humans from the acute and chronic harmful effects of these compounds [10,11].
In a similar vein, Diethyl Phosphate belongs to a group of compounds called organophosphates. Organophosphate (OP) or phosphate ester is the general name for esters of phosphoric acid. Many of the most important biochemicals are organophosphates, including DNA and RNA, as well as many of the cofactors essential for life. Organophosphates are the basis of many insecticides, herbicides, and nerve agents. These compounds are a diverse group of chemicals used in both domestic and industrial settings. In this study, Gaussian 03 was used to carry out geometry optimization for the 15diethyl p-nitrophenyl phosphate compounds via the density functional theory (DFT). The aim of this study was to theoretically shed light on how the diethyl p-nitrophenyl phosphate derivatives would activate PON1.
Material and method
The 15 descriptors mentioned above have been carried out at the B3LYP level of theory using Gaussian-03 series of program package [12,13]. The calculations were based on 6-31G (d,p) basis set. This method has been widely implemented to study the relationship between corrosion inhibition efficiency of the molecules and their electronic properties [13] In order to set up correlation between experimental data and structural and electronic characteristics of the investigated activators, the geometry of the molecules were optimized by the density functional theory (DFT) [13], with the Becke’s three parameter exchange functional(1) along with the Lee– Yang–Parr correlation functional theory (B3LYP) [14].
Results and discussion
Tables 1 and 2 delineates the calculated parameters of the diethyl p-nitrophenyl phosphate derivative molecules using B3LYP/6-311G(d,p), B3LYP/6-311++G(2d,2p) methods.
As HOMO is often associated with the electron donating ability of a molecule, high value of HOMO is likely to indicate the tendency of the molecule to donate electrons to appropriate acceptor molecules with lower energy MO [15-17]. HOMO and LUMO orbitals of the 15 molecules were obtained from the quantum chemical calculation by the DFT using B3LYP/6-311G(d,p), B3LYP/6-311++G(2d,2p) bases sets as shown in Figure 1 below [18,19].
Figure 1: Molecular structure, HOMO, LUMO & ESP of the 15 molecules.
Tables 1&2: The calculated parameters by B3LYP/6-311G(d,p) : HOMO, LUMO, Energy gap, Hardness, Softness, Electronegativity, Chemical potential, Electrophilicity index(ω), Nucleofugality, & Electrofugality.
|
HOMO |
LUMO |
Energy gap |
hardness |
softness |
Electronegativity |
chemical potential |
electrophilicity index(ω) |
Nucleofugality |
Electrofugality |
1 |
-7.423 |
-5.476 |
1.948 |
0.974 |
0.513 |
12.899 |
-12.899 |
85.419 |
73.007 |
98.80504 |
2 |
-7.749 |
-3.007 |
4.742 |
2.371 |
0.211 |
10.756 |
-10.756 |
24.398 |
14.828 |
36.34051 |
3 |
-7.123 |
-2.144 |
4.979 |
2.49 |
0.201 |
9.267 |
-9.267 |
17.246 |
9.224 |
27.75778 |
4 |
-7.19 |
-5.058 |
2.133 |
1.066 |
0.469 |
12.248 |
-12.248 |
70.334 |
58.62 |
83.1154 |
5 |
-7.305 |
-5.66 |
1.645 |
0.823 |
0.608 |
12.965 |
-12.965 |
102.174 |
89.62 |
115.5502 |
6 |
-7.091 |
-5.092 |
1.999 |
0.999 |
0.5 |
12.183 |
-12.183 |
74.267 |
62.583 |
86.94992 |
7 |
-6.745 |
-1.453 |
5.292 |
2.646 |
0.189 |
8.199 |
-8.199 |
12.702 |
5.826 |
22.22404 |
8 |
-6.967 |
-2.495 |
4.472 |
2.236 |
0.224 |
9.462 |
-9.462 |
20.02 |
11.677 |
30.60001 |
9 |
-7.087 |
-2.024 |
5.064 |
2.532 |
0.197 |
9.111 |
-9.111 |
16.394 |
8.549 |
26.77058 |
10 |
-6.931 |
-2.245 |
4.686 |
2.343 |
0.213 |
9.175 |
-9.175 |
17.968 |
9.964 |
28.3147 |
11 |
-7.254 |
-2.617 |
4.636 |
2.318 |
0.216 |
9.871 |
-9.871 |
21.015 |
12.303 |
32.04467 |
12 |
-6.922 |
-2.133 |
4.789 |
2.394 |
0.209 |
9.055 |
-9.055 |
17.12 |
9.263 |
27.37184 |
13 |
-7.055 |
-2.106 |
4.949 |
2.474 |
0.202 |
9.161 |
-9.161 |
16.957 |
9.034 |
27.35466 |
14 |
-6.885 |
-1.208 |
5.677 |
2.838 |
0.176 |
8.094 |
-8.094 |
11.54 |
4.865 |
21.0528 |
15 |
-6.682 |
-0.849 |
5.833 |
2.917 |
0.171 |
7.532 |
-7.532 |
9.725 |
3.651 |
18.71468 |
Table 2: The other calculated parameters by B3LYP/6-311G(d,p): Hyperpolarisibility, Pz valance, Alpha, Delta (alpha) & the Kcat/KM (activity).
|
Hyperpolarisibility 10-30 (edu) |
Pz (elect) |
Alpha(au) |
Delta (alpha)
esu 10-24 |
Kcat/KM |
1 |
5.68 |
0.014 |
266.983 |
39.56693 |
130 |
2 |
8.30 |
-0.051 |
234.831 |
34.802 |
4040 |
3 |
0.73 |
-0.258 |
202.744 |
30.04666 |
122 |
4 |
7.89 |
-0.280 |
236.182 |
35.00222 |
8300 |
5 |
1.12 |
0.014 |
265.589 |
39.36024 |
5600 |
6 |
9.23 |
-0.665 |
235.636 |
34.92126 |
5800 |
7 |
2.49 |
-0.147 |
198.966 |
29.48676 |
48 |
8 |
7.92 |
-0.763 |
229.294 |
33.98132 |
400 |
9 |
3.88 |
-0.743 |
229.322 |
33.98552 |
220 |
10 |
8.21 |
-0.761 |
242.487 |
35.93657 |
285 |
11 |
3.66 |
-0.534 |
232.672 |
34.48199 |
58 |
12 |
7.39 |
-0.789 |
243.966 |
36.15576 |
130 |
13 |
2.35 |
-0.673 |
227.786 |
33.75784 |
88 |
14 |
1.70 |
-0.228 |
198.971 |
29.48755 |
0.57 |
15 |
0.44 |
-0.748 |
214.728 |
31.82269 |
0.403 |
Both HOMO and LUMO of the 15 molecules were centered on the phenyl group, which illustrated the reactive orbitals of these molecules. The highest EHOMO among the 15 molecules were -6.682 eV, -6.745 eV, -6.885 eV, -6.922 eV, -6.931 eV, -6.967 eV, -7.055 eV, -7.087 eV, -7.091 eV, -7.123 eV, -7.19 eV, -7.254 eV, -7.305 eV, -7.423 eV, and -7.749 eV. These results were observed in the molecules 15, 7, 14, 12, 10, 8, 13, 9, 6, 3, 4, 11, 5, 1, and 2 respectively. The ELUMO of these molecules were -0.849 eV, -1.208 eV, -1.453 eV, -2.024 eV, -2.106 eV, -2.133 eV, -2.144 eV, -2.245 eV, -2.495 eV, -2.617 eV, -3.007 eV, -5.058 eV, -5.092 eV, -5.476 eV, and -5.66 eV for the molecules 15, 14, 7, 9, 13, 12, 3, 10, 8, 11, 2, 4, 6, 1, and 5 respectively. The energy gap results were 1.645 eV, 1.948 eV, 1.999 eV, 2.133 eV, 4.472 eV, 4.636 eV, 4.686 eV, 4.742 eV, 4.789 eV, 4.949 eV, 4.979 eV, 5.064 eV, 5.292 eV, 5.677 eV, and 5.833 eV correspondingly. In line with the results of energy gap calculated by B3LYP/6-311G(d,p) method and the interpretation of Figures (2-4), we can understand that the molecules with smaller EHOMO-ELUMO energy gap lead to lower kinetic stability and higher chemical reactivity [15]. Thus, the molecules with the highest reactivity are 5, 1, 6, 4, 8, 11, 10, 2, 12, 13, 3, 9, 7, 14, and 15.
Figure 2: HOMO & LUMO at B3LYP/6-311G(d,p)
Figure 3: HOMO & LUMO at B3LYP/6-311++G(2d,2p)
Figure 4: HOMO & LUMO at B3LYP/6-311++G(3df,3pdp).
Figure 5: the longest and shortest bond length
The negative charge as shown in the above ESP charts is around O5 of the phosphate group, for the whole molecules under investigation. In addition, these negative charges can be observed around the O29, O30, O33 and O34 of the two nitrous oxide attached to the phenyl ring in molecule (1); O30 and O31in molecule (2); O29 and O30 in molecule (4); O31, O32, O33 and O34 in molecule (5); O30 and O31 in molecule (11), and O31 and O32 in molecule (12). Furthermore, the negative charge can be detected around the O32 of the aldehyde group that is attached to the phenyl ring, O31 of the acetophenone of the molecule (10), and N31 of the cyanophenyl of the molecule (13).
With respect to Nucleofugality (defined as the propensity of an atom or group of them to depart bearing the bonding electron pair in a heterolytic cleavage process [20,21], the highest nucleofugality of the 15 molecules was 89.62, 73.007, 62.583, 58.62, 14.828, 12.303, 11.677, 9.964, 9.263, 9.224, 9.034, 8.549, 5.826, 4.865 and 3.651 for the molecules 5, 1, 6, 4, 2, 11, 8, 10, 12, 3, 13, 9, 7, 14 and 15 respectively. According to these results, the molecules that can activate the PON1 enzyme of the aforementioned group of molecules. The Table 3 below shows the correlations between the calculated parameters and the activity of the enzyme. Because of there are some parameters show a weak relationship with the activity of the enzyme that measured in the reference (Khersonsky O, Tawfik DS. 2005).
Table 3: The C0rrelations of the different parameters with the activity
Paramet
Mole. |
HOMO |
LUMO |
Hardness |
Softness |
Energy gap |
Electronegativity |
Chemical_potential |
Electrophilicity |
Nucleofugality |
Electrofugality |
Hyperpolarizability |
Pz |
Alpha |
Delta |
|
1 |
-7.423 |
-5.476 |
0.974 |
0.513 |
1.948 |
12.899 |
-12.899 |
85.419 |
73.007 |
98.80504 |
5.68 |
0.014 |
266.983 |
39.567 |
130 |
2 |
-7.749 |
-3.007 |
2.371 |
0.211 |
4.742 |
10.756 |
-10.756 |
24.398 |
14.828 |
36.34051 |
8.3 |
-0.051 |
234.831 |
34.802 |
4040 |
3 |
-7.123 |
-2.144 |
2.49 |
0.201 |
4.979 |
9.267 |
-9.267 |
17.246 |
9.224 |
27.75778 |
0.73 |
-0.258 |
202.744 |
30.047 |
122 |
4 |
-7.19 |
-5.058 |
1.066 |
0.469 |
2.133 |
12.248 |
-12.248 |
70.334 |
58.62 |
83.11540 |
7.89 |
-0.28 |
236.182 |
35.002 |
8300 |
5 |
-7.305 |
-5.660 |
0.823 |
0.608 |
1.645 |
12.965 |
-12.965 |
102.174 |
89.62 |
115.55020 |
1.12 |
0.014 |
265.589 |
39.360 |
5600 |
6 |
-7.091 |
-5.092 |
0.999 |
0.5 |
1.999 |
12.183 |
-12.183 |
74.267 |
62.583 |
86.94992 |
9.23 |
-0.665 |
235.636 |
34.921 |
5800 |
7 |
-6.745 |
-1.453 |
2.646 |
0.189 |
5.292 |
8.199 |
-8.199 |
12.702 |
5.826 |
22.22404 |
2.49 |
-0.147 |
198.966 |
29.487 |
48 |
8 |
-6.967 |
-2.495 |
2.236 |
0.224 |
4.472 |
9.462 |
-9.462 |
20.02 |
11.677 |
30.60001 |
7.92 |
-0.763 |
229.294 |
33.981 |
400 |
9 |
-7.087 |
-2.024 |
2.532 |
0.197 |
5.064 |
9.111 |
-9.111 |
16.394 |
8.549 |
26.77058 |
3.88 |
-0.743 |
229.322 |
33.986 |
220 |
10 |
-6.931 |
-2.245 |
2.343 |
0.213 |
4.686 |
9.175 |
-9.175 |
17.968 |
9.964 |
28.31470 |
8.21 |
-0.761 |
242.487 |
35.937 |
285 |
11 |
-7.254 |
-2.617 |
2.318 |
0.216 |
4.636 |
9.871 |
-9.871 |
21.015 |
12.303 |
32.04467 |
3.66 |
-0.534 |
232.672 |
34.482 |
58 |
12 |
-6.922 |
-2.133 |
2.394 |
0.209 |
4.789 |
9.055 |
-9.055 |
17.12 |
9.263 |
27.37184 |
7.39 |
-0.789 |
243.966 |
36.156 |
130 |
13 |
-7.055 |
-2.106 |
2.474 |
0.202 |
4.949 |
9.161 |
-9.161 |
16.957 |
9.034 |
27.35466 |
2.35 |
-0.673 |
227.786 |
33.758 |
88 |
14 |
-6.885 |
-1.208 |
2.838 |
0.176 |
5.677 |
8.094 |
-8.094 |
11.54 |
4.865 |
21.05280 |
1.7 |
-0.228 |
198.971 |
29.488 |
0.57 |
15 |
-6.682 |
-0.849 |
2.917 |
0.171 |
5.833 |
7.532 |
-7.532 |
9.725 |
3.651 |
18.71468 |
0.44 |
-0.748 |
214.728 |
31.823 |
0.403 |
Correlations (R2) |
0.184 |
0.538 |
0.5295 |
0.5028 |
0.5295 |
0.5275 |
0.5275 |
0.4903 |
0.4837 |
0.4945 |
0.1505 |
0.1014 |
0.1304 |
0.1304 |
Kat/km |
We know that the bond length is: the distance between two bonded atoms at their minimum potential energy, or the average distance between two bonded atoms. The correlation between the bond length and reactivity is that the longer the bond length is the more reactive it will be. The short one is more stable, however [22]. The longer bonds length includes those bonds within the phosphate group attached to the phenyl ring, mainly the bond between the phosphor atom and oxygen atom 4 (P1-O4: 1.6383Å), (P1-O2: 1.5868 Å), and (P1-O3: 1.5782 Å) that bonded to the phenyl ring, whereas the shorter bonds were (O4-C20: 1.3615 Å), (O2-C6: 1.4592 Å) and (P1-O5: 1.4609 Å) [23]. [Figure 5]
This bonds length was among the entire molecules, as shown in figure 6 below.
The total number of 15 molecules based on descriptors, namely, HOMO, LUMO, Energy Gap, Hardness, Softness, Electronegativity, Chemical Potential, Electrophilicty Index, Nucleofugality, Hyperpolarisibility, Pz, Alpha, Delta and dependent variable KCAT were investigated using CART Decision Tree and Artificial Neural Networks (ANNs) in order to examine which factors may have impact on the dependent variable, KCAT. The data set included both linear and non-linear calculations. While CART Decision Tree was conducted in order to pin down which descriptors have impact on the dependent variable, KCAT using SALFORD Predictive Modeler 8.0 for linear part, Artificial Neural Networks (ANNs) was utilized for the same purpose using SPSS 20.0 for non-linear part. CART Decision Tree tests revealed that Chemical Potential and Homo descriptors were statistically significant variables by conducting 5-fold cross-validation with 0.544 coefficient of determination.
Of the variables that have significant results that are in sync with the objective of the study are Chemical Potential and Homo. The model below delineates the output of this relationship and pinpoints these significant variables. The importance level for Chemical potential is higher since the separation starts with that descriptor. Besides, its critical value is -10.31. If the molecules display critical value less than and equal to -10.31, there will be no other significant descriptors available except Chemical Potential. By contrast, if the Chemical Potential is greater than -10.31, there will be other significant descriptors adding extra information to the model, which is called Homo. The Descriptors' contribution to the model comes with a single separating value at -6.90.
As for the non-linear part, ANNs was conducted in order to examine which descriptors may have impact on the dependent variable, KCAT. Owing to the fact that the data set we had was small, we had the model run 100 times and then the average was taken. The most significant variables were Pz, Hyperbola, Delta and Alpha with 62 percent of determination of coefficient (R2=0.62). In consequence, the total number of 15 molecules consisting of 14 descriptors and one independent variable that have been already mentioned above were investigated in order to examine which descriptors would impact the dependent variable. Since the data set contained both linear and non-linear calculations, two different statistical models, CART Decision Tree and ANNs, were utilized for the linear part and non-linear part, respectively. The results revealed that Chemical Potential and Homo were significant descriptors for the linear part. Similarly, Pz, Hyperpolarizability, Delta and Alpha were significant descriptors for the non-linear part.
To sum up, utilizing ANN2021 Copyright OAT. All rights reservthe descriptors, Pz, Hyperpola., Delta and Alpha do have an impact on the activity of the enzymes. Similarly, CART model used to measure the effect of the descriptors for the linear part on the activity of the enzymes reveals that the most significant descriptors are Chemical Potential and Homo. Most importantly, however, the results show that the order of the descriptors in the CART model bear significance. That is, while Chemical Potential with its value of greater than -10.31 was the first splitting variable having impact on dependent variable KCAT, Homo came second in importance, with a value of less than and equal to -6.90. By utilizing the CART method, thereupon, not only do we have the significant descriptors but also we have their critical values and orders.
The major aim of this paper was to examine the descriptors of the 15 aforementioned molecules that may activate PON1. The results of the study reveal that there is a strong relationships between the activity of the enzyme and the studied descriptors. In order to examine the relationships, two data analysis methods, CART Decision Tree and Artificial Neural Networks (ANNs) were used for the linear part and non-linear part of the data set, respectively. The most significant descriptors for the linear part were Chemical Potential and Homo. However, the order of the descriptors was determined by CART model. While Chemical Potential with its value of greater than -10.31 was the first splitting variable to have impact on dependent variable KCAT, Homo was the second variable that showed an impact on dependent variable, with its value of less than and equal to -6.90. The importance of CART method was that it not only revealed and sorted out significant descriptors but it also pinpointed their critical values and orders. In result, CART Decision Tree turned out that Chemical Potential and Homo descriptors were statistically significant variables by conducting 5-fold cross-validation with 0.544 coefficient of determination. In a similar fashion, ANNs, employed for non-linear part in order to determine which descriptors have impact on dependent variable, revealed that Pz, Hyperpolarizability, Delta and Alpha were the most significant contributors, with 62 percent of determination of coefficient (R2=0.62).