Abstract
Experimental observations suggest that electronic characteristics play a role in the rates of substrate oxidation for cytochrome P450 enzymes. For example, the tendency for oxidation of a certain functional group generally follows the relative stability of the radicals that are formed (e.g., N-dealkylation >O-dealkylation > 2° carbon oxidation > 1° carbon oxidation). In addition, results show that useful correlations between the rates of product formation can be developed using electronic models. In this article, we attempt to determine whether a combined computational model for aromatic and aliphatic hydroxylation can be developed. Toward this goal, we used a combination of experimental data and semiempirical molecular orbital calculations to predicted activation energies for aromatic and aliphatic hydroxylation. The resulting model extends the predictive capacity of our previous aliphatic hydroxylation model to include the second most important group of oxidations, aromatic hydroxylation. The combined model can account for about 83% of the variance in the data for the 20 compounds in the training set and has an error of about 0.7 kcal/mol.
The P4501 enzymes are a superfamily of monooxygenases involved in the metabolism of both exogenous and endogenous compounds. Ironically, these enzymes play a central role in both the prevention and induction of chemical toxicities and carcinogenicity. Although most P450 oxidations of xenobiotics result in detoxification, occasionally a more toxic intermediate is formed. In fact, many ultimate toxins and carcinogens are formed by the bioactivation of less reactive compounds, and many bioactivation reactions are mediated by the P450 enzymes. Often bioactivation reactions are in competition with detoxification pathways for the same substrate. Since these enzymes play such a central role in both detoxification and bioactivation, predictive models for cytochrome P450 catalysis will be useful tools for evaluating of the potential risks of environmental exposures. One of the most pertinent but difficult problems in risk assessment is translating bench results and mechanistic information into a form that can be used. This article outlines steps toward the development of computational models from laboratory data into a form that can be used for risk assessments that accurately reflect experimental results. For example, these models now more completely describe all positions of metabolism for nitriles and should be more complete in predicting the toxicity related to nitrile metabolism. These semiempirical computational models blend experimental data and computational chemistry in such a way as to provide a consistent prediction of the bioactivation rates for a broad spectrum of compounds. In particular, these models can be used to predict xenobiotic metabolism by the P450 enzyme family, including the bioactivation of compounds to toxins and carcinogens.
Models such as those presented here for P450-mediated reactions can also play a role in drug design. Tools that predict regioselectivity can be used to assess the pharmacokinetics of drugs before synthesis, saving time and money in the drug development process. These types of tools, either alone or in combination with homology or pharmacophore models, can also provide for the design of better drugs, with higher compliance and fewer toxic side-effects. However, P450 enzymes are difficult to model by the standard methods used for most drug targets, which are based mainly on predicting binding affinities related to steric features, since accurate results depend upon the prediction of both the electronic and steric features of the enzymes. This is different from many other enzyme systems since P450 has the need to metabolize a vast array of xenobiotics, which makes it impractical to have one enzyme for each compound or even each class of compounds. Therefore, although most cellular functions tend to be very specific, xenobiotic metabolism requires enzymes with diverse substrate specificities. The cytochromes P450 have apparently assumed much of this role. The required diversity is accomplished by families and subfamilies of enzymes with generally broad substrate specificities, a very reactive oxygenating species and a broad regioselectivity. Thus, for many reactions, the electronic features of the substrate are all that are required to predict regioselectivity (Grogan et al., 1992; Harris et al., 1992; de Groot et al., 1995, 1999;Yin et al., 1995).
We have reported a rapid electronic model for the prediction of regioselectivity in P450-mediated hydrogen atom abstraction mechanisms (Korzekwa et al., 1990a). The methods depend only on the calculation of the AM1 ground-state energies for the parent compound and the product radicals. The relative activation energy is predicted by the use of eq.1.
Materials and Methods
Data analysis and graphing were done with SigmaPlot version 5 (SPSS, Inc. Chicago, IL). The energy difference for regioselectivity was determined by taking the log of the ratio of metabolites and multiplying by 0.616, which gives the energetic difference at 310°K.
Semiempirical calculations were performed with Gaussian 98 (Gaussian, Inc., Pittsburgh, PA.). The semiempirical activation energies for hydrogen atom abstraction for the reactions outlined in the text were estimated with our quantum mechanical model [thep-nitrosophenoxy radical (PNR) model] based on hydrogen abstraction reactions using PNR (Korzekwa et al., 1990b) and the semiempirical AM1 Hamiltonian (Dewar et al., 1985). The model uses the calculated (AM1) enthalpies of reaction and the ionization potentials of the resultant radicals to predict the AM1 activation enthalpies (Hact). All open shell systems were treated with an unrestricted Hartree-Fock Hamiltonian. Each of the reactants and the transition states for the aromatic hydroxylation reactions were minimized using the default procedures in Gaussian 98, and each transition state was confirmed by frequency calculations and had only one negative eigenvalue. All transition states were found by following the most negative eigenvalue after the methoxy radical oxygen-aromatic carbon bond in the tetrahedral intermediate was lengthened to 1.95 Å.
The aromatics compounds listed in Table 1were selected to cover a range of electron-donating and electron-withdrawing groups based on Hammett ς values. We emphasized single-ring compounds at the expense of multiple-ring compounds since all of the experimental data were for single-ring-containing compounds. It is possible that a different model will be required for multiple-ring-containing compounds, five-membered rings, and other types of aromatic compounds for which no experimental is available.
Results
Ideally, we would be able to use the same oxidant (PNR) to model P450 aromatic addition that we used for hydrogen atom abstraction. We have shown previously that aromatic oxidation is likely to occur by addition of a triplet-like oxygen to form an tetrahedral intermediate (Korzekwa et al., 1989). This tetrahedral intermediate can then rearrange to epoxides, ketones, and phenols. Although addition of the PNR to an aromatic carbon progresses smoothly through a transition state to a tetrahedral intermediate, the electronic state of the tetrahedral intermediate is not the ground state. Thus, the reactions from reactants to products and products to reactants occur on two different potential energy surfaces. In addition to complicating any transition state studies, the value of any Brønsted correlations will be in doubt since the transition states are not necessarily intermediate between products and reactants. Thus, thep-nitrosophenoxy radical alone is not likely to serve as a versatile model for other P450-mediated oxidations, and other small oxygen radicals were tested within the AM1 formalism.
To circumvent the problems associated with PNR additions to aromatic systems, methoxy radical was used for aromatic addition reactions. Using the AM1 formalism, the methoxy radical adds smoothly to aromatic compounds and, in contrast to the p-nitrosophenoxy radical, remains on a single potential energy surface. Methoxy radical was added to the aromatic compounds shown in Table 1, and the heats of reaction and transition state were characterized for each reaction. A good linear correlation was observed for the heats of reaction and the activation energies reported in Table 1. A plot of the data is shown in Fig. 1. Equation 2 is the equation that describes the linear correlation between activation energy and heats of reaction. This linear correlation has aR2 value of 0.92 and a standard error approximately 0.35 kcal/mol. In eq. 2, ΔHact is the enthalpy of activation, and ΔHreac is the enthalpy of reaction. Since this equation depends only on the enthalpy of reaction for predicting rate, even the activation energy of a large compound can be predicted extremely rapidly.
The activation energies of our aliphatic model obtained from eq.1 has never been corrected to reflect experimental activation energies. Thus, we need to scale eq. 1 relative to experimentally measured values. Literature values for regioselectivity that appear to reflect intrinsic reactivity differences were found for octane (Jones et al., 1986, 1990), hexane (Morohashi et al., 1983), diphenylpropane (Hjelmeland et al., 1977a), substituted diphenylpropanes (Hjelmeland et al., 1977b), 2-methylanisole (Higgins et al., 2001), 4-methyanisole (Higgins et al., 2001), α-chloro-p-xylene (Higgins et al., 2001), and ethylbenzene (White et al., 1986). The values for octane, 2-methylanisole, 4-methyanisole, α-chloro-p-xylene, and ethylbenzene have been shown to undergo rapid interchange between the two sites of interest by isotope effect experiments. The octane and hexane values are for C-1 versus C-2 hydroxylation, whereas ethylbenzene and diphenylpropane provide an estimate of the effect of a phenyl substituent on C-1 versus C-2 hydroxylation and benzylic versus an internal secondary position. The remaining compounds give values for substituent effect on benzylic hydroxylation and aromaticO-dealkylation. The computationally predicted energies are given in Table 2. A plot of the predicted versus the observed regioselectivity is given in Fig.2. We performed a linear regression on the experimental versus theoretical data to correct for the differences in predicted versus theoretical regioselectivities, forcing they-intercept through the origin. The results are shown in eq.3.
The metabolism of toluene was determined to give around 70% benzyl alcohol and 30% aromatic hydroxylation in phenobarbital-induced rat microsomes (Hanzlik et al., 1984; Hanzlik and Ling, 1990) and to give 95% benzyl alcohol and 5% aromatic hydroxylation in human microsomes (Tassaneeyakul et al., 1996). These results give differences in energy of activation for the aromatic to benzylic hydroxylation of 0.7 and 1.8 kcal/mol for the two systems, respectively. Since the work done in rat microsomes gave a range of ortho, meta, and para products, we decided to take the mean of the upper and lower limit of the range to fit to the computational results. For ethylbenzene, the reported ratio of para hydroxylation to benzylic hydroxylation in purified CYP2B1 was around 0.0013 with an energy difference of 4.1 kcal/mol, reflecting the faster reaction for the secondary benzylic position relative to the primary benzylic position of toluene (White et al., 1986). Foro-xylene and p-xylene the ratios of benzylic to aromatic were measured to be 11.5 and 33, which correspond to energy differences of 1.5 and 2.2 kcal/mol (Iyer et al., 1997). When a similar experiment was conducted by Lindsey-Smith for anisole metabolism, aromatic oxidation was found to be faster thanO-dealkylation by about 3 times (Lindsay-Smith and Sleath, 1983), corresponding to a difference in energy of 0.67 kcal/mol. This is reduced to a ratio of around 2 for ortho hydroxylation of 2-methylanisole, when the average value of single-expressed enzymes shown to give information about energetics is taken into account (Higgins et al., 2001). Of these experiments, only the work of Higgins et al. (2001) was conducted with the expressed purpose of determining the energy difference between two pathways, although ideally all of the data would be generated in the same enzyme preparation and with the goal of accurately determining the ratios of interest.
With these data, the model for hydrogen atom abstraction and the new model for aromatic addition can be combined. Figure3 is a plot of the predicted versus measured difference in energy for the oxidations in Table 3. Obviously, a linear relationship exists. The equation for the line is shown in eq.4. In eq. 3, ΔΔGmeasured is the measured difference in energies based on regioselectivity, ΔHact(arom) is the predicted activation energy for aromatic hydroxylation based from eq. 2, and ΔHact(Habs) is the uncorrected AM1 activation energy for hydrogen atom abstraction. The fit eq. 3 gives anR2 value of 0.85, with a standard error approximately 0.65 kcal/mol.
Discussion
Experimental observations suggest that electronic characteristics play a role in the rates of substrate oxidation for cytochrome P450 enzymes. For example, the tendency for oxidation of a certain functional group generally follows the relative stability of the radicals that are formed (e.g., N-dealkylation >O-dealkylation > 2° carbon oxidation > 1° carbon oxidation). In addition, results show that useful correlations between the rates of product formation can be developed using electronic models (Grogan et al., 1992; Yin et al., 1995). These results are apparently contradicted by the results of kinetic experiments, which show the steps before substrate oxidation (electron reduction) to be rate limiting in the catalytic cycle (Higgins et al., 1998). This apparent contradiction can be resolved by an understanding of the kinetics of branched pathways, which was described by Jones et al. (1986). In brief, if product formation is not rate limiting, the reactivity of the substrate will not affect the rate of product formation. However, if an alternate product can be formed, differences in reactivity will be observed. For example, if norcamphor is substituted with deuterium, less product is observed than if it has hydrogen, even though the rate-limiting step is not norcamphor hydroxylation. Atkins and Sligar (1987) found that the isotope effect was observed because the build-up in the enzyme-substrate complex due to deuterium substitution was prevented by alternate product formation. In this case, the alternate product was water. We have seen similar results for the C-1 hydroxylation of octane; however in this case, the alternate products were C-2 and C-3 oxidation products (Jones et al., 1986). Therefore, slowing the rate of oxidation of one position by deuterium substitution causes an increase in the rate of metabolism of another position or an increase in decoupling to water formation (Higgins et al., 1998). This can only occur if the rate of exchange of the different positions within the active site is as fast or faster than the rate of the oxidation step. This means that for a substrate that can rotate freely in the active site the regioselectivity will be primarily determined by the electronic reactivity of the various positions on the molecule (Higgins et al., 2001). For these molecules, product regioselectivities can be predicted by the electronic characteristics of the substrate. Conversely, regioselectivity can provide an important tool for testing computational models.
We previously reported the development of an electronic model for hydrogen atom abstraction with can predict aliphatic hydroxylation, amine dealkylation, and O-dealkylation (Korzekwa et al., 1990a). To our knowledge, this article is the first time anyone has combined predictive models for P450-mediated aromatic and aliphatic oxidation so that a continuous prediction can be made of either aromatic, aliphatic, or a combination of these mechanisms (Fig. 4). However, qualitative models that approach these models have been described for CYP2D6 (de Groot et al., 1999). The model described here was generated by correcting the computational predictions with experimental results for microsomes and single expressed enzymes. Every effort was made to use only experimental data that would be expected to be free of specific enzyme orientation effects, as described by Higgins et al. (2001), and thus should be a general electronic model for cytochrome P450 enzymes. However, we did use a number of data points derived from microsomal preparations. It is possible that the two metabolites in these cases result from two distinct enzymes in the microsomal preparation, and when possible, we chose the phenobarbital-induced microsomal preparation since the major isoform is CYP2B1, an enzyme that has shown a lack of binding preference and a dominance of electronic factors for the type of compounds use in this study (White et al., 1984; Higgins et al., 2001). However, to exclude this data would decrease an already small dataset. In general, we have attempted to construct our electronic models with data for compounds that are small and do not contain strong binding features (Higgins et al., 2001). This means that the best data will not have orienting effects associated with the enzyme tertiary structure. Orienting effects have been shown to be important for a number of P450 enzymes and need to be added to the electronic models for accurate prediction of metabolism by these enzymes (Szklarz and Halpert, 1997; de Groot et al., 1999; Lightfoot et al., 2000; Rao et al., 2000; Szklarz et al., 2000; Xue et al., 2001).
Thus, although this model is general to the P450 superfamily, it is not going to work to any significant extent on enzymes such as those in the 4B family that have a large contribution from substrate-protein interactions determining regioselectivity (Fisher et al., 1998). One might expect the models to work better with enzymes and substrates that show little preference in orientation, such as CYP1A2 (Higgins et al., 2001), CYP2B1 (Higgins et al., 2001), CYP2B4 (White et al., 1984), CYP2E1 (Higgins et al., 2001), and CYP3A4. It should be noted that, although these enzymes have been shown to be free of excessive orientation effects for some substrates, it is possible that other substrates will show strong orientation effects in any one of these enzymes. When binding/orientation effects become large, models that account for binding effects on regioselectivity can be combined with the electronic models to give a more complete description. It is important to note that, although for certain enzymes the electronic model can stand alone to predict regioselectivity, it is doubtful that binding models alone, without an electronic model, can predict regioselectivity. Even strongly orienting P450 enzymes can give multiple products, and the electronic nature of each position must be known.
Other considerations that need to be made when using this model are the general steric accessibility of the positions of interest. For example, octane is metabolized to 1-octanol, 2-octanol, and 3-octanol in the ratio of 1:23:7 (Jones et al., 1990). The model has been shown to predict the correct regioselectivity for the C-1 and C-2 positions; however, the electronics of the C-2 and C-3 position are essentially the same so the 23:7 ratio would not be well predicted. In this case, the C-3 position appears to be less accessible since it is buried in the center of the molecule. Similarly, C-4 hydroxylation is not seen with octane to any appreciable extent, whereas the electronic features are not significantly different from those at the C-2 position. Another feature of this type is ortho hydroxylation. For many compounds, the group adjacent to the ortho position will hinder metabolism at the ortho position. These types of steric shielding are independent of the P450 enzyme and could be added to the models presented here.
Finally, it should be noted that no attempt has been made to externally validate these models. Although this would obviously be preferred, we felt that the datasets were too small to remove data for external validation. We are currently working on developing new datasets to validate these models. We did look at the internal consistency of the models by leave-one-out cross validation and foundq2 values of 0.79 for the combined model (data in Fig. 4). The cross-validatedq2 for leaving out 20% of the data ranged from 0.76 to 0.81 over 10 runs with different randomly chosen groups.
In conclusion, computational models are presented that can predict the two major P450 pathways of metabolism, aliphatic and aromatic oxidation reactions. The models depend on experimental data to correct for the errors associated with the fast semiempirical methods used to predict each reaction mechanism. The computational models corrected by experimentally measured regioselectivities give the predictive models described by eqs. 3 and 4. These models do a reasonable job at predicting the regioselectivity energetics and can be used in conjunction with models for the active site of the enzyme. Studies are underway in the application of these models to predicting human drug metabolism.
Footnotes
-
This work was supported by National Institute of Environmental Health Sciences Grant ES09122.
- Abbreviations used are::
- P450
- cytochrome P450
- AM1
- Austin model-1
- PNR
- p-nitrosophenoxy radical
- Received August 20, 2001.
- Accepted October 1, 2001.
- The American Society for Pharmacology and Experimental Therapeutics