Statistical Methods for Analysis of Time-Dependent Inhibition of Cytochrome P450 Enzymes

Phillip Yates; Heather Eng; Li Di; R. Scott Obach

doi:10.1124/dmd.112.047233

Abstract

Time-dependent inhibition (TDI) of cytochrome P450 (P450) enzymes, especially CYP3A4, is an important attribute of drugs in evaluating the potential for pharmacokinetic drug-drug interactions. The analysis of TDI data for P450 enzymes can be challenging, yet it is important to be able to reliably evaluate whether a drug is a TDI or not, and if so, how best to derive the inactivation kinetic parameters K_I and k_inact. In the present investigation a two-step statistical evaluation was developed to evaluate CYP3A4 TDI data. In the first step, a two-sided two-sample z-test is used to compare the k_obs values measured in the absence and presence of the test compound to answer the question of whether the test compound is a TDI or not. In the second step, k_obs values are plotted versus both [I] and ln[I] to determine whether a significant correlation exists, which can then inform the investigator of whether the inactivation kinetic parameters, K_I and k_inact, can be reliably estimated. Use of this two-step statistical evaluation is illustrated with the examination of five drugs of varying capabilities to inactivate CYP3A4: ketoconazole, erythromycin, raloxifene, rosiglitazone, and pioglitazone. The use of a set statistical algorithm offers a more robust and objective approach to the analysis of P450 TDI data than frequently employed empirically derived or heuristic approaches.

Introduction

Time-dependent inhibition (TDI) represents a more severe form of inhibition for cytochrome P450 (P450) enzymes relative to reversible inhibition because inactivated enzymes have to be regenerated to restore their function by de novo protein synthesis (Kalgutkar et al., 2007; Obach et al., 2007; Venkatakrishnan and Obach, 2007). TDI inactivates P450 enzymes that are critical for metabolizing many drugs and it often causes prolonged effects, even after the inactivator(s) has been removed from the body. TDI can lead to clinically relevant drug-drug interactions (DDI), the effects of which can result in potential toxicity or loss of efficacy concerns (Zhou et al., 2007). For example, the failure of several late-stage clinical candidates was attributed to TDI (Zimmerlin et al., 2011), drugs have been withdrawn from the market because of TDI (e.g., mibefradil), and severe restrictions have been imposed on clinical usage for concomitant medications due to TDI effects (Wienkers and Heath, 2005). For these reasons, regulatory agencies (Food and Drug Administration, European Medicines Agency) have issued strict guidelines pertaining to the conduct of TDI studies, and these data are required for regulatory submission of a new drug application (U.S. Department of Health and Human Services Food and Drug Administration Center for Drug Evaluation and Research, 2012).

Currently, measuring TDI is one of the most challenging enzymology experiments to conduct in the area of drug metabolism. The assay includes complex experimental steps with multiple concentrations and time points, preincubation, secondary incubation, multiple transfer steps, and intricate timing. Automation, using robotic systems, can help alleviate some of the complexity of the assay (Zimmerlin et al., 2011). The analysis of TDI experimental data is also quite involved because of the need to fit and interpret multiple kinetic parameters. To address these TDI challenges, Pharmaceutical Research and Manufacturers of America (Washington DC) provided guidance on best practices for conducting in vitro TDI studies (Grimm et al., 2009). This perspective provided a clear roadmap on how to evaluate TDI in drug discovery. However, two questions remained unresolved: 1) when is TDI detectable for a drug; and 2) how to best estimate K_I and k_inact? In this study, we apply a statistical approach to objectively evaluate whether a compound is a TDI [current standard practices are somewhat arbitrary in our opinion (Grimm et al., 2009; Zimmerlin et al., 2011; U.S. Department of Health and Human Services Food and Drug Administration Center for Drug Evaluation and Research, 2012)] and if we can determine K_I and k_inact with confidence using CYP3A4 TDI experimental data.

Materials and Methods

Materials.

Pooled human liver microsomes from 50 male and female donors were purchased from BD Biosciences (Woburn, MA). Solvents, β-NADPH, monobasic and dibasic potassium phosphate, and magnesium chloride were purchased from Sigma-Aldrich (St. Louis, MO). Probe substrate, internal standard, and inhibitors were obtained from the following sources: midazolam (Cerilliant Corporation, Round Rock, TX), rosiglitazone and pioglitazone (Sequoia Research Products, Pangbourne, UK), erythromycin and raloxifene (Sigma-Aldrich), and ketoconazole (Enzo Life Sciences, Farmingdale, NY);[²H₄]1′-hydroxymidazolam was prepared at Pfizer (Groton, CT).

Time-Dependent Inhibition Assay.

Phosphate buffer (100 mM, pH 7.4) was prepared from 100 mM monobasic and dibasic potassium phosphate solutions. MgCl₂ stock solution was prepared in water (200 mM). NADPH solution (13 mM) was prepared in potassium phosphate buffer immediately before use. Inhibitor stock solutions were prepared at 100× the final preincubation concentration. Inhibitor concentration ranges were selected to surround the expected K_I. Concentrations were approximately evenly spaced after applying a log transformation.

Inactivation kinetic experiments have been described previously (Ghanbari et al., 2006; Obach et al., 2007; Polasek and Miners, 2007; Zimmerlin et al., 2011). Experiments were conducted in two steps, an inactivation preincubation, followed by a secondary incubation to assess remaining CYP3A4 activity, both conducted at 37°C. In the inactivation preincubation, human liver microsomes (0.3 mg/ml), MgCl₂ (3.3 mM), and NADPH (1.3 mM) in potassium phosphate buffer (100 mM, pH 7.4) were prewarmed at 37°C for 5 min. Preincubation was commenced with the addition of inhibitor or vehicle (2 μl). The final preincubation solvent concentration was 0.9% (v/v) acetonitrile and 0.1% (v/v) DMSO (ketoconazole, pioglitazone, raloxifene, erythromycin), or 1% (v/v) DMSO (rosiglitazone) (Chauret et al., 1998). At each time point (0, 2, 4, 7, 15, and 30 min), an aliquot of the preincubation mixture (10 μl) was transferred to a prewarmed secondary incubation mixture, consisting of midazolam (23 μM, approximately 10-fold K_m), MgCl₂ (3.3 mM), and NADPH (1.3 mM) in potassium phosphate buffer (100 mM, pH 7.4) at a final volume of 200 μl (20-fold dilution). After 6 min, the secondary incubation was terminated by transferring 100 μl of the solution from the second incubation to 200 μl of acetonitrile containing internal standard ([²H₄]1′-hydroxymidazolam, 0.1 μg/ml). Incubations were conducted in duplicate. Samples were then vortexed and centrifuged for 10 min at 2000g. Supernatant was transferred, mixed with an equal volume of water containing formic acid (0.2%), and analyzed via liquid chromatography/tandem mass spectrometry (LC-MS/MS). The assay was conducted at six nonzero concentrations and six time points for estimating K_I and k_inact.

LC-MS/MS Quantification.

A Sciex API4000 QTRAP LC-MS/MS (Foster City, CA) triple quadrupole mass spectrometer fitted with a turbo ion-spray interface operated in positive ion mode was used to monitor for 1′-hydroxymidazolam and [²H₄]1′-hydroxymidazolam. A Shimadzu (Columbia, MD) LC-20AD HPLC system with a CTC LEAP autosampler (LEAP Technology, Carrboro, NC) was programmed to inject 10 μl of sample on a Halo 2.7 μm C18 3.0 × 30 mm column (Advanced Materials Technology, Wilmington, DE). Analytes were eluted using a binary gradient mixture consisting of 0.1% (v/v) formic acid in water (solvent A) and 0.1% (v/v) formic acid in acetonitrile (solvent B), at a flow rate of 1.2 ml/min and monitored using the multiple reaction monitoring mode for the mass-to-charge (m/z) transitions 342.0 → 203.3 (1′-hydroxymidazolam) and 346.0 → 207.3 ([²H₄]1′-hydroxymidazolam). Average difference between all replicates reported was 4.1%.

Parameter Estimates.

Resulting analyte-to-internal standard peak area ratios from MS were normalized to averaged analyte-to-internal standard peak area ratios observed at 0 min in the solvent controls (hereafter referred to as activity); individual replicate activity was then used to estimate k_obs. The linear regressions, Spearman's correlation coefficient calculations, and various slope tests were performed using Excel (Microsoft Corporation, Redmond, WA). The two- and three-parameter nonlinear fits were obtained using JMP 8.0.1 (SAS Institute Inc., Cary, NC).

Statistical Analysis.

Estimating the k_inact and K_I kinetic parameters for an inactivation experiment is a two-step process (Shou and Dai, 2008; Obach et al., 2010). Statistical tests are applied here in an augmented procedure to test for TDI. First, for each compound concentration j the inactivation rate, k_obs,j, is estimated using a linear regression of the natural logarithm of activity, using k individual observations, versus time, that is, ln(Activity)_k = β₁Time_k + β₀ + ε_k, and ε_k is the experimental variance N(0,σ₁²). By definition, for each concentration j we estimate k_obs,j using the estimate for −β₁. A set of k_obs,j rates is generated across the concentrations tested using the various individual regression β₁ slope estimates. If appropriate, nonlinear regression is then used to estimate a three-parameter Michaelis-Menten equation, that is, where ε_j is the combined experimental and modeling variance N(0,σ₂²). Two two-parameter models were also tested: one in which all k_obs data were uncorrected for activity decreases observed at [I] = 0 μM, and the resulting k_obs versus [I] relationship was forced through the origin (i.e., “2p_U”-uncorrected), and a second wherein k_obs[0 _μM] is set to zero and k_obs values are normalized against k_obs[0 _μM] at each time point (“2p_C”-corrected). For each linear regression three useful estimates are produced: root mean squared error (RMSE), an estimate for σ₁, slope, and intercept. Using standard normal theory assumptions one can form hypothesis tests for both β₀ and β₁. Of particular interest to a test for TDI is the set of hypotheses H₀: −β_1[I] + β_{1[0 μM]} = k_obs[I]− k_obs[0 _μM] = 0 versus H₁: −β_1[I] + β_{1[0 μM]} = k_obs[I] − k_obs[0 _μM] ≠ 0 for each nonzero concentration. Despite the traditional use of a t test here, we evaluate these contrasts using a large sample normal theory test statistic, that is, where the S.E. is based on the individual RMSE estimates. Each test is conducted using a nominal level, for example, α = 0.05. If any of these hypothesis tests yields a significant finding we conclude that the compound exhibits TDI. Although not formally discussed here, similar tests for β₀ may reflect specific compound behavior or be used to better understand the experimental protocol.

The previous step implements a statistical test for TDI. Because weak inhibitors or suboptimal designs (e.g., number or spacing of concentrations tested) affect the subsequent nonlinear model, we propose a statistical test before estimating the kinetic equation. The two tests proposed here are analogous to a structured comparison of means for a gradient treatment with general spacing (Lentner and Bishop, 1993) and allow us to qualify the kinetic estimates. Qualifying estimate quality may be needed because correlated (or unstable) kinetic estimates are common for the Michaelis-Menten model (Raaijmakers, 1987). In practice, preincubation compound concentrations are often diluted in a logarithmic-like manner, for example, two-fold dilutions. Comparable to bioassay or dose-response models (DeLean et al., 1978), we first consider a relationship between the observed k_obs rates and ln[I]. To help mitigate the inferential complexities for a small-sample nonlinear model, we test a linear regression of k_obs,j = γ₁ln[I]_j + γ₀ + ε_j before fitting the three-parameter Michaelis-Menten equation. Here, we are specifically interested in a test of H₀: γ₁ = 0 versus H₁: γ₁ > 0. If we reject this one-sided t test at a predetermined level α we have likely improved our ability to estimate the kinetic parameters via nonlinear regression. Failing to reject this parametric test may be because of an inadequate k_obs−ln[I] linear approximation for a TDI compound, that is, γ₁ was biased toward 0 for some inhibitors investigated. To further control for possible false negatives we apply a second test, the rank-based Spearman's correlation coefficient r_s, using all of the untransformed ([I]_j, k_obs,j) pairs as an additional inferential gate before calculating the kinetic parameters. r_s is well-suited for detecting a monotonically increasing relationship between k_obs and [I], an intrinsic feature of a TDI compound. A standard test of H₀: r_s = 0 versus H₁: r_s > 0 is then performed. Rejecting H₀ suggests that meaningful kinetic parameters are likely to be estimable; failing to reject H₀ suggests additional scientific and statistical considerations.

Standard least-squares nonlinear regression is used to estimate k_obs[0 _μM], k_inact and K_I. The initial starting value for each parameter was the following: k_obs[0 _μM] = 0, k_inact = max(k_obs,j), and K_I = median[I]_j. We suggest caution in the use and interpretation of inferential procedures for the various Michaelis-Menten kinetic parameters. It is well-understood that fit summaries, for example, R², may suggest a satisfactory nonlinear model, and yet the model's estimated kinetic parameters are very imprecise (Seber and Wild, 2003). Although approximate S.E.s can be used to generate large-sample confidence intervals for k_obs[0 _μM], k_inact, and K_I, we recommend and report here profile likelihood confidence intervals, which are more robust and possess better coverage properties for small sample estimates, for characterizing the uncertainty in the kinetic estimates. We advocate the supplemental use of regression-based tests for TDI compounds here due to the prevalence of small sample TDI experiments and a reduction in the number of parameters to test. A single intermediate test for γ₁ or r_s may be preferred to a formal test for, at a minimum, k_inact and K_I.

As a final aide to qualify kinetic estimate quality, we propose a heuristic based on the empirical estimate for K_I. We suggest that the estimate for K_I be more than three times the smallest experimental concentration and less than roughly one-third the largest concentration for a TDI compound. Estimates for K_I that are outside of this region may suggest a suboptimal experimental design or an unstable nonlinear regression. Refer to Fig. 1 for a summary of the TDI decision tree containing the various statistical tests proposed here.

Fig. 1.

TDI decision tree.

Results and Discussion

Importance of Statistical Methods for Assessing TDI.

Considerable attention has been given to the TDI of P450 enzymes and its role in clinical DDI. Much of this attention has focused on the use of in vitro inactivation data, specifically K_I and k_inact, in the prediction of clinical DDI using various static and dynamic models (Mayhew et al., 2000; Wang et al., 2004; Obach et al., 2007; Yang et al., 2008). Despite wide interest in methodologies for accurate TDI measurement and prediction, there has not been an investigation of the statistical rigor that needs to be employed to appropriately interpret in vitro TDI data. One of the most important questions is how to decide whether a test compound is, in fact, a TDI or not. Statistical methods may help provide a nonbiased assessment of TDI and offer a quantitative interpretation of the U.S. Food and Drug Administration draft guidance statement pertaining to TDI “any time-dependent loss of initial product formation rate may indicate time-dependent inhibition” (U.S. Department of Health and Human Services Food and Drug Administration Center for Drug Evaluation and Research, 2012).

Statistical tests, despite their inherent limitations as decision instruments, offer some general advantages for testing for TDI. Apart from declaring the level of the test (e.g., α = 0.05), these tests are data-driven and incorporate the observed variability. This lessens the need for ad hoc tests or investigator-specific heuristics. Statistical uncertainty estimates may protect against data over-interpretation, especially for weak TDI compounds. Continuous-valued test statistics are more informative than discrete decisions. Various statistical estimates, for example, the RMSE or S.E. of a given k_obs fit, provide an investigator with valuable data. Such information, apart from qualifying the interpretation of a single experiment, can inform the planning and conduct of future TDI experiments. An increased understanding of a putative TDI compound is also possible, for example, a correlation between the RMSE and concentration for a set of k_obs. In contrast to simulation- or numerical-estimation procedures, linear models (or an assumed linear approximation) are more workable for evaluating design proposals.

Two-Step Statistical Analysis of TDI Data.

In this study we use a two-step process in the statistical analysis of experimental TDI data (Fig. 1) to determine 1) whether a compound exhibits TDI, and 2) whether K_I and k_inact can be estimated. Statistical analysis is applied to TDI data generated by using an experimental design of 6 nonzero test concentrations at 6 time points of preincubation. Five drugs (ketoconazole, pioglitazone, rosiglitazone, raloxifene, and erythromycin) with varying degrees of TDI for CYP3A4 are used to illustrate the statistical approach.

Step I: Is a Compound a TDI?

In the first step, the natural logarithm of activity is plotted at each time point for each concentration (Fig. 2A). Linear regressions are used to estimate the inactivation rate constants (k_obs) at each of the concentrations. This plot makes it easier to identify potential data edits needed to improve k_obs accuracy. For example, to better estimate k_obs for the potent inactivator raloxifene, several later time points were removed at multiple inhibitor concentration levels. Each k_obs is assessed in reference to the vehicle control (typically DMSO, acetonitrile, or methanol) using a two-sided two-sample z-test (see under Materials and Methods). If an estimated p value is less than 0.05, this result suggests that the compound is a TDI at that concentration. Other choices for the level of the test, for example, α equal to 0.01, are possible and may safeguard the reproducibility of the observed finding. A qualitative assessment of the RMSE estimates may be warranted because this quantity is critical in estimating the various k_obs S.E.s. If a compound does not demonstrate a significant difference in k_obs between the vehicle and compound at one or more concentrations (p ≥ 0.05), this suggests, within the confines of the experimental approach used, that the test compound is not a TDI. Hence, if at least one significant k_obs difference is observed, then the conclusion should be made that TDI has occurred. Using this rationale, the statistical tests for TDI of five drugs are summarized in Table 1. These data suggest that all of the compounds are TDI at all concentrations (p < 0.05), with the exception of pioglitazone at low concentrations (3, 6, and 10 μM) and ketoconazole at all concentrations (potent, reversible inhibition was observed for ketoconazole).

Fig. 2.

Inactivation of CYP3A4-mediated midazolam hydroxylation by ketoconazole, pioglitazone, rosiglitazone, erythromycin, and raloxifene. Inactivation plots to determine k_obs (A) and nonlinear regression to determine K_I and k_inact (B). Incubations were done in duplicate, and individual activity values were plotted and analyzed. In B, solid lines and hollow circles represent the 3p fit; red dashed lines represent the 2p_U fit to the hollow circles; blue dotted lines and filled triangles represent the 2p_C fit. Experimental observations for raloxifene that were excluded from the k_obs estimates are also indicated.

View this table:

TABLE 1

Summary of k_obs and the statistical tests for k_obs[I] vs. k_{obs[0 μM]}

A compound is declared to be TDI when at least one p < 0.05.

Compounds with low empirical k_obs values (e.g., 0.004–0.01 min⁻¹) can be declared as TDI, albeit very weak, should k_obs test significantly different from vehicle. Based on an empirical median S.E. estimate of 0.0012 min⁻¹ (Table 1), the least significant difference that could be detected for a k_obs[I] versus k_obs[0 _μM] comparison for our laboratory is 0.0033 min⁻¹ at a p = 0.05 level. The clinical relevance of weak inactivators, for example, their potential for DDI in the clinic, and our ability to accurately predict DDI in humans is beyond the scope of this article. In a recent report it was suggested that k_obs values less than 0.02 min⁻¹ at 10 μM serve as a boundary between compounds that show a DDI for CYP3A and those that do not (Zimmerlin et al., 2011). Establishing a boundary on the limit of detection for k_obs is currently subjective. If an artificially high limit is set the potential impact of weak inactivators will be overlooked. In our view, it is more important to determine whether k_obs is significant based on statistical analysis. The estimated kinetic parameters can be used to model and predict in vivo DDI. It has been previously reported that even a weak TDI can cause significant DDI in the clinic (Obach et al., 2007). Because TDI is a safety endpoint we advocate a conservative approach toward evaluating and declaring TDI.

Step II: Can K_I and k_inact Be Determined?

The second step tests whether reliable K_I and k_inact values can be determined for a TDI compound. A statistical test for this is partially motivated via a plot of k_obs versus ln[I] (Fig. 3) or [I] (Fig. 2B). Ketoconazole is not included in Fig. 3 because it is not a TDI; this compound is furthermore excluded from potential subsequent nonlinear modeling efforts. A significant positive correlation (p < 0.05) suggests that reliable kinetic parameter estimates may be achievable. One-sided tests, based on both a linear regression (ln[I]) and a nonparametric correlation estimator ([I]), are used. The suitability of the k_obs versus ln[I] linear approximation may help gauge design quality and inactivator potency. The nonparametric test, although not as powerful as its parametric counterpart for small samples, provides a robust intermediate test for a monotonic increasing association. If both correlations are not significant, K_I and k_inact may not be reliably determined even though the compound is a TDI. If either test is significant, K_I and k_inact are calculated using a three-parameter nonlinear regression model (Fig. 2B).

Fig. 3.

Plots of estimated inactivation rates (k_obs) versus the natural log of inactivator concentration (ln[I]) for four compounds. A linear fit (no nonzero concentration points were excluded) is used to evaluate the relationship between inactivation rate and concentration. Results using Spearman's correlation, a supplementary test, are listed in Table 2.

Reliable estimates for K_I and k_inact will, at a minimum, depend on the relationship between the estimate of K_I and the range of [I] used in the experiment. Because this is true for any enzyme kinetic or binding experiment, we added an additional provision to guard against unreliable kinetic estimates (e.g., 3 [I]_min < K_I < 1/3 [I]_max for a TDI compound likely to generate reliable kinetic estimates). A poor positioning of the K_I estimate relative to the range of [I] may affect the S.E. for K_I, reflect an intrinsically ill-conditioned regression, or a suboptimal experimental design. For example, the K_I and k_inact estimates for pioglitazone are not reported due to K_I > 1/3 [I]_max (Table 2). However, it should be noted that the K_I for rosiglitazone was only two-fold greater than the lowest tested concentration, but nevertheless offered a quality fit for the k_obs versus [I] relationship. This is evident in the small confidence interval in the parameter estimates (Table 2), and illustrates the notion that even when the K_I approaches the maximum or minimum concentration tested good estimates of parameters are attainable.

View this table:

TABLE 2

Summary of pioglitazone, rosiglitazone, raloxifene, and erythromycin kinetic parameter estimates for two- (corrected and uncorrected k_obs) and three-parameter models, Spearman's rank correlation coefficient (r_s), and a test for a positive linear regression of k_obs versus ln[I]

Ninety-five percent profile-likelihood confidence intervals for each kinetic estimate are listed in parentheses. The rank correlation and linear regression tests do not depend on the nonlinear model fit and are reported on the same row as the three-parameter model estimates. k_{obs[0 μM]} is not estimated in the 2p model.

For the remaining four compounds examined in this article significant associations, for both the linear regression and Spearman's rank coefficient (r_s) tests (Table 2), suggest that the kinetic parameters can be calculated via the Michaelis-Menten equation. In Fig. 3, we see a V shape in a plot of k_obs versus ln[I] for the weak inactivator pioglitazone, an inverted V shape for the potent inactivator raloxifene, and a near linear relationship for the intermediate inactivators rosiglitazone and erythromycin. Such plots can serve as an ad hoc diagnostic for inactivator potency or concentration spacing and magnitude. We did not attempt to improve the quality of the linear approximation by removing selected k_obs values; for some compounds and designs such an effort may be contraindicated. Applying the final provision to pioglitazone's estimate for K_I suggested that the kinetic estimates for this weak inactivator are unreliable (K_I > 1/3 [I]_max; see Table 2). Because there are other experimental designs for TDI, such as the IC₅₀ shift assay or simpler experiments that use one concentration and one preincubation time (Grimm et al., 2009), it is recommended that statistically based decision points be derived for these experimental designs as well.

Two- Versus Three-Parameter Michaelis-Menten Models.

TDI kinetic parameters are commonly estimated using either a two-parameter Michaelis-Menten model (k_inact and K_I; here, we either set k_obs[0 _μM] at time 0 to zero and normalize all of the k_obs values against k_obs[0 _μM] for each time point, or did not perform this adjustment to any of the data; other “normalization” strategies are possible) or a three-parameter model (k_inact, K_I, and k_obs[0 _μM]). Depending on the number of observations, experimental design and strength of the nonlinear model, including k_obs[0 _μM] in the fitting of the kinetic parameters can be important to generate reliable estimates for k_inact and K_I. Assuming k_obs[0 _μM] equals zero in a two-parameter fit can bias estimates for k_inact and K_I for some inactivators and is influenced by strategies to “normalize” or “correct” the set of k_obs relative to the vehicle k_obs. Two- and three-parameter Michaelis-Menten kinetic estimates for the four inactivators are listed in Table 2. K_I estimates for rosiglitazone and erythromycin, using uncorrected k_obs estimates in a two-parameter model, differed noticeably from the three-parameter estimates for K_I. Apart from rosiglitazone, the k_inact estimates were comparable across the two- and three-parameter models. A graphical comparison of the models (Fig. 2B) was inconclusive in its ability to identify the most suitable model form. At a minimum, this makes apparent the challenges associated with generating precise kinetic estimates based on small samples. As evident in Table 2, assuming a two-parameter model also affects the width of the confidence intervals due to a more parsimonious model parameterization. Historical estimates for k_obs[0 _μM] from our laboratory ranges from 0.0002 to 0.0081 min⁻¹ (mean ± S.D. = 0.0036 ± 0.0016, n = 60). This measureable uncertainty is explicitly included in a three-parameter model, whereas a two-parameter model may discount (or disguise) this fact via intermediate adjustments to the observed data or the estimates for k_obs.

The correlation estimates between the various kinetic parameters using both corrected and uncorrected two- and three-parameter Michaelis-Menten fits are summarized in Table 3. Here, a two-parameter model more strongly affects the correlation between k_inact and K_I as compared with a three-parameter model. This illustrates the coupling of design-modeling complexities for producing estimates for k_obs[0 _μM], K_I, and k_inact using a limited number of observations/concentration range. Large correlation differences between the corrected and uncorrected two-parameter kinetic estimates are not evident. For pioglitazone, the evident lack of nonlinearity in k_obs (Fig. 2B) suggests that both the two- and three-parameter models are over-parameterized. The near-perfect correlation between K_I and k_inact reflects this result. Despite observable curvature in k_obs for the remaining three compounds (Fig. 2B), the kinetic correlations for the two-parameter model are high. Transitioning to a three-parameter model reduced the correlation between K_I and k_inact while making evident the correlation of these parameters with k_obs[0 _μM]. The three-parameter correlation structure reflects the observable experimental interdependencies between the three kinetic estimates and, although not formally presented here, can be used to diagnose the individual kinetic estimates or the design. Seber and Wild (2003) discuss common problems pertinent to nonlinear regression, for example, inherently ill-conditioned models or very imprecise/highly correlated estimates. Our experimental data indicates that the stochastic behavior of k_obs[0 _μM] has a nonzero mean, and we capture this uncertainty in our statistical model definition. It is clear that k_obs[0 _μM] possesses a nonzero value, and that attempts to force the k_obs versus [I] relationship through the origin without correcting for this factor is inappropriate and yields estimates of K_I that are incorrect (Table 2). Although extreme differences between the kinetic estimates under the two model parameterizations are not grossly evident in Table 2, other methods to adjust for nonzero k_obs[0 _μM] or more variable experimental data can result in large kinetic model discrepancies. For estimating TDI kinetic parameters, we advocate the use of the three-parameter form of the Michaelis-Menten equation.

View this table:

TABLE 3

Nontrivial correlation estimates for the Michaelis-Menten kinetic parameters using both two- and three-parameter nonlinear models

Others Methods Considerations.

Various k_obs comparisons, for example, k_obs[60 _μM] versus k_obs[30 _μM] or k_obs[0 _μM], could be tested using a combined regression-ANOVA statistical model (Myers, 1990). We did not apply such a model for several reasons: specifying/fitting a suitable statistical model combines aspects of the experimental protocol with more complex estimation algorithms, (unplanned) tests of various k_obs combinations increases the need for more careful Type I error control (an incorrect rejection of the null hypothesis), and the slope-based inactivation rate estimate at one or more concentrations may be poor in the log-linear approximation for strong TDI compounds [i.e., more pronounced activity attenuation can occur at long time points (Obach et al., 2010)]. Here, we removed several experimental observations to improve the k_obs estimates for raloxifene. Removing select observations to create a more defensible model creates imbalances in these data, affects estimate S.E.s (only the lowest raloxifene concentration S.E.s resembled those obtained for the other compounds), and affects degrees of freedom calculations. Concerns over how best to count the “right” number of degrees of freedom, in the presence of a potentially unbalanced complex experimental structure, combined with a conservative testing approach, motivated our use of a z-test in place of a t test. The large sample-based test statistic calculations proposed here, although open to statistical criticism, are straightforward and easily calculated in widely available software. A closed-form z-test easily supports the possibility of plug-in S.E. estimators should severe under- or overestimates for one or more RMSEs be observed. Aliquot extraction at various time intervals suggests that the distinction between experimental and sampling units may need careful consideration or induce a nontrivial correlation structure between observations. More complex statistical models or experimental designs may be employed here. Bayesian methods may also be suitable for experimentalists who routinely conduct TDI experiments.

We are not recommending that multiplicity corrections, for example, Dunnett's test, be performed across the j-1 k_obs[I] − k_obs[0 _μM] comparisons. In practice, only a small number of compound concentrations are evaluated across a limited (empirically determined) potency range. We consider an (in)ability to differentiate or test pairs of k_obs estimates using unadjusted p values more relevant to the TDI decision process than controlling for an overall family-wise Type I error or false discovery rate. As suggested earlier, a careful consideration of the precise α level used for testing can mitigate false alarm concerns. The interplay between aspects of testing and the magnitudes of the observed experimental variances across one or more experiments is nontrivial. Furthermore, for TDI compounds the k_obs values must be monotonic (k_obs,j > k_obs,i for some concentrations [I]_i and [I]_j) in relation to increasing concentration. The tests for the regression-based slope γ₁ and rank correlation-based r_s proposed here, although not providing rigorous Type I control for the overall test for TDI, should reflect this pattern in addition to providing a post hoc measure of protection against subsequent attempts to fit an ill-conditioned kinetic equation. Potential TDI false positives pose less of a concern relative to allowing a false negative TDI compound to reach the later stages of drug development due to TDI's relation to a safety endpoint. At present, we have adopted a conservative stance and expect that an evolving standard for declaring TDI may emerge in the literature over time.

The use of traditional statistical estimators allows us to exploit well-understood large sample theory. We acknowledge the inherent risks in such an approach, but numerical routines, such as resample/bootstrap procedures, also pose a concern for investigators (Berger, 2000). One- or two-sided tests may be performed here; one's choice influences the interpretation of the results and the power of the test. Here, we elect to use a two-sided hypothesis test for a pair of k_obs (H₀: k_obs[I] − k_obs[0 _μM] = 0 versus H₁: k_obs[I] − k_obs[0 _μM] ≠ 0), but chose a one-sided test for γ₁ and r_s (e.g., H₀: r_s = 0 versus H₁: r_s > 0). The first case may gauge empirical precision, whereas the latter test may be preferred for further TDI screening efforts. Calibrating the procedure against known positive or negative controls can also be performed. Spearman's rank correlation coefficient r_s was preferred to other nonparametric estimators, for example, Kendall's τ or Hoeffding's D (Hollander and Wolfe, 1999), due to its comparability with the widely used Pearson product-moment correlation coefficient and its use as a specific test for a positive association between k_obs and [I].

We are not recommending a novel approach to estimating k_inact or K_I; standard least-squares nonlinear regression was performed here. Unlike other linear-based Michaelis-Menten approximations, such as Lineweaver-Burk or Eadie-Hofstee fits, which are known to be problematic (Ritchie and Prvan, 1996), both the nonparametric and slope-based tests are not used to estimate kinetic parameters. Wald-type or profile-likelihood methods can be used to generate confidence intervals and test the model estimates. An ordinary least-squares test of the k_obs-ln[I] slope is understood to be imperfect. But, fitting and diagnosing model quality for an (intercept-adjusted) Michaelis-Menten curve based on a small sample is also challenging. Seber and Wild (2003) document the use of transformations to circumvent nonlinear modeling challenges. Reducing the number of parameters to estimate and test can provide benefit as a preliminary screening measure. A linear fit simplifies model diagnostics, for example, to identify outliers or detect abrupt transitions in k_obs. The use of Spearman's coefficient r_s helped mitigate concerns regarding inadequate linear approximations for some TDI compounds. A large number of samples for analysis may negate the need for the linear test.

Conclusions

In this report, we have proposed a two-step statistical approach to the analysis of CYP3A4 TDI data. This is particularly important in the research of new drugs because the U.S. Food and Drug Administration recently issued a draft guidance stating that “any time-dependent loss of activity” (sic) caused by a P450 inhibitor merits further investigation. The first step, a two-sided two-sample z-test, offers a simple and effective manner of evaluating TDI data and determining whether a new drug is a TDI, without resorting to empirically derived or heuristic cutoff values. The second step shows whether a set of k_obs versus [I] or ln[I] data can be used to estimate K_I and k_inact, ultimately using a three-parameter Michaelis-Menten model. An important attribute to the successful application of this approach requires that the experimental technique be precise and reproducible; if individual data points of the natural logarithm of activity versus preincubation time relationships show a high degree of scatter, TDI could be missed because the z-test p values will exceed 0.05. In our research, k_obs values of approximately 0.005 min⁻¹ can be reliably discerned and yield p < 0.05 when tested for known TDI compounds. Although drugs yielding such low rates of inactivation may not result in meaningful in vivo DDI, it is still nevertheless important to identify such compounds in vitro so that appropriate clinical follow-up can be done. Further improvements in physiologically based pharmacokinetic modeling of CYP3A4 TDI data are needed to better use high quality in vitro data in generating reliable predictions of DDI.

Authorship Contributions

Participated in research design: Yates, Eng, Di, and Obach.

Conducted experiments: Eng.

Performed data analysis: Yates, Eng, Di, and Obach.

Wrote or contributed to the writing of the manuscript: Yates, Eng, Di, and Obach.

Acknowledgments

We thank Mark Niosi for generating some of the experimental data and Dr. Larry Tremaine for leadership and support.

Footnotes

Article, publication date, and citation information can be found at http://dmd.aspetjournals.org.

http://dx.doi.org/10.1124/dmd.112.047233.
ABBREVIATIONS:
TDI
time-dependent inhibition
DDI
drug-drug interaction
DMSO
dimethyl sulfoxide
LC-MS/MS
liquid chromatography/tandem mass spectrometry
P450
cytochrome P450
RMSE
root mean square error.

Received June 13, 2012.
Accepted August 31, 2012.

References

↵
1. Berger VW
(2000) Pros and cons of permutation tests in clinical trials. Stat Med 19:1319–1328.
OpenUrl CrossRef PubMed
↵
1. Chauret N,
2. Gauthier A,
3. Nicoll-Griffith DA
(1998) Effect of common organic solvents on in vitro cytochrome P450-mediated metabolic activities in human liver microsomes. Drug Metab Dispos 26:1–4.
OpenUrl Abstract/FREE Full Text
↵
1. DeLean A,
2. Munson PJ,
3. Rodbard D
(1978) Simultaneous analysis of families of sigmoidal curves: application to bioassay, radioligand assay, and physiological dose-response curves. Am J Physiol 235:E97–E102.
OpenUrl PubMed
↵
1. Ghanbari F,
2. Rowland-Yeo K,
3. Bloomer JC,
4. Clarke SE,
5. Lennard MS,
6. Tucker GT,
7. Rostami-Hodjegan A
(2006) A critical evaluation of the experimental design of studies of mechanism based enzyme inhibition, with implications for in vitro-in vivo extrapolation. Curr Drug Metab 7:315–334.
OpenUrl CrossRef PubMed
↵
1. Grimm SW,
2. Einolf HJ,
3. Hall SD,
4. He K,
5. Lim HK,
6. Ling KH,
7. Lu C,
8. Nomeir AA,
9. Seibert E,
10. Skordos KW,
11. et al
. (2009) The conduct of in vitro studies to address time-dependent inhibition of drug-metabolizing enzymes: a perspective of the Pharmaceutical Research and Manufacturers of America. Drug Metab Dispos 37:1355–1370.
OpenUrl Abstract/FREE Full Text
↵
1. Hollander M,
2. Wolfe DA
(1999) Nonparametric Statistical Methods, 2nd ed, John Wiley & Sons, Inc., New York.
↵
1. Kalgutkar AS,
2. Obach RS,
3. Maurer TS
(2007) Mechanism-based inactivation of cytochrome P450 enzymes: chemical mechanisms, structure-activity relationships and relationship to clinical drug-drug interactions and idiosyncratic adverse drug reactions. Curr Drug Metab 8:407–447.
OpenUrl CrossRef PubMed
↵
1. Lentner M,
2. Bishop T
(1993) Experimental Design and Analysis, 2nd ed, Valley Book Company, Blacksburg.
↵
1. Mayhew BS,
2. Jones DR,
3. Hall SD
(2000) An in vitro model for predicting in vivo inhibition of cytochrome P450 3A4 by metabolic intermediate complex formation. Drug Metab Dispos 28:1031–1037.
OpenUrl Abstract/FREE Full Text
↵
1. Myers RH
(1990) Classical and Modern Regression with Applications, 2nd ed, Duxbury, Pacific Grove.
↵
1. Pang KS,
2. Rodrigues AD,
3. Peter RM
1. Obach RS,
2. Fahmi OA,
3. Walsky RL
(2010) Inactivation of human cytochrome P450 enzymes and drug-drug interactions, in Enzyme- and Transporter-Based Drug-Drug Interactions Progress and Future Challenges ( Pang KS, Rodrigues AD, Peter RM eds), pp 473–495, Springer, New York.
↵
1. Obach RS,
2. Walsky RL,
3. Venkatakrishnan K
(2007) Mechanism-based inactivation of human cytochrome p450 enzymes and the prediction of drug-drug interactions. Drug Metab Dispos 35:246–255.
OpenUrl Abstract/FREE Full Text
↵
1. Polasek TM,
2. Miners JO
(2007) In vitro approaches to investigate mechanism-based inactivation of CYP enzymes. Expert Opin Drug Metab Toxicol 3:321–329.
OpenUrl CrossRef PubMed
↵
1. Raaijmakers JG
(1987) Statistical analysis of the Michaelis-Menten equation. Biometrics 43:793–803.
OpenUrl CrossRef PubMed
↵
1. Ritchie RJ,
2. Prvan T
(1996) Current statistical methods for estimating the Km and Vmax of Michaelis-Menten kinetics. Biochem Educ 24:196–206.
OpenUrl CrossRef
↵
1. Seber GAF,
2. Wild CJ
(2003) Nonlinear Regression. Wiley-Interscience, Hoboken.
↵
1. Zhang D,
2. Zhu M,
3. Humphreys WG
1. Shou M,
2. Dai R
(2008) Analysis of in vitro cytochrome P450 inhibition in drug discovery and development, in Drug Metabolism in Drug Design and Development: Basic Concepts and Practice ( Zhang D, Zhu M, Humphreys WG eds), pp 513–544, John Wiley & Sons, Inc., Hoboken.
↵
U.S. Department of Health and Human Services Food and Drug Administration Center for Drug Evaluation and Research (2012) Guidance for Industry. Drug Interaction Studies- Study Design, Data Analysis, Implications for Dosing, and Labeling Recommendations (FDA ed.), Rockville, MD.
↵
1. Venkatakrishnan K,
2. Obach RS
(2007) Drug-drug interactions via mechanism-based cytochrome P450 inactivation: points to consider for risk assessment from in vitro data and clinical pharmacologic evaluation. Curr Drug Metab 8:449–462.
OpenUrl CrossRef PubMed
↵
1. Wang YH,
2. Jones DR,
3. Hall SD
(2004) Prediction of cytochrome P450 3A inhibition by verapamil enantiomers and their metabolites. Drug Metab Dispos 32:259–266.
OpenUrl Abstract/FREE Full Text
↵
1. Wienkers LC,
2. Heath TG
(2005) Predicting in vivo drug interactions from in vitro drug discovery data. Nat Rev Drug Discov 4:825–833.
OpenUrl CrossRef PubMed
↵
1. Yang J,
2. Liao M,
3. Shou M,
4. Jamei M,
5. Yeo KR,
6. Tucker GT,
7. Rostami-Hodjegan A
(2008) Cytochrome p450 turnover: regulation of synthesis and degradation, methods for determining rates, and implications for the prediction of drug interactions. Curr Drug Metab 9:384–394.
OpenUrl CrossRef PubMed
↵
1. Zhou SF,
2. Xue CC,
3. Yu XQ,
4. Li C,
5. Wang G
(2007) Clinically important drug interactions potentially involving mechanism-based inhibition of cytochrome P450 3A4 and the role of therapeutic drug monitoring. Ther Drug Monit 29:687–710.
OpenUrl CrossRef PubMed
↵
1. Zimmerlin A,
2. Trunzer M,
3. Faller B
(2011) CYP3A time-dependent inhibition risk assessment validated with 400 reference drugs. Drug Metab Dispos 39:1039–1046.
OpenUrl Abstract/FREE Full Text

In this issue

Download PDF

Article Alerts

Email Article

Citation Tools

Cited By...

More in this TOC Section

Show more Articles

[1] ↵
Berger VW
(2000) Pros and cons of permutation tests in clinical trials. Stat Med 19:1319–1328.
OpenUrl CrossRef PubMed

[2] Berger VW

[3] ↵
Chauret N,
Gauthier A,
Nicoll-Griffith DA
(1998) Effect of common organic solvents on in vitro cytochrome P450-mediated metabolic activities in human liver microsomes. Drug Metab Dispos 26:1–4.
OpenUrl Abstract/FREE Full Text

[4] Chauret N,

[5] Gauthier A,

[6] Nicoll-Griffith DA

[7] ↵
DeLean A,
Munson PJ,
Rodbard D
(1978) Simultaneous analysis of families of sigmoidal curves: application to bioassay, radioligand assay, and physiological dose-response curves. Am J Physiol 235:E97–E102.
OpenUrl PubMed

[8] DeLean A,

[9] Munson PJ,

[10] Rodbard D

[11] ↵
Ghanbari F,
Rowland-Yeo K,
Bloomer JC,
Clarke SE,
Lennard MS,
Tucker GT,
Rostami-Hodjegan A
(2006) A critical evaluation of the experimental design of studies of mechanism based enzyme inhibition, with implications for in vitro-in vivo extrapolation. Curr Drug Metab 7:315–334.
OpenUrl CrossRef PubMed

[12] Ghanbari F,

[13] Rowland-Yeo K,

[14] Bloomer JC,

[15] Clarke SE,

[16] Lennard MS,

[17] Tucker GT,

[18] Rostami-Hodjegan A

[19] ↵
Grimm SW,
Einolf HJ,
Hall SD,
He K,
Lim HK,
Ling KH,
Lu C,
Nomeir AA,
Seibert E,
Skordos KW,
et al
. (2009) The conduct of in vitro studies to address time-dependent inhibition of drug-metabolizing enzymes: a perspective of the Pharmaceutical Research and Manufacturers of America. Drug Metab Dispos 37:1355–1370.
OpenUrl Abstract/FREE Full Text

[20] Grimm SW,

[21] Einolf HJ,

[22] Hall SD,

[23] He K,

[24] Lim HK,

[25] Ling KH,

[26] Lu C,

[27] Nomeir AA,

[28] Seibert E,

[29] Skordos KW,

[30] et al

[31] ↵
Hollander M,
Wolfe DA
(1999) Nonparametric Statistical Methods, 2nd ed, John Wiley & Sons, Inc., New York.

[32] Hollander M,

[33] Wolfe DA

[34] ↵
Kalgutkar AS,
Obach RS,
Maurer TS
(2007) Mechanism-based inactivation of cytochrome P450 enzymes: chemical mechanisms, structure-activity relationships and relationship to clinical drug-drug interactions and idiosyncratic adverse drug reactions. Curr Drug Metab 8:407–447.
OpenUrl CrossRef PubMed

[35] Kalgutkar AS,

[36] Obach RS,

[37] Maurer TS

[38] ↵
Lentner M,
Bishop T
(1993) Experimental Design and Analysis, 2nd ed, Valley Book Company, Blacksburg.

[39] Lentner M,

[40] Bishop T

[41] ↵
Mayhew BS,
Jones DR,
Hall SD
(2000) An in vitro model for predicting in vivo inhibition of cytochrome P450 3A4 by metabolic intermediate complex formation. Drug Metab Dispos 28:1031–1037.
OpenUrl Abstract/FREE Full Text

[42] Mayhew BS,

[43] Jones DR,

[44] Hall SD

[45] ↵
Myers RH
(1990) Classical and Modern Regression with Applications, 2nd ed, Duxbury, Pacific Grove.

[46] Myers RH

[47] ↵
Pang KS,
Rodrigues AD,
Peter RM
Obach RS,
Fahmi OA,
Walsky RL
(2010) Inactivation of human cytochrome P450 enzymes and drug-drug interactions, in Enzyme- and Transporter-Based Drug-Drug Interactions Progress and Future Challenges ( Pang KS, Rodrigues AD, Peter RM eds), pp 473–495, Springer, New York.

[48] Pang KS,

[49] Rodrigues AD,

[50] Peter RM

[51] Obach RS,

[52] Fahmi OA,

[53] Walsky RL

[54] ↵
Obach RS,
Walsky RL,
Venkatakrishnan K
(2007) Mechanism-based inactivation of human cytochrome p450 enzymes and the prediction of drug-drug interactions. Drug Metab Dispos 35:246–255.
OpenUrl Abstract/FREE Full Text

[55] Obach RS,

[56] Walsky RL,

[57] Venkatakrishnan K

[58] ↵
Polasek TM,
Miners JO
(2007) In vitro approaches to investigate mechanism-based inactivation of CYP enzymes. Expert Opin Drug Metab Toxicol 3:321–329.
OpenUrl CrossRef PubMed

[59] Polasek TM,

[60] Miners JO

[61] ↵
Raaijmakers JG
(1987) Statistical analysis of the Michaelis-Menten equation. Biometrics 43:793–803.
OpenUrl CrossRef PubMed

[62] Raaijmakers JG

[63] ↵
Ritchie RJ,
Prvan T
(1996) Current statistical methods for estimating the Km and Vmax of Michaelis-Menten kinetics. Biochem Educ 24:196–206.
OpenUrl CrossRef

[64] Ritchie RJ,

[65] Prvan T

[66] ↵
Seber GAF,
Wild CJ
(2003) Nonlinear Regression. Wiley-Interscience, Hoboken.

[67] Seber GAF,

[68] Wild CJ

[69] ↵
Zhang D,
Zhu M,
Humphreys WG
Shou M,
Dai R
(2008) Analysis of in vitro cytochrome P450 inhibition in drug discovery and development, in Drug Metabolism in Drug Design and Development: Basic Concepts and Practice ( Zhang D, Zhu M, Humphreys WG eds), pp 513–544, John Wiley & Sons, Inc., Hoboken.

[70] Zhang D,

[71] Zhu M,

[72] Humphreys WG

[73] Shou M,

[74] Dai R

[75] ↵
U.S. Department of Health and Human Services Food and Drug Administration Center for Drug Evaluation and Research (2012) Guidance for Industry. Drug Interaction Studies- Study Design, Data Analysis, Implications for Dosing, and Labeling Recommendations (FDA ed.), Rockville, MD.

[76] ↵
Venkatakrishnan K,
Obach RS
(2007) Drug-drug interactions via mechanism-based cytochrome P450 inactivation: points to consider for risk assessment from in vitro data and clinical pharmacologic evaluation. Curr Drug Metab 8:449–462.
OpenUrl CrossRef PubMed

[77] Venkatakrishnan K,

[78] Obach RS

[79] ↵
Wang YH,
Jones DR,
Hall SD
(2004) Prediction of cytochrome P450 3A inhibition by verapamil enantiomers and their metabolites. Drug Metab Dispos 32:259–266.
OpenUrl Abstract/FREE Full Text

[80] Wang YH,

[81] Jones DR,

[82] Hall SD

[83] ↵
Wienkers LC,
Heath TG
(2005) Predicting in vivo drug interactions from in vitro drug discovery data. Nat Rev Drug Discov 4:825–833.
OpenUrl CrossRef PubMed

[84] Wienkers LC,

[85] Heath TG

[86] ↵
Yang J,
Liao M,
Shou M,
Jamei M,
Yeo KR,
Tucker GT,
Rostami-Hodjegan A
(2008) Cytochrome p450 turnover: regulation of synthesis and degradation, methods for determining rates, and implications for the prediction of drug interactions. Curr Drug Metab 9:384–394.
OpenUrl CrossRef PubMed

[87] Yang J,

[88] Liao M,

[89] Shou M,

[90] Jamei M,

[91] Yeo KR,

[92] Tucker GT,

[93] Rostami-Hodjegan A

[94] ↵
Zhou SF,
Xue CC,
Yu XQ,
Li C,
Wang G
(2007) Clinically important drug interactions potentially involving mechanism-based inhibition of cytochrome P450 3A4 and the role of therapeutic drug monitoring. Ther Drug Monit 29:687–710.
OpenUrl CrossRef PubMed

[95] Zhou SF,

[96] Xue CC,

[97] Yu XQ,

[98] Li C,

[99] Wang G

[100] ↵
Zimmerlin A,
Trunzer M,
Faller B
(2011) CYP3A time-dependent inhibition risk assessment validated with 400 reference drugs. Drug Metab Dispos 39:1039–1046.
OpenUrl Abstract/FREE Full Text

[101] Zimmerlin A,

[102] Trunzer M,

[103] Faller B

Main menu

User menu

Search

Statistical Methods for Analysis of Time-Dependent Inhibition of Cytochrome P450 Enzymes

Abstract

Introduction