Abstract
Human and rat hepatocytes have a strong tendency to underpredict hepatic intrinsic clearance (CLint) and the extent of underprediction increases with increasing observed CLint. In this study, application of the log average rat hepatocyte–rat in vivo empirical scaling factor (ESF) of 4.2 to human hepatocyte prediction successfully removed bias but did not improve precision. An analogous method using individual drug rat ESFs only achieved marginal improvement in accuracy but not precision. A novel approach to resolve clearance-dependent prediction, involving rat ESFs calculated for particular (order of magnitude) ranges of observed CLint (log average range, 0.12−2.1) improved human prediction precision but only modestly reduced bias. However, rat in vivo CLint was several-fold greater than human in vivo CLint and this was reflected in greater rat hepatocyte and microsome CLint, suggesting that rat metabolic enzymes are more efficient than their human counterparts, by several-fold. By applying the segregated rat ESFs followed by the human/rat CLint ratio, which was consistent regardless of CLint (log average 3.5), both accuracy and precision were improved, providing both a means of mitigating clearance dependence and reaffirming the potential role of rat hepatocytes for prediction of human metabolic CLint. These cross-species observations indicate that underprediction from human in vitro systems may be predominantly consequential of an intrinsic property of the in vitro system rather than individual drug properties.
Introduction
A recent article by Wood et al. (2017) demonstrated that both rat and human hepatic in vitro systems (hepatocytes and microsomes) tend to underpredict in vivo intrinsic clearance (CLint) in a clearance-dependent way. The reason for this is not yet clear but in vitro factors including cofactor exhaustion (Swales and Utesch, 1998; Hengstler et al., 2000; Wang et al., 2005) and a rate-limiting unstirred water layer (Wood et al., 2018) may play significant roles. Underprediction of hepatic CLint using standard in vitro systems, particularly human, has been recognized for more than a decade but the bias has thus far been generally assumed to be imprecise and irrespective of in vivo CLint. Over this time, pragmatic empirical correction of bias has frequently been advocated and although this has not been limited to the use of a simple average scaling factor, it has not recognized the clearance dependence (Ito and Houston, 2005; Poulin et al., 2012; Sohlenius-Sternbeck et al., 2012; Yamagata et al., 2017).
In the early 2000s, it was considered that prediction of clearance from rat hepatocytes was relatively successful compared with the equivalent results from the then emerging use of human hepatocytes (Naritomi et al., 2001, 2003). From this perspective, Naritomi et al. (2003) found that scaling human hepatocyte CLint using the single drug observed/predicted CLint ratio (single drug empirical scaling factor [ESFsd]) from rat hepatocytes improved human predictions to mostly within 5-fold for nine model compounds. They speculated that each compound had an intrinsic scaling factor due to either in vitro nonspecific binding, a lack of equilibrium of blood binding (in vivo), involvement of bound drug in uptake, or heterogeneous distribution of transporters and metabolic enzymes in the liver. Since then, considerable evidence has accumulated confirming that human hepatocytes generally underpredict in vivo CLint (Shibata et al., 2002; Hallifax et al., 2005; Ito and Houston, 2005; Riley et al., 2005; Brown et al., 2007; Chiba et al., 2009), attributed to unavoidable donor phenotypic variability, potentially detrimental processing (preparation and storage), and in vitro lability of metabolizing enzymes and uptake transporters, all of which are potential sources of variation and possibly bias. The removal of this bias can be achieved by applying an average hepatocyte–in vivo empirical scaling factor (ESFav) but the lack of any mechanistic basis for this correction limits its use prospectively.
A recent study by De Bruyn et al. (2018) investigated the utility of monkey hepatocytes to improve prediction of human clearance for a range of drugs cleared by hepatic uptake transporters. The authors found that individual drug scaling factors from monkeys applied to human data improved prediction. However, they also found that use of a monkey ESFav gave similar improvements in prediction, which confounds the hypothesis that cross-species scaling facilitates common factors related to specific drug properties. This recent attempt at a cross-species approach to maximize human clearance prediction from human hepatocytes has prompted further investigation of the clearance dependence of prediction bias in humans and rats using the dataset of Wood et al. (2017). Our aim was to establish whether the clearance dependence in human hepatic CLint prediction from hepatocytes could be corrected by scaling from rat hepatocyte CLint based on common system performance; several scaling approaches, including conventional ESFs (ESFav, ESFsd) and a novel, multi-ESF approach, were compared (see Table 1).
The use of a cross-species ESF has the potential advantage of being an independent measure of isolated hepatocyte functionality with respect to in vivo activity and hence differs from the conventional ESF, which applies within the same species. Hence it might be used prospectively together with the capacity scalers (hepatocellularity and/or proteomic measurements) to boost confidence in human prediction from either rat or human hepatocytes by way of providing a measure of in vitro functional activity.
Materials and Methods
Data Source.
Datasets for human (n = 101 for hepatocytes and 83 for microsomes) and rat (n = 128 for hepatocytes and 71 for microsomes) in vitro CLint and complementary in vivo CLint were taken from Wood et al. (2017), in turn based on examination of published datasets, including both approved pharmaceuticals and investigatory proprietary compounds in which blood clearance or plasma clearance (CLp) was determined from intravenous dosing and where blood clearance did not exceed hepatic blood flow. This included in vitro CLint determined from either metabolite formation (over a range of substrate concentrations) or from substrate depletion from single substrate concentration depletion time profiles. All hepatocyte data were from suspended cell (fresh or cryopreserved) assays without the addition of serum albumin. Some data were mean values of several independent publications. The datasets were considered to predominantly comprise highly permeable drugs, of which the majority would be expected to be cleared by metabolism without rate limitation by transport. All CLint values had been corrected for unbound drug and physiologically scaled to the whole liver using standard methods. The in vivo CLint values taken for this study were those calculated from the well stirred liver model, after correction for renal clearance where applicable. The fraction unbound in blood (fub) was also taken from this dataset.
Correlation Analysis.
Predictions of in vivo CLint from human hepatocytes were compared with observed in vivo CLint for the whole datasets and for the subset of drugs common to both species, as in Wood et al. (2017). The human predicted CLint values were compared with corresponding in vivo reported values directly (approach 1) and then corrected using a series of ESFs based on rat predictions of CLint, as listed in Table 1. First, previously advocated ESFs calculated from rat predictions were applied (approaches 2 and 3 for ESFav and ESFsd, respectively).
To investigate the mechanistic origin of the clearance-dependent bias in prediction, direct comparison between species in vivo (approaches 4 or 5) or the in vitro system (approaches 6–9) was examined using subsets of common drugs within the whole dataset. In addition, approaches 10 and 11 explore correlations between human hepatocyte and human microsome CLint (common drugs in humans) and between human hepatocytes and human microsomes from the same donor livers from a specific study (Foster et al., 2011).
Subsequently, a novel prediction correction approach using ESFs based on prediction data segregated according to a particular range of CLint values (ESFseg) was examined as a potential means of correcting for clearance-dependent bias. Approaches 12–14 investigate human and rat predictions segregated according to each order of magnitude of observed CLint (five levels: 10–10,000 ml/min/kg).
Assessment of Accuracy and Precision of Predictions.
The bias in predictions was assessed by calculation of the geometric mean fold error (GMFE) (eq. 1). The root mean squared error (RMSE) (eq. 2) was used as a measure of precision.(1)(2)where n is the number of predictions.
Residuals (log 10 predicted/observed) of predictions were examined graphically and the percentage of CLint predictions within 2-fold of in vivo was used as an indicator of predictive accuracy, consistent with previous publications (Obach, 1999; Sohlenius-Sternbeck et al., 2012; Chan et al., 2013).
Results
Human Hepatocyte In Vitro−In Vivo Correlation.
Human in vivo CLint was predominantly underpredicted in hepatocytes (Fig. 1, A and B); the average fold-underprediction (GMFE) was 4.2. The overall prediction precision, represented by an RMSE value of 3550, provides a benchmark against which subsequent predictions, where correction factors are applied, can be compared (Table 2).
As noted previously, microsomes display a wider range of CLint predictions than hepatocytes (human and rat); human hepatocyte predicted CLint ranged from approximately 1 to 1000 ml/min/kg compared with approximately 0.1 to 10,000 ml/min/kg in microsomes (Wood et al., 2017). In rats, predicted CLint was limited to approximately 10,000 ml/min/kg in hepatocytes, whereas it reached 100,000 ml/min/kg in microsomes (Wood et al., 2017). Prediction of CLint was clearance dependent in both species (and systems), with negative bias increasing with increasing CLint. The intercept of the prediction trend line with the line of unity occurred at about 10 ml/min/kg for humans and about 100 ml/min/kg for rats, signifying generally greater rates in rats compared with humans.
Cross-Species Correction of Bias: Rat−Human Scaling Factors.
The similar GMFE prediction bias between humans (4.2; Table 2) and rats (4.7; Wood et al., 2017) indicated potential for consistent cross-species correction for hepatocyte prediction. By applying this simple factor (rat ESFav), bias in human prediction was effectively abolished; the rat-corrected human hepatocyte GMFE was 0.896 (Table 2). However, as expected, precision was virtually unaffected (RMSE = 3530); consequently, clearance-dependent bias remained (Fig. 1, C and D).
Alternatively, using the ESFs for individual drugs (ESFsd), bias was only marginally reduced (GMFE from 4.35 to 3.82). For this drug-specific approach, the precision remained about the same (RMSE = 1010) compared with the common drug dataset predictions when no ESF was applied (Fig. 1, E and F; Table 2).
Rat−Human Correlation In Vivo.
Comparison of in vivo CLint for common drugs between humans and rats revealed that human in vivo CLint is on average several-fold less than for rats (Fig. 2), similar to the average bias in human hepatocyte prediction of CLint (Table 2). By visual inspection, there was no clearance dependence in this bias, although the dataset was considerably smaller (n = 23) than the in vitro–in vivo datasets for all drugs (n = 101 for humans and 128 for rats).
There was some discrepancy between human and rat fub, although there was no evidence of bias between the species (Fig. 3). Where fub was less than 0.1, differences between rats and humans ranged several-fold, indicating potentially considerable differences in CLint for highly bound drugs.
Rat−Human Correlation In Vitro.
Predicted CLint from human hepatocytes was consistently lower than from rat hepatocytes and was clearance dependent (Fig. 4A). Prediction from human microsomes was also consistently lower than from rats; in contrast, there was no clear evidence of clearance dependence (Fig. 4B). More direct assessment of CLint between the species at the cellular and subcellular levels was enabled by comparison of unscaled CLint (Fig. 4, C and D); in this case, only marginal convergence due to the differences in physiologic scaling factors could be seen and hence the above observations were found to hold.
Human Hepatocyte−Microsome Correlation.
Human hepatocyte predictions of CLint were marginally lower on average than human microsome predictions, with a tendency for bias to increase with CLint (Fig. 5A). This trend was corroborated by a discrete dataset of CLint for a series of benzodiazepines in hepatocytes and microsomes from the same livers, from four donors (Foster et al., 2011) (Fig. 5B). In this more specific dataset, hepatocyte bias was less negative for all substrates in those donors showing overall lower CLint for all pathways.
Rat−Human Correction of Bias: Clearance-Dependent Scaling Factors.
When both the human and rat datasets were segregated into several discrete levels of in vivo CLint (Table 3), so that predictions within each human level (one order of magnitude of in vivo CLint) were corrected using the particular equivalent rat CLint level average ESF (ESFseg), clearance dependence was effectively eliminated (Fig. 6, A and B). However, considerable bias remained (GMFE = 2.82) although precision was increased (RMSE = 3030) compared with uncorrected predictions (Table 2). The ESFseg values are given in Table 3. To remove the consistent bias, each rat ESFseg was multiplied by the ratio of log average human ESF to log average rat ESF; this two-step scaling approach achieved effective elimination of bias (GMFE = 0.802; Fig. 6, E and F).
In comparison, using human ESFseg to correct human predictions completely eliminated the bias, as expected (GMFE = 1.00) (Fig. 6, C and D). The precision was greater than any other method applied to this dataset (RMSE = 1990), providing a representation of the optimum precision obtainable by this kind of empirical correction for this particular dataset. The precision achieved using the rat ESFseg (together with the ratio of human ESFav/rat ESFav, as above) was similar, with an RMSE of 2100.
Discussion
It has become clear in recent years that physiologic scaling of CLint obtained from suspended human hepatocytes not only underpredicts in vivo CLint on average (Ito and Houston, 2005; Riley et al., 2005; Brown et al., 2007; Bowman and Benet, 2016) but does so in a clearance-dependent way (Hallifax et al., 2010; Wood et al., 2017). Although the reasons for this dependence remain unclear but could involve effects of the unstirred water layer and/or cofactor depletion in vitro (Hengstler et al., 2000; Hewitt et al., 2000; Hewitt and Utesch, 2004; Hallifax et al., 2010; Foster et al., 2011; Wood et al., 2017, 2018), achieving resolution of this clearance-dependent bias from the considerable prediction uncertainty has provided an opportunity to apply better targeted empirical correction. Because prediction from rat hepatocytes has now been shown to be clearance dependent in the same manner as human hepatocytes (Wood et al., 2017), there appears to exist a basis for renewed potential in the utility of rat hepatocytes for prediction of human in vivo CLint.
The main purpose of this study was to compare conventional ESFs between rat and human hepatocyte CLint with a novel ESF scaling approach to deal with clearance dependence. At the same time, we examined the relationships between rat and human CLint both in vivo and in vitro and between human hepatocytes and microsomes for further mechanistic insight into species-dependent and species-independent performance in vitro. This might offer a means of empirical enhancement of conventional scaling that would take account of factors affecting inherent functional activity in vitro, in addition to physiologic capacity.
For these datasets comprising drugs primarily cleared by metabolism without permeability limitation, use of the rat average ESF clearly removes the average bias for human prediction but without increasing precision, leaving unsatisfactory prediction uncertainty. Despite the considerable imprecision among the predictions, clearance dependence was previously resolved (Hallifax et al., 2010; Wood et al., 2017); as such, it is clear that a single scaling factor is inappropriate. Scaling factors for individual drugs is a concept that gained some traction after studies by Naritomi et al. (2001, 2003) showed prediction improvement (from 12- to 199-fold to within about 5-fold for hepatocytes), which was attributed to drug-specific factors. Yet according to the literature, this approach does not appear to have become widely adopted; rather, various groups have pursued a variety of alternative, empirical, and semi-mechanistic approaches to the problem of underprediction of clearance (Sohlenius-Sternbeck et al., 2012; Yamagata et al., 2017). Wood et al. (2017) examined literature prediction datasets for common drugs between humans and rats (as used in our study) and found no correlation of individual ESFs between rats and humans, which does not support a drug-specific premise. Using the same data, we examined the relationship between human and rat hepatocyte predictions rather than individual ESFs; although there was a correlation, which was biased in favor of the rat, it was clearance dependent. Thus there remains little evidence that drug-specific ESFs offer an improvement in prediction uncertainty. This agrees with recent findings for prediction of transporter-mediated clearance, in which the application of individual drug ESFs from monkey hepatocytes to human hepatocyte CLint did not markedly improve prediction of in vivo clearance compared with the use of an average monkey hepatocyte ESF (De Bruyn et al., 2018). It is apparent that other factors in the in vitro–scaling processes need to be considered and resolved.
The subset of predictions for drugs common to both human and rat hepatocyte predictions showed that in vitro CLint tends to be greater in rats by several-fold, although with considerable imprecision, similar to in vitro–in vivo predictions. From a broad interspecies scaling perspective, it is expected that rat hepatic clearance exceeds that of humans due to greater liver relative volume and greater relative hepatic blood flow. However, when focusing specifically on the CLint parameter, which is normalized to hepatic blood flow, the remaining interspecies discrepancy would signify that the rat liver is inherently more efficient at clearing drugs than its human counterpart. As apparent in this study, human hepatocyte CLint is less than that of rats in a clearance-dependent way; at high CLint (>100 ml/min/kg), there is a tendency toward a more than 10-fold difference between the species. However, while the generally lower CLint in human hepatocytes is reflected in the equivalent comparison for microsomes, there does not appear to be a clearance-dependent relationship in this subcellular system. Therefore, a consistent difference between rat and human hepatic metabolic CLint (of several-fold) is indicated. This appears to reflect the general difference in in vivo CLint between the species, suggesting a generally greater capacity and/or affinity of drug-metabolizing enzymes in rat hepatocytes. As hepatocytes from both species give clearance-dependent prediction (Wood et al., 2017), the consistency in microsomes indicates the existence of both a species-dependent factor (difference in metabolic efficiency) and a species-independent factor underlying a relatively greater inability of human hepatocytes to process high-clearance drugs compared with rat hepatocytes. Thus, the interspecies examination described appears to resolve factors underlying the performance of hepatocytes in vitro, despite the uncertainty recognized among predictions, and this may support attempts at a general, empirical approach to address clearance dependence in prediction.
Despite the complication of clearance-dependent prediction from human and rat hepatocytes, this whole-cell in vitro system should remain the preferred choice compared to hepatic microsomes. In addition to accommodating a more comprehensive set of metabolic pathways than microsomes (Engtrakul et al., 2005; Riley et al., 2005; Hallifax et al., 2010), hepatocytes allow potentially influential uptake as well as the overall disposition processes to be assessed, which is increasingly important given trends in drug discovery toward larger molecules with limited permeability. To address the clearance dependence in prediction (at least empirically), it is clear that ESFs need to be clearance dependent as well. In addition, the possibility emerges that such scaling factors from the rat, as a key preclinical species, can be applied to the human situation prospectively for improved prediction among sets of test compounds. To this end, a panel of segregated rat ESFs (ESFseg) was calculated based on the separation of paired predicted and observed values, according to different ranges of observed CLint. A relatively simple segregation, by order of magnitude of observed CLint, was found adequate to achieve effective removal of clearance dependence. Although bias between the species remained, this was now relatively consistent, reflecting some improvement in precision. Having removed the clearance dependence, comparison of ESFseg between rats and humans by CLint level revealed a relatively constant cross-species factor for hepatocytes, as anticipated in the earlier species comparisons. Therefore, a second ESF (human ESFav/rat ESFav) was introduced to correct this apparently inherent difference in metabolic rate between the two species. By applying these ESFs in sequence, human hepatocyte predictions became largely free of bias with improved precision—virtually identical to the human internally corrected predictions. Hence, a simple basis for cross-species scaling was established that offers an overall improvement of predictions of human CLint from rat hepatocyte CLint. For these drugs (metabolically cleared without permeability limitation), it can be seen that a human ESF at one particular level of CLint is approximately matched by a rat ESF at the CLint level one order of magnitude greater. This displacement appears to equal an interspecies factor of about 3-fold (as discussed above), which when factored by the clearance-dependent ESF (ESFseg) allows consistent correction between these two species. Therefore, sets of predictions of human CLint via rat hepatocytes might be more confidently undertaken using the rat scaling factors described above, in addition to the direct use of human ESFseg to human hepatocyte CLint. While it is tempting to assume that CLint for any individual drug in rats can be successfully scaled to the human situation by this factor, the observation that specific drug hepatocyte CLint appears not to be linearly related between humans and rats means that a species differential may not hold consistently for individual drugs.
When CLint was directly compared between hepatocytes and microsomes from the same human liver donors, there was a tendency for CLint in both systems to be relatively low or high between individuals (Foster et al., 2011). Because this phenomenon holds for low and high CLint pathways for the same drug, a consistent liver factor may be invoked that feeds through to a difference between hepatocyte and microsome CLint according to the level of CLint. For individuals with relatively high hepatic activity compared with others, there is thus a tendency for the highest CLint pathways to give lower CLint in hepatocytes compared with microsomes (and vice versa). Therefore, it appears that the CLint level, rather than the individual drug, is the driver of this scaling phenomenon. Notwithstanding inevitable contributions from drug-specific factors, a predominantly system-specific phenomenon would explain why drug-specific ESFs (ESFsd) are not particularly successful, as alluded to above. It is also notable that this relationship prevails at low CLint where for human prediction (<10 ml/min per kilogram), the incidence of underprediction is more or less negated by the incidence of overprediction (in terms of bias). The reasons for this system-specific scaling phenomenon are not clear, although some difference in metabolic capacity between rat and human hepatocytes is indicated in this study.
Beyond the system-specific factor, there are species differences between hepatocytes whose impact can be dependent on the drug, such as the shaking of hepatocyte incubations (Wood et al., 2018). Other drug-specific species differences in hepatic clearance include metabolic and transporter pathways, not least due to species differences in cytochrome P450 enzymes or transporters resulting in differences in capacity and/or affinity. In addition, differences in the extent of binding to plasma proteins could reflect significant species differences in protein. Hence, there is potential for recategorization of a drug’s clearance level between rats and humans and factors specific between drug- and species-specific proteins may further explain why drug-specific ESFs are not particularly successful. In addition, experimental methodology may confound species comparison, including failure to identify high-affinity clearance pathways due to inadequately low substrate concentration (Klopf and Worboys, 2010; Wood et al., 2018).
The relationship between human and rat CLint both in vitro and in vivo has been explored for historical datasets of drugs comprising drugs predominantly cleared by metabolism without permeability limitation and, based on the species-dependent and species-independent factors revealed, viable empirical improvement of human CLint prediction from either human or rat hepatocytes is indicated. Nevertheless, further experimental examination of the sources of uncertainty are advocated, including potentially influential drug-specific uncertainties, due to the need for more accurate in vitro pharmacokinetic parameter values for confident physiologically based pharmacokinetic modeling.
Authorship Contributions
Participated in research design: Hallifax, Houston.
Performed data analysis: Hallifax, Houston.
Wrote or contributed to the writing of the manuscript: Hallifax, Houston.
Footnotes
- Received October 29, 2018.
- Accepted December 20, 2018.
This work was supported by the Centre for Applied Pharmacokinetic Research consortium membership, which included Certara, Genentech, Janssen Pharmaceutica, and Eli Lilly & Co.
Abbreviations
- CLint
- intrinsic clearance
- ESF
- empirical scaling factor
- ESFav
- average empirical scaling factor
- ESFsd
- single drug empirical scaling factor
- ESFseg
- segregated empirical scaling factor
- fub
- fraction unbound in blood
- GMFE
- geometric mean fold error
- RMSE
- root mean squared error
- Copyright © 2019 by The American Society for Pharmacology and Experimental Therapeutics