Predicting Volume of Distribution in Humans: Performance of In Silico Methods for a Large Set of Structurally Diverse Clinical Compounds

Neha Murad; Kishore K. Pasikanti; Benjamin D. Madej; Amanda Minnich; Juliet M. McComas; Sabrinia Crouch; Joseph W. Polli; Andrew D. Weber

doi:10.1124/dmd.120.000202

Visual Overview

Abstract

Volume of distribution at steady state (V_D,ss) is one of the key pharmacokinetic parameters estimated during the drug discovery process. Despite considerable efforts to predict V_D,ss, accuracy and choice of prediction methods remain a challenge, with evaluations constrained to a small set (<150) of compounds. To address these issues, a series of in silico methods for predicting human V_D,ss directly from structure were evaluated using a large set of clinical compounds. Machine learning (ML) models were built to predict V_D,ss directly and to predict input parameters required for mechanistic and empirical V_D,ss predictions. In addition, log D, fraction unbound in plasma (fup), and blood-to-plasma partition ratio (BPR) were measured on 254 compounds to estimate the impact of measured data on predictive performance of mechanistic models. Furthermore, the impact of novel methodologies such as measuring partition (Kp) in adipocytes and myocytes (n = 189) on V_D,ss predictions was also investigated. In predicting V_D,ss directly from chemical structures, both mechanistic and empirical scaling using a combination of predicted rat and dog V_D,ss demonstrated comparable performance (62%–71% within 3-fold). The direct ML model outperformed other in silico methods (75% within 3-fold, r² = 0.5, AAFE = 2.2) when built from a larger data set. Scaling to human from predicted V_D,ss of either rat or dog yielded poor results (<47% within 3-fold). Measured fup and BPR improved performance of mechanistic V_D,ss predictions significantly (81% within 3-fold, r² = 0.6, AAFE = 2.0). Adipocyte intracellular Kp showed good correlation to the V_D,ss but was limited in estimating the compounds with low V_D,ss.

SIGNIFICANCE STATEMENT This work advances the in silico prediction of V_D,ss directly from structure and with the aid of in vitro data. Rigorous and comprehensive evaluation of various methods using a large set of clinical compounds (n = 956) is presented. The scale of techniques evaluated is far beyond any previously presented. The novel data set (n = 254) generated using a single protocol for each in vitro assay reported in this study could further aid in advancing V_D,ss prediction methodologies.

Introduction

The current drug discovery path is a sequential, time-consuming process with a high attrition rate (Hinkson et al., 2020). Attrition of small-molecule drug candidates due to poor pharmacokinetic (PK) profiles has diminished significantly in recent years (Waring et al., 2015). This advancement can partly be attributed to the unprecedented emphasis on screening compounds based on PK parameters in the drug discovery phase (Ferreira and Andricopulo, 2019). PK is a well recognized and fundamental property that influences drug concentrations at target, which ultimately determines a drug’s efficacy and safety (Ferreira and Andricopulo, 2019). Volume of distribution at steady state (V_D,ss) is a key PK parameter that describes the relationship between drug concentration measured in plasma or blood to the amount of drug in the body at equilibrium (Smith et al., 2015). Estimation of apparent V_D,ss is of utmost importance because it influences C_max and half-life in plasma and target tissues, which in turn determines dose and dosing regimen in the clinic (del Amo et al., 2013). Toward this end, V_D,ss in humans is commonly predicted using preclinical in vivo and in vitro data in conjunction with various allometric scaling methods such as the Oie and Tozer method (Jones et al., 2011). Alternatively, V_D,ss can be extrapolated from tissue-to-plasma partition coefficients (Kp) from preclinical species (generally rat) (Nigade et al., 2019). These experiments are resource-intensive and require the synthesis of compounds; these limitations further hinder the ability to predict human V_D,ss early in drug discovery or during lead optimization. Thus, considerable effort has been undertaken to develop predictive in silico models to accelerate and reduce the cost of drug discovery processes (Wenzel et al., 2019). As V_D,SS is dependent on the tissue partitioning of compounds, numerous studies have focused on developing in silico approaches to predict tissue partitioning based on physicochemical properties such as pKa and log P, plasma protein binding, and blood-to-plasma partition ratio (BPR) (Graham et al., 2012; del Amo et al., 2013). Poulin and Theil were some of the first to propose a mechanistic Kp prediction method (Poulin and Krishnan, 1995; Poulin and Theil, 2002). This method incorporates several important mechanisms, such as albumin binding, neutral lipid, and phospholipid binding. Berezhkovskiy (2004) is another method similar to Poulin and Theil. The Rogers and Rowland method (Rodgers et al., 2005; Rodgers and Rowland, 2006) is by far the most comprehensive Kp prediction method in terms of mechanisms captured. It includes all the mechanisms captured in previous published methods along with the addition of acidic phospholipid and cytosolic ion partitioning. A drawback for the Rogers and Rowland method is that there are two sets of equations based on the dissociation constant or pKa of the compounds, and the cutoff or switch between these equations was set at a pKa of 7. This results in a discontinuous relationship between the dissociation constant and plasma tissue partitioning. Finally, the method is also heavily dependent on accurate pKa predictions. To address these issues, a modified Rodgers and Rowland method was developed (Lukacova et al., 2008) that employs a single continuous combined equation for compounds regardless of pKa. Ion partitioning into acidic or basic intracellular compartments (lysosomes and mitochondria) was described by Trapp et al. (2008) and can be used as an aid to Kp prediction method for compounds for which ion trapping is expected. Key mechanisms that play a crucial role in partitioning itself between plasma and the specific organ tissue implemented by each prediction method is summarized in Table 1.

View this table:

TABLE 1

Comparison of mechanistic tissue partitioning (Kp) prediction methods

Accurately predicting V_D,ss remains a challenge that has not been adequately solved (Smith et al., 2015). Few studies have evaluated the performance of various V_D,ss prediction methods; however, these reports were either in preclinical species (Graham et al., 2012) or used a small set (<150) of clinical compounds (Jones et al., 2011; Korzekwa and Nagar, 2017; Chan et al., 2018; Nigade et al., 2019; Mayumi et al., 2020). Recently, Lombardo et al. (2018) published a manually curated data set of V_D,ss for 1352 drugs after intravenous dosing, which presented an opportunity to evaluate the predictive performance of various V_D,ss methodologies in determining human V_D,ss. Therefore, we investigated the 1) performance of the most common V_D,ss prediction strategies, 2) sensitivity of input parameters that influence V_D,ss predictions, 3) impact of experimental data on mechanistic V_D,ss predictions, and 4) whether novel methodologies such as using adipocyte and myocyte cell partitioning could improve V_D,ss predictions.

Materials and Methods

Experimental Approaches

The V_D,ss prediction strategies investigated are broadly categorized into two approaches based on the starting data for the analysis, which is either fully in silico (e.g., structural) or in vitro (experimental). Based on the compound availability, an initial in vitro experimental data set of 331 compounds (Lombardo et al., 2018) was identified. Predictive performances were assessed using 956 compounds for the in silico and 254 compounds for the in vitro experimental approaches, respectively.

For the in silico approach, V_D,ss was predicted directly from chemical structure [using compound Simplified Molecular Input Line Entry System (SMILES) as input] by using the following four approaches: 1) mechanistic V_D,ss prediction using predicted physicochemical properties from commercial software (ADMET Predictor 9.0) or 2) using machine learning (ML) models generated by the Accelerating Therapeutics for Opportunities in Medicine (ATOM) consortium, 3) allometric scaling from predicted V_D,ss for preclinical species such as rat and dog ML models, and 4) direct human V_D,ss predictions using an ML model built using clinical compounds (see schematic shown in Fig. 1).

Fig. 1.

Overview of human V_D,ss prediction methods and input parameters (in silico and in vitro data) evaluated in this study.

In the Experimental Data approach, two distinct experimental data sets were generated. The first experimental data set included measurement of physicochemical properties under a single protocol for each in vitro experiment, which included log D, fraction unbound in plasma (fup), and BPR for 331 clinical compounds (Lombardo et al., 2018). The above experimental data were used as input parameters individually or in combination to predict mechanistic V_D,ss (Lukacova et al., 2008). In addition, novel experiments were conducted to determine partition of compounds in human adipocytes and myocytes for 200 compounds that were a subset of the 331 compounds selected above. In silico and experimental methodologies are further described in detail below. The percentage of compounds that had accurately predicted V_D,ss within 2-, 3-, or 10-fold; r² (Pearson correlation coefficient); and absolute average fold error (AAFE) were used as key criteria for comparison of predictive performance of each method.

In Silico Methods

V_D,ss of the clinical compounds data set (Lombardo et al., 2018) was subdivided based on whether experimental data were directly measured (331 compounds) or not (970 compounds). Evaluation of in silico methods was performed on both data sets. It is important to note that all the evaluations were performed on a complete hold-out set. For example, when predicting V_D,ss for the experimental data set, none of the compounds in the experimental data set were a part of any of the ML model building data sets.

ADMET Mechanistic V_D,ss Prediction.

ADMET Predictor (version 9.0) was used to predict pKa (S + Acidic_pKa, S + Basic_pKa), fraction unbound in plasma (hum_fup%, converted to fup), BPR, and log P/D (S + log D, S + log P) from chemical structure. These parameters were subsequently used as input parameters to predict mechanistic Kp and human V_D,ss predictions (Lukacova et al., 2008). Predicted values of the input parameters were limited to typical assay limits for each of the input parameters (hum_fup%: 0.1%–100%, BPR: 0–200, log P and log D: −3 to 10).

ATOM Mechanistic, Allometry, and Direct ML Predictions

ATOM Mechanistic V_D,ss Prediction.

Data sets generated by GlaxoSmithKline (Supplemental Table 1) containing molecular structure information and physicochemical parameters (log D, fup, BPR) were split into train, validation, and test subsets. Model training and evaluation was generally performed as previously described (Minnich et al., 2020). Briefly, a grid search hyperparameter optimization technique was employed to train several machine learning models (neural networks and random forests) with different hyperparameter combinations (learning rate, layer sizes, number of nodes, dropout rates for neural networks and maximum depth, number of trees for random forests), splitting strategies (random and scaffold), and featurization techniques [graph convolution, extended connectivity fingerprint (ECFP), molecular operating environment (MOE) descriptors, and Mordred descriptors]. Additional details related to data sets and model performances are described in Supplemental Table 1. Models with highest validation set R² (coefficient of determination calculated using sklearn’s r²_score package) regression score function were selected to predict fup, BPR, and log D from chemical structures. These parameters were subsequently used to predict mechanistic Kp and human V_D,ss predictions by the Lukacova method (Lukacova et al., 2008) as described in the ADMET Mechanistic V_D,ss Prediction section above.

Allometric Scaling.

Rat fup, rat V_D,ss, dog fup, dog V_D,ss, and human fup values were predicted using ATOM ML models built on GlaxoSmithKline proprietary data sets as described in the ATOM Mechanistic V_D,ss Prediction section (Supplemental Table 1). Subsequently, human V_D,ss was predicted using the following three methods:

Single-species allometry scaling from rat (Jones et al., 2011)
Single-species allometry scaling from dog (Jones et al., 2011)
Predicted from rat and dog V_D,ss using two species (Wajima et al., 2003)

Direct ML Models.

An alternative approach to mechanistic prediction of human V_D,ss is to build ML models to predict volumes of distribution directly from chemical structures. For this approach, regression models based on molecular structure were fit to directly predict the log base 10 experimental human V_D,ss values of clinical compounds (Lombardo et al., 2018). Compounds were clustered by Bemis-Murcko scaffold and subsequently divided into training, validation, and test sets, starting with the largest cluster size to the smallest cluster size. A train/validation/test split of 70%/10%/20% was used to train and evaluate random forest and neural network models as described for the in vitro parameter models (Minnich et al., 2020). Neural network models sampled different combinations of learning rates, layer sizes, and number of nodes. Random forest models sampled different maximum tree depth and number of trees. Several featurization approaches were used including DeepChem’s (https://github.com/deepchem/deepchem) graph convolution model, ECFP, and calculated MOE and Mordred descriptors. Models were selected by picking the model with the maximum validation set R². Clinical compounds were grouped into two sets. The first set of compounds was the 287 compounds that were selected for experimental measurements (BPR, fup, and log D). The second set of compounds was the 970 additional compounds described in Lombardo et al. (2018) without further experimental measurements. These sets were used in two ways for fitting and prediction. 1) To compare predictive performance of the direct ML models against the other in vitro approaches, models were trained using the 970 human V_D,ss of compounds without further experimental measurements. The V_D,ss ML model was then used to predict V_D,ss for the 287 compounds with new experimental measurements for comparison with in vitro methods. 2) A very challenging (due to the small size of the training set) external test set was used by inverting the previous approach. Models were developed using 287 compounds with new experimental measurements. Then, the fit model was used to predict V_D,ss for the 970 compounds without further experimental measurements. In both approaches, the set of compounds used for model development was further split into training, validation, and internal test sets as previously described.

Experimental Data

Log D.

The chromatographic hydrophobicity index (CHI) (Valkó et al., 1997) values were measured using a reversed phase high-performance liquid chromatography (HPLC) column (50 × 2 mm 3 µM Gemini NX C18; Phenomenex, UK) with fast acetonitrile gradient at starting mobile phase of pH 2, 7.4, and 10.5. CHI values are derived directly from the gradient retention times using calibration parameters for standard compounds. The CHI value approximates to the volume percent organic concentration when the compound elutes. CHI is linearly transformed into ChromlogD (Young et al., 2011) by least-squares fitting of experimental CHI values to calculated ClogP values for over 20,000 research compounds using the following formula: ChromlogD_pH=7.4 = 0.0857CHI-2.00.

Blood-to-Plasma Partition Ratio.

In vitro measurement of blood-to-plasma partition was conducted in human blood (K₂EDTA as anticoagulant) obtained from a commercial source (BioReclamation IVT, Liverpool, NY). Hematocrit (the ratio of volume of red blood cells to total blood) was measured by centrifugation of the whole blood at 3000 rpm for 10 minutes using microhematocrit capillary tubes. Control plasma was prepared from a portion of the whole blood by centrifugation at 3000g for 10 minutes. Both whole blood and control plasma samples were warmed at 37°C in a water bath for 30 minutes. Subsequently, the test compounds (1 µM in the final concentration) and controls [methazolamide (BPR ∼1) and metoprolol (BPR ∼40)] were spiked into blood and incubated at 37°C (5% CO₂) with shaking at 200 rpm for 60 minutes along with control samples. After incubation for 60 minutes, the incubated whole blood was removed from the water bath, and the plasma was separated by centrifugation at 1000g for 10 minutes. Aliquots of the control plasma were also removed. All plasma samples (50 µl) were treated with 400 µl of ice-cold acetonitrile containing an internal standard (100 ng/ml tolbutamide in acetonitrile). After the removal of protein by centrifugation at 1640g (3000 rpm) for 10 minutes at 4°C, the supernatants were transferred to HPLC autosampler plate. Test compounds and internal standard response (or peak area) ratio in whole blood and its resulting plasma were measured using liquid chromatography with tandem mass spectrometry (LC/MS/MS). Blood-to-plasma partition was calculated by ratio of mass spectrometric response of compounds in blood samples after 60 minutes of incubation to mass spectrometric response in plasma samples.

Fraction Unbound in Plasma.

In vitro measurement of fup was conducted using a rapid equilibrium dialysis (RED) device. The fup values of test compounds and a positive control (warfarin) were determined at a single time point of 4 hours postincubation. Considering high surface-to-volume ratio of the membrane compartment in a RED device, equilibrium is expected to be achieved within 4 hours of incubation (Waters et al., 2008). Stock solutions of test compounds and warfarin were prepared in DMSO at concentrations of 5 mM and subsequently diluted to a final concentration of 0.5 mM in DMSO:water (1:1, v/v). Incubation mixtures were prepared by diluting the stock solution into human plasma obtained from a commercial source (BioReclamation IVT). Final concentrations of compounds in incubation mixture were 5 µM. Human plasma was prewarmed in a water bath at 37°C prior to the experiment. In total, 400 µl of the stopping solution (100 ng/ml tolbutamide in acetonitrile) was added to a 96-well deep well sample collection plate on ice. In a RED device, 500 µl of PBS was added to the white chambers (receiver side), and aliquots (300 µl) of each incubation mixture were spiked into the red wells (donor side). A sample (40 µl) of the incubation mixture was transferred into the 0-minute wells on the sample collection plate. The device and remaining spiked plasma samples were incubated at 37°C for 4 hours with shaking at 150 rpm. After the incubation period, 40 µl of the remaining spiked plasma was transferred to the sample collection plate. All samples in the RED device were mixed by pipetting prior to aliquoting (40 µl) from each donor well into a well containing 160 µl of PBS buffer. A sample (160 µl) of each receiver well was aliquoted into a tube containing 40 µl of blank plasma. PBS (160 µl) was added to the 0-minute and 240-minute stability wells. Analysis of samples was performed using LC/MS/MS. For all samples, peak area ratios were used to determine percent unbound. Plasma proteins were precipitated with 400 μl of acetonitrile containing 100 ng/ml tolbutamide as a mass spectral internal standard. The resulting mixtures were vortex-mixed, followed by centrifugation for 15 minutes at >3500 rpm/min. A sample (100 µl) of the supernatant/well was transferred to a clean 96-well plate containing 100 µl of ultrapure water/well. The plate was vortexed for 1 minute at >1700 rpm/min. Aliquots (4 µl) of the resulting supernatant were injected onto the LC/MS/MS system to obtain peak area ratios for each compound to determine fraction unbound in plasma. Equilibrium dialysis method for measuring fup is amenable to automation and is generally accepted as the gold standard (Trainor, 2007).

Adipocyte and Myocyte Partition.

Intracellular partition of compounds in adipocytes and myocytes was determined using a protocol described previously (Treyer et al., 2018). Primary human adipocytes and myocytes were obtained from commercial sources (Lonza, MD). The test compounds and controls at a final concentration of 0.5 μM were incubated with fully differentiated myocytes and adipocytes plated in culture in triplicate at 37°C (5% CO₂) with shaking at 100 rpm for 45 minutes. After the end of the incubation, the medium was transferred to a stop solution containing acetonitrile and internal standard (100 ng/ml tolbutamide in acetonitrile). The cell layer was washed with 200 µl of cold Hanks’ buffered salt solution and extracted with stop solution (100 ng/ml tolbutamide in acetonitrile). Both the intracellular and extracellular compound concentrations were analyzed using LC/MS/MS. The cell protein concentration was determined by the bicinchoninic acid assay. Intracellular drug accumulation (Kp) was calculated from the peak area ratios of the analyte to internal standard in the medium, cells, and protein concentration from the following equation. Protein content was quantified using the bicinchoninic acid assay in representative wells to calculate the cellular volume (), assuming 6.5 μl/mg protein (Treyer et al., 2018). Amount of drug in the cells ) was estimated using peak area ratio and volume of cell lysate (area ratio × volume of cell lysate). refers to corrected medium concentration. Intracellular accumulation was determined using cell lysate concentration × volume of cell lysate (150 µl). Subsequently, the or is calculated accounting from protein binding in plasma.

Predictions Based on Experimental Data

Mechanistic Models for Kp Prediction.

Experimental data (log D, fup, BPR) were used as input parameters individually or in combination to predict Kp (Lukacova et al., 2008) and subsequently were used to calculate V_D,ss using the following relationship:where is the volume of plasma; is the volume of erythrocytes (; E/P is the erythrocyte-to-plasma ratio, which is derived by the equation BPR + hematocrit − 1)/hematocrit; and and are the plasma tissue partition ratio and volume, respectively, for the tissue (Nigade et al., 2019).

Tissue-Level Kp Prediction.

We used five strategies for predicting V_D,ss using adipocytes and myocyte Kp values:

Adipocyte-only method: Adipocyte Kp values were used to calculate partitioning into fat (). Kp for other organs was assumed to be 1 to predict V_D,ss using the following equation:
Myocyte-only method: Myocyte Kp values were used to calculate partitioning into muscle tissue (), and Kp for other organs was assumed to be 1 to predict V_D,ss using the following equation:
Combined method: Both adipocyte and myocyte Kp values were used to calculate fat and muscle volumes, respectively. Kp for all nonfat and muscles organs was assumed to be 1 to predict V_D,ss.
Average method: Average of adipocyte and myocyte Kp values were used as Kp for all nonfat and muscle tissues. Both adipocyte and myocyte Kp values were used to calculate fat and muscle volumes, respectively, to predict V_D,ss.
Separate method: Mechanistic Kp (Lukacova et al., 2008) calculations were used for nonfat or nonmuscle organs. Both adipocyte and myocyte Kp values were used to calculate fat and muscle volumes, respectively. Both of the volumes were subsequently added to predict V_D,ss as follows:

Results

As summarized in Fig. 1, we investigated the performance of the most common V_D,ss prediction strategies, sensitivity of input parameters that influence V_D,ss predictions, impact of experimental data on mechanistic V_D,ss predictions, and whether adipocyte and myocyte cell partitioning could improve predictive performance by using a large compound data set. An in silico–only approach was applied using a set of 956 compounds (the ATOM in silico set) related to the Lombardo intravenous dosing drug set (n = 1352 drugs) in which V_D,ss values were reported (Lombardo et al., 2018). A separate set of compounds, the ATOM experimental set (n = 254 compounds), had additional in vitro data collected under uniform experimental conditions (see Materials and Methods; Supplemental Table 2) and was used as a comparator against the purely in silico methods. Although the ATOM experimental data set was selected based on the compound availability from an initial set of 331 drugs, it represented chemical diversity of the clinical data set (Supplemental Fig. 1).

The comparative assessments of various in silico approaches evaluated to predict human V_D,ss for two discrete sets of compounds are summarized in Fig. 2 and Table 2. Details of ATOM ML models used to predict input parameters for mechanistic V_D,ss predictions are shown in (Supplemental Table 1). Model/featurization combination that resulted in the best models varied by data sets. MOE or graph convolution featurization with random forest or neural network models most frequently outperformed other featurization and models investigated in this study. Relative to other in silico methods, mechanistic V_D,ss predictions (both by ATOM and ADMET ML models) and two-species allometry demonstrated superior predictive performance, with 62%–71% of compounds within 3-fold of observed V_D,ss for both data sets (Table 2). In contrast, scaling from single species using allometric methods performed poorly, with only 38%–47% of compounds within 3-fold (Table 2). Trends in predictive performance (such as percentage within 2-, 3-, and 10-fold; AAFE; and Pearson’s r²) across various in silico models were comparable using either the smaller or larger data sets (Table 2, 283 and 956 compounds), with an exception for direct ML model. Predictive performance of the direct ML model to predict V_D,ss increased significantly when the ML model was built using a larger data set (Fig. 3; Table 2). The percentage of compounds within 2-, 3-, and 10-fold increased to 58%, 75%, and 98% from 36%, 55%, and 88%, respectively (Fig. 2; Table 2). Similarly, there was significant improvement in r² values (from 0.14 to 0.52) and AAFE (decreased from 3.3 to 2.2). The scatter plots of direct ML model predictions are shown in Fig. 3. Additional scatter plots of predicted V_D,ss compared with reported (Lombardo et al., 2018) values across both data sets and various in silico methods are presented in Supplemental Fig. 2.

Fig. 2.

Summary of model performance of in silico V_D,ss prediction methodologies: (A) ATOM in silico set (n = 956 compounds) and (B) ATOM experimental set (n = 254 compounds).

View this table:

TABLE 2

Summary of model performance of in silico V_D,ss prediction methodologies for Lombardo intravenous dosing drug set (n = 1352 drugs) divided into two subsets: 1) ATOM in silico set (>940 compounds) and 2) ATOM experimental set (n > 280 compounds)

Fig. 3.

Predicted vs. observed V_D,ss using direct ML models: (A) the ML model built was using a smaller data set (287 compounds), and predictions were tested on a large in silico set (956 compounds) and (B) vice versa. Crosslines indicate 2-, 3-, and 10-fold limits.

Experimentally measured log D, fup, and BPR in vitro assays for 254 compounds are summarized in Supplemental Table 2. Although 331 compounds were originally included, some of the compounds showed analytical or recovery issues in different assays and were removed from the data sets. Figure 4 and Table 3 summarize predictive performance of various combinations of experimental data (Supplemental Table 2) as input parameters. Scatter/kernel density estimation plots of mechanistic V_D,ss predictions using various combinations of experimental data (fup, BPR, and log D) as input parameters are shown in Supplemental Fig. 4. The highest percentage of compounds within 3-fold of prediction error was observed when experimentally determined fup and BPR were used as input parameters, with 81% of the compounds within 3-fold of Lombardo reference values; a good correlation between predicted and observed values (r² = 0.58) was seen.

Fig. 4.

Predictive performance of mechanistic Kp prediction methods using various combinations of experimental (Exp.) data.

View this table:

TABLE 3

Summary of mechanistic V_D,ss predictive performance using experimental data (fup, BPR, and log D) as input parameters

Correlation between observed and predicted V_D,ss for 254 compounds using experimental fup and BPR data as input parameters is shown in Fig. 5. Among the experimental parameters investigated, V_D,ss predictions were sensitive to BPR. V_D,ss predictions within 3-fold dropped to 73% from 81%, and r² reduced from 0.58 to 0.42 when only fup was used instead of fup and BPR. In absence of experimental data, assuming BPR as 1 could be recommended, as better performance was observed when the BPR value was assumed to be 1 instead of inputting ML-predicted values (Table 3); 63% of the compounds were predicted within 2-fold when BPR was assumed to be 1, compared with 56% when BPR was predicted from ML models in combination with measured fup. This highlights that V_D,ss predictions are sensitive to errors in BPR predictions from ML models and that the best performance across all the methods is with measured fup and BPR values. In contrast, complementing measured log D to mechanistic predictions with fup and BPR measured data did not improve predictive performance any further (Table 3). Since predicted values from log D ML models (both ADMET and ATOM) were in close agreement with measured values (Supplemental Fig. 3), it is not surprising to see that measurement of log D values did not improve V_D,ss predictions. Figure 5A displays the correlation of predicted-to-observed V_D,ss classified by ionization class (Lombardo et al., 2018). Anionic and zwitterionic compounds are the best-predicted classes compared with neutral compounds. The kernel density estimation (Seaborn Python library: https://seaborn.pydata.org/tutorial/distributions.html) plot in Fig. 5B demonstrates underlying distribution of the points in the Fig. 5A scatter plot. Figure 5B suggests that overall predictions using mechanistic predictions using measured fup and BPR are directly correlated, and a majority of the predictions are on the unity line, highlighting that there is no overall trend of overpredicting or underpredicting V_D,ss.

Fig. 5.

(A) Scatter plot [colored by ionic state reported in Lombardo et al. (2018)]. (B) Kernel density plot showing correlation between observed and predicted V_D,ss for 254 compounds using experimental (exp) fup and BPR data as input parameters. Crosslines indicate 2-, 3-, and 10-fold limits.

As fat and muscle contribute to 60% of body volume, the impact of experimental adipocyte and myocyte cell partition in improving V_D,ss prediction was investigated. Measured intracellular partitioning of 189 compounds in adipocytes and myocytes is presented in Supplemental Table 3. The impact of adipocyte and myocyte cell partition on predictive performance for the same set of compounds was compared with that from the best predictive model (fup and BPR experimental data as input parameters; Fig. 6; Table 4). Good correlation between observed versus predicted V_D,ss was noted when either adipocyte or myocyte or both Kp values were used (r² of 0.41–0.48, Table 4). Although the percentage of compounds within 3-fold, r², and AAFE were not significantly different using either adipocyte or myocyte partitioning, percentage of compounds within 2-fold was significantly higher when V_D,ss was predicted using adipocyte Kp values (54% vs. 41%, Table 4). The combination of both adipocyte and myocyte partitioning with different strategies did not improve predictive performance any further (Table 4). For the same set of compounds, V_D,ss predicted using only fup and BPR experimental data demonstrated higher percentage of compounds with 2- and 3-fold compared with predictions based on adipocyte or myocyte data (Fig. 6; Table 4).

Fig. 6.

Predictive performance using adipocyte (Kp fat) and myocyte (Kp muscle) partitioning experimental (exp) data.

View this table:

TABLE 4

Performance of V_D,ss prediction methods utilizing adipocyte and myocyte Kp experimental data

Across all the prediction methods evaluated using different data sets, there was a good correlation between AAFE and percentage of compounds within 2- or 3-fold of observed. As anticipated, prediction methods in which lower AAFEs were observed demonstrated the highest percentage of compounds within 3-fold. Among all the methods investigated, mechanistic V_D,ss predictions utilizing measured fup and BPR as input parameters demonstrated superior performance, with lowest AAFE, highest r², and percentage of compounds within 3-fold.

Discussion

Mechanistic V_D,ss Predictions.

Kp calculations use physiologic parameters of the tissue and physicochemical properties of the drug to ascertain how compounds partition themselves between plasma and tissue. Based on preliminary evaluations and other reports in the literature (Graham et al., 2012), the Lukacova method (Lukacova et al., 2008) was used as a method of choice for mechanistic V_D,ss predictions. Key prerequisite input parameters to predict mechanistic V_D,ss are pKa, log D, log P, fup, and BPR. Therefore, estimating these input parameters either by in silico methods or by experimental measurements, and impact of measured parameters on mechanistic V_D,ss predictions have been explored.

Mechanistic V_D,ss predictions using input parameters predicted by either ATOM ML models or ADMET Predictor demonstrated similar performance across data sets (Table 2). Therefore, either of the two ML models set (ATOM or ADMET Predictor) can be used to predict mechanistic V_D,ss in silico. It is important to note that ML models for BPR [ATOM ML or ADMET Predictor (from user manual)] were built using very small data sets (Supplemental Table 1), and predictive performances of ML models to predict BPR are questionable. When predicted BPR values were replaced with experimental data, significant improvement in mechanistic V_D,ss predictive performance was observed; r² increased from 0.38 to 0.51 and percentage within 3-fold increased from 66% to 79%, highlighting the sensitivity of V_D,ss predictions to BPR values (Fig. 4; Table 3). As BPR is a key parameter, particularly for calculation of intracellular acidic phospholipid binding of strongly basic drugs, it could be anticipated to improve the predictions. However, impact of BPR measurement was not definitely demonstrated in literature until recently (Yau et al., 2020). The current evaluations (Table 3) clearly demonstrate the importance of measuring BPR in predicting V_D,ss and the need to fill the existing gaps in BPR data sets used to build predictive ML models. It is noteworthy that with only two in vitro measurements (fup and BPR), 81% of compounds are within 3-fold of observed V_D,ss (Table 3), with AAFE of 2.0.

Because it can impact both the pharmacokinetics and pharmacodynamics of a drug, fup is measured routinely in drug discovery (Smith et al., 2010). On the other hand, BPR of compounds in the early discovery phase is relatively less routinely measured and might lead to missed opportunities not only in predicting V_D,ss (as observed in this study) but also in predicting the impact on overall pharmacokinetics of a compound (Kalamaridis and DiLoreto, 2014). Comparable predictive performance was noted by Chan et al. (2018) using a smaller data set of 152 clinical compounds. They demonstrated that mechanistic V_D,ss predictions were accurate or superior to empirical approaches based on the extrapolation of V_D,ss from preclinical species (Chan et al., 2018). In addition to superior performance of mechanistic V_D,ss prediction methods (using either ML-predicted or experimental input parameters), a mechanistic approach uniquely offers the ability to calculate partitioning (Kp) of compounds into various tissues.

Allometric Scaling.

Traditionally, prediction of human V_D,ss has relied on scaling of V_D,ss obtained from preclinical species using allometric equations (Jones et al., 2011). Although allometry has some limitations in predicting distribution of highly protein-bound drugs, it has been a valuable technique to predict human PK parameters to determine first-time-in-human dose (Choi et al., 2019). To leverage existing data from animal studies during early drug discovery, use of ML-predicted V_D,ss employing allometric scaling from preclinical species was explored. Although there continue to be translational questions about interspecies scaling, it was hypothesized that deployment of this technique could allow for much wider chemical space coverage relative to human V_D,ss trained models, as well as to provide insight into mechanisms not captured by mechanistic models such as transporter-driven tissue uptake. Although ML models to predict V_D,ss and fup values in preclinical species have demonstrated good performance (Supplemental Table 1), single-species scaling performed poorly in predicting human V_D,ss (Table 2, <50% were within 3-fold). This poor performance could be due to magnification of errors in predictions of V_D,ss and/or fup values in addition to limitations of single-species scaling. Several studies have shown that plasma protein binding corrections significantly enhanced predictive performance of allometric scaling from preclinical V_D,ss (Zou et al., 2012). As the V_D,ss predictions are inversely proportional to fup in preclinical species (see Materials and Methods for equations), errors in the predictions of fup values will have a significant impact on V_D,ss predictions. Therefore, we investigated V_D,ss comparisons without fup corrections. Direct correlation of predicted dog V_D,ss (without fup corrections) with human V_D,ss demonstrated improved performance, with 48%, 65%, and 97% of compounds within 2-, 3-, and 10-fold of observed human V_D,ss, respectively, when compared with fup accounting for the difference between dog and human (23%, 37%, and 75%, Table 2). This supports that the poor predictive accuracy of the dog fup model magnified the prediction errors. However, similar improved performance or correlations were not observed in the case extrapolating from rat V_D,ss predictions. In contrast, human V_D,ss scaled using both rat and dog by the Wajima method demonstrated predictive performance similar to mechanistic models (Table 2). Although overall predictive performance is not significantly different between the two methods, it is noteworthy that mechanistic models were relatively better at predicting anionic compounds within 2-fold compared with the Wajima method (Supplemental Fig. 7). V_D,ss predictions classified by ionization class across various methods can be found in Supplemental Fig. 6.

Direct ML Models.

Previously, we observed that the data set size has a direct impact on model predictivity for several pharmacokinetic related data sets (Minnich et al., 2020). As anticipated, ML models built using smaller data sets, such as that for BPR, showed lower model performance statistics compared with models built using a larger data set (Supplemental Table 1). Furthermore, the direct ML model built on a larger data set (using 970 clinical compounds) outperformed other in silico methods, including the mechanistic V_D,ss method (Table 2). When utilizing direct ML models built on a larger data set, 75% of compounds (Table 2) were predicted within 3-fold of observed V_D,ss, with excellent correlation (Fig. 3B). It is important to highlight that the clinical data set is highly diverse across physicochemical, in vitro ADME, and in vivo PK properties (Lombardo et al., 2018). Models built on diverse data sets of chemical space have a greater applicability domain and generalizability (Simeon et al., 2019). Therefore, direct ML predictions of V_D,ss might be the most computationally efficient and predictive way to process in silico predictions of V_D,ss for de novo compounds. One limitation of the current model is the relatively small training set, possibly restricting the application of the model to certain chemotypes. In such cases, models that are limited to structurally related analogs may prove more predictive than global models built on a diverse set of compounds (Simeon et al., 2019). Despite some differences in hyperparameters and data set splits used relative to our study, Simeon et al. (2019) demonstrated similar predictive performance for a direct ML model built using a data set of 941 compounds. These independent studies provide promising evidence of improved performances of direct ML models with enhanced data sets of clinical compounds.

Predictions Using Adipocyte and Myocyte Cell Partitioning.

Muscle and fat are tissues with larger physiologic volumes (60% of tissue volume), and distribution of compounds to these tissues have a major impact on the V_D,ss of compounds in human (Davies and Morris, 1993). Björkman (2002) evaluated relative contributions of various tissue partition coefficients (Kp, tissues) in predicting V_D,ss in rat and observed an excellent linear correlation (>0.99) between V_D,ss when calculated using only Kp values from muscle and fat. In this study, we hypothesized that intracellular partitioning of compounds into human adipocytes and myocytes in vitro could be used as a surrogate to determine fat and muscle Kp values and subsequently be used to estimate human V_D,ss. In addition, measuring Kp values directly in human cells could improve translation to human tissues. Higher predictive performance was observed, but only when one of the adipocyte partition or myocyte partition values was included to predict V_D,ss (Table 4). Adipocyte and myocyte partition values and predicted V_D,ss were highly correlated (r² > 0.7), suggesting that measurement of partition in only one cell type is adequate. Between the two measurements, adipocyte partition (only) showed better performance, particularly with respect to the percentage of compounds within 2-fold compared with myocyte partition (Kp muscle only). Combination of both adipocyte and myocyte partition in various combinations did not provide significant improvement in V_D,ss predictions (Table 4). Although, showed good correlation to human V_D,ss, it failed to predict compounds with low V_D,ss (<1 l/kg) because of volume contributions from other tissues (assumption of Kp = 1) (Supplemental Fig. 5a). Surprisingly, predictive performance was lower when fat and muscle volumes were predicted using both adipocyte and myocyte measured data, and the volume of the remaining tissues was predicted using mechanistic Kp prediction method. Only 56% of the compounds were within 3-fold compared with 63% when Kp was assumed to be 1 for other tissues (Table 4). However, it improved prediction of compounds with low V_D,ss. Measured adipocyte and myocyte partition data provided in Supplemental Table 3 enable further exploration of V_D,ss prediction methods.

Conclusions

One of the purposes of comparing various in silico V_D,ss prediction methods was to establish the best in silico approaches to predict V_D,ss for de novo compounds. Based on the extensive comparisons of results across the in silico methods (Table 2), we conclude that 1) the mechanistic V_D,ss prediction methods using a combination of ML models for predicting physicochemical properties paired with mechanistic equations for Kp or 2) the Wajima method employing predicted rat and dog V_D,ss are our recommended in silico approaches to predict human V_D,ss. If a larger training data set of chemically diverse V_D,ss experimental values is available, then direct ML predictions of V_D,ss might be the most computationally efficient and predictive way to process in silico predictions of V_D,ss for de novo compounds. Once these de novo compounds have been synthesized in discovery, it is most useful to experimentally measure BPR and fup to get to a more accurate estimation of human V_D,ss. Based on our analysis, BPR is the most sensitive physicochemical property to determine V_D,ss in silico. Further, we investigated the utility of adipocyte and myocyte partitioning in predicting V_D,ss. If fat or muscle partition coefficients are being considered as part of the model, adipocyte Kp measurements may provide more predictive power than either myocyte Kp alone or adipocyte and myocyte combined. In summary, the scale of prediction strategies evaluated and size of data sets used in this study are novel and significantly larger than those presented in the literature thus far. In addition, we investigated novel methodologies such as adipocyte and myocyte partitioning in predicting V_D,ss. Finally, we have provided several novel in vitro data sets (e.g., BPR, adipocyte Kp, myocyte Kp) generated using a single protocol for 254 clinical compounds that will enable the research community to further enhance V_D,ss prediction methods.

Authorship Contributions

Participated in research design: Polli, Pasikanti, Weber, Murad, Crouch.

Conducted experiments: McComas, Pasikanti.

Performed data analysis: Murad, Madej, Pasikanti, Minnich.

Wrote or contributed to the writing of the manuscript: Pasikanti, Murad, Polli.

Footnotes

Received September 30, 2020.
Accepted November 3, 2020.

↵1 N.M. and K.K.P. contributed equally as co-first authors.
This work represents a multi-institutional effort. Funding sources include the following: Lawrence Livermore National Laboratory internal funds; the National Nuclear Security Administration; GlaxoSmithKline, LLC; and federal funds from National Institutes of Health National Cancer Institute and the Department of Health and Human Services [Contract No. 75N91019D00024]. This work was performed under the auspices of the US Department of Energy by Lawrence Livermore National Laboratory [Contract No. DE-AC52-07NA27344].
https://doi.org/10.1124/dmd.120.000202.
↵This article has supplemental material available at dmd.aspetjournals.org.
This document was prepared as an account of work sponsored by an agency of the United States government. Neither the United States government nor Lawrence Livermore National Security, LLC, nor any of their employees makes any warranty, expressed or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. Reference herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise does not necessarily constitute or imply its endorsement, recommendation, or favoring by the United States government or Lawrence Livermore National Security, LLC. The views and opinions of authors expressed herein do not necessarily state or reflect those of the United States government or Lawrence Livermore National Security, LLC, and shall not be used for advertising or product endorsement purposes. The authors declare no competing financial interest.

Abbreviations

AAFE: absolute average fold error
ATOM: Accelerating Therapeutics for Opportunities in Medicine
BPR: blood-to-plasma partition ratio
CHI: chromatographic hydrophobicity index
ECFP: extended connectivity fingerprint
fup: fraction unbound in plasma
Kp: tissue-to-plasma partition coefficient
LC/MS/MS: liquid chromatography with tandem mass spectrometry
ML: machine learning
MOE: molecular operating environment
PK: pharmacokinetic
RED: rapid equilibrium dialysis
V_D,ss: volume of distribution at steady state

This is an open access article distributed under the CC BY Attribution 4.0 International license.

References

↵
1. Berezhkovskiy LM
(2004) Volume of distribution at steady state for a linear pharmacokinetic system with peripheral elimination. J Pharm Sci 93:1628–1640.
OpenUrl CrossRef PubMed
↵
1. Björkman S
(2002) Prediction of the volume of distribution of a drug: which tissue-plasma partition coefficients are needed? J Pharm Pharmacol 54:1237–1245.
OpenUrl CrossRef PubMed
↵
1. Chan R,
2. De Bruyn T,
3. Wright M, and
4. Broccatelli F
(2018) Comparing mechanistic and preclinical predictions of volume of distribution on a large set of drugs. Pharm Res 35:87.
OpenUrl
↵
1. Choi GW,
2. Lee YB, and
3. Cho HY
(2019) Interpretation of non-clinical data for prediction of human pharmacokinetic parameters: in vitro-in vivo extrapolation and allometric scaling. Pharmaceutics 11:168.
OpenUrl
↵
1. Davies B and
2. Morris T
(1993) Physiological parameters in laboratory animals and humans. Pharm Res 10:1093–1095.
OpenUrl CrossRef PubMed
↵
1. del Amo EM,
2. Ghemtio L,
3. Xhaard H,
4. Yliperttula M,
5. Urtti A, and
6. Kidron H
(2013) Applying linear and non-linear methods for parallel prediction of volume of distribution and fraction of unbound drug. PLoS One 8:e74758.
OpenUrl
↵
1. Ferreira LLG and
2. Andricopulo AD
(2019) ADMET modeling approaches in drug discovery. Drug Discov Today 24:1157–1165.
OpenUrl CrossRef
↵
1. Graham H,
2. Walker M,
3. Jones O,
4. Yates J,
5. Galetin A, and
6. Aarons L
(2012) Comparison of in-vivo and in-silico methods used for prediction of tissue: plasma partition coefficients in rat. J Pharm Pharmacol 64:383–396.
OpenUrl CrossRef PubMed
↵
1. Hinkson IV,
2. Madej B, and
3. Stahlberg EA
(2020) Accelerating therapeutics for opportunities in medicine: a paradigm shift in drug discovery. Front Pharmacol 11:770.
OpenUrl PubMed
↵
1. Jones RD,
2. Jones HM,
3. Rowland M,
4. Gibson CR,
5. Yates JW,
6. Chien JY,
7. Ring BJ,
8. Adkison KK,
9. Ku MS,
10. He H, et al.
(2011) PhRMA CPCDC initiative on predictive models of human pharmacokinetics, part 2: comparative assessment of prediction methods of human volume of distribution. J Pharm Sci 100:4074–4089.
OpenUrl CrossRef
↵
1. Caldwell GW and
2. Yan Z
1. Kalamaridis D and
2. DiLoreto K
(2014) Drug partition in red blood cells, in Optimization in Drug Discovery: In Vitro Methods (Caldwell GW and Yan Z eds), pp 39–47, Humana Press, Totowa, NJ.
↵
1. Korzekwa K and
2. Nagar S
(2017) Drug distribution Part 2. Predicting volume of distribution from plasma protein binding and membrane partitioning. Pharm Res 34:544–551.
OpenUrl
↵
1. Lombardo F,
2. Berellini G, and
3. Obach RS
(2018) Trend analysis of a database of intravenous pharmacokinetic parameters in humans for 1352 drug compounds. Drug Metab Dispos 46:1466–1477.
OpenUrl Abstract/FREE Full Text
↵
1. Lukacova V,
2. Parrott N,
3. Lave T,
4. Fraczkiewicz G,
5. Bolger M, and
6. Woltosz W
(2008) General approach to calculation of tissue:plasma partition coefficients for physiologically based pharmacokinetic (PBPK) modeling, in: AAPS National Annual Meeting and Exposition; 2008 November 17–19; Atlanta, GA.
↵
1. Mayumi K,
2. Tachibana M,
3. Yoshida M,
4. Ohnishi S,
5. Kanazu T, and
6. Hasegawa H
(2020) The novel in vitro method to calculate tissue-to-plasma partition coefficient in humans for predicting pharmacokinetic profiles by physiologically-based pharmacokinetic model with high predictability. J Pharm Sci 109:2345–2355.
OpenUrl
↵
1. Minnich AJ,
2. McLoughlin K,
3. Tse M,
4. Deng J,
5. Weber A,
6. Murad N,
7. Madej BD,
8. Ramsundar B,
9. Rush T,
10. Calad-Thomson S, et al.
(2020) AMPL: a data-driven modeling pipeline for drug discovery. J Chem Inf Model 60:1955–1968.
OpenUrl
↵
1. Nigade PB,
2. Gundu J,
3. Pai KS,
4. Nemmani KVS, and
5. Talwar R
(2019) Prediction of volume of distribution in preclinical species and humans: application of simplified physiologically based algorithms. Xenobiotica 49:528–539.
OpenUrl
↵
1. Poulin P and
2. Krishnan K
(1995) An algorithm for predicting tissue: blood partition coefficients of organic chemicals from n-octanol: water partition coefficient data. J Toxicol Environ Health 46:117–129.
OpenUrl CrossRef PubMed
↵
1. Poulin P and
2. Theil FP
(2002) Prediction of pharmacokinetics prior to in vivo studies. 1. Mechanism-based prediction of volume of distribution. J Pharm Sci 91:129–156.
OpenUrl CrossRef PubMed
↵
1. Rodgers T,
2. Leahy D, and
3. Rowland M
(2005) Physiologically based pharmacokinetic modeling 1: predicting the tissue distribution of moderate-to-strong bases. J Pharm Sci 94:1259–1276.
OpenUrl CrossRef PubMed
↵
1. Rodgers T and
2. Rowland M
(2006) Physiologically based pharmacokinetic modelling 2: predicting the tissue distribution of acids, very weak bases, neutrals and zwitterions. J Pharm Sci 95:1238–1257.
OpenUrl CrossRef PubMed
↵
1. Simeon S,
2. Montanari D, and
3. Gleeson MP
(2019) Investigation of factors affecting the performance of in silico volume distribution QSAR models for human, rat, mouse, dog & monkey. Mol Inform 38:e1900059.
OpenUrl
↵
1. Smith DA,
2. Beaumont K,
3. Maurer TS, and
4. Di L
(2015) Volume of distribution in drug design. J Med Chem 58:5691–5698.
OpenUrl CrossRef
↵
1. Smith DA,
2. Di L, and
3. Kerns EH
(2010) The effect of plasma protein binding on in vivo efficacy: misconceptions in drug discovery. Nat Rev Drug Discov 9:929–939.
OpenUrl CrossRef PubMed
↵
1. Trainor GL
(2007) The importance of plasma protein binding in drug discovery. Expert Opin Drug Discov 2:51–64.
OpenUrl CrossRef PubMed
↵
1. Trapp S,
2. Rosania GR,
3. Horobin RW, and
4. Kornhuber J
(2008) Quantitative modeling of selective lysosomal targeting for drug design. Eur Biophys J 37:1317–1328.
OpenUrl CrossRef PubMed
↵
1. Treyer A,
2. Mateus A,
3. Wiśniewski JR,
4. Boriss H,
5. Matsson P, and
6. Artursson P
(2018) Intracellular drug bioavailability: effect of neutral lipids and phospholipids. Mol Pharm 15:2224–2233.
OpenUrl
↵
1. Valkó K,
2. Bevan C, and
3. Reynolds D
(1997) Chromatographic hydrophobicity index by fast-gradient RP-HPLC: a high-throughput alternative to log P/log D. Anal Chem 69:2022–2029.
OpenUrl CrossRef PubMed
↵
1. Wajima T,
2. Fukumura K,
3. Yano Y, and
4. Oguma T
(2003) Prediction of human pharmacokinetics from animal data and molecular structural parameters using multivariate regression analysis: volume of distribution at steady state. J Pharm Pharmacol 55:939–949.
OpenUrl PubMed
↵
1. Waring MJ,
2. Arrowsmith J,
3. Leach AR,
4. Leeson PD,
5. Mandrell S,
6. Owen RM,
7. Pairaudeau G,
8. Pennie WD,
9. Pickett SD,
10. Wang J, et al.
(2015) An analysis of the attrition of drug candidates from four major pharmaceutical companies. Nat Rev Drug Discov 14:475–486.
OpenUrl CrossRef PubMed
↵
1. Waters NJ,
2. Jones R,
3. Williams G, and
4. Sohal B
(2008) Validation of a rapid equilibrium dialysis approach for the measurement of plasma protein binding. J Pharm Sci 97:4586–4595.
OpenUrl CrossRef PubMed
↵
1. Wenzel J,
2. Matter H, and
3. Schmidt F
(2019) Predictive multitask deep neural network models for ADME-tox properties: learning from large data sets. J Chem Inf Model 59:1253–1268.
OpenUrl
↵
1. Yau E,
2. Olivares-Morales A,
3. Gertz M,
4. Parrott N,
5. Darwich AS,
6. Aarons L, and
7. Ogungbenro K
(2020) Global sensitivity analysis of the Rodgers and Rowland model for prediction of tissue: plasma partitioning coefficients: assessment of the key physiological and physicochemical factors that determine small-molecule tissue distribution. AAPS J 22:41.
OpenUrl
↵
1. Young RJ,
2. Green DV,
3. Luscombe CN, and
4. Hill AP
(2011) Getting physical in drug discovery II: the impact of chromatographic hydrophobicity measurements and aromaticity. Drug Discov Today 16:822–830.
OpenUrl CrossRef PubMed
↵
1. Zou P,
2. Zheng N,
3. Yang Y,
4. Yu LX, and
5. Sun D
(2012) Prediction of volume of distribution at steady state in humans: comparison of different approaches. Expert Opin Drug Metab Toxicol 8:855–872.
OpenUrl CrossRef PubMed

In this issue

Download PDF

Article Alerts

Email Article

Citation Tools

Cited By...

More in this TOC Section

Show more Articles

[1] ↵
Berezhkovskiy LM
(2004) Volume of distribution at steady state for a linear pharmacokinetic system with peripheral elimination. J Pharm Sci 93:1628–1640.
OpenUrl CrossRef PubMed

[2] Berezhkovskiy LM

[3] ↵
Björkman S
(2002) Prediction of the volume of distribution of a drug: which tissue-plasma partition coefficients are needed? J Pharm Pharmacol 54:1237–1245.
OpenUrl CrossRef PubMed

[4] Björkman S

[5] ↵
Chan R,
De Bruyn T,
Wright M, and
Broccatelli F
(2018) Comparing mechanistic and preclinical predictions of volume of distribution on a large set of drugs. Pharm Res 35:87.
OpenUrl

[6] Chan R,

[7] De Bruyn T,

[8] Wright M, and

[9] Broccatelli F

[10] ↵
Choi GW,
Lee YB, and
Cho HY
(2019) Interpretation of non-clinical data for prediction of human pharmacokinetic parameters: in vitro-in vivo extrapolation and allometric scaling. Pharmaceutics 11:168.
OpenUrl

[11] Choi GW,

[12] Lee YB, and

[13] Cho HY

[14] ↵
Davies B and
Morris T
(1993) Physiological parameters in laboratory animals and humans. Pharm Res 10:1093–1095.
OpenUrl CrossRef PubMed

[15] Davies B and

[16] Morris T

[17] ↵
del Amo EM,
Ghemtio L,
Xhaard H,
Yliperttula M,
Urtti A, and
Kidron H
(2013) Applying linear and non-linear methods for parallel prediction of volume of distribution and fraction of unbound drug. PLoS One 8:e74758.
OpenUrl

[18] del Amo EM,

[19] Ghemtio L,

[20] Xhaard H,

[21] Yliperttula M,

[22] Urtti A, and

[23] Kidron H

[24] ↵
Ferreira LLG and
Andricopulo AD
(2019) ADMET modeling approaches in drug discovery. Drug Discov Today 24:1157–1165.
OpenUrl CrossRef

[25] Ferreira LLG and

[26] Andricopulo AD

[27] ↵
Graham H,
Walker M,
Jones O,
Yates J,
Galetin A, and
Aarons L
(2012) Comparison of in-vivo and in-silico methods used for prediction of tissue: plasma partition coefficients in rat. J Pharm Pharmacol 64:383–396.
OpenUrl CrossRef PubMed

[28] Graham H,

[29] Walker M,

[30] Jones O,

[31] Yates J,

[32] Galetin A, and

[33] Aarons L

[34] ↵
Hinkson IV,
Madej B, and
Stahlberg EA
(2020) Accelerating therapeutics for opportunities in medicine: a paradigm shift in drug discovery. Front Pharmacol 11:770.
OpenUrl PubMed

[35] Hinkson IV,

[36] Madej B, and

[37] Stahlberg EA

[38] ↵
Jones RD,
Jones HM,
Rowland M,
Gibson CR,
Yates JW,
Chien JY,
Ring BJ,
Adkison KK,
Ku MS,
He H, et al.
(2011) PhRMA CPCDC initiative on predictive models of human pharmacokinetics, part 2: comparative assessment of prediction methods of human volume of distribution. J Pharm Sci 100:4074–4089.
OpenUrl CrossRef

[39] Jones RD,

[40] Jones HM,

[41] Rowland M,

[42] Gibson CR,

[43] Yates JW,

[44] Chien JY,

[45] Ring BJ,

[46] Adkison KK,

[47] Ku MS,

[48] He H, et al.

[49] ↵
Caldwell GW and
Yan Z
Kalamaridis D and
DiLoreto K
(2014) Drug partition in red blood cells, in Optimization in Drug Discovery: In Vitro Methods (Caldwell GW and Yan Z eds), pp 39–47, Humana Press, Totowa, NJ.

[50] Caldwell GW and

[51] Yan Z

[52] Kalamaridis D and

[53] DiLoreto K

[54] ↵
Korzekwa K and
Nagar S
(2017) Drug distribution Part 2. Predicting volume of distribution from plasma protein binding and membrane partitioning. Pharm Res 34:544–551.
OpenUrl

[55] Korzekwa K and

[56] Nagar S

[57] ↵
Lombardo F,
Berellini G, and
Obach RS
(2018) Trend analysis of a database of intravenous pharmacokinetic parameters in humans for 1352 drug compounds. Drug Metab Dispos 46:1466–1477.
OpenUrl Abstract/FREE Full Text

[58] Lombardo F,

[59] Berellini G, and

[60] Obach RS

[61] ↵
Lukacova V,
Parrott N,
Lave T,
Fraczkiewicz G,
Bolger M, and
Woltosz W
(2008) General approach to calculation of tissue:plasma partition coefficients for physiologically based pharmacokinetic (PBPK) modeling, in: AAPS National Annual Meeting and Exposition; 2008 November 17–19; Atlanta, GA.

[62] Lukacova V,

[63] Parrott N,

[64] Lave T,

[65] Fraczkiewicz G,

[66] Bolger M, and

[67] Woltosz W

[68] ↵
Mayumi K,
Tachibana M,
Yoshida M,
Ohnishi S,
Kanazu T, and
Hasegawa H
(2020) The novel in vitro method to calculate tissue-to-plasma partition coefficient in humans for predicting pharmacokinetic profiles by physiologically-based pharmacokinetic model with high predictability. J Pharm Sci 109:2345–2355.
OpenUrl

[69] Mayumi K,

[70] Tachibana M,

[71] Yoshida M,

[72] Ohnishi S,

[73] Kanazu T, and

[74] Hasegawa H

[75] ↵
Minnich AJ,
McLoughlin K,
Tse M,
Deng J,
Weber A,
Murad N,
Madej BD,
Ramsundar B,
Rush T,
Calad-Thomson S, et al.
(2020) AMPL: a data-driven modeling pipeline for drug discovery. J Chem Inf Model 60:1955–1968.
OpenUrl

[76] Minnich AJ,

[77] McLoughlin K,

[78] Tse M,

[79] Deng J,

[80] Weber A,

[81] Murad N,

[82] Madej BD,

[83] Ramsundar B,

[84] Rush T,

[85] Calad-Thomson S, et al.

[86] ↵
Nigade PB,
Gundu J,
Pai KS,
Nemmani KVS, and
Talwar R
(2019) Prediction of volume of distribution in preclinical species and humans: application of simplified physiologically based algorithms. Xenobiotica 49:528–539.
OpenUrl

[87] Nigade PB,

[88] Gundu J,

[89] Pai KS,

[90] Nemmani KVS, and

[91] Talwar R

[92] ↵
Poulin P and
Krishnan K
(1995) An algorithm for predicting tissue: blood partition coefficients of organic chemicals from n-octanol: water partition coefficient data. J Toxicol Environ Health 46:117–129.
OpenUrl CrossRef PubMed

[93] Poulin P and

[94] Krishnan K

[95] ↵
Poulin P and
Theil FP
(2002) Prediction of pharmacokinetics prior to in vivo studies. 1. Mechanism-based prediction of volume of distribution. J Pharm Sci 91:129–156.
OpenUrl CrossRef PubMed

[96] Poulin P and

[97] Theil FP

[98] ↵
Rodgers T,
Leahy D, and
Rowland M
(2005) Physiologically based pharmacokinetic modeling 1: predicting the tissue distribution of moderate-to-strong bases. J Pharm Sci 94:1259–1276.
OpenUrl CrossRef PubMed

[99] Rodgers T,

[100] Leahy D, and

[101] Rowland M

[102] ↵
Rodgers T and
Rowland M
(2006) Physiologically based pharmacokinetic modelling 2: predicting the tissue distribution of acids, very weak bases, neutrals and zwitterions. J Pharm Sci 95:1238–1257.
OpenUrl CrossRef PubMed

[103] Rodgers T and

[104] Rowland M

[105] ↵
Simeon S,
Montanari D, and
Gleeson MP
(2019) Investigation of factors affecting the performance of in silico volume distribution QSAR models for human, rat, mouse, dog & monkey. Mol Inform 38:e1900059.
OpenUrl

[106] Simeon S,

[107] Montanari D, and

[108] Gleeson MP

[109] ↵
Smith DA,
Beaumont K,
Maurer TS, and
Di L
(2015) Volume of distribution in drug design. J Med Chem 58:5691–5698.
OpenUrl CrossRef

[110] Smith DA,

[111] Beaumont K,

[112] Maurer TS, and

[113] Di L

[114] ↵
Smith DA,
Di L, and
Kerns EH
(2010) The effect of plasma protein binding on in vivo efficacy: misconceptions in drug discovery. Nat Rev Drug Discov 9:929–939.
OpenUrl CrossRef PubMed

[115] Smith DA,

[116] Di L, and

[117] Kerns EH

[118] ↵
Trainor GL
(2007) The importance of plasma protein binding in drug discovery. Expert Opin Drug Discov 2:51–64.
OpenUrl CrossRef PubMed

[119] Trainor GL

[120] ↵
Trapp S,
Rosania GR,
Horobin RW, and
Kornhuber J
(2008) Quantitative modeling of selective lysosomal targeting for drug design. Eur Biophys J 37:1317–1328.
OpenUrl CrossRef PubMed

[121] Trapp S,

[122] Rosania GR,

[123] Horobin RW, and

[124] Kornhuber J

[125] ↵
Treyer A,
Mateus A,
Wiśniewski JR,
Boriss H,
Matsson P, and
Artursson P
(2018) Intracellular drug bioavailability: effect of neutral lipids and phospholipids. Mol Pharm 15:2224–2233.
OpenUrl

[126] Treyer A,

[127] Mateus A,

[128] Wiśniewski JR,

[129] Boriss H,

[130] Matsson P, and

[131] Artursson P

[132] ↵
Valkó K,
Bevan C, and
Reynolds D
(1997) Chromatographic hydrophobicity index by fast-gradient RP-HPLC: a high-throughput alternative to log P/log D. Anal Chem 69:2022–2029.
OpenUrl CrossRef PubMed

[133] Valkó K,

[134] Bevan C, and

[135] Reynolds D

[136] ↵
Wajima T,
Fukumura K,
Yano Y, and
Oguma T
(2003) Prediction of human pharmacokinetics from animal data and molecular structural parameters using multivariate regression analysis: volume of distribution at steady state. J Pharm Pharmacol 55:939–949.
OpenUrl PubMed

[137] Wajima T,

[138] Fukumura K,

[139] Yano Y, and

[140] Oguma T

[141] ↵
Waring MJ,
Arrowsmith J,
Leach AR,
Leeson PD,
Mandrell S,
Owen RM,
Pairaudeau G,
Pennie WD,
Pickett SD,
Wang J, et al.
(2015) An analysis of the attrition of drug candidates from four major pharmaceutical companies. Nat Rev Drug Discov 14:475–486.
OpenUrl CrossRef PubMed

[142] Waring MJ,

[143] Arrowsmith J,

[144] Leach AR,

[145] Leeson PD,

[146] Mandrell S,

[147] Owen RM,

[148] Pairaudeau G,

[149] Pennie WD,

[150] Pickett SD,

[151] Wang J, et al.

[152] ↵
Waters NJ,
Jones R,
Williams G, and
Sohal B
(2008) Validation of a rapid equilibrium dialysis approach for the measurement of plasma protein binding. J Pharm Sci 97:4586–4595.
OpenUrl CrossRef PubMed

[153] Waters NJ,

[154] Jones R,

[155] Williams G, and

[156] Sohal B

[157] ↵
Wenzel J,
Matter H, and
Schmidt F
(2019) Predictive multitask deep neural network models for ADME-tox properties: learning from large data sets. J Chem Inf Model 59:1253–1268.
OpenUrl

[158] Wenzel J,

[159] Matter H, and

[160] Schmidt F

[161] ↵
Yau E,
Olivares-Morales A,
Gertz M,
Parrott N,
Darwich AS,
Aarons L, and
Ogungbenro K
(2020) Global sensitivity analysis of the Rodgers and Rowland model for prediction of tissue: plasma partitioning coefficients: assessment of the key physiological and physicochemical factors that determine small-molecule tissue distribution. AAPS J 22:41.
OpenUrl

[162] Yau E,

[163] Olivares-Morales A,

[164] Gertz M,

[165] Parrott N,

[166] Darwich AS,

[167] Aarons L, and

[168] Ogungbenro K

[169] ↵
Young RJ,
Green DV,
Luscombe CN, and
Hill AP
(2011) Getting physical in drug discovery II: the impact of chromatographic hydrophobicity measurements and aromaticity. Drug Discov Today 16:822–830.
OpenUrl CrossRef PubMed

[170] Young RJ,

[171] Green DV,

[172] Luscombe CN, and

[173] Hill AP

[174] ↵
Zou P,
Zheng N,
Yang Y,
Yu LX, and
Sun D
(2012) Prediction of volume of distribution at steady state in humans: comparison of different approaches. Expert Opin Drug Metab Toxicol 8:855–872.
OpenUrl CrossRef PubMed

[175] Zou P,

[176] Zheng N,

[177] Yang Y,

[178] Yu LX, and

[179] Sun D

Main menu

User menu

Search

Predicting Volume of Distribution in Humans: Performance of In Silico Methods for a Large Set of Structurally Diverse Clinical Compounds

Visual Overview

Abstract

Introduction

Materials and Methods

Experimental Approaches

In Silico Methods

ADMET Mechanistic VD,ss Prediction.

ATOM Mechanistic, Allometry, and Direct ML Predictions

ATOM Mechanistic VD,ss Prediction.

Allometric Scaling.

Direct ML Models.

Experimental Data

Log D.

Blood-to-Plasma Partition Ratio.

Fraction Unbound in Plasma.

Adipocyte and Myocyte Partition.

Predictions Based on Experimental Data

Mechanistic Models for Kp Prediction.

Tissue-Level Kp Prediction.

Results

Discussion

Mechanistic VD,ss Predictions.

Allometric Scaling.

Direct ML Models.

Predictions Using Adipocyte and Myocyte Cell Partitioning.

Conclusions

Authorship Contributions

Footnotes

Abbreviations

References

In this issue

In Silico Prediction of Volume of Distribution in Humans

Citation Manager Formats

In Silico Prediction of Volume of Distribution in Humans

Jump to section

Related Articles

Cited By...

More in this TOC Section

Similar Articles

Navigate

More Information

ASPET's Other Journals

ADMET Mechanistic V_D,ss Prediction.

ATOM Mechanistic V_D,ss Prediction.

Mechanistic V_D,ss Predictions.