Abstract
Cytochrome P450 2D6 (CYP2D6) is responsible for the metabolism and elimination of approximately 25% of clinically used drugs, including antidepressants and antipsychotics, and its activity varies considerably on a population basis, primarily because of genetic variation. CYP2D6 phenotype can be assessed in vivo following administration of an exogenous probe compound, such as dextromethorphan or debrisoquine, but use of a biomarker that does not require administration of an exogenous compound (i.e., drug) has considerable appeal for assessing CYP2D6 activity in vulnerable populations, such as children. The goal of this study was to isolate, purify, and identify an “endogenous” urinary biomarker (M1; m/z 444.3102) of CYP2D6 activity reported previously. Several chromatographic separation techniques (reverse phase HPLC, cation exchange, and analytical reverse phase UPLC) were used to isolate and purify 96 μg of M1 from 40 l of urine. Subsequently, 1D and 2D NMR, and functional group modification reactions were used to elucidate its structure. Analysis of mass spectrometry and NMR data revealed M1 to have similar spectroscopic features to the nitrogen-containing steroidal alkaloid, solanidine. 2D NMR characterization by HMBC, COSY, TOCSY, and HSQC-TOCSY proved to be invaluable in the structural elucidation of M1; derivatization of M1 revealed the presence of two carboxylic acid moieties. M1 was determined to be a steroidal alkaloid with a solanidine backbone that had undergone C-C bond scission to yield 3,4-seco-solanidine-3,4-dioic acid (SSDA). SSDA may have value as a dietary biomarker of CYP2D6 activity in populations where potato consumption is common.
SIGNIFICANCE STATEMENT Endogenous or dietary biomarkers of processes involved in drug disposition and response may allow improved individualization of drug treatment, especially in vulnerable populations, such as children. Given that several CYP2D6 substrates are commonly used in pediatrics and the ubiquitous nature of potato consumption in Western diets, SSDA has considerable appeal as a noninvasive biomarker of CYP2D6 activity to guide treatment with CYP2D6 substrates in children and adults.
Introduction
One goal of precision therapeutics is to individualize drug doses for patients based on their unique characteristics, thereby optimizing safety and efficacy. The cytochrome P450 2D6 (CYP2D6) enzyme is responsible for metabolizing up to 25% of clinically used drugs (Zhou, 2009a,b). CYP2D6 is highly polymorphic with more than 120 allelic variants identified and classified by the Pharmacogene Variation Consortium (PharmVar) at https://www.pharmvar.org/gene/CYP2D6 (Gaedigk et al., 2008). Inheritance of two recessive loss-of-function alleles results in a “poor-metabolizer phenotype,” whereas combinations of full- or reduced-function alleles result in a wide spectrum of phenotypes, including intermediate, extensive, and ultra-rapid metabolizers. As a result, significant interindividual variability in CYP2D6 phenotype has been observed across multiple patient populations, including within groups of nominally the same genotype (Gaedigk et al., 2008; Jones et al., 2010; Llerena et al., 2012; Brown et al., 2016). Given that phenotype prediction from genotype data can be confounded by environmental factors, such as drug-drug interactions, direct assessment of CYP2D6 phenotype may offer a more accurate assessment of CYP2D6 activity at a given point in time relative to genotyping (Wu, 2011; Kantae et al., 2017).
Traditional phenotyping studies require the administration of a probe compound that is a substrate of the drug metabolizing enzyme of interest, collection of urine for a defined period of time, followed by determination of the molar ratio of parent compound and metabolite (Streetman et al., 2000). The over-the-counter cough suppressant, dextromethorphan, is a relatively safe probe for assessment of CYP2D6 phenotype in vivo, and the molar ratio of dextromethorphan to dextrorphan in a 4-hour urine collection has been used to phenotype CYP2D6 activity in infants (Blake et al., 2007), children (Evans et al., 1989), pregnant women (Wadelius et al., 1997; Tracy et al., 2005), and adults (Gaedigk et al., 2008). Although widely used as a phenotyping probe, the dextromethorphan/dextrorphan metabolite ratio can be influenced by factors like urinary pH (Labbe et al., 2000), confounding the genotype-phenotype relationship. Other commonly used phenotyping probes, such as debrisoquine, sparteine, and bufurolol, are logistically more difficult to use because of prescription status or they are not approved for clinical use in the United States. Thus, there is considerable appeal for biomarkers reflecting CYP2D6 activity that do not require administration of a drug or other exogenous compound.
Several endogenous compounds have been identified as CYP2D6 substrates (Yu et al., 2003a,b), but none has been explored as a potential phenotyping probe. In contrast, our collaborators applied an untargeted, global metabolomics approach using urine collections from patients previously genotyped and phenotyped for CYP2D6 activity with dextromethorphan to identify an “endogenous” biomarker of CYP2D6 activity (Tay-Sontheimer et al., 2014). This effort identified a compound with parent ion mass of m/z 444.3102 (designated M1) and product ions m/z 98.0964, 370.2731, 206.1883, 56.0494, 55.0550 and 81.0692 (in order of decreasing abundance) that were absent in urine from CYP2D6 poor-metabolizers. M1 was further validated as a product of CYP2D6 activity by observing a decrease in its formation following inhibition of CYP2D6 activity with fluoxetine, a potent CYP2D6 inhibitor. Queries in the common metabolomic databases for the identity of M1 and product ions resulted in no matches, and no further details about the identity of M1 were provided, precluding any further assessment of its clinical utility as a biomarker of CYP2D6 activity.
The goal of this project was to isolate, purify, and elucidate the structure of M1. Through a series of chromatographic methods and 1D and 2D NMR experiments and derivatization reactions, we describe the purification and structural identification of a metabolic product of the nitrogen-containing steroidal alkaloid, solanidine, 3,4-seco-solanidine-3,4-dioic-acid (SSDA), as a potential biomarker CYP2D6 activity.
Materials and Methods
Materials and Reagents
Liquid chromatography-mass spectrometry Optima grade methanol and acetonitrile were purchased from Thermo Fisher Scientific (Waltham, MA). House water was purified using BARNSTEAD E-pure 4-model system to produce 18.1 MΩ-cm water and used without further purification. Liquid chromatography-mass spectrometry grade formic acid, trimethylamine (NEt3), ethanolamine, ammonium hydroxide solution (28.0%–30.0% NH3), acetic acid, sodium acetate (NaOAc), ammonium acetate (NH4OAc), ammonium bicarbonate [(NH4)(HCO3)], spherical C18 flash silica (45-75 μm, 70 Å), Syringe Loading injection Rheodyne Model 7125 and Supelclean LC-SCX SPE tubes 3 ml were obtained from Millipore-Sigma (St. Louis, MO). 10 cm Omnifit chromatography column was purchased from Diba Industries (msscientific Chromatographie-Handel, Berlin, Germany). Waters SunFire C18 (100 Ä, 5.0 μm, 4.6 × 150 mm) and Acquity UPLC BEH C18 (130 Ä, 1.7 μm, 2.1 × 100 mm) columns were purchased from Waters (Milford, MA). Methanol-D4 was purchased from Cambridge Isotope Laboratories, Inc (Tewksbury, MA). Imject EDC mcKLH Spin Kit was purchased from Thermo Fisher Scientific. A total of 40 l of urine was collected from a single individual, pooled in 4 l aliquots with the pH adjusted of each pooled aliquot to pH = 4.0–4.5 using acetic acid. The collection of urine for unknown compound purification was declared nonhuman subjects research by Children’s Mercy Kansas City Pediatric Institutional Review Board under 45 CFR 46.102(f) (not involving human subjects).
Chromatographic Purification of M1
A comprehensive series of preparative procedures were conducted to isolate and purify M1 for subsequent identification. Briefly, these steps included an initial purification using a 10 cm Omnifit C18 flash silica column, a 3 ml Supelclean LC-SCX solid phase extraction cartridge, a second 10 cm Omnifit C18 flash silica column purification step, followed by purification using a Waters C18 Sunfire column purification, and a final purification procedure using a Waters Acquity UPLC BEH C18 column. The various steps and procedures are summarized in Table 1, and a complete description can be found in the Supplemental Methods.
Analytical methods used in the isolation and purification of M1 from human urine
Chromatography and Mass Spectrometry
Fractions collected during preparative chromatography were analyzed on a Waters TQ-XS triple quadrupole step-wave mass spectrometer (Waters, Manchester, UK) that was connected to an I-Class Acquity UPLC with a flow-through needle sample manager. The instrument was operated in ESI+ mode. The probe capillary was optimized at 3.0 kV and desolvation temperature was set to 400°C. The source offset was set to 60 V. The following gas flows were used: desolvation gas, 800 l/h; collision gas flow, 0.15 ml/min; and nebulizer, 7.0 bar. Argon was used for collision-induced dissociation (CID). Analytes were separated on a Waters Acquity UPLC C18 reversed-phase column (BEH 1.7 μm, 2.1 × 50 mm) that was preceded by a Waters Acquity UPLC C18 VanGuard Precolumn (1.7 μm, 2.1 × 5 mm). The following gradient was used on the UPLC system: initial conditions of 1% B, increasing linearly to 99% B at 4.0 minutes, and held for 1 minute. At 5.10 minutes, the column was stepped down to initial conditions. Mobile phases A and B were H2O and MeOH, respectively, both containing 0.1% formic acid. The flow rate was 0.4 ml per min and column temperature was set to 25°C.
Nuclear Magnetic Resonance (NMR) Conditions
NMR samples were prepared by dissolving the compound in approximately 500 or 300 μl of CD3OD (Cambridge Isotopes, Tewksbury, MA), then transferring to a 5 mm conventional tube (Wilmad LabGlass, Vineland, NJ) or Shigemi tube (Shigemi Co. LTD., Tokyo, Japan), respectively. All NMR spectra were acquired using a Bruker AVIII 500 MHz spectrometer equipped with a cryogenically cooled broadband observe probe or 800 MHz Bruker AV spectrometer equipped with a cryogenically cooled triple resonance inverse probe. NMR experiments were acquired and processed using Bruker Topspin and MestreNova software, respectively, with conventional parameters, unless otherwise noted. 1H NMR shifts given were referenced internally to residual protonated solvent resonance at δ 3.31 ppm (CD2HOD). 13C NMR shifts provided were referenced internally to solvent resonance at δ 49.0 ppm (CD3OD). All values in parentheses are coupling constants (Hz) where 2J and 3J denote germinal and vicinal couplings, respectively.
Derivatization of SSDA
A 2 ml glass auto sample vial was charged with 10 μl of a 1:1000 diluted solution of purified unknown M1 and evaporated to dryness and redissolved with 1 ml of Imject EDC conjugation buffer. Then 10 μl of a 1M ethanolamine solution was added to the solution and [H+] was checked to ensure pH = 5. Next, 1-ethyl-3-(2-dimethylaminopropyl) carbodiimide HCl (EDC, 10 mg, 0.522 mmol) was added to the solution and vortexed. The mixture was placed in a 60°C oven. Reaction progress was analyzed by pulling 20 μl samples at time intervals of 30, 60, 90, and 240 minutes and diluted to final volume of 200 μl of using 90:10 10 mM NH4OAc buffer, pH = 7.0/MeOH solution. Samples were analyzed on Xevo G2−XS quadropole time-of-flight mass spectrometer that was connected to a M-Class Acquity UPLC with a flow-through needle sample manager (complete details of the chromatography conditions are provided in the Supplemental Methods).
Results
Elemental Composition By High Resolution Mass Spectrometry
Using high resolution mass spectrometry, the monoisotopic mass (M+H)+ of M1 was determined to be 444.3115 Da, and based on this information, the isotopic distribution and the nitrogen rule, an elemental composition of C27H41NO4 was proposed. This elemental composition deviated from the theoretical value of 444.3114 by 0.1 mDa (0.2 ppm) and matched the predicted isotopic pattern (I-Fit confidence of 99.52%). The double bond equivalent (DBE) of this elemental composition was 8. Querying the elemental composition against major databases yielded no identification, as previously reported (Tay-Sontheimer et al., 2014), and a relatively high collision energy (50 eV) was required to fragment M1 into the previously reported fragments. We performed a search of the fragments against the HMDB database (using an in-house developed tool) and found that the fragment at 98.0964, and the couplet of 204.1743 and 206.1906 matched the in-silico generated fragmentation pattern of the steroidal alkaloid solanidine. A literature search revealed that the low molecular weight CID fragments of M1 have been commonly observed in various steroidal alkaloids (Cahill et al., 2010; Zhou et al., 2010); however, the 444.3→370.3 transition, a loss of 74 Da, is unique for this molecule and could not be adequately explained. A search of the steroidal alkaloidal literature for a better match than a solandine-derived structure was unsuccessful, and because of the limited utility of CID spectra for de novo identification of steroidal alkaloids, the decision was made to isolate the compound.
Isolation of M1 From Human Urine
Comparison of peak areas between a known concentration of solanidine and an unknown concentration of M1 resulted in an estimated M1 concentration of 20 nM or 8.9 μg per l in human urine. Based on NMR sensitivity considerations (e.g., https://www.chem.wisc.edu/∼cic/nmr/Guides/Other/sensitivity-NMR.pdf) we estimated that approximately 100–300 μg of M1 would be required to obtain an interpretable 13C{1H} NMR spectrum (Supplemental Fig. 15). Therefore, M1 isolation and purification was initiated using 40 l of human urine, using the series of chromatographic steps described in Methods. The challenges associated with each step and the approaches taken to resolve those issues are described in more detail in Supplemental Results. The final product of the isolation and purification process was a solution that appeared pure by quadropole time-of-flight mass spectrometry and served as the source material for interrogation by NMR and determination of M1 structure.
NMR Analysis of M1
Multiple NMR experiments were performed to characterize purified M1, including 1H, 13C{1H}, DEPT135-13C{1H}, DEPT90-13C{1H}, HSQC, HMBC, COSY, 2D TOCSY, and HSQC-TOCSY; raw spectra are provided in the Supplemental Material (Supplemental Figs. 1–37). The interpreted NMR spectra are summarized in Table 2 and Table 3.
1H and 13C{1H} data for M1 and solanidine
NMR data for M1
13C{1H}, DEPT135-13C{1H} and DEPT90-13C{1H} (Supplemental Figs. 12–15) experiments confirmed the elemental composition of M1 and documented the presence of twenty-seven unique carbons, including four methyl (CH3), nine methylene (CH2), nine methine (CH), and five quaternary (C) carbons. The 1H NMR experiment showed 39 protons, two less than the proposed elemental composition; however, exchangeable protons are not observed in 1H NMR experiments acquired in protic solvents (e.g., CD3OD).
1H and 13C{1H} NMR spectra of solanidine in CD3OD were prepared so that the NMR spectra of M1 could be compared against that of solanidine (Supplemental Fig. 1 and Table 2). Solanidine contains several structural features that result in diagnostic signals in 1H and 13C{1H} NMR experiments because of 1) the 4 methyl groups, 2) the heterocyclic structure (because of the nitrogen), 3) the unsaturation in the B-ring, and 4) the single hydroxyl group. Analogous to solanidine, four methyl groups were observed for M1, and the splitting patterns of the four methyl groups (C-18, C-19, C-21, and C-27) matched the expected pattern of singlet, singlet, doublet and doublet, for each methyl group respectively. Zooming in on the aliphatic region of the 1H NMR spectrum (Supplemental Fig. 2) of M1 highlights the chemical resonances of four methyl groups (Supplemental Fig. 3). Additionally, 13C{1H} chemical shifts of the carbons adjacent to the nitrogen (C-16, C-17, and C-22) are highly characteristic for solanidine and have values of 71.1, 62.9 and 76.3 ppm for M1 (Table 2). The unsaturation in the B-ring of solanidine on C5 and C6 could also be observed in M1, although C6 resonated 13.6 ppm downfield in M1, relative to solanidine (135.9 versus 122.3 ppm). Interestingly the last diagnostic signal in solanidine, is related to the hydroxyl group on carbon 3, which was not observed in M1.
Stepwise connectivity models used to identify M1 as SSDA are summarized in Fig. 1. Our strategy hinged on using the HSQC, HMBC, 2D TOCSY, and HSQC-TOCSY experiments to build these connectivity models, which were initiated at resonances that were separated from the overlapping aliphatic regions of the spectra, namely methyl groups and the olefin. We started with the resonance at 1.27 ppm in the 1H NMR spectrum. In the HMBC experiment, this proton (C19-H) displayed correlations with four carbon atoms, C-5, C-9, C-10, and ambiguous cross peak with either C-1, C-2, C-7, or C-24. C-10 and C-5 are both quaternary carbons because they had no correlations in the HSQC experiment. Given the chemical shift of C-5 (δ 142.2), it is highly suggestive that it is a sp2-hybridized carbon. Likewise, in the HMBC experiment C6-H displayed correlations with C-8, C-10, C-4, and ambiguous correlation with either C-1, C-2, C-7, or C-24. The 1H and 13C{1H} chemical shifts of C-6 suggest it is an olefin group (-HC=). The resonance assigned to C4 has a chemical shift of 174.1 ppm, implying the presence of a carbonyl moiety. C6-H in M1 is significantly shifted downfield from that of solanidine (Table 2), suggesting the presence of an electron withdrawing group near C6-H. Finally, the C3-H of solanidine is missing in the 1H spectrum of M1, indicating possible structural modification of the A-ring.
Stepwise connectivity models of SSDA structure as described in Results with hydrogen atoms omitted for clarity. (A) Z = ambiguous assignment of C-1, C-2, and C-24. X = C or heteroatom; (B) Z = ambiguous assignment of carbons 1, 2, and 24. X = C or heteroatom; (C) X = C or heteroatom; (D) X = C or heteroatom; (E) X = C or heteroatom; R = C or heteroatom; R’ = C or heteroatom; (F) X = heteroatom.
The HSQC-TOCSY experiment revealed that C6-H had correlations with only C8-H, and ambiguous correlations with either C-1, C-2, C-7, or C-24 further supported the assignment of C-6 as adjacent to a quaternary carbon, C-5 (Fig. 1A). Analysis of the 2D COSY spectrum revealed correlation of C6-H with C7-H' (Supplemental Fig. 30). C-7 contained diastereotopic protons (H', δ 2.17) and (H, δ 1.73–1.69) (Table 2). C7-H' had correlations with C-5, C-6, C-8, C-9, and C-14 in the HMBC experiment. The HSQC-TOSCY C7-H' showed only cross peaks with C-6 and C-8. Unfortunately, C7-H overlapped with C2-H and C8-H proton resonances. The HMBC experiment for C8-H had cross peaks with C-5, C-6, C-9, C-14, and C-19. Incorporation of this information expanded the stepwise connectivity model to include C-8, C-9, and C-14 (Fig. 1B).
HSQC data revealed that C-2 contained diastereotopic protons, C2-H' (δ 2.33) and C2-H (δ 1.79–1.71), and the HMBC experiment demonstrated that C2-H' had correlations with C-3, C-9, C-10, C-19, where C-3 had a chemical shift of 181.5 ppm, suggesting it could be a carbonyl group. The HSQC-TOCSY experiment showed a single correlation that could not be resolved unambiguously, however, the observation of only one correlation in the HSQC-TOCSY implies that C-2 is adjacent to at least one quaternary carbon. Since the C2-H' resonance at 2.33 ppm was well separated, we could use the 2D TOCSY experiment to determine that this resonance is coupled with diastereotopic protons at δ 2.07 and δ 2.03–1.99 ppm, along with geminal coupling with C2-H at δ 1.79–1.71 (Supplemental Fig. 6 and 7). Resonances C1-H and C15-H' (δ 2.03–1.99) are overlapped with each other (Supplemental Fig. 6 and 7). Both C-1 and C-15 have diastereotopic protons, C1-H' (δ 2.07) and C1-H (δ 2.03–1.99) and C15-H' (δ 2.03–1.99) and C15-H (δ 1.32–1.29). Since C2-H' did not have a correlation with C15-H in the TOCSY experiment, the correlation at δ 2.07 and δ 2.03–1.99 represents C1-H' and C1-H, respectively (Supplemental Fig. 6 and 7). This is further supported by the HMBC experimental results wherein C1-H' had correlations with C-3, C-10 and ambiguous cross peaks with C-2, C-7, or C-24. The TOCSY for C1-H'/C15-H and C1-H also had only two correlations, C2-H' (δ 2.33) and C2-H (δ 1.79), indicating it was also adjacent a quaternary carbon (Supplemental Fig. 7). We also noted there was no correlation between C15-H and C2-H' in the 2D TOCSY (Supplemental Fig. 32). With these results, the stepwise connectivity model was broadened to include C-1 through C-3 (Fig. 1C).
The HSQC experiment revealed C-11 to have diastereotopic protons, C11-H' (δ 1.66, dd, 2JH-H = 13.0 Hz, 3JH-H = 3.8 Hz) and C11-H (δ 1.54, qd, 2JH-H = 13.0 Hz, 3JH-H = 3.8 Hz). The HMBC experiment showed C11-H' correlations to C-9 and C-12. C-12 has diastereotopic protons C12-H' (δ 1.88–1.78) and C12-H (δ 1.34–1.28). Focusing on the other methyl resonance that is a singlet, C18-H (δ 0.90), the HMBC experiment revealed that C18-H correlates to C-12, C-13, C-14, C-17, and C-20. HSQC-TOCSY demonstrated that C18-H had no correlations because it was adjacent to a quaternary carbon, C-13. The HMBC experiment for C14-H revealed correlations to C-8, C-13, C-14, C-16, C-17, and C-18, and C-11, C-12, C-13, C-16, C-17, C-18, and C-20 could now be added to the connectivity model (Fig. 1D).
D-ring Assignment
The third methyl group (δ 1.03 ppm, d, 3JH-H = 6.2 Hz), which is a doublet and hence, must be adjacent to a carbon with only one unique hydrogen, making it a methine (-CH) carbon, is assigned to C21-H. The HMBC for C21-H had correlations to C-17, C-20, and C-22. The HSQC-TOCSY for C21-H showed correlations to C-16, C-17, C-20, C-22, C-23, and ambiguous cross peaks to C-1, C-2, C-7, or C-24 (Table 3). Unfortunately, C22-H had no correlations in the HMBC and HSQC-TOCSY experiments. We examined the COSY experiment for C21-H to see if any correlations could be found. C21-H correlated to one multiplet (δ 1.88–1.79) that represented overlapped resonances from C12-H', C17-H, C20-H, C24-H', and C25-H.
Given the downfield shift of the C-16 (δ = 71.1 ppm; Table 2), we reasoned that C-16 is adjacent to an electronegative atom. C-16 in solanidine has a resonance at δ 70.9 ppm and can explain the downfield shift caused by the presence of the tertiary alkylamine. In the M1 1H NMR spectrum C16-H and C26-H are overlapped (δ 3.26–3.20). The HMBC experiments for C16-H and C26-H showed no correlations; however, the HSQC-TOCSY displayed correlations with C-15, C-25, C-26, and C-27, and ambiguous cross peaks to either C-1, C-2, C-7, or C-24. Based on the DEPT90 and DEPT135 experiments, C-16 is a methine carbon and C-26 is a methylene. The 2D TOCSY experiment of C16-H had correlations with C15-H' (δ 2.03–1.99) and C21-H. C16-H also had correlations with overlapped two regions δ 1.88–1.80 ppm (C12-H', C17-H, C20-H, C24-H' and C25-H) and δ 1.32–1.29 ppm (C9-H, C12-H, C14-H and C15-H). The COSY experiment for C16-H had two correlations with C15-H' and overlapped resonances of C12-H', C17-H, C20-H, C24-H and C25-H. There are two points of interest here. First, both HSQC-TOCSY experiments for C14-H and C16-H had correlations with C15 and C17. Second, the COSY for C16-H had correlations with C15-H' and C17-H, whereas the 2D TOCSY for C16-H had correlations with C14-H, C15-H', and C17-H. One drawback to an HMBC experiment is that pulse sequences use a single fixed delay based on the average value of nJCH (Reynolds and Enríquez, 2002). Complex molecules, such as steroidal-alkaloids, have a range of nJCH values (2–15 Hz) and correlations with nJCH values that differ significantly from the average will be filtered. Although C-15 did not have correlations with C16-H or C17-H, the HMBC experiments for C15-H' (δ 2.03–1.99) had correlations with C-13, C-14, and C-17. Additionally, the HMBC experiments for C15-H (δ 1.32–1.29) had correlations with C-13, C-14, C-16, and C-17, and C-18 (Table 2). Based on these results, the D-ring assignment was completed and included in the stepwise connectivity model (Fig. 1E).
Proposed M1 Structure
Identification of the remaining carbons was completed by examining the final methyl group, C-27. HMBC experiments for C27-H showed correlations with C-25 and C-26, and ambiguous cross peak to either C-1, C-2, C-7, or C-24. Like C-21, C-27 must be directly adjacent to a methine carbon, in this case C-25. C-26 was significantly downfield shifted compared with C-24 (Δ = 26.7 ppm) and is directly bonded to the nitrogen atom. The HSQC-TOCSY experiment of C27-H resulted in correlations with C-23, C-25, and C-26, and ambiguous cross peaks to C-1, C-2, C-7, or C-24. Assignment of C-23 was completed using the COSY experiment and was found to have diastereotopic protons, C23-H' (δ 1.93) and C23-H (δ 1.41–1.34). Analysis of C23-H' resulted in correlations with C24-H' (δ 1.83–1.78), the multiplet at δ 1.41–1.34 (geminal coupling), and C24-H (δ 1.08–1.04). The HSQC-TOCSY results for C23-H' had correlations with C27, C24, C25, and C26. HSQC-TOCSY results for C23-H had correlations with only C-24. The completed connectivity model for M1 is presented in Fig. 1F.
Heteroatom Identification and Final Structure of SSDA
For heteroatom identification we reviewed the elemental composition of M1, C27H41NO4 and noted that the proposed structure in Fig. 1F has a molecular formula of C27H39NO2 and is missing two oxygen atoms and two hydrogen atoms. The total sum of the integration values for the resonances in the 1H spectrum had a value of 39. Given that all carbon atoms were identified, the remaining oxygen and hydrogen atoms must be part of the carbonyl moieties at C-3 and C-4. Therefore, we propose that M1 contains two carboxylic acid functional groups and is identified as 3,4-seco-solanidine-3,4-dioic acid (SSDA; Fig. 2).
Structures of solanidine (A) and final proposed structure of M1 as 3,4-seco-solanidine-3,4-dioic acid (SSDA) (B). For solanidine, rings are labeled with letters and carbon atoms are numbered; hydrogen atoms are omitted for clarity. For SSDA, the numbering of carbon atoms has been omitted for clarity, except for A-ring carbons.
Confirmation of SSDA Dicarboxylic Structure
Orthogonal confirmation of the carboxylic acid moieties was acquired by activating the carboxylate groups with carbodiimide, followed by reaction with ethanolamine (Supplemental Fig. 32). Reaction progress was monitored using mass spectrometry. The first amidation reaction was facile, with the disappearance of the parent ion of m/z 444.31 and appearance of m/z 487.35, indicating the addition of 1 unit of ethanolamine. The second amidation required an increase in temperature to 60°C and was confirmed with the appearance of the m/z 530.39 (Supplemental Fig. 38 and 39), consistent with the proposed SSDA structure.
Determination of Parent and Product Ion Structures
Support for the proposed structure of SSDA is further corroborated by the product ion assignments from the MS/MS fragmentation experiment (Fig. 3). Collision induced dissociation (CID) of the parent ion of m/z 444.3123 resulted in six major product ions: m/z 370.2744, m/z 356.2582, m/z 326.2851, m/z 206.1908, m/z 150.1270, m/z 136.1116, and m/z 98.0983, and each of the major fragments can now be explained by simple bond breaking. The product ion of m/z 370.2744 had a mass loss of m/z 74.0379 with an elemental composition of C3H6O2. This fragmentation represents the loss of the propionic acid chain. The product ion of m/z 356.2582 had an elemental composition of C23H36NO2 and represents a mass loss of m/z 88.0541 from the parent ion, which would be consistent with propionic acid and a methyl group. The product ion 326.2851 m/z had an elemental composition of C23H36N. The mass loss difference between product ions m/z 370.2744 and m/z 326.2851 of 43.9893 represents loss of CO2, or the second carboxylic acid moiety. The product ion of m/z 206.1908 had an elemental composition of C14H24N and represents the loss of B and C rings. The product ion m/z of 150.1270 had an elemental composition of C10H16N and can be described as the fragment composed of a hydro-indolizine ring, a diagnostic feature of solanidine-based alkaloids (Lawson et al., 1997; Shou et al., 2010). The product ion of m/z 98.0983 had an elemental composition of C6H12N and is described as the six-member amino-alkane ring.
MS/MS fragmentation chromatogram of SSDA (A), with labeled product ions. The MS/MS fragmentation chromatogram for the parent compound, solanidine, is provided for reference (B).
Quantitative NMR was performed using an internal calibrant, and a quantity of 96 μg of SSDA was determined with a purity of 99.8% with the assumption that salts and other inert NMR impurities have been removed from the overall purification methods (Supplemental Tables 1–3).
Discussion
In this investigation, we report the structure of an “endogenous” biomarker of CYP2D6 previously discovered by an untargeted, global metabolomics approach using urine collections from patients previously genotyped and phenotyped for CYP2D6 activity with dextromethorphan (Tay-Sontheimer et al., 2014). The compound, designated “M1” was characterized by a parent ion mass of m/z 444.3102, was absent in urine of CYP2D6 poor metabolizers, and its formation was reduced in vivo by coadministration of the CYP2D6 inhibitor, fluoxetine. Present at relatively low concentrations in urine, M1 was isolated and purified from ∼40 l of urine and determined to be a metabolite of solanidine, 3,4-seco-solanidine-3,4-dioic acid. As solanidine is a toxic glycoalkaloid produced by members of the Solanaceae family and found at particularly high concentrations in the metabolically active parts of plants, such as leaves, flowers, and sprouts of potatoes (Omayio et al., 2016; Saranya et al., 2018), SSDA is dietary in origin and not an “endogenous” biomarker of CYP2D6 activity.
The structure of SSDA was somewhat unusual and unexpected, but the final structure is plausible for several reasons. First, the 1H NMR experiments were conducted in CD3OD and acidic protons are known to undergo H/D exchange in protic deuterated solvents like methanol. Secondly, the 13C{1H} chemical shifts for C-3 and C-4 are δ 181.5 and 174.1, respectively, and correspond very well to reported chemical shifts of carboxylic acid moieties in C27 products. For example, 3,4-seco-12α-hydroxy-5β-cholan-3,4,24-trioic acid, a bile acid produced from cholesterol metabolism, has been synthesized and characterized using NMR techniques (Sekiguchi et al., 2017). The reported 13C{1H} chemical shifts for C-3 and C-4 are δ 176.4 and δ 177.2, respectively. Furthermore, scission of the A-ring is not uncommon. Exploration of the literature reveals that A-ring opening of steroidal compounds is not unprecedented as there are multiple examples that proceed by a mechanism involving oxidation of the alcohol followed by a Baeyer-Villiger oxidation and, subsequently, hydrolysis. For example, trans-cyclohexa-1,2-diol has been reported to be oxidized to the dicarboxylic acid adipate by the fungus Acinetobacter TD63 via a mechanism that involves an initial dehydrogenation to 2-hydroxycyclohexane-1-one, ring expansion to form a lactone, 1-oxa-2-oxo-7-hydroxycycloheptane, rearrangement to 6-oxohexanoate, and subsequently adipate (Davey and Trudgill, 1977). Similarly, 3β-acetoxy-5α-androstan-3-one is reported to undergo A-ring opening when incubated in the presence of Myceliophthora thermophile (Hunter et al., 2009), and the A-ring of testosterone can be cleaved under anaerobic conditions by Steroidobacter denitrificans (Wang et al., 2013). Perhaps of most relevance to SSDA formation, the structurally similar steroidal alkaloid, tomatidine, has been shown to undergo degradation via A-ring opening to form 4-hydroxy-3,4-seco-tomatidine-3-oic acid in the presence of the fungus, Gymnoascus reesii (Gaberc-Porekar et al., 1983).
There are also several reports describing a role for mammalian oxidative enzymes catalyzing Baeyer-Villiger oxidation of steroidal substrates (Leisch et al., 2011). Verhoeven and coworkers reported that A-ring opening of a progestogen, (17α)-17-hydroxy-11-methylene-19-norpregna-4,15-dien-20-yn-3-one, was the major metabolic route after oral administration of the compound to rats (Verhoeven et al., 1998), and Lai et al. (2011) reported on the oxidation and opening of a keto-piperidine group of a poly(ADP-ribose) polymerase inhibitor; in the latter study, in vitro experiments revealed FMO5 to be responsible for the ring-opening via Baeyer-Villiger oxidation. Finally, a novel secondary bile acid in the common ring tail possum was recently described and mimics the exact A-ring opening that we observed (Sekiguchi et al., 2017); A-ring scission through a Baeyer-Villiger mechanism and oxidation of the resulting alcohol (following hydrolysis) to the carboxylate is a multistep process.
Given that minimal to no formation of SSDA is observed in CYP2D6 poor metabolizers and that formation is reduced in the presence of CYP2D6 inhibitors in vivo, it is likely that CYP2D6 catalyzes the rate-limiting step in SSDA formation. In fact, incubation of solanidine with heterologously expressed CYP2D6 produced a hydroxylated metabolite that has been identified as 4-hydroxysolanidine by an NMR analysis approach analogous to that described for SSDA (manuscript in preparation), a significant finding in that 3-keto-4-hydroxy compounds are potential substrates for Baeyer-Villiger oxidations, as discussed above. Based on the reported trans-cyclohexa-1,2-diol to adipate pathway (Davey and Trudgill, 1977) (Fig. 4A) and the reports of 3,4-seco-12α-hydroxy-5β-cholan-3,4,24-trioic acid formed from a secondary bile acid (Sekiguchi et al., 2017) (Fig. 4B) and 4-hydroxy-3,4-seco-tomatidine-3-oic acid from tomatidine (Gaberc-Porekar et al., 1983) (Fig. 4C), we propose a pathway for formation of SSDA from solanidine following CYP2D6-mediated formation of 4-hydroxysolanidine in Fig. 4D. Consistent with this pathway, we have also observed structures consistent with dehydrogenation of 4-hydroxysolanidine to form 4-keto-solanidine and 3-keto-4-hydroxysolanidine in patient urine samples, although the analytes and the enzymes involved in the processes have not yet been identified (manuscript in preparation).
Proposed pathway for SSDA formation involving Baeyer-Villiger chemistry based on literature precedents. (A) Proposed pathway for adipic acid formation from trans-cyclohexa-1,2-diol (Davey and Trudgill, 1977); (B) A-ring opening to form 3,4-seco-12α-hydroxy-5β-cholan-3,4,24-trioic acid formed from deoxycholic acid (Sekiguchi et al., 2017); (C) Formation of 4-hydroxy-3,4-seco-tomatidine-3-oic acid from tomatidine (Gaberc-Porekar et al., 1983); (D) Proposed pathway for formation of 3,4-seco-solanidine-3,4-dioic acid following an initial CYP2D6-mediated hydroxylation to form 4-hydroxysolanidine.
In summary, we have presented the isolation and structural elucidation of SSDA using supporting data from several 1D and 2D NMR experiments, functional group modification reactions, and mass spectrometry data. Collectively, the data all support the proposed structure for SSDA and increase confidence in the final assignments. Derived from solanidine, a steroidal alkaloid naturally occurring in plants of the Solanaceae family, such as potatoes, a urinary metabolite ratio of solanidine/SSDA would be analogous to the dextromethorphan/dextrorphan ratio used to assess CYP2D6 phenotyping following administration of cough suppressant, dextromethorphan. Indeed, during preparation of this manuscript, Magliocco et al. (2021) applied an elegant pharmacometabolomic approach to provide compelling evidence for solanidine-based metabolite ratios in urine and plasma as biomarkers of CYP2D6 activity in vivo. Identification of m/z 444.3102 as SSDA and the proposed pathway presented in Fig. 4D provides insight into the identities for the major CYP2D6 metabolites m/z 414.3366 and m/z 412.3210 reported in that study. Given that potatoes or potato-derived products are relatively common in North American and European diets, and the relatively long half-life of solanidine [t1/2 = 5 days; (Harvey et al., 1985)], we concur with Magliocca et al. (2021) that solanidine-based metabolite ratios may have broad utility as a measure of CYP2D6 activity that does not require exogenous drug administration. Clearly, multiple metabolic steps are involved in the biotransformation of solanidine to form SSDA and it is possible that a ratio of solanidine to various combinations of metabolites along the path from 4-hydroxysolanidine to SSDA may ultimately prove superior to administration of an exogenous compound, such as dextromethorphan, as a convenient biomarker of CYP2D6 activity for application in precision therapeutic settings.
Authorship Contributions
Participated in research design: Behrle, Leeder, van Haandel
Conducted experiments: Behrle, Douglas, van Haandel
Performed data analysis and interpretation: Behrle, Douglas, van Haandel
Wrote or contributed to the writing of the manuscript: Behrle, Douglas, Leeder, van Haandel
Footnotes
- Received May 26, 2022.
- Accepted June 29, 2022.
Funding support for this article was provided by the National Institutes of Health (NIH) and National Science Foundation (NSF) (R01HD058556, S10RR024664, P30GM110761, NSF #1625923)
The longitudinal CYP2D6 phenotyping study in which SSDA was discovered and originally described as an “endogenous” biomarker of CYP2D6 activity was funded by [Grant R01-HD058556], Exogenous and Endogenous Biomarkers of CYP2D6 Variability in Pediatrics (Leeder and Lin, co-PIs). Isolation, purification, and identification of M1 as SSDA was supported by internal funding, Children’s Mercy Kansas City (van Haandel, PI). Support for the 500 MHz NMR was provided by National Institutes of Health Shared Instrumentation [Grant S10-RR024664] and NSF Major Research Instrumentation Grant # 1625923. The 800 MHz instrument is supported by an Institutional Development Award (IDeA) from the NIH National Institute of General Medical Sciences [Grant P30-GM1110761].
↵1 Current affiliation: Department of Bioanalytical Chemistry, Charles River Laboratories, Mattawan, Michigan.
↵2 Current affiliation: Analytical Research and Development, NCE Project Team, Abbvie, North Chicago, Illinois.
Part of this work was presented as follows: (2017) Metabolomics in Translational Research Summit. Vanderbilt University, October 30, 2017
(2019) Found in Translation: Adaptive CMPK Strategies for Emerging Technologies. 22nd Annual Land O’ Lakes Drug Metabolism and Applied Pharmacokinetics Conference, September 9–12, 2019.
↵
This article has supplemental material available at dmd.aspetjournals.org.
Abbreviations
- 1D
- one dimensional
- 2D
- two dimensional
- COSY
- 1H-1H correlation spectroscopy
- DEPT90
- distortionless enhancement by polarization transfer with a final proton pulse angle of 90°
- DEPT135
- distortionless enhancement by polarization transfer with a final proton pulse angle of 135°
- HMBC
- heteronuclear multiple-bond correlation
- HSQC
- heteronuclear single quantum correlation
- HSQC-TOCSY
- heteronuclear single quantum correlation with additional total correlation spectroscopy transfer
- MS/MS
- tandem mass spectroscopy
- SSDA
- 3,4-seco-solanidine-3,4-dioic acid
- TOCSY
- total correlation spectroscopy
- UPLC
- ultra-performance liquid chromatography
- Copyright © 2022 by The American Society for Pharmacology and Experimental Therapeutics