![]() |
|
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Department of Medicine, Section of Hematology/Oncology (A.L.H., H.-J.K., Q.N., S.A.C., O.I.O.), Department of Health Studies (D.H.), and Department of Human Genetics (D.L.F., S.D., O.I.O.), University of Chicago, Chicago, Illinois, Northern California Cancer Center, Fremont, California (E.M.J., D.W.W.); and Stanford University School of Medicine, Stanford, California (D.W.W., A.S.W.)
(Received January 3, 2007; accepted April 30, 2007)
| Abstract |
|---|
|
|
|---|
The unifying premise of integrative epidemiology suggested by Spitz et al. (2005
) is that the same genes that are implicated in cancer risk may also be involved in a person's propensity to carcinogenic exposure and/or to modulation of therapeutic outcome. Therefore, constructing genetic profiles that could be used to individualize therapy may also increase our understanding of cancer risk genes and may be applied to cancer development and prediction of outcome. To this end, differences in TA repeats of UGT1A1 have been shown to be responsible for the toxic effects of irinotecan, an anticancer drug (Iyer et al., 2002
), and have also been linked with cancer susceptibility (Guillemette et al., 2000
; Adegoke et al., 2004
). Of interest are studies suggesting an association of UGT1A1 TA repeat polymorphisms with breast cancer among African Americans but not Caucasians (Guillemette et al., 2000
, 2001
).
UGT1A1 is the main isoform that glucuronidates bilirubin (Tukey and Strassburg, 2000
), and variants in the TATA promoter and phenobarbital-responsive enhancer module (PBREM) region have been studied extensively, but variation in serum bilirubin levels is not explained solely by TA repeats. Few studies have explored the polymorphisms throughout the UGT1A1 gene and assigned functions to these polymorphisms (Sai et al., 2004
; Kaniwa et al., 2005
). In addition, few studies have examined variants of UGT1A1 related to total serum bilirubin within the normal range (<1.0 mg/dL) (Bosma et al., 1995
; Sai et al., 2004
; Lin et al., 2006
). Nevertheless, it is important to evaluate these to understand whether variants causing hyperbilirubinemia are similar to or different from those involving variation of total bilirubin levels within the normal range. Identifying common alleles of UGT1A1 related to bilirubin have important implications in both cancer treatment and prevention. A recent clinical trial has shown that total bilirubin before treatment was a positive predictor of grade 4 neutropenia in cancer patients receiving irinotecan (Innocenti et al., 2004
). Additionally, recent works have shown that the risk of cardiovascular disease is lower in those with higher bilirubin levels (Lin et al., 2006
). It also has been hypothesized that bilirubin itself may be a potent antioxidant and its slight increase within the normal range may be beneficial in reducing risk of cancer (Grant and Bell, 2000
).
Furthermore, population-based studies have shown that serum bilirubin levels vary with gender, race, and smoking status and are 10 to 20% lower in African Americans than in Caucasians (Zucker et al., 2004
). Although there are well established differences in UGT1A1 TA repeat allele frequencies between Caucasians and African Americans, with (TA)5 and (TA)8 being absent in Caucasians (Beutler et al., 1998
; Guillemette et al., 2000
, 2001
), it remains unclear whether there are additional differences in other UGT1A1 single nucleotide polymorphisms (SNPs) between the two racial groups, which may explain some of the observed racial difference in serum bilirubin levels.
In this study, we aimed to 1) compare the variations in (TA)n and SNPs from the PBREM region to the 3'-conserved region of UGT1A1 between African-American and Caucasian women; 2) describe the linkage disequilibrium (LD) and construct haplotypes for these two populations; and 3) examine the relationship between UGT1A1 polymorphisms and total bilirubin among individuals with total bilirubin levels within the normal range. To our knowledge, this is the first study to examine the entire region of the UGT1A1 gene in relation to total bilirubin level in Caucasian and African-American women.
| Materials and Methods |
|---|
|
|
|---|
|
SNP Selection. Novel SNP identification was based on resequencing of the UGT1A gene cluster (Maitland et al., 2006
). Briefly, optimal sets of mapped based tagging SNPs (tSNPs) were generated from LDSelect (Carlson et al., 2004
) for African-American and Caucasian populations (unpublished data). As expected, the number of tSNPs for Caucasians (n = 10) was smaller than that for African Americans (n = 18) because the number of genetic variations is generally smaller in the former. The criteria for selecting these tSNPs were setting the SNPs with a relative MAF >10% and a r2 threshold of 0.8. An additional 6 SNPs were identified as well as (TA)n. In total, there were 26 unique SNPs based on the resequencing of the UGT1A1 genomic region. Of these, 14 SNPs in addition to (TA)n were selected for the expanded genotyping in the current study, whereas other SNPs were not pursued because of high repetitive sequences within primer sets.
PCR for (TA)n. Variants were genotyped as described previously (Te et al., 2000
). Briefly, approximately 5 ng of DNA was amplified by PCR in 10 µl with primers flanking the polymorphic TA locus in the promoter region of the UGT1A1 gene. The only protocol modifications to cycling were a final extension at 72°C for 5 min compared with 10 min as described previously. Control DNAs from individuals known to have a 6/6, 6/7, or 7/7 genotype were included in the PCR analysis. The amplified product yielded a 93-, 95-, 97-, or 99-base pair fragment, which corresponded to (TA)5, (TA)6, (TA)7, or (TA)8, respectively. Samples were diluted 1:4, subjected to sequencing on an ABI 377 3700 Automated DNA Sequencer for (TA)n, and scored via Gene Mapper 3.7 (Applied Biosystems, Foster City, CA).
SNP Genotyping. Genotyping was performed using the SNPstream (Beckman Coulter, Fullerton, CA) and procedures followed the specification for the instrument. Primer design for PCR and single base extension (SBE) was performed with Beckman Coulter Autoprimer software. PCR reactions were organized by SNP type. Reactions were composed of PCR primers at a final concentration of 50 nM, 0.2 U of enzyme Hot Master Taq (Eppendorf, Hamburg, Germany), and 2 to 5 ng of genomic DNA per reaction. Amplification was performed according to the manufacturer's conditions. All postamplification steps were performed according to SNPstream specifications. Briefly, PCR cleanup was accomplished by treating reactions with 3 µlof Exo/SAP (USB, Cleveland, OH) with incubation for 30 min at 37°C followed by 100°C for 10 min. SBE reactions were performed using reagents and protocols specific to the SNPstream platform. SBE primers were present at a final concentration of 20 nM. Reactions were hybridized to Beckman array plates and scanned by the SNPstream. Genotyping of SNPs was done by DNAPrint Genomics (Sarasota, FL).
Bilirubin Levels. To explore the relationship between UGTIA1 genotypes and total bilirubin levels, data on serum total bilirubin levels were obtained from the University of Chicago Hospital medical records. Total bilirubin values for patients diagnosed with cancer were collected before the start of any treatments. If multiple values were available for a patient, the mean total bilirubin level was calculated. The standard deviation for these patients ranged from 0.05 to 0.35 mg/dl, with a median of 0.11 mg/dl. Bilirubin data for individuals who had known liver metastasis or significant hepatic dysfunction were excluded for the analysis.
Statistical Analysis. Hardy-Weinberg equilibrium (HWE) for all loci was examined among African Americans and Caucasians separately using a
2 test. Differences in allele frequencies between African Americans and Caucasians were compared using a
2 test or Fisher's exact test, as appropriate. Pairwise LD was measured by Lewontin's D' (Hedrick, 1987
), and the LD matrices were plotted using the GOLD software package (Abecasis and Cookson, 2000
). Haplotypes were reconstructed using a Bayesian statistical method implemented in phase 2.1 (Stephens et al., 2001
; Stephens and Donnelly, 2003
). To describe the haplotype diversity, we calculated the effective number of haplotypes separately for each racial group (Carlson et al., 2004
). The effective number of haplotypes, analogous to the effective number of alleles (Hartl and Clark, 1997
), was calculated as Ne = 1/
pi2, where pi is the frequency of the ith haplotype. To find DNA regions in which there was no apparent historical recombination, we first searched for haplotype blocks using pair-wise D' implemented in HaploBlockFinder (Zhang and Jin, 2003
). Then the minimum blocks were expanded by adding SNPs at the ends using the following criteria. If the calculated D' value between haplotypes contained in the minimum blocks and the SNP at the end was greater than the prespecified value (e.g., 0.8), we concluded that the added marker belonged to the same block. We proposed this haplotype block finding method because multilocus LD may not be adequately reflected by pairwise LD. Within each block, haplotype-tagging SNP (or microsatellite marker) was identified manually with help from HaploBlockFinder.
Because the distribution of total serum bilirubin was skewed to the right, a natural log transformation was done. For easier interpretation, geometric means and 95% confidence intervals are presented. One-way analysis of variance (ANOVA) was used to test whether the bilirubin level varied significantly by TA repeat or SNP genotypes, without assuming a genetic model. The coefficient of determination (R2) obtained from the ANOVA was used to indicate the predictive value. Multiple linear regression models included TA repeats and individual SNPs to examine the independent effect of SNPs after adjusting for TA repeats. Correspondingly, a partial R2 was calculated to indicate the additional predictive value beyond TA repeats. We also examined the independent effect of TA repeats, race, breast cancer status, and age on bilirubin levels using multiple linear regression models. Finally, we examined the relationship between the inferred haplotypes and bilirubin levels in the Caucasian sample using linear regression models. p < 0.05 is considered statistically significant.
| Results |
|---|
|
|
|---|
|
The allele frequencies of TA repeats and all SNPs genotyped were significantly different between Caucasian and African Americans (Table 2). Caucasians had only (TA)6 and (TA)7 repeats, whereas African Americans had broader variability in the number of TA repeats, including (TA)5 and (TA)8. Marker 6 appeared to be a unique SNP in African Americans. The MAFs of markers 13 and 15 in Caucasians and of marker 10 in African Americans were less than 5%.
Linkage Disequilibrium and Haplotype Structure of the UGT1A1 Gene. Figure 1 shows pairwise LD measured by Lewontin's D' for the two racial groups. In African Americans, there were two LD clusters: markers 1 to 5 (corresponding to the range from promoter to intron 1) and markers 9 to 15 (3'untranslated region of exon 5 and conserved regions). A historical recombination hot spot appears to exist between the two clusters. A similar LD pattern was observed in Caucasians, except that there was a long-range LD between markers 3 and 5 and markers 11 and 15.
|
|
Two haplotype blocks were observed for both Caucasians and African Americans, with one block consisting of markers 1 to 7 and the other block consisting of markers 9 to 15. Marker 8 did not fall into either block because it was in moderate LD with either block 1 or 2(D'
0.6). Although the gene was divided into two haplotype blocks, there was weak but nonignorable LD between the two blocks (D'= 0.52 in African Americans and D'= 0.45 in Caucasians). In Caucasians, the first block was defined by two major and one minor haplotype, and only two markers (TA repeats and any one of markers 1, 3, 5, and 7) were required to tag them. The second block in Caucasians was defined by four common haplotypes, requiring three haplotype-tagging SNPs. In African Americans, five common haplotypes accounted for 85% of the first block and four markers were required to tag them. The second block in African Americans consisted of five common haplotypes and required four haplotype-tagging SNPs. In both blocks, haplotypes were more diversified in African Americans than in Caucasians.
Bilirubin Level, UGT1A1 Polymorphism, and Race. Total serum bilirubin levels were available for 125 patients (89 Caucasians and 36 African Americans). In the one-way ANOVA analysis, bilirubin levels were significantly associated with TA repeats (marker 2) and several other SNPs (markers 1, 4, 5, 9, 12, 13, and 14). Figure 2A shows the percentage of the variability in serum bilirubin levels explained by each marker (R2). Specifically, the variability in bilirubin concentration explained by TA repeats was 27% (p < 0.0001). However, after adjusting for TA repeats, only marker 5 was marginally significant (p = 0.05). Figure 2B shows that 5% or less of variability in bilirubin level was explained by each SNP beyond TA repeats. The geometric means of bilirubin levels by TA repeat genotypes are presented in Table 4. Homozygous carriers of (TA)7 had the highest serum bilirubin level, followed by heterozygous carriers of (TA)6/(TA)7. Homozygous carriers of (TA)6 and heterozygous carriers of (TA)5/(TA)6 had low levels of bilirubin. There were only a few carriers of (TA)5/(TA)7 and (TA)6/(TA)8, and, thus, no inference was made for these categories. TA repeats was still a strong determinant of bilirubin levels (adjusted p < 0.0001) after adjusting for age, race, and breast cancer diagnosis. Interestingly, bilirubin levels were significantly lower (adjusted p = 0.02) in patients diagnosed with breast cancer (0.40 mg/dl) than in those without breast cancer (0.46 mg/dl).
|
|
Serum bilirubin levels were 22% higher (p = 0.005) in Caucasians (0.44 mg/dl) than in African Americans (0.36 mg/dl) (Table 4). This difference in bilirubin level was not explained by the distribution of TA repeats alone: after adjustment for TA repeats, Caucasians still had higher bilirubin level than African Americans (p = 0.018). Stratified by race, 29% of the variation in total bilirubin level was explained by TA repeats for Caucasians and 17% for African Americans (Fig. 3).
|
Of the 89 Caucasian patients with bilirubin data, 3 patients were excluded from the analysis of haplotype and bilirubin because their haplotypes were constructed with a high level of uncertainty (probability <80%). Table 5 shows the diplotype configuration (haplotype pairs) and the relationship with total serum bilirubin levels in Caucasians. Bilirubin levels varied significantly across the diplotypes in block 1 (R2 = 0.33, p < 0.0001) and the diplotypes in block 2 (R2 = 0.22, p = 0.01). However, this haplotype-phenotype correlation is mainly due to the effect of TA repeat polymorphisms. After removing the effect of TA repeat polymorphisms using linear regression models, neither block 1 (partial R2 = 0.06, p = 0.17) nor block 2 haplotypes (partial R2 = 0.14, p = 0.14) were significantly correlated with bilirubin. This is because (TA)n was one of the tagging markers for block 1 haplotypes, and it was also moderately linked with haplotypes in block 2 (Table 5). In the 36 African Americans with bilirubin data, 20 distinct diplotypes were found in block 1 and 19 diplotypes were found in block 2. Therefore, the study lacked statistical power to examine the relationship between UGT1A1 haplotypes and bilirubin levels in African Americans.
|
| Discussion |
|---|
|
|
|---|
The distribution of the number of TA repeats observed in this study is consistent with previous studies (Bosma et al., 1995
; Beutler et al., 1998
; Guillemette et al., 2001
; Innocenti et al., 2002
). (TA)5 and (TA)8 were each found in about 5% of African Americans but in none of the Caucasians. The allele frequency of (TA)7 was slightly higher in African Americans (0.375) than in Caucasians (0.323). UGT1A1 polymorphisms other than the TATA box have not been well studied, and several SNPs (M6, M8 and M10) identified in this study are novel. The allele frequencies and corresponding genotypes of these SNPs were quite different between African Americans and Caucasians. The MAFs of marker 9 (c.*211) observed in this study (0.276 for Caucasians and 0.382 for African Americans) were comparable to those in a previous study (labeled as 1813C>T), which reported a frequency of 0.220 for Caucasians and 0.350 for African Americans (Kaniwa et al., 2005
). Furthermore, with the exception of SNP M5, the MAFs are comparable to that of the CEPH and Yoruba groups from the Environmental Genome Project (2003) and HapMap Project (2006). For M5, the MAF for African Americans (0.431) is between the values from the EGP data set (Yoruba 0.292 and CEPH 0.455), which may be due to the larger sample size in this study or to an admixture of African Americans.
We described the haplotype structure of the UGT1A1 gene from its PBREM region to 3'-conserved region. We found that it was better to describe the haplotypes using two blocks for both Caucasians and African Americans with the advantage of low haplotype diversity within each block. For example, in African Americans, the effective number of haplotypes for blocks 1 and 2 was 4.6 and 5.2, respectively, less than that for the whole gene (Ne = 17.7). Sai et al. (2004
) also reported that UGT1A1 could be divided into two blocks among Japanese, and Kaniwa et al. (2005
) used the same two blocks to describe UGT1A1 genes in African Americans, Caucasians, and Japanese. The boundaries of the two blocks proposed by Sai et al. (2004
) are similar to those in this study. In our study, the first block ranges from PBREM and promoter regions to intron 2, instead of to exon 1 as in the previous study (Sai et al., 2004
). This discrepancy may be due to the arbitrary cutoff points of LD measures, as markers 6 to 7 in intron 2 were not in complete LD with markers 3 to 5 in intron 1. We believe it is better to include intron 1 in block 1 for African Americans and Caucasians as shown in Fig. 1 and Table 3. In Caucasians, marker 1 (at –1352 upstream of the transcription starting point) and TA repeats were sufficient to tag the common haplotypes in block 1, C7 0.323, A6 0.569, and C6 0.107. Interestingly, these haplotype frequencies were similar to those tagged by markers –3279T>G and TA repeats for Caucasians in previous studies, G7 0.36–0.39, T6 0.45–0.53, and G6 0.09–0 (Innocenti et al., 2002
; Kaniwa et al., 2005
), suggesting that these three haplotypes are the same although different markers were used. For African Americans, the haplotypes tagged by marker 1 and TA repeats in this study were also comparable to that tagged using markers –3279T>G and TA repeats in previous studies (Innocenti et al., 2002
; Kaniwa et al., 2005
). However, the haplotype diversity within block 1 was greater in African Americans than in Caucasians and thus more tagging markers are required (Table 3). Similarly, the haplotype diversity within block 2 was also greater in African Americans than in Caucasians. In this study, the haplotype block 2 extended from the 3' untranslated region of exon 5 to the conserved region. To our knowledge, no previous study had examined this. These ethnic differences in SNP and haplotype distribution suggest that the presence and strength of the association between UGT1A1 genotypes and phenotypes including drug toxicities and cancer susceptibility need confirmation in independent populations.
Consistent with a previous study conducted in Caucasians (Bosma et al., 1995
), we showed that TA repeats were strongly correlated with total bilirubin within the normal range, accounting for 29% of variations among Caucasians and 17% among African Americans. It is conceivable that the observed differences between the two groups is due to the diversity in the African Americans and the relatively small sample of African Americans patients with serum total bilirubin levels available for study. Nonetheless, our observation that serum total bilirubin levels were also lower among African American is consistent with results of previous population-based studies (Zucker et al., 2004
). A TA repeat polymorphism was also shown to be responsible for the variation in bilirubin in Asian individuals with a similar R2 (Ki et al., 2003
; Sai et al., 2004
). In addition, an Asian-specific variant, 211G>A (*6, G71R) in exon 1 has been associated with variations in bilirubin (Huang et al., 2000
; Ki et al., 2003
; Sai et al., 2004
). A study conducted in Japanese found that 1813C>T in the 3' untranslated region of exon 5 was weakly but significantly associated with bilirubin level after adjusting for TA repeats (p = 0.03) (Sai et al., 2004
). In this study, we showed that the same SNP, M9, was not significant after adjusting for TA repeats (p = 0.11). Among Caucasians, we did not find another variant that was associated with bilirubin conditional on TA repeats, as shown in both the genotype analysis (Fig. 2) and haplotype analysis (Table 5). Our sample size for African Americans was relatively small; thus, we cannot rule out that there may be other UGT1A1 variants accounting for the variation in bilirubin in African Americans. This possibility calls for further investigation of bilirubin levels in a larger study among African Americans as bilirubin can serve as an antioxidant and therefore may be a surrogate biomarker for risk.
A review of the UGT literature suggests that bilirubin is selectively glucuronidated by UGT1A1 (Tukey and Strassburg, 2000
). A linkage analysis of pedigree data from the Framingham Study estimated the heritability of serum bilirubin to be 49% ± 6%, and the peak multipoint logarithm of odds score is located 1 centimorgan away from the UGT1A1 gene. The authors concluded that UGT1A1 may be the major gene controlling serum bilirubin (Lin et al., 2003
), but almost all subjects in their study were Caucasians. Potentially, there are other factors such as other isoforms of UGT or other genes that moderate the variation in serum bilirubin because bilirubin levels also depend on the rate of bilirubin production, transportation, and elimination. The presence of Gilbert alleles is not necessarily linked to hyperbilirubinemia (Kadakol et al., 2000
). In addition, UGT genes in mice are regulated by factors such as CAAT-enhancer binding protein (C/EBP
) (Lee et al., 1997
). Recent studies suggested that these factors may influence human hepatic enzymes such as UGT2B7 (Ishii et al., 2000
). Thus, these factors may play a role by enhancing or suppressing UGT activity and thus bilirubin in humans.
With respect to the racial difference in bilirubin levels, we observed the paradox as pointed out previously (Beutler et al., 1998
): African Americans had lower bilirubin levels but had higher frequencies of (TA)7 and (TA)8 than Caucasians. In each stratum of TA repeat genotypes, (TA)6/(TA)6, (TA)6/(TA)7, and (TA)7/(TA)7, African Americans still had lower bilirubin levels than Caucasians. It has been documented that African Americans have lower hemoglobin levels than Caucasians (Perry et al., 1992
), and the difference in hemoglobin existed after excluding subjects with iron-deficient erythropoiesis and
-thalassemia mutations (Beutler and West, 2005
). Other genes such as G6PD and OATP2 or environmental factors may account for this racial difference. For example, OATP-C has been correlated with conjugated bilirubin in a pilot study of 23 patients (Ieiri et al., 2004
) This finding warrants further study because same total serum bilirubin levels in African American and Caucasian patients who are treated with glucuronidated drugs may indicate different toxicity profiles.
In conclusion, in this study we reconstructed the haplotypes of the UGT1A1 gene spanning from the transcription-regulating region to the 3'-conserved region with (TA)n repeats and 14 SNPs and showed wider variation of UGT1A1 haplotypes in African-American women than in Caucasian women. The racial differences in haplotype distribution may be responsible for different toxicity profiles for African-American and Caucasian patients treated with glucuronidated drugs and cancer susceptibility, suggesting that more than (TA) n profile is needed in the African American population. Regarding functions of UGT1A1 polymorphisms, it is clear that the number of TA repeats is an important factor for total bilirubin variation in the Caucasian population, and there may exist other variants related to bilirubin within the UGT1A1 gene, but these variants are less likely to be clinically significant. In African Americans, the role of TA repeats remains unclear but further studies with larger sample sizes are necessary to elucidate the function of other variants and haplotypes. Future studies are planned to examine the UGT1A1 gene with a larger sample of African Americans as well as a sample of Nigerians.
| Acknowledgments |
|---|
| Footnotes |
|---|
A.L.H., D.H., and H.-J.K. contributed equally to this work.
Article, publication date, and citation information can be found at http://dmd.aspetjournals.org.
ABBREVIATIONS: UGT, UDP-glucuronosyltransferase; PBREM, phenobarbital-responsive enhancer module; SNP, single nucleotide polymorphism; LD, linkage disequilibrium; tSNP, tagging SNP; MAF, minor allele frequency; SBE, single base extension; HWE, Hardy-Weinberg equilibrium; ANOVA, analysis of variance.
Address correspondence to: Dr. Olufunmilayo I. Olopade, Department of Medicine, Section of Hematology/Oncology, University of Chicago, 5841 S. Maryland Ave., MC 2115, Chicago, IL 60637. E-mail: folopade{at}medicine.bsd.uchicago.edu
| References |
|---|
|
|
|---|
-thalassemia on hemoglobin levels and mean corpuscular volume. Blood 106: 740–745.
-mediated activation of the human UDP glucuronosyltransferase 2B7 promoter. Mol Pharmacol 57: 940–947.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||