![]() |
|
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Departments of Pharmacology & Toxicology and Medicine, James Graham Brown Cancer Center, University of Louisville School of Medicine, Louisville, Kentucky
(Received January 31, 2007; accepted March 15, 2007)
| Abstract |
|---|
|
|
|---|
Expression of human NAT1 and NAT2 is widely distributed throughout body tissues (Barker et al., 2006
; Husain et al., 2007
). Although many xenobiotic NAT substrates have been discovered, only one potential endogenous substrate, p-aminobenzoylglutamate, has been discovered (Minchin, 1995
). Therefore, it is highly probable that other, yet unknown endogenous NAT substrates and physiological roles exist. Discovery of potential substrates and useful inhibitors is limited by our knowledge of mammalian NAT enzyme structure.
To date, no publication to our knowledge has described the crystal structure of mammalian NATs. For this reason, investigators have used homology modeling techniques to gain clues about the key structural characteristics of mammalian NATs, including the location and potential mechanism(s) of single-nucleotide polymorphism-induced changes and the shape of the active site for computational docking of NAT substrates (Rodrigues-Lima et al., 2001
; Rodrigues-Lima and Dupret, 2002
; Kawamura et al., 2005
; Lau et al., 2006
). Quality mammalian NAT structures could be used to rationalize experimental data and screen for potential substrates and inhibitors, potentially pointing to an endogenous physiological role. There are limitations to homology modeling techniques, however, because they sometimes rely on incomplete structural template information.
Bacterial NAT crystal structures were used as templates to create the aforementioned mammalian NAT homology models. Bacterial NAT crystal structures have been solved for Salmonella typhimurium, Pseudomonas aeruginosa, Mycobacterium smegmatis, and Mesorhizobium loti, and can be found with Protein Data Bank accession numbers 1E2T, 1W4T, 1GX3, and 2BSZ, respectively (Sinclair et al., 2000
; Sandy et al., 2002
; Holton et al., 2005
; Westwood et al., 2005
). Alignment scores between bacterial and mammalian NAT protein sequences, and among the bacterial NAT sequences, are relatively low (
30%). Aligning mammalian and bacterial NAT protein sequences reveals important differences between the bacterial and mammalian NAT protein structures that could limit the utility of the mammalian NAT homology models.
|
It is unknown whether this insert has defined secondary structure and is integrated into the second domain beta barrel, or whether it takes the form of an unstructured loop or coil. In the latter case, a loop adjacent to the active site pocket could affect the size, shape, and/or accessibility of the active site pocket. In addition, since the location of the loop in a homology model is determined by the placement of the insert in the alignment, as is demonstrated by the differences in previously reported alignments (Payton et al., 2001
; Rodrigues-Lima et al., 2001
; Savulescu et al., 2005
; Lau et al., 2006
), it is possible that a slightly different or more optimal alignment could place the insert in a different location.
Our homology modeling analyses yielded two different, and apparently equally valid, alignments that placed this insert in two different locations. We used secondary structure prediction programs and designed a series of human NAT2 deletion mutants to positively define the insert location and better understand its structure and function.
| Materials and Methods |
|---|
|
|
|---|
Secondary Structure Prediction. N-Acetyltransferase protein sequences from S. typhimurium, P. aeruginosa, M. loti, and human NAT2 were evaluated using 12 secondary structure prediction programs: APSSP2, PROF king, PROF sec, PORTER, JPRED, JUFO, SCRATCH, SABLE1, SABLE2, SAM-T99, PSIpred, and YASPIN (McClelland and Rumelhart, 1988
; Kneller et al., 1990
; Rost, 1996
; Karplus et al., 1998
; Cuff and Barton, 2000
; McGuffin et al., 2000
; Ouali and King, 2000
; Meiler et al., 2001
; Raghava, 2002
; Adamczak et al., 2005
; Cheng et al., 2005
; Lin et al., 2005
; Pollastri and McLysaght, 2005
). Secondary structure predictions were then aligned based on Modeler or 3DCoffee protein sequence alignments.
Human NAT2 Mutation. A 17-residue segment was deleted from human NAT2 at 157-173 or 167-183, based on Modeler and 3DCoffee alignment results, respectively. Residues immediately surrounding the deletion site were replaced by beta turn-facilitating residues to prevent disruption of the beta barrel secondary structure. Eight different deletion mutants were created. For the Modeler-based deletion, turns QEGST, PHAG, ANDH, and SDGSD were inserted on either side of the deletion in four different deletion mutants. For the 3DCoffee-based deletion, turns HHEH, GRSG, GG, and GGSG were inserted on either side of the deletion in four different deletion mutants. In both sets of mutants, the first three turns match turn residues found in the same location in structures 1E2T, 1W4T, and 2BSZ, respectively (Fig. 1), and the fourth turn was made entirely of "turn-promoting" residues (Paul and Rosenbusch, 1985
). Codons chosen for the turn residues were based on the codon usage table for Homo sapiens (http://www.kazusa.or.jp/codon).
The human NAT2 nucleotide sequence was mutated by overlap extension (Horton et al., 1989
). Human NAT2 coding region sequences on either side of the targeted deletion were amplified independently by polymerase chain reaction (PCR) from a prokaryotic expression vector pKK223-3 containing human NAT2 (Ferguson et al., 1994
), using plasmid-specific and human NAT2-specific primers (Table 1). Each set of NAT2-specific primers contained complementary nucleotide sequences coding for the replacement turn residues. For each mutant, the two separate PCR products were combined, denatured at 98°C, and the complementary sequences allowed to anneal at 30°C, followed by overlap extension using nested forward and reverse human NAT2-specific PCR primers. The forward and reverse nested primers introduced restriction sites EcoRI and HindIII, respectively. Independent duplicates of each mutant were cloned and expressed in parallel as cloning and expression controls.
|
A restriction fragment length analysis using restriction enzymes KpnI and ApoI was used to verify the successful removal of the 17-residue segment in the Modeler-based and 3DCoffee-based deletion mutants, respectively, before cloning. Successful deletion of the Modeler-based insert changed the KpnI digestion pattern from one to zero cuts. Successful deletion of the 3DCoffee-based insert changed the ApoI digestion pattern from two to one cut. Digestion products were resolved on a 1% agarose gel.
Cloning, Sequencing, and Expression. Independent duplicates of the reference human NAT2 coding region were cloned and expressed in parallel with the independent duplicates of each mutant. All were directionally cloned into pKK223-3 bacterial expression vector (Pharmacia-LKB Biotechnology, Piscataway, NJ) using EcoRI and HindIII restriction sites as described previously (Doll and Hein, 1995
).
The expected nucleotide sequence was confirmed by automated sequencing of sense and antisense strands as described previously (Walraven et al., 2006
) using human NAT2-specific sequencing primers (Table 1), before transformation and expression in JM105 strain Escherichia coli as described previously (Chung and Miller, 1988
; Chung et al., 1989
; Doll and Hein, 1995
). Bacteria were lysed by sonication in homogenization buffer as described previously (Doll and Hein 1995
), and total bacterial lysate protein concentration was determined for each expression (Bradford, 1976
).
Enzyme Activity and Stability. As a measure of NAT2 enzyme activity, sulfamethazine (SMZ) N-acetylation assays were performed by incubating bacterial lysate, 1 mM acetyl-coenzyme A, and 300 µM SMZ at 37°C for 10 min under linear rate conditions, and the reaction was stopped by adding 1/10 volume of 1 M acetic acid. Control reactions substituted water for acetyl-coenzyme A. Acetylated SMZ product was separated and measured using a Beckman Coulter System Gold high-performance liquid chromatography system (Beckman Coulter, Fullerton, CA) as described previously (Leff et al., 1999
). Sulfamethazine is a NAT2-selective substrate (Grant et al., 1991
). Thermostability was measured by incubating the bacterial lysate (1 mg/ml) at 30 -50°C for 10 min, immediately followed by assays for SMZ N-acetylation as described above. Results were normalized to activity of untreated lysate to give the percentage of activity remaining.
|
| Results |
|---|
|
|
|---|
The NAT2 homology models were superimposable with the bacterial template structures, with the Cys68-His107-Asp122 catalytic triad positioned as expected in the active site pocket. Due to a lack of structural template information, the 17-residue insert appeared as loops in the second domain of human NAT2. In the models, the Modeler alignment-based loop was located one beta strand away from the 3DCoffee-based loop and away from the active site, whereas the 3DCoffee alignment-based loop was adjacent to the active site pocket (Fig. 2).
Because these homology models were not used for detailed evaluation of specific residues or for substrate docking, an analysis of the structural quality was not necessary. Such detailed evaluations would require molecular dynamics simulations for structural optimization. The Modeler program generates models of sufficient quality for the immediate purposes of this study.
Secondary Structure Prediction. Bacterial NAT secondary structure predictions were very similar to their actual secondary structures as defined on the Research Collaboratory for Structural Bioinformatics (RCSB) Protein Data Bank website (www.rcsb.org), thus providing confidence that the human NAT2 predictions also were accurate. The predicted bacterial NAT and human NAT secondary structures were strikingly similar, with nearly all major secondary structures shared (Fig. 3). The individual prediction programs gave slightly different variations for each sequence, but the prediction programs were overall very consistent.
|
When the bacterial NAT and human NAT2 predictions were aligned based on the Modeler alignment, the 17-residue human NAT2 insert was located in the same place as a predicted beta strand. In addition, a bacterial NAT beta strand on the C-terminal side of the insert did not match up with a human NAT2 beta strand. This apparent mismatch was corrected when the prediction results were aligned based on the 3DCoffee alignment. In this case, the 17-residue insert did not have any predicted secondary structure and the surrounding predicted secondary structures matched with the bacterial NAT secondary structures.
Another important observation was made regarding the C-terminal tail. The bacterial NAT C-terminal helix found in all four known bacterial NAT crystal structures was not predicted for the human NAT2 sequence by any of the prediction programs.
Human NAT2 Deletion Mutants. Based on restriction fragment length analyses and complete sequencing results, the eight different human NAT2 deletion mutants were successfully constructed and cloned into bacterial expression vector pKK223-3. Duplicate clones of the reference and deletion mutant human NAT2 provided highly reproducible results. This validates the cloning process and expression system used in this study as highly consistent, as demonstrated previously (Walraven et al., 2006
).
Enzyme activity results (Fig. 4) clearly demonstrated that deletion of the Modeler alignment insert resulted in complete loss of enzyme activity, regardless of the turn used to replace the residues surrounding the deletion. On the other hand, all of the 3DCoffee deletion mutants had measurable N-acetylation activity. The activities for these mutants were highly variable, with one turn variant exhibiting SMZ NAT activity nearly equivalent to the reference activity.
|
|
Deleting the 17-residue insert resulted in decreased activity due, at least in part, to reduced stability. The choice of turn residues was apparently also critical to the mutant enzyme's stability. The mutant with the highest activity was also the mutant with the shortest turn substituted in for the segment, which was composed entirely of glycines (GG).
| Discussion |
|---|
|
|
|---|
Previous studies have generated alignments that placed the insert in different locations. Lau et al. (2006
) placed the insert residues at either 172-188 or 170-186 (in the alignment, QP is repeated on both sides of the insert), which is modeled as a loop in the second domain, adjacent to the active site pocket. The alignment published by Savulescu et al. (2005
) places the insert in the same location, and Kawamura et al. (2005
) reported models with a loop also in the second domain, adjacent to the active site. These three studies placed the loop close to the 3DCoffee-based insert (167-183) described in our study. Payton et al. (2001
) published an alignment with the insert located at 156-172, which is nearly identical to the Modeler-based insert (157-173) described in our study.
The results of the present study clearly identify the correct insert location and confirm that the insert is a loop. The loop is not integrated into the secondary structure of the protein and has no defined secondary structure. Because bacterial NATs function well without an insert, it is conceivable that deleting the loop in a way that facilitated the folding of the beta barrel at the deletion site could result in active protein, probably with slightly altered characteristics. On the other hand, it is much less likely that deleting the wrong insert location would yield an active protein, since it would remove a significant portion of the surrounding beta barrel secondary structure in the second domain. If the insert were not a loop, but were integrated into the secondary structure of the protein, both deletions would significantly alter the fold of the protein or prevent it from folding altogether.
When the 3DCoffee-assigned insert (167-183) was deleted, the enzyme remained active, although with altered characteristics. On the other hand, deletion of the Modeler-assigned insert (157-173) resulted in undetectable activity. These results support the conclusion that residues 167-183 are not only the correct insert location, but they form a loop that does not play a critical role in the integrity of the beta barrel secondary structure. Deleting residues 157-173 probably removed a portion of the second domain beta barrel and altered the folding and/or function of the protein. For the purposes of this study, it is not necessary to know whether the loss of activity was due to misfolding of the protein, altered protein function, or a combination of both.
Secondary structure predictions supported and confirmed these conclusions. According to the alignments in Fig. 3, deletion of the Modeler-assigned insert resulted also in the deletion of a beta strand in the second domain beta barrel. This is consistent with the loss of activity seen for the Modeler alignment-based mutants. In addition, no secondary structure was predicted for the 3DCoffee-assigned insert, which is consistent with the conclusion that the insert must be a loop or coil with no defined secondary structure, and not integrated into the surrounding beta barrel. These results are also consistent with our hypothesis that the mammalian NAT proteins can function without the loop.
The secondary structure prediction that the bacterial NAT C-terminal helix is not shared by human NAT2 suggests that the C-terminal tail is free to interact with the active site. The C-terminal helix is likewise not predicted for human NAT1 or any of the rat Nats (unpublished data). This would be consistent with previous reports that the 10 C-terminal residues are critical for control of acetyl-coenzyme A hydrolysis (Mushtaq et al., 2002
). Because acetyl-coenzyme A hydrolysis takes place in the active site, it is reasonable to assume that the C-terminal tail must interact with the active site pocket.
Although the loop is not required for NAT catalytic activity, the reduced stability of loop-less human NAT2 suggests that this loop may serve a role in stabilizing the protein's structure. How the loop might accomplish this task is beyond the scope of homology modeling. Our models suggest that the loop could also play a role in active site access since it is adjacent to the active site and flexible enough to extend over the bacterial NAT-like active site cavity. Combined with the possibility that the C-terminal tail interacts with the active site, the active site pocket could look much different from its representation in current homology models.
After our experiments were completed, the crystal structure for a mutant human NAT1 protein was deposited into the RCSB Protein Data Bank under the code 2IJA. Although the publication describing this structure has not yet been released, it substantiates the findings of our human NAT2 study. The overall structure of our NAT2 model compares favorably to 2IJA, with a root mean square deviation of 1.4 Å between the two structures. In addition, the NAT2 loop location as determined by our study (167-183) is nearly identical to the 21JA loop residues (165-185). Two structural differences between 2IJA and our model are the association of the loop with a beta strand in domain three, and the insertion of the C-terminal tail adjacent to the active site pocket. These interesting structural features have a dramatic impact on the size and shape of the active site pocket and rationalize the influence of the C-terminal tail on acetyl-coenzyme A hydrolysis. These two major differences also support our study's finding that the loop may have a stabilizing effect on the protein's structure and make sense of the predicted lack of secondary structure in the C-terminal NAT2 sequence. More information is needed about the flexibility of these important structural features in solution.
This study demonstrates the strength of combining computational and experimental approaches for determining protein structure and function. Based on the initial success of mammalian crystallization attempts (2IJA), it might be expected that the results of this study will be confirmed by the eventual crystallization of human NAT2. This study identified the location and potential function of the mammalian NAT loop and contributes to our understanding of arylamine N-acetyltransferase structure and function.
| Acknowledgments |
|---|
| Footnotes |
|---|
This work was partially supported by United States Public Health Service Grant CA34627 from the National Cancer Institute and Training Grant ES011564 from the National Institute of Environmental Health Sciences. This work constitutes partial fulfillment by Jason Walraven of the Ph.D. in Pharmacology and Toxicology at the University of Louisville School of Medicine.
ABBREVIATIONS: NAT, arylamine N-acetyltransferase; SMZ, sulfamethazine; PCR, polymerase chain reaction; RCSB, Research Collaboratory for Structural Bioinformatics.
The online version of this article (available at http://dmd.aspetjournals.org) contains supplemental material. ![]()
Address correspondence to: Dr. David W. Hein, Department of Pharmacology and Toxicology, University of Louisville School of Medicine, Louisville, KY 40292. E-mail: d.hein{at}louisville.edu
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
H. Wu, L. Dombrovsky, W. Tempel, F. Martin, P. Loppnau, G. H. Goodfellow, D. M. Grant, and A. N. Plotnikov Structural Basis of Substrate-binding Specificity of Human Arylamine N-Acetyltransferases J. Biol. Chem., October 12, 2007; 282(41): 30189 - 30197. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||