Assessing the relative importance of the biophysical properties of amino acid substitutions associated with human genetic disease

Hum Mutat. 2002 Aug;20(2):98-109. doi: 10.1002/humu.10095.

Abstract

The inclusion of a mutation in a pathology-based database such as the Human Gene Mutation Database (HGMD) is a two-stage process: first, the mutation must occur at the DNA level, then it must cause a clinically detectable disease state. The likelihood of the latter step, termed the relative clinical observation likelihood (RCOL), can be regarded as a function of the structural/functional consequences of a mutation at the protein level. Following this paradigm, we modeled in silico all amino acid replacements that could potentially have arisen from an inherited single base pair substitution in five human genes encoding arylsulphatase A (ARSA), antithrombin III (SERPINC1), protein C (PROC), phenylalanine hydroxylase (PAH), and transthyretin (TTR). These proteins were chosen on the basis of 1) the availability of a crystallographic structure, and 2) a sufficiently large number of amino acid replacements being logged in HGMD. A total of 9,795 possible mutant structures were modeled and 20 different biophysical parameters assessed. Together with the HGMD-derived spectra of clinically detected mutations, these data allowed maximum likelihood estimation of RCOL profiles for the 20 parameters studied. Nine parameters (including energy difference between wild-type and mutant structures, accessibility of the mutated residue, and distance from the binding/active site) exhibited statistically significant variability in their RCOL profiles, indicating that mutation-associated changes affected protein function. As yet, however, a biological meaning could only be attributed to the RCOL profiles of solvent accessibility and, for three proteins, local energy change, disturbed geometry, and distance from the active center. The limited ability of the biophysical properties of mutations to explain clinical consequences is probably due to our current lack of understanding as to which amino acid residues are critical for protein folding. However, since the proteins examined here were unrelated, and our findings consistent, it may nevertheless prove possible to extrapolate to other proteins whose dysfunction underlies inherited disease.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Substitution / genetics*
  • Amyloid Neuropathies, Familial / genetics
  • Amyloid Neuropathies, Familial / metabolism
  • Antithrombin III / chemistry
  • Antithrombin III / genetics
  • Antithrombin III / physiology
  • Antithrombin III Deficiency / genetics
  • Antithrombin III Deficiency / metabolism
  • Arylsulfatases / chemistry
  • Arylsulfatases / deficiency
  • Arylsulfatases / genetics
  • Arylsulfatases / physiology
  • Biophysics / methods*
  • Databases, Genetic
  • Databases, Protein
  • Genetic Diseases, Inborn / enzymology
  • Genetic Diseases, Inborn / genetics*
  • Genetic Diseases, Inborn / metabolism
  • Genotype
  • Humans
  • Leukodystrophy, Metachromatic / enzymology
  • Leukodystrophy, Metachromatic / genetics
  • Models, Genetic
  • Mutation, Missense / genetics
  • Phenotype
  • Phenylalanine Hydroxylase / chemistry
  • Phenylalanine Hydroxylase / deficiency
  • Phenylalanine Hydroxylase / genetics
  • Phenylalanine Hydroxylase / physiology
  • Phenylketonurias / enzymology
  • Phenylketonurias / genetics
  • Prealbumin / chemistry
  • Prealbumin / deficiency
  • Prealbumin / genetics
  • Prealbumin / physiology
  • Protein C / chemistry
  • Protein C / genetics
  • Protein C / physiology
  • Protein C Deficiency / genetics
  • Protein C Deficiency / metabolism
  • Protein Structure, Secondary / genetics

Substances

  • Prealbumin
  • Protein C
  • Antithrombin III
  • Phenylalanine Hydroxylase
  • Arylsulfatases