Mapping the genetic architecture of gene expression in human liver

PLoS Biol. 2008 May 6;6(5):e107. doi: 10.1371/journal.pbio.0060107.

Abstract

Genetic variants that are associated with common human diseases do not lead directly to disease, but instead act on intermediate, molecular phenotypes that in turn induce changes in higher-order disease traits. Therefore, identifying the molecular phenotypes that vary in response to changes in DNA and that also associate with changes in disease traits has the potential to provide the functional information required to not only identify and validate the susceptibility genes that are directly affected by changes in DNA, but also to understand the molecular networks in which such genes operate and how changes in these networks lead to changes in disease traits. Toward that end, we profiled more than 39,000 transcripts and we genotyped 782,476 unique single nucleotide polymorphisms (SNPs) in more than 400 human liver samples to characterize the genetic architecture of gene expression in the human liver, a metabolically active tissue that is important in a number of common human diseases, including obesity, diabetes, and atherosclerosis. This genome-wide association study of gene expression resulted in the detection of more than 6,000 associations between SNP genotypes and liver gene expression traits, where many of the corresponding genes identified have already been implicated in a number of human diseases. The utility of these data for elucidating the causes of common human diseases is demonstrated by integrating them with genotypic and expression data from other human and mouse populations. This provides much-needed functional support for the candidate susceptibility genes being identified at a growing number of genetic loci that have been identified as key drivers of disease from genome-wide association studies of disease. By using an integrative genomics approach, we highlight how the gene RPS26 and not ERBB3 is supported by our data as the most likely susceptibility gene for a novel type 1 diabetes locus recently identified in a large-scale, genome-wide association study. We also identify SORT1 and CELSR2 as candidate susceptibility genes for a locus recently associated with coronary artery disease and plasma low-density lipoprotein cholesterol levels in the process.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • Animals
  • Child
  • Child, Preschool
  • Cholesterol, LDL / blood
  • Cholesterol, LDL / genetics
  • Coronary Artery Disease / genetics
  • Diabetes Mellitus, Type 1 / genetics
  • Female
  • Gene Expression Profiling*
  • Genes, MHC Class II / genetics
  • Genetic Predisposition to Disease / genetics*
  • Genome, Human
  • Genotype
  • Humans
  • Infant
  • Liver / metabolism*
  • Male
  • Mice
  • Middle Aged
  • Oligonucleotide Array Sequence Analysis
  • Polymorphism, Single Nucleotide / genetics*
  • Quantitative Trait Loci / genetics
  • RNA, Messenger / analysis
  • RNA, Messenger / genetics
  • Transcription, Genetic / genetics*

Substances

  • Cholesterol, LDL
  • RNA, Messenger