Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Letter
  • Published:

Reliable identification of large numbers of candidate SNPs from public EST data

Abstract

High-resolution genetic analysis of the human genome promises to provide insight into common disease susceptibility. To perform such analysis will require a collection of high-throughput, high-density analysis reagents. We have developed a polymorphism detection system that uses public-domain sequence data. This detection system is called the single nucleotide polymorphism pipeline (SNPpipeline). The analytic core of the SNPpipeline is composed of three components: PHRED, PHRAP and DEMIGLACE. PHRED and PHRAP are components of a sequence analysis suite developed to perform the semi-automated analysis required for large-scale genomes1,2 (provided courtesy of P. Green). Using these informatics tools, which examine redundant raw expressed sequence tag (EST) data, we have identified more than 3,000 candidate single-nucleotide polymorphisms (SNPs). Empiric validation studies of a set of 192 candidates indicate that 82% identify variation in a sample of ten Centre d'Etudes Polymorphism Humain (CEPH) individuals. Our results suggest that existing sequence resources may serve as a valuable source for identifying genetic variation.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1: RFLP confirmation of candidate SNP in UNIGENE set Hs.54515.
Figure 2: Candidate SNP within UNIGENE set Hs.83816.

Similar content being viewed by others

References

  1. Ewing, B., Hillier, L., Wendl, M.C. & Green, P. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 8, 175–185 ( 1998).

    Article  CAS  Google Scholar 

  2. Ewing, B. & Green, P. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8, 186–194 (1998).

    Article  CAS  Google Scholar 

  3. Schuler, G.D. Pieces of the puzzle: expressed sequence tags and the catalog of human genes. J. Mol. Med. 75, 694–698 (1997).

    Article  CAS  Google Scholar 

  4. Hillier, L. et al. Generation and analysis of 280,000 human expressed sequence tags. Genome Res. 6, 807– 828 (1996).

    Article  CAS  Google Scholar 

  5. Wang, D.G. et al. Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science 280, 1077–1082 (1998).

    Article  CAS  Google Scholar 

  6. Murray, J.C. et al. A comprehensive human linkage map with centimorgan density. Cooperative Human Linkage Center (CHLC). Science 265 , 2049–2054 (1994).

    Article  CAS  Google Scholar 

  7. Jin, L. & Nei, M. Limitations of the evolutionary parsimony method of phylogenetic analysis. Mol. Biol. Evol. 7 , 82–102 (1990).

    CAS  Google Scholar 

  8. Sokal, R.R. & Sneath, P.H.A. Principles of Numerical Taxonomy (W.H. Freeman, San Francisco, 1963).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kenneth H. Buetow.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Buetow, K., Edmonson, M. & Cassidy, A. Reliable identification of large numbers of candidate SNPs from public EST data. Nat Genet 21, 323–325 (1999). https://doi.org/10.1038/6851

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1038/6851

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing