MAKING SNPs MAKE SENSE


Scientists are identifying, cataloging and studying small genetic variations among humans that will lead to more specialized and effective medical treatments. What do these variations look like, and what exactly makes them informative?

Researchers look for one or more single nucleotide polymorphisms (SNPs; pronounced "snips"). SNPs are single-nucleotide substitutions of one base for another that occur in more than one percent of the general population.


The challenge for scientists is to identify SNPs that correlate with a particular effect in patients. Reliable SNPs could serve as predictive markers that inform our decisions about numerous aspects of medical care, including specific diseases, effectiveness of various drugs and adverse reactions to specific drugs.

This pharmacogenetic approach could save time, money, and discomfort for millions of patients through accurate diagnoses and matching patients with appropriate medicines.

Finding SNPs in the human genome

How do scientists identify SNPs?

Scientists approach the problem of identifying, cataloging and characterizing SNPs in two main ways:

  • Genomic approaches. This approach is used by scientists who want to see the big picture. Several large-scale projects have combined the efforts of many institutions to identify and catalog all of the SNPs in the 3-billion-base pair human genome. Each project involves hundreds of scientists, who compare the genomes of numerous individuals to identify the differences. These comparisons require a lot of computer-powered data analysis. As they work, scientists sort and catalog their results in databases that are available to anyone over the Internet, including other scientists and you. (See Additional Resources for links to these projects.)

  • Functional approaches. This approach is used by scientists who are interested in a particular disease or drug response. The biological processes involved in diseases and drug responses are controlled by the activities of many genes. Scientists interested in a particular process select genes known to be involved in the process and examine them in people who have a response or disease, as well as those who don't. By comparing people's DNA sequences, scientists can identify SNPs that correspond with a particular function or response.

SNP Quick Reference

SNP

SNP (pronounced "snip") stands for Single Nucleotide Polymorphism. SNPs are single-nucleotide substitutions of one base for another. Each SNP location in the genome can have up to four versions: one for each nucleotide, A, C, G and T. A SNP and its distribution in a population might look like the image at the left.

Not all single-nucleotide changes are SNPs, though. To be classified as a SNP, two or more versions of a sequence must each be present in at least one percent of the general population.

SNPs occur throughout the human genome - about one in every 300 nucleotide base pairs. This translates to about 10 million SNPs within the 3-billion-nucleotide human genome.


SNPs and disease-causing mutations: Not the same!

Differences in SNPs and disease-causing mutations

If you know what a point mutation is, then the description of a SNP might sound similar. True, both are single-nucleotide differences in a DNA sequence, but SNPs should not be confused with disease-causing mutations. The image to the left shows some tell-tale differences:

First, to be classified as a SNP, the change must be present in at least one percent of the general population. No known disease-causing mutation is this common.

Second, most disease-causing mutations occur within a gene's coding or regulatory regions and affect the function of the protein encoded by the gene. Unlike mutations, SNPs are not necessarily located within genes, and they do not always affect the way a protein functions. SNPs are divided into two main categories:

Linked SNPs (also called indicative SNPs) do not reside within genes and do not affect protein function. Nevertheless, they do correspond to a particular drug response or to the risk for getting a certain disease.

Causative SNPs affect the way a protein functions, correlating with a disease or influencing a person's response to medication. Causative SNPs come in two forms:

Coding SNPs, located within the coding region of a gene, change the amino acid sequence of the gene's protein product.

Non-coding SNPs, located within the gene's regulatory sequences, change the level of gene expression and, therefore, how much RNA and protein is produced.


NCRR/SEPA

Supported by a Science Education Partnership Award (SEPA) [No. 1 R25 RR16291-01] from the National Center for Research Resources, a component of the National Institutes of Health, Department of Health and Human Services. The contents provided here are solely the responsibility of the authors and do not necessarily represent the official views of NCRR or NIH.