Although most of the polymorphism described in the Guide is related to coding regions of genes, there are many well-known examples that affect gene regulation and splicing. One of the best described is related to persistence of lactase (LCT) expression in adults associated with milk tolerance. Many mutations affecting splicing are known for the globin genes. Several classes of polymorphism that directly or indirectly affect coding sequences are presented in this section.
The ABO locus, controlling the main blood groups (see Hematopoiesis and Erythrocytes), is a case not only of frequent polymorphism with a common null allele but of an enzyme where the natural variants provide extensive functional information.
Several common disease loci are associated with resistance to infection. Among these are G6PD (glucose-6-phosphate dehydrogenase; also isoform), CFTR (cystic fibrosis conductance regulator), and HBB (hemoglobin β). The alterations in iron storage seen with mutations in HFE (hemochromatosis; also many isoforms) might impact bacterial infections. Mutant alleles of MEFV (pyrin) might represent selection for enhanced response to certain pathogens. Also of note is the resistance to HIV associated with mutation at CCR5 (also alt mRNA).
Mutations of filaggrin (FLG) affect the granule layer of the skin and are common in certain populations. As shown in the following dot plot (word size 3), most of this relatively large protein consists of a large number of repeating regions that yield mature filaggrin after proteolytic cleavage. The reference sequence has 11 repeats and some variation in repeat number is known. Because the individual repeats have function, mutations are often frameshifts or produce nonsense codons.
Polymorphism in the N-acetyltransferases has been studied intensively because of the role of the enzymes in the metabolism of xenobiotics. A number of the cytochrome P450 enzymes have been studied for similar reasons.
Variation in the number of DAZ genes on the Y chromosome (see Testes and Sperm) has been investigated because of the association of variations at this locus with fertility. Variation in gene content is also seen for the receptors on NK cells mentioned above.
Human mutations are known in many proteins involved in the biogenesis of lysosome-related organelles. The text describes how mouse mutants supplement the information derived from known human variation.
Extraordinary levels of polymorphism are seen with the LPA gene (see Lipoproteins). In this case, the variation derives from the number of copies of a 114-aa repeating unit that makes up most of the protein. The reference allele has 16 copies of the repeat.
New mutations are frequently seen at the very large locus encoding DMD (dystrophin, with many alt mRNAs and isoforms). Alterations in many of the largest genes in the genome are observed in tumors.
DMD and HFE have many alternate products that can be accessed via the links in this section.
See also the additional reading for this chapter.
Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.