Human/chimp DNA similarity continues to decrease: counting indels
It is conventionally held that humans and chimps differ only very slightly in their DNA. However, new evidence suggests that the difference might be much more drastic. Mutations resulting in DNA insertions and deletions cause much of the genetic difference between the two species, but are typically not included in estimates of diversity. Moreover, areas of significant similarity are often affected by selective constraints. An increasing number of functions are also being discovered for so-called ‘junk DNA’, suggesting similarity in such DNA is not necessarily due to common descent. Additional research should aid the understanding of such important data in the debate over origins.
Creationists have long maintained that the similarity between human and chimp DNA is not all that it is touted to be. A new study in the Proceedings of the National Academy of Sciences could help confirm this.
It is widely held that ‘The common chimpanzee (Pan troglodytes) is our closest relative. Its genome sequence is about 98.8% identical to our own, and we shared a common ancestor some six million years ago.’1 The assumption that humans diverged from chimps roughly this long ago also forms the basis of the mitochondrial clock,2 which ‘continues to be widely used to “time” human evolution and population movements, both ancient and modern.’3 In the popular-level book Genome, Matt Ridley states that:
‘Apart from the fusion of chromosome 2, visible differences between chimp and human chromosomes are few and tiny. In thirteen chromosomes no visible differences of any kind exist. If you select at random any “paragraph” in the chimp genome and compare it with the comparable “paragraph” in the human genome, you will find very few “letters” are different: on average, less than two in every hundred. We are, to a ninety-eight per cent approximation, chimpanzees, and they are, with ninety-eight per cent confidence limits, human beings. If that does not dent your self-esteem, consider that chimpanzees are only ninety-seven per cent gorillas; and humans are also ninety-seven per cent gorillas. In other words we are more chimpanzee-like than gorillas are.’4
One creationist response to such arguments regarding human/chimp DNA similarity has been that ‘Chimp DNA has not been anywhere near fully sequenced so that a proper comparison can be made’,5 and that this evidence is just as easily explained (and predicted, for that matter) by the concept of a common designer:
‘Since DNA codes for structures and biochemical molecules, we should expect the most similar creatures to have the most similar DNA. Apes and humans are both mammals, with similar shapes, so both have similar DNA. We should expect humans to have more DNA similarities with another mammal like a pig than with a reptile like a rattlesnake. And this is so. Humans are very different from yeast but they have some biochemistry in common, so we should expect human DNA to differ more from yeast DNA than from ape DNA.’6
In a recent article,7 David A. DeWitt cited a study which found that the two species are only 95% identical when insertions and deletions are considered,8 showing that the estimate of divergence depends mainly on what type of DNA is being compared. A number of differences between humans and chimps were named that are difficult to quantify in an estimate of sequence divergence (that is, the differences in bases between the human and chimp genomes), including shorter telomeres in humans, a 10% larger chimp genome, and great differences in chromosomes 4, 9, 12 and the Y chromosome, for example. Indeed, DNA similarity estimates ‘do not adequately represent fine changes in genome organization.’9
Considering DNA gaps
Previous estimates of sequence divergence have focused exclusively on base substitutions in DNA—that is, one base (or one DNA ‘letter’—A, T, C or G) being replaced with another. The new calculation, resulting in much less sequence similarity, also includes insertions and deletions, or indels, (occurring when a base is added or removed, often resulting in what is known as a frameshift mutation), in addition to base substitutions. The author of the study, Roy J. Britten, stated:
‘It appears appropriate to me to consider the full length of the gaps in estimating the interspecies divergence. These stretches of DNA are actually absent from one and present in the other genome. In the past, indels have often simply been counted regardless of length and added to the base substitution count, because that is convenient for phylogenetics.’8
His findings lend support to the idea that much of the failure of DNA to hybridize between chimps and humans is the result of missing DNA due to indel events. Britten then became involved in a follow-up paper in which these initial results were confirmed; in fact, it was found that ‘the 5% human-chimp difference already published is likely to be an underestimate, possibly by more than a factor of 2.’10
Now, Anzai et al. have published a new report in the Proceedings of the National Academy of Sciences that confirms this statement. In the study, nearly one-half of the MHC (major histocompatibility complex) region was sequenced, ‘which to date represents the longest continuous sequence within this species [chimps], our closest evolutionary relative’, and has been described as a ‘rapidly evolving’ part of the genome.9 Although it has been held that human/chimp similarity in the MHC is ‘so great that the alleles must have originated before the supposed chimp/human evolutionary divergence’,11 the sequence results actually dropped the DNA similarity estimate down to 86.7%!12 Indeed, the actual difference between the two species (when counting indels) is greater than 5% by well more than a factor of two. Not only this, but ‘evolutionists now recognize that complex MHC genetic motifs can arise independently’ in primates—that is, at least some similarities that do exist are not attributable to common descent.13
The human genome contains two MHC Class I genes, the MICA and MICB, yet chimpanzees contain only one gene at this location, the Patr-MIC. According to evolutionary speculation, a 95-kb deletion occurred between the two human genes, forming the hybrid chimpanzee gene ~33–44 million years ago, by far predating the commonly held divergence date between the two species of 6 million years. Because the two ends of the chimpanzee gene seem to match up with the beginning of the human MICA and end of the human MICB genes, it may seem reasonable that common ancestry is feasible. However, even some humans contain a single gene at this location (called the HLA-B*4801 allele) very similar to the one found in chimps. The study notes that it ‘is quite intriguing that an equal-sized deletion involving this very same region and genes (MICA/B) has happened at distinct points in time in several different primate species’.12 Yet it is also claimed that other such similar changes in DNA structure cannot be attributed to convergence, but must be due to common ancestry! Clearly, similar ‘mistakes’ can arise independently in separate species (as expanded upon by Woodmorappe13). The hypothesis that a Designer would create the same structures for the same functions seems to explain the data much more easily. As noted by Woodmorappe,11 strong selective pressures must have existed in order to prevent the MHC similarities between primates from being scrambled over supposed millions of years, further weakening the evolutionary scenario.
The Anzai et al. study also mentions a number of differences between humans and chimps that may be a result of genetic changes in the MHC genes, including the difference in handling infectious agents such as HIV, hepatitis B and C, and susceptibility to Plasmodium falciparum. Therefore, the differences observed in these genes may portray the believed ‘true’ divergence between the two species much better than previous estimates.
Although these results are interesting, there has been debate over whether or not indels should be included in sequence divergence estimates. For example, a mutation called a translocation can occur, in which a segment of DNA breaks off from one chromosome and is inserted in another. The original Britten study discussed such rearrangement events briefly and found them to be frequent. Due to the fact that indel differences were defined as ‘the full length of the gaps’ in the genomes, the estimates would not be able to consider this kind of mutational change easily.14 New research will hopefully aid in the understanding of changes in genome organization, and give clues as to how these changes can be included in estimates of human/chimp similarity.
Difference between coding and noncoding DNA
Other studies have resulted in estimates of similarity higher than 98.6%, also. For instance, Wildman et al.15 compared ~90 kilobases of human DNA to chimps and found a similarity of 98.86%, even when counting indels. This is important evidence, considering that it is in direct opposition to the data presented by Britten and Anzai et al. However, it must be understood that the various estimates use different types of DNA. Wildman’s team examined only coding DNA from a number of genes. Here, non-synonymous changes (those affecting protein structure by changing the specific amino acid encoded) are subject to purifying selection. This means that they can be selected against if they have any effect on the function of the protein.
Similarly, a study of human chromosome 21 (the smallest chromosome in the human genome) found only 3,003 nucleotide differences in over 400 kilobases. It was shown that: ‘The differences in coding, promoter, and exon-intron junction regions were 0.51 ± 0.02%, 0.88 ± 0.03%, and 0.85 ± 0.02%, respectively, much lower than the previously reported 1.23% in genomic regions’,16 with an overall similarity of 99.3%. Within an evolutionary framework, these results would confirm chimps as our closest relatives. However, this finding seems to contradict the knowledge of a high substitution rate on chromosome 21, also leading to the conclusion
‘ … that the higher level of similarity observed in the transcript units in this study is attributable to the presence of purifying natural selection exerted on the most important functional portions of the genes, including promoters, coding regions, and intronic regions near the exon-intron boundary.’16
Therefore, high similarity estimates specifically involve regions of coding DNA that are functionally constrained. The studies by Britten et al. and Anzai et al. both consider non-coding DNA, which might be less constrained, and therefore more free to accumulate random mutations. This non-coding DNA thus serves as a more accurate portrayal of true divergence. Of course, it is very reasonable within the context of biblical creation that the most similarity should exist where protein function is vital, since the same proteins would be used for the same structures by a common Designer.6 It naturally follows that non-coding DNA, being less constrained, possibly contains more divergence.
Returning to the Anzai et al. study, which found chimps and humans to be 86.7% similar, a general trend may be noticed with higher similarity in coding regions. Whereas most ‘non-MHC genes are involved in basic (homeostatic) cellular functions that require interindividual as well as interspecies homogeneity’, the MHC genes ‘have to constantly adapt themselves to the microbiological habitat of every species.’ Therefore, purifying selection tends to maintain the structural conservation of non-MHC genes because of their specific functions. We can conclude that the 86.7% estimate ‘may be a better representation of whole-genome sequence similarity between the human and the chimpanzee’ than previous estimates of 98.6%. Since ‘the major difference between the human and chimpanzee sequences is overwhelmingly attributable to indels’,12 estimates not including these mutations ignore a huge source of potential differences. Recent studies have consistently found indels to be the main source of variation between humans and chimps.8,10,12 It should also be noted, in contrast to examples of high-sequence similarity, that sequence divergence in certain regions can exceed 20%.8 As noted by DeWitt, estimates can be ‘misleading because it depends on what is being compared.’7
Introns are regions of DNA in the genome that do not code for a protein product, and are therefore assumed to have no function. Because of this, ‘introns in a particular gene are often compared between organisms, with the base pair differences seen between their sequences supposedly indicating the degree and time of divergence since they last shared a common ancestor.’17 Indeed, functionless introns should be very different in humans and chimps, or even nonexistent, within the context of biblical creation. However, evidence is mounting that introns are not, after all, void of function, and the assumption that they were may ‘come to be a classic story of orthodoxy derailing objective analysis of the facts.’18 Other forms of ‘junk’ DNA, obviously said to lack function and thus able to mutate at random, actually contradict evolutionary phylogenies, such as pseudogenes shared by humans and gorillas but not chimps, the CYP pseudogene being present only in chimps, and a substitution in the Alpha-1,3GT pseudogene shared by cows, squirrel monkeys and gorillas. Many substitutions that are shared take place in a non-random manner, also weakening the explanatory power of common descent.13 Numerous articles have been published discussing the functions of various alleged forms of ‘junk’ DNA,13,17,19–22 and it is encouraging to actually see evolutionary journals awakening to this important fact. The preservation of introns
‘… suggests they do something indispensable. And indeed a large number are transcribed into varieties of RNA that perform a much wider range of functions than biologists had imagined possible. Some scientists now suspect that much of what makes one person, and one species, different from the next are variations in the gems hidden within our “junk” DNA.’23
Similarities in introns do, therefore, fit the creationist paradigm quite nicely.
DNA is not everything
I suggest that further research is required in order to sort through this evidence, research that will also find differences inherent within the chimp kind. Indels can easily be viewed as intrinsic differences between kinds. The DNA sequence is not all that distinguishes different kinds of organisms—as geneticist Steve Jones was quoted in Creation as saying, ‘We also share about 50% of our DNA with bananas and that doesn’t make us half bananas, either from the waist up or the waist down.’24 Evidence has certainly emerged that ‘DNA is not everything’; for example, mitochondria, ribosomes, the endoplasmic reticulum and the cytosol are passed unchanged from parent to offspring (save for possible mutations in mtDNA). In fact, gene expression is itself under the control of the cell.25 Some animals have undergone extremely dramatic genetic changes, and yet their phenotype has remained virtually identical.26 Such epigenetic marks ‘can dramatically affect the health and characteristics of an organism—some are even passed from parent to child—yet they do not alter the underlying DNA sequence.’27 This evidence lends great support to reproduction after kinds (Genesis 1:24–25; 1 Corinthians 15:39), as structures present within parents are preserved in their offspring.
This is an exciting time for creationists as estimates of human/chimp similarity continue to decrease when indels are considered. Although it is obvious that the two species are very much alike in the mere DNA sequences (many of the same structures are present in both, so this would be expected in a creation model), the previous estimate of ~98.6% sequence identity may have been dealt a significant blow. Upcoming research will likely shed new light on the many differences between humans and other animals, and continue to affirm the truth of Genesis.
I wish to thank Kim Risely for reviewing an earlier draft of this manuscript. Also, I recognize Reed Cartwright, David DeWitt and Carl Wieland for providing helpful information and criticism. I am also indebted to Randy Kim, for having supported, encouraged and taught me at the times when it was most needed.
About the author
J. Warren Nelson is a high school sophomore with a particular interest in biology as it relates to evolutionary theory. He has attended a number of Creation conferences, including Creation Conference 2003. He often gives talks at his school’s Fellowship of Christian Athletes, on topics ranging from Genesis to Christian love. His current research interests involve population genetics, chemical evolution and Christian friendship. Return to top.
- Cyranoski, D., Almost human …, Nature 418(6901):910–912, 2002.
- Gibbons, A., Calibrating the mitochondrial clock, Science 279(5347):28–29, 1998.
- Howell, N., Smejkal, C.B., Mackey, D.A., Chinnery, P.F., Turnbull, D.M. and Herrnstadt, C., The pedigree rate of sequence divergence in the human mitochondrial genome: there is a difference between phylogenetic and pedigree rates, American J. Human Genetics 72(3):659–670, 2003.
- Ridley, M., Genome, HarperCollins Publishers, New York, p. 28, 1999.
- Batten, D., Human/chimp DNA similarity: Evidence for evolutionary relationship? Creation 19(1):21–22, 1997.
- Sarfati, J., Refuting Evolution 2, Creation Ministries International, Brisbane, Australia, pp. 112–113, 2002.
- DeWitt, D.A., >98% Chimp/Human DNA similarity? Not any more, TJ 17(1):8–10, 2003.
- Britten, R.J., Divergence between samples of chimpanzee and human DNA sequences is 5%, counting indels, Proc. Nat. Acad. Sci. USA 99(21):13633–13635, 2002.
- Hacia, J.G., Genome of the apes, Trends in Genetics 17(11):637–645, 2001.
- Britten, R.J., Rowen, L., Williams, J. and Cameron, R.A., Majority of divergence between closely related DNA samples is due to indels, Proc. Nat. Acad. Sci. USA 100(8):4661–4665, 2003.
- Woodmorappe, J., Noah’s Ark: A Feasibility Study, Institute for Creation Research, El Cajon, p. 203, 1996.
- Anzai, T., Shiina, T., Kimura, N., Yanagiya, K., Kohara, S., Shigenari, A., Yamagata, T., Kulski, J.K., Naruse, T.K., Fujimori, Y., Fukuzumi, Y., Yamazaki, M., Tashiro, H., Iawmoto, C., Umehara, Y., Imanishi, T., Meyer, A., Ikeo, K., Gojobori, T., Bahram, S. and Inoko, H., Comparative sequencing of human and chimpanzee MHC class I regions unveils insertions/deletions as the major path to genomic divergence, Proc. Nat. Acad. Sci. USA 100(13):7708–7713, 2003.
- Woodmorappe, J., Are pseudogenes ‘shared mistakes’ between primate genomes? TJ 14(3):55–71, 2000.
- Cartwright, R.A., personal communication, 8 July 2003.
- Wildman, D.E., Uddin, M., Liu, G., Grossman, L.I. and Goodman, M., Implications of natural selection in shaping 99.4% nonsynonymous DNA identity between humans and chimpanzees: enlarging genus Homo, Proc. Nat. Acad. Sci. USA 100(12):7181–7188, 2003.
- Shi, J., Xi, H., Wang, Y., Zhang, C., Jiang, Z., Zhang, K., Shen, Y., Jin, L., Zhang, K., Yuan, W., Wang, Y., Lin, J., Hua, Q., Wang, F., Xu, S., Ren, S., Xu, S., Zhao, G., Chen, Z., Jin, L. and Huang, W., Divergence of the genes on human chromosome 21 between human and other hominoids and variation of substitution rates among transcription units, Proc. Nat. Acad. Sci. USA 100(14):8331–8336, 2003.
- Walkup, L.K., Junk DNA: evolutionary discards or God’s tools? TJ 14(2):18–30, 2000.
- Mattick J.S., quoted in: Gibbs, W.W., The unseen genome: gems among the junk, Scientific American 289(5):46–53, 2003.
- Batten, D., More junk reclaimed, TJ 16(2):8, 2002.
- Woodmorappe, J., Pseudogene function: regulation of gene expression, TJ 17(1):47–52, 2003.
- Woodmorappe, J., Pseudogene function: more evidence, TJ 17(2):15–18, 2003.
- Woodmorappe, J., Junk DNA indicted, TJ 18(1);27–33, 2004.
- Gibbs, W.W., The unseen genome: gems among the junk, Scientific American 289(5):46–53, November 2003.
- Jones J., quoted in: Wieland, C., Furry little humans? Creation 24(3):10–12, 2002.
- Williams, A.R., Jumping paradigms, TJ 17(1):19–21, 2003.
- Fox, D., Wallaby nations, New Scientist 175(2354):32–35, 2002.
- Gibbs, W.W., The unseen genome: beyond DNA, Scientific American 289(6):106–113, 2003.