Zoology 510, Class Notes for Ridley, Chapter 17
The Reconstruction of Phylogeny.
No written assignment, but you should try to answer the Study and Review Questions at the end of the chapter (pp. 506-7).
Brief Outline
510 index page
Introduction to Chapter 17.
- The basic logic of phylogenetic reconstruction is quite straightforward.
- Nature provides species. These species have a phylogenic history, but that history cannot be observed directly.
- There is only one true phylogeny.
- Inheritance from common ancestry produces similarity.
- Evolutionary divergence produces differences.
- We arrange the species according to similarities and differences.
- The result should reveal phylogeny
- Unfortunately, Nature does not cooperate. Nature provides misleading and contradictory clues.
- Three conceptual tools are available:
- Cladistic analysis based on reliable synapomorphy (derived homology).
- Cladistic analysis using parsimony and numerous similarities and differences.
- "Distance" analysis, encompassing a variety of effective methods (in addition to phenetics which is quite unreliable).
- The difficult job for phylogenetic reconstruction, like that for classification, is deciding which way of choosing characters is most reliable in any particular case.
CHECK LIST of important TERMS
- phylogeny / phylogenetic tree
- rooted tree
- unrooted tree
- informative vs. uninformative characters
- homology vs. analogy
- character conflict (homoplasy, symplesiomorphy)
- derived vs. ancestral homologies
(synapomorphy vs. symplesiomorphy)
- outgroup
- outgroup comparison
- von Baer's first law
- ancestral state vs. derived state
Chapter 17, Section-by-Section Comments
17.1. "Phylogenies are inferred from characters shared between species."
- This section introduces the idea of rooted and unrooted trees.
- An unrooted tree indicates relationships without specifying any direction for evolutionary change. It has no ancestral "root" (or stem), and so does not indicated which branches came earlier and which ones later.
- A rooted tree does include indication of ancestor-descendent relationships.
- This section also introduced the idea of informative and uninformative characters.
- Informative characters are those which are shared among some but not all members of a group in question.
- Uninformative characters are those which which are shared by all members of the group OR which are possessed only by single members. In neither case can they be used for inferring relationships.
17.2. "The parsimony principle works if evolutionary change is improbable."
- Parsimony is an ancient guide for inference.
- Occam's Razor is the classic assertion of parsimony:
Essentia non sunt multiplicanda praeter necessitatum.
Essentials shall not be multiplied beyond necessity.
Explanations should be kept as simple as possible
- When applied in the reconstruction of phylogeny, parsimony is usually taken to mean that the true tree will call for fewer evolutionary changes during phylogeny than would be needed to fit the same set of similarities and differences onto an incorrect tree.
- This would be fine if any evolutionary change occurred only once and was never reversed. Unfortunately, that is not the case. Any attempt at phylogenetic reconstruction soon reveals that some similarities must be convergent or parallel (i.e., occurring multiple times) while some differences must involve reversal.
- Therefore, rigid parsimony cannot be trusted absolutely.
- The principle of parsimony makes good sense, if it is used as fallible guide and not as an unimpeachable law.
17.3. "Phylogenetic inference uses two principles: parsimony and distance."
- The parsimony principle is based on counting and minimizing the number
of discrete transformations which must occur in a phylogenetic tree.
- The distance principle involves arranging tree branches of various
lengths so the distances between every species pair can be most closely matched
by summing distances along each branch of the tree. Distance, in turn,
is a quantitative measure of similarity; the greater the distance, the less
similarity. (The description in Ridley, p.468, is fairly useless.)
- In simple cases, examples of these two methods can look very similar. Ideally they should give the same results (but in the real world they often do not).
- Both methods are computationally intense for any but the smallest sets of species.
17.4. "In most real cases, not all characters suggest the same phylogeny."
- True synapomorphies (derived homologies, reflecting shared ancestry) cannot conflict.
- Similarities which are not true synapomorphies may be due homoplasy (convergence, parallel evolution, or reversal) or symplesiomorphy (shared, ancestral similarity).
- Homoplasy and symplesiomorphy are both common.
- In practice, the job of reconstructing phylogeny is largely one of distinguishing
true synapomorphy from homoplasy.
17.5. "Homologies are more reliable for phylogenetic inference than are analogies."
- This section contains painfully circular reasoning, beginning with the section heading itself. However, the underlying concept is sound. Once phylogeny is known with some confidence, one can determine which similarities are homologous and which are analogous. And one can then see how misleading the analogies might be.
- The next section suggests tools for breaking out of the circle.
17.6. "Homologies can be distinguished from analogies by several criteria."
- Several features often characterize reliable homologies:
- They share "fundamental structure" (whatever that might mean). In practice, "fundamental structure" is generally taken to mean composition from similar elements, similarly arranged, like the various bones and muscles of homologous vertebrate limbs.
- They share similar relations to surrounding characters, such as relative position in the body.
- They share similar embryonic development (and genetic specification).
- Unfortunately, none of these can be trusted absolutely. There
are well-established exceptions to them all.
- Homologies tend to keep these reliable similarities in spite of modification for different modes of life.
- In contrast, analogies are often similar specifically because of (convergent or parallel) adaptation for a common way of life. So, similarities that are similarly adapted for current needs are poor candidates for reliable homology.
17.7. "Derived homologies are more reliable indicators of phylogenetic relations than are ancestral homologies."
- Paraphyletic groups are similar because of ancestral similarities (symplesiomorphies) that have been lost (or highly modified) from some members of the complete monophyletic group.
- Recognizing that a highly-modified group is itself monophyletic is often easy; recognizing the remaining paraphyletic assemblage is NOT monophyletic can be much less obvious.
- The following section describe tools for (tentatively) distinguishing between synapomorphies and symplesiomorphies.
17.8. "The polarity of character states can be inferred by three main techniques."
- 17.8.1 "Outgroup comparison."
- An outgroup is a group that lies outside the group whose phylogeny
is being analyzed.
- Ideally, an outgroup should be closely related to the group in question.
- Traits shared among some members of the group in question which are
also present in the outgroup are likely to be ancestral homologies (symplesiomorphies)
and therefore not evidence of monophyletic relationship within the group.
- Identifying suitable outgroups requires some prior knowledge of phylogeny.
- Outgroup comparison can also be confounded by homoplasy, in which case
multiple outgroups can be helpful.
- 17.8.2"The embryological criterion."
- Von Baer's first law, that ancestral characters appear earlier in embryonic
development than derived characters, bears a distinct similarity to Haeckel's
infamous biogenetic law, that "Ontogeny recapitulates phylogeny."
- By limiting the application of his principle to the embryonic sequence
of developmental states, von Baer avoided the error of assuming
that embryonic development recapitulates the sequence of ancestral adult
states.
- Unfortunately, we know that von Baer's first law does have exceptions
(embryonic forms can and have changed evolutionarily, independently from
later life-stages, such that related but strikingly dissimilar embryos
can give rise to very similar adults.) Since we often know even
less about embryology than we do about phylogeny, applying the embryological
criterion to infer phylogeny runs a decided risk of error or circular
reasoning.
- 17.8.3 "The fossil record."
- To the extent that the fossil record can be trusted, ancestral traits
can be directly observed.
- However, here the problem lies in deciding which fossil creatures are
ancestral to which group. Even if the fossil is older than the first
known appearance of the group in question, it is generally difficult to
know for certain that the group didn't actually diverge earlier still
and simply fail to leave a record of its beginnings.
17.9. "Any residual character conflict can be resolved by parsimony."
- In spite of all the uncertainty, it is possible to know, for certain,
that all of the rules for inferring homology do have exceptions. This
is proven by character conflict, in which there is no single tree which is
consistent with all the evidence, and in which different rules for inference
yield different best trees.
- Faced with unresolved uncertainty as a result of conflicting characters,
there are four strategies:
- Gather more data (often impractical or excessively expensive).
- Intensify analysis of available data (often fruitless).
- Suspend judgment.
- Assert the overriding correctness of parsimony.
- Of these four, assertion of parsimony is chosen by many contemporary cladists.
This has the virtue of giving "definite" answers. Unfortunately,
such definite answers may well be wrong.
- Suspension of judgment is usually a sound choice, but deeply unsatisfying.
17.10. "Molecular sequences are becoming increasingly important in phylogenetic inference, and they have distinct properties."
- "The logic of phylogenetic inference is identical for molecular and morphological characters."
- Molecular characters are available in vast numbers.
- Every amino acid in every protein is a character.
- Similarly, every base-pair in every gene is a character.
- Each individual character is quite unreliable; character conflict is widespread.
- There are only possible four character states for a DNA base-pair.
- There are only 20 possible character states for an amino acid.
- With such low numbers, convergence and reversal become reasonably probable.
- So large numbers of characters are needed.
- Comparison of sequences is straightforward, using statistical techniques based on distance (using simple counts of differences) or parsimony (using counts of discrete character state-changes).
- These statistical techniques are limited by insurmountable computational difficulties to relatively small sets of taxa.
- Most commonly, molecular methods are used either with a group of species within a single taxon whose monophyly is already known (probably on the basis of morphology) or with representative species from several higher taxa whose separate monophyly is already known (again, most probably .from morphology)
17.11. "Molecular sequences can be used to infer an unrooted tree for a group of species."
- Steps involved in reconstruction:
- Acquire comparable molecular sequence data. (This is not always as straightforward as it sounds.)
- Align the sequences (also can be tricky).
- Find informative sites.
- Draw all possible trees (computationally impossible for large data sets) or sample from possible trees.
- Determine distance measure or count state-changes on each tree.
- Choose the optimal tree (based on optimality criterion of distance statistic or parsimony).
- Root the tree using outgroup comparison.
17.12. "Different molecules evolve at different rates, and molecular evidence can be tuned to solve particular phylogenetic problems."
- The job here is to find a molecule with the highest number of informative sites.
- Mitochondrial DNA is useful for recent lineages (diverging over the last few million years).
- Nuclear ribosomal RNA genes are useful for more ancient lineages (diverging over hundreds of millions of years).
17.13. "Molecular phylogenetic research encounters difficulties when the number of possible trees is large and not enough informative evidence exists."
- Note that with a practical upper limit of 25 taxa for which trees can be tested exhaustively, additional criteria or assumptions must be introduced.
- There are two distinct issues, "optimality criterion" and algorithm.
- The "optimality criterion" has already been discussed, as parsimony or distance (with several distinct "distance" methods available.
- "Algorithm" refers to the method the computing program uses to search for an optimal tree. Because exhaustive search is impossible for any but small sets of taxa (25 or fewer), some method of trial sampling must be used. All available algorithms may fail to find the best tree.
17.14. "Unrooted trees can be inferred from other kinds of evidence, such as chromosomal inversions in Hawaiian fruitflies or comparative anatomy in the mammal-like reptiles."
- This section celebrates some classical examples of phylogenetic inference. The Drosophila example is especially elegant. Flies have wonderfully large, banded chromosomes. Since each chromosomal inversion is a good, reliable synapomorphy, a precise phylogeny could be inferred at the species level.
17.15. "Some molecular evidence can only be used to infer phylogenetic relations with distance statistics."
- Direct sequence comparison is a powerful method with increasing popularity.
- The older molecular techniques, immunological cross-reactivity and DNA annealing temperatures, give measures of similarity that cannot be analyzed cladistically.
17.16. "Comparing molecular evidence and paleontological evidence."
- When evidence is properly understood, all lines of evidence should be congruent.
- However, when one line of evidence is scanty, or based on dubious assumptions, while another appears stronger, the stronger line will prevail.
- There is no automatic preference for morphological evidence or for molecular evidence or for paleontological evidence. Whatever works, works.
- Two examples show how the shifting weight of evidence can revise hypotheses about phylogeny.
- 17.16.1 "Molecular evidence successfully challenged paleontological evidence in the analysis of human phylogenetic relations."
- 17.16.2"Paleontological evidence may be correcting molecular evidence in the analysis of amniote relations."
17.17. "Conclusion."
- Please take note: "It is easy to be deceived by discussions of phylogenetic inference into thinking that it represents an exceptionally uncertain, shaky kind of science." It is unfortunately true, in some cases, that "'Paleontology is mute, comparative anatomy meaningless, and embryology lies.'" But it is easy to overlook the fact that current interest tends to focus "on unsolved problems--and these issues tend to be the difficult ones." That's true in many areas of evolutionary biology, not just phylogenetic analysis.
510 index page
Comments and questions:
dgking@siu.edu
Department of Zoology e-mail:
zoology@zoology.siu.edu
Comments and questions related to web server:
webmaster@science.siu.edu
SIUC / College of Science / Zoology / Faculty / David King / ZOOL 510
URL: http://www.science.siu.edu/zoology/king/510/mr17.htm
Last updated: 3-Jan-2001 / dgk