phylogenetics 1.2 Flashcards by Aida Rezaei

What are the differences between trees?

Branch lengths have different meanings.

How well did you know this?

Not at all

Perfectly

What is a Cladogram?

Branch length has no special meaning. Focuses only on the order of branching — who is related to whom. Used when we just care about the pattern of ancestry.

It’s like a simple family tree.

How well did you know this?

Not at all

Perfectly

What is a Phylogram?

Branch length represents the amount of evolutionary change. The longer the branch, the more mutations or character changes have occurred. Often built from genetic distance data.

The tree now shows both relationships and evolutionary differences.

How well did you know this?

Not at all

Perfectly

What is a Chronogram?

Branch length represents time — calibrated with fossil records or molecular clocks. Assumes a mutation rate per time unit (e.g., 1 mutation per million years). All species living today should line up evenly at the tips (same “present” time).

A chronogram shows both who descended from whom and how long ago they split.

How well did you know this?

Not at all

Perfectly

What are the 3 types of trees?

Cladogram, Phylogram, Chronogram.

How well did you know this?

Not at all

Perfectly

What does LUCA stand for?

The Last Universal Common Ancestor.

How well did you know this?

Not at all

Perfectly

What is LUCA?

The most recent organism from which all life on Earth descended.

How well did you know this?

Not at all

Perfectly

How was LUCA characterized?

Scientists believe LUCA lived around 3.5 to 3.8 billion years ago and was: A cellular organism with a lipid bilayer (a real cell structure), using DNA, RNA, and proteins. The ancestor of all three domains of life: Bacteria, Archaea, Eukarya. LUCA is not the first life form ever, but rather the last ancestor before life diversified into the major branches we see today.

How well did you know this?

Not at all

Perfectly

What are the shapes of trees?

Polytomy, Bifurcation.

How well did you know this?

Not at all

Perfectly

What is a Polytomy?

A node with more than two branches splitting out of it. Means we don’t know the exact order of branching — uncertain relationships.

Example: one ancestor gives rise to A, B, and C at once (but we can’t tell which came first).

How well did you know this?

Not at all

Perfectly

What is a Bifurcating tree?

Every node splits into exactly two branches. Called fully resolved because all relationships are clear and unambiguous. Scientists aim for bifurcating trees but often start with polytomies when data are uncertain.

How well did you know this?

Not at all

Perfectly

What is important to note about phylogenetic trees?

A phylogenetic tree is a hypothesis, not a confirmed fact. No loops allowed: Trees don’t model hybridization or horizontal gene transfer (those require “networks”).

How well did you know this?

Not at all

Perfectly

What are the main assumptions about phylogenetic trees?

All taxa share a common ancestor. Mutations accumulate gradually over time. If two species (A and B) are more similar to each other than to C, then: C likely split off earlier from the common ancestor.

How well did you know this?

Not at all

Perfectly

What does the tree interpretation indicate?

(root) | |—— C | └——┬—— A └—— B A and B share a recent ancestor, while C diverged earlier.

How well did you know this?

Not at all

Perfectly

What are the groups of trees based on how species relate to their common ancestors?

Monophyletic, Paraphyletic, Polyphyletic Groups.

How well did you know this?

Not at all

Perfectly

What is a Monophyletic Group?

Includes a common ancestor and all of its descendants. This is the “ideal” grouping in modern biology.

Example: All mammals (including humans, whales, bats) share a single ancestor → monophyletic.

How well did you know this?

Not at all

Perfectly

What is a Paraphyletic Group?

Study These Flashcards

Includes a common ancestor and some, but not all, descendants. Leaves out one or more subgroups.

Example: Reptiles are paraphyletic because birds evolved from reptiles but are excluded from “Reptilia” in the traditional sense.

What is a Polyphyletic Group?

Study These Flashcards

Includes species that don’t share an immediate common ancestor. Grouped together based on similar traits that evolved independently — a case of convergent evolution (homoplasy).

Example: Bats and birds both have wings, but their last common ancestor didn’t. So, “flying animals” as a group is polyphyletic.

What do phylogenetic trees depend on?

Study These Flashcards

Phylogenetic trees depend on which characters you analyze: Morphological features (e.g., color, shape), Genetic sequences (DNA), Behavioral traits.

Why can different datasets produce different trees?

Study These Flashcards

Some traits evolve faster than others. Some similarities arise independently (convergent evolution).

Example scenario: Organism A: uppercase black, Organism B: lowercase black, Organism C: lowercase red, Organism D: uppercase red. If you build a tree based on color, you’ll group black vs. red. If you build it based on letter case, you’ll group uppercase vs. lowercase. Thus, different features tell different evolutionary stories. Scientists usually combine multiple data types to get a more reliable tree.

Why might different characters give different trees?

Study These Flashcards

A phylogenetic tree is an inference: we use observed characters (DNA letters, protein differences, morphological traits) to reconstruct history. Different characters can tell different stories because: Homoplasy / convergent evolution, Multiple changes at the same site (double or back mutations), Mistakes in calling the ancestral state.

What is Homoplasy / convergent evolution?

Study These Flashcards

Two species may look or have sequences that are similar not because they inherited that feature from a recent common ancestor, but because they evolved the same feature independently.

Example: wings in bats and birds — both fly but their wing structures evolved separately.

What are multiple changes at the same site?

Study These Flashcards

A nucleotide can mutate A → G then later G → A (back mutation), or mutate A → G in two independent lineages (parallel mutation). These events hide true history and make unrelated taxa look similar or related taxa look different.

What are mistakes in calling the ancestral state?

Study These Flashcards

When we infer which character state was ancestral, we can be wrong—this leads to incorrect placement of splits.

What is the difference between gene trees and species trees?

Species tree: the actual historical branching of populations/species (who split from whom and when). Gene tree (or genealogy): the history of a single genetic locus (one small stretch of DNA). Gene copies have their own ancestry that traces back through individuals, not directly through species.

What is Incomplete Lineage Sorting (ILS)?

Imagine three species that split quickly one after the other: species X splits into A and the ancestor of B+C, then soon after the ancestor of B+C splits into B and C. If the splits happen fast and population sizes are large, gene copies sampled from A, B, and C may have gene genealogies that do not match the species branching order.

What are the key drivers of ILS?

Short time between speciation events (rapid radiation) → more chance of discordant gene trees. Large effective population size (Ne) → gene copies take longer to coalesce, increasing chance of mismatch.

What is the practical consequence of ILS?

If species A, B, C split rapidly, many loci may show X-B, X-C, or B-C groupings — not all will match the true species tree.

What is an example of ILS?

Species tree (true history): ┌─ A ───┤ └─┬─ B └─ C. Species split quickly (A splits off, then B and C soon after). Gene tree from locus 1 might show: ┌─ A ───┤ └─┬─ C └─ B. Gene tree from locus 2 might show: ┌─ B ───┤ └─┬─ A └─ C. Only some gene trees match the species tree.

Why does picking only one gene per species not solve the ILS problem?

You might say: “OK, pick one gene copy per species and build a tree.” That reduces the number of gene copies, but it does not remove ILS — it moves the uncertainty to deeper nodes.

How to handle gene/species discordance in practice?

Use many independent loci, Coalescent-aware methods / multispecies coalescent (MSC), Test for gene flow / introgression, Check branch lengths & timing.

Why use many independent loci?

The discordant signal will be diluted and the true species tree often emerges as the dominant pattern.

What are coalescent-aware methods?

Statistical methods that explicitly model how gene genealogies coalesce inside species trees (e.g., ASTRAL, SVDquartets, BUCKy).

Why test for gene flow / introgression?

If hybridization or horizontal gene transfer occurs, trees alone are not enough — network approaches or explicit introgression tests are needed.

What to check regarding branch lengths & timing?

Short internal branches are hotspots for ILS; long branches are safer (more time for gene coalescence to occur).

What is the population genetics perspective on genealogy & coalescence?

Each gene copy has a lineage; going backward in time, pairs of gene copies eventually coalesce into a common ancestor copy. The expected time for two gene copies to coalesce depends on the effective population size (Ne).

What are other causes of conflicting trees?

Migration, hybridization, horizontal gene transfer (HGT), Homoplasy, Sampling error and limited data.

What do migration, hybridization, and horizontal gene transfer (HGT) create?

These events create real reticulation (networks), not trees. A node may receive genetic material from another lineage.

What is Homoplasy?

Parallel mutations make unrelated taxa look alike.

What are the limitations of pairwise distances?

Saturation. Over long times, the same site may have changed multiple times. p-distance flattens out and loses information.

How to choose markers for pairwise distances?

Choosing markers by time scale: Very recent splits: use fast-evolving markers (microsatellites, rapidly evolving mtDNA regions). Deep divergences: use slowly evolving, conserved regions (housekeeping genes) to avoid saturation.

What are the clustering methods?

UPGMA, Neighbor-Joining (NJ).

phylogenetics 1.2 Flashcards

(42 cards)