What is the purpose of an alignment (4)?
What is the purpose of multiple sequence alignments (2)?
- Perform evolutionary analysis
In a pairwise alignment, what does positioning of two amino acids at the same point imply? What is the best way to determine the implication, and can this always be done?
They perform the same role in homologous proteins.
Can be determined by performing structural alignment, where the amino acids are aligned in 3D. May not be possible if no 3D structure is available.
What can be done to increase the accuracy of pairwise alignments? What does this reveal? What alignment to perform is usually worthwhile?
Adding more sequences to an alignment. Can reveal patterns that aren’t obvious in a pairwise alignment. Worthwhile to perform MSA.
Describe how an MSA is done (6).
How is scoring done in MSA (3)?
What are some disadvantages of MSA (2)?
- Difficult for 4 sequences, more than 20 is impossible.
How does the CLUSTAL algorithm work (4)?
What is an advantage and disadvantage of the CLUSTAL algorithm?
Advantage
- Results in near optimal alignment.
Disadvantage
- If an early error is made, it is preserved.
What is a major problem with the CLUSTAL algorithm?
Selection of an appropriate matrix for alignments consists of divergent and closely related sequences.
From an MSA, what do highly conserved residues suggest?
Correspondence to an active site.
From an MSA, where are insertions and deletions often found?
In surface loops.
From an MSA, what do conserved patterns of hydrophobicity with a spacing of 2 indicate? What about 4?
B-sheet.
4 indicates a-helix.
What is the terminal node?
End point.
What is an internal node?
Hypothetical ancestor.
What is a root?
Common ancestor.
What are the three basic assumptions of cladistics?
Can phylogenetic trees be rooted, or unrooted? What sequence is better to use? What can be done with this sequence, and what is it called?
They can be rooted, or unrooted.
It is better to use a sequence that is more divergent from all other sequences. The tree can be rooted at this sequence, called an outgroup.
What is characteristic of fully resolved trees?
They are binary, with no more than 2 branches at each node.
What is a problem with increases in taxa?
The number of possible trees increases exponentially, making it hard to know the one drawn is the true tree.
What are three ways to build trees?
Describe the distance matrix method.
It is a clustering method, for a set of species. You choose the two most similar, add a node, then add the next most similar.
What is a disadvantage of the distance matrix method?
Very simplistic, makes assumptions that may not be true.
Describe neighbour joining. What does it assume?
Stepwise build, doesnt asume all taxa have the same evolutionary rate. It can detect it, and corrects for this.