What is a haplotype?
A set of DNA variations that tend to be inherited together
- Chromosome has enormous anumber of variable sites
- So different chromosomes will have different combinations of alleles on those sites (loci)
How do haplotypes work in diploids?
What are ‘phases’ of haplotypes?
The haplotypic combinations present
What methods can we use to find out the haplotype phases?
It is challenging:
1. Allele specific PCR and Next Gen Sequencing
- Can get sequences for whole chromosome arms - ‘long read’ NGS
- Alleles yield PCR products of different sizes
- Correct combination of allele specifieic primers to amplify haplotypes
- Algorithmic analysis of sequences
What is recombination?
The shuffling of chromosomes segments to generate a new haplotype combination
What is the generation of new haplotypes caused by?
When / how does recombination occur?
During meiosis - when maternal and paternal chromosomes are aligned and cross over
- Humans - recombination rate ~ 1 to 10 events per chromosome
- Occurs throughout genome - but there are ‘recombination hotspots’ as well as ‘cold regions’
- Can cause gene conversion (non-reciprocal) recombination or conventional (reciprocal) recombination
- The further apart 2 sequences - the higher the probability of recombination between them
Why is recombination important?
Modulates / influences each of the 4 evolutionary forces dicussed in these lectures
- Fundamental part of sexual reproduction
- Creates novel combinations of genes
- Purges genome of deleterious mutations (removes them)
- Increases efficiency of natural selection - reduces interference between loci under different selection regimes
- Responsible for different sequences having different ancestral histories - increases the information available from the past but also increases the complexity of its analyses
- Can be exploited to infer population history - (e.g., new selection tests, admixture)
- Can be exploited to help locate genes of interest - (e.g., disease loci in humans)
Why is there variation in recombination in different parts of the chromosomes?
How do population processes and recombination interact with the strength of linkage on chromosomes?
What is linkage disequilibrium?
LD is the non-random association of variants of different polymorphic sites between alleles in the population - alleles at different loci
- E.g., When loci have LD = we can predict the variant at one site if we know the variant at another site
- So - sites close together are less likely to have recombination between them - so are more likely to be in LD
- E.g., if no linkage exists (linkage equilibrium) - frequency of each haplotype corresponds to product of allele frequencies
- Recombination decreases LD in each generation
What is linkage disequilibrium important in studying?
What processes increase LD?
How can you measure LD - quantify strength of LD?
How do linkage equilibrium/disequilibrium differ?
For Lewontins D - what do the D values indicate about LD?
D relates to the expected haplotype frequencies as a measure of the deviation from the pattern under random assortment and free recombination
- D = 0 - linkage equilibrium
- D > 0 - Association between alleles - occurs together more than expected - coupling phase
- D < 0 - Dissociation between alleles - occurs together less than expected - repulsion phase
- D usually varies between -0.25 - 0.25
For linked markers, how does LD change with chromosomal distance?
What is admixture?
Two genetically distinct (isolated) populations coming together
How does LD work in admixture?
What does the amount of LD in a population depend on?
Amount of LD in populations depends on recombination rate + when an admixture event may have occured in past
- Recombination high = LD reduces rapidly
- Recent admixture = high LD, ancient admixture = lower LD
Give an example of an admixture event in humans
How can you think about the coalescent with recombination?
How do selection forces affect diversity at linked loci and give an example?
Forces at one position affect the surrounding areas of the genome:
- Purifying selection: reduction in diversity at linked loci (background selection)
- Positive selection: hitchhiking effect leads to selective sweep
- e.g., selection on tb1 gene during domestication of maize - loss of nucleotide diversitiy (PI) in 5’ upstream area regulatory region of maize (caused by selective breeding) - localized loss of diversity is important indicator of where positive selection may have occurred
Explain the process of hitchhiking and selective sweeps?