Dye determination sequencing
- one reaction with all four ddNTPS
Single sequence to genome
BAC library
Individual BAC clones
‘Shotgun’ sequencing with Sanger
Overlap Layout Consensus Method
‘Next generation’ sequencing
Sanger
Illumina (Solexa)
High-throughput sequencing
Involves:
High-throughput sequencing
The HTS cycle
Illumina Hiseq
Chromosomes
have a single DNA molecule with specialised DNA sequences for the initiation of DNA replication, for spindle interactions in mitosis (centromeres), and for maintaining the integrity of the ends (telomeres)
Protein gene expression
occurs at open reading frames, from which RNA polymerase transcribes mRNAs that are translated to form polypeptides, which become functioning proteins. Genes contain DNA sequences for control of their expression
Protein coding genes
generally not repetitive but there are some exceptions, e.g. gillagrin and high copy number genes
Repetitive regions
microsatellites, telomeres, intron sequences
tRNA
very similar sequences (but very short)
rRNA
many copies of some ribosomal genes
Transposons
mobile genetic elements - sequence of a few kb that can move about the genome. Thousands of copies in eukaryotes
Size matters
Contig
a contiguous (continuous) consensus sequence from an assembly
Scaffold
a series of contigs where we have additional information to place them together in the right order and orientation but the sequence between the contigs is not complete
Assembly
the set of scaffolds for one genome
N50
the size of the largest contig/scaffold of which is 50% of the assembled data is in a contig/scaffold of that size or larger
Read length