what is human genome projecT?
1990 - goal to sequence al 3*10^9 base pairs of human DNA
- completed in 2003 - thought it would take 30 year.s
who did HGP?
public: US govt, Francis Collin
private: The institute Craig venter
first completed genome sequence was?
haemophilus influenzae
- respiratory disease. took 8 years thru private funding
venter style?
collins style?
simplistic approach
- sequence DNA. find end of gene. create probe that extended from end of piece to start of next piece
slow, systematic. had to create probe between each finding.
sequencing DNA
sequence thousands of clones at one. 7-10 times coverage of entire genome
what’s sanger sequencing?
DNA polymerase in soln copies after primer.
di-deoxynucleotide (no -OH) addd to soln. DNApol can’t copy if no -OH.
separate based on size. know based on dideoxy what original nucleotide is.
shorter strands at beginning of sequence
next gen sequencing
structure of ORF
open reading frame
- approx 300 bp before stop codon.
ORF =/= gene. may be, but doesnt need to be. if transcribed + translated = gene.
using ORF
ORF content to genome size
greater genome size = more ORF content
lifestyles of bacteria
endosymbiotic - live in other cell. use host DNA, but have their own
parasitic: may grow inside cell. can’t grow without host DNA
Free-living: independent. dont need other organisms. usually larger than others. fitness cost
gene annotation
compare ORF, if similar sequence annotated as similar function.
- gene annotations help reconstruct metabolic pathways + determine gene complement
problem with gene annotation?
2 sequences may have diff function
- if dont have sequence similar, maybe protein is still same function
pathogens + growth factors
from host cell.
no genes for amino acid biosynthesis
- no genes
what is URF
ORF with unknown function
Re-constructed Genome map
ORF’s in opposite directions.
analyze map of bacteria + determine what kind of bacteria 1
glycolysis, PPP
fermenter. lactate, no O2
analyze map of bacteria + determine what kind of bacteria 2
TCA cycle. has ETC flagella, pilli have NADH oxidizer
analyze map of bacteria + determine what kind of bacteria 3
calvin cycle, carbon fixation.
citric acid cycle
photosynthetic -
NH3 + O2 at 1st etc,
ammonia-oxidizing bacterium.
chemolithoautotroph
aerobe
ORFs in bacterial genome : genome vs function
DNA replication: same amount needed regardless of genome. proportionally decrease tho. same with translation
trancription + signal transduction : more regulation, more pathways when there’s bigger size.
energy: about stable - increase genome doesnt change energy much
what are homologous genes
from same ancestor
diff btw paralog and ortholog
paralog: gene duplication: gene stays within same species or organism (a and b-heme)
- 1 gene retains function, other may vary/pseudogene.
define synteny
similar gene order
- more closely related = more syntenous