Uniprot
protein
EMBL
gene
ENSEMBL
genome
NCBI
bacteria
methods for sequence comparison
FASTA
speeding up alignments with hash tables, heuristic algorithm, usage of K-tuples to search for matching sequence patterns of K-tuple hits
BLAST
an algorithm for comparing primary biological sequence information, optimized for speed use.
blastn
compares your nucleotide sequence with database nucleotide sequence
blastp
compares your query protein sequence with database of protein sequence that were derived from cDNA of interest
blastx
first translates your sequence into amino acids in 6 reading frames then compares the protein sequences with protein databases
tblastn
compares your query protein sequence with the database after translating each nucleotide sequence into protein using all 6 reading frames
tblastx
translates both query nucleotide sequence & the database sequence in all 6 reading frames & then compares the protein sequence. looks for protein coding regions. Good choice- less noise
PROSITE
protein database. Its uses includes identifying possible functions of newly discovered proteins and analysis of known proteins for previously undetermined activity
what is PSI-BLAST
(position specific iterated BLAST)- iterative search using protein BLAST algorithm.
how is PSI-BLAST used
HMMR
software for working with sequence HMM (hidden markov models= generalization of protein models).
Pfam
protein family database. looks at domains & protein family definitions & HMM
MFFT & Clustal Omega
MSA program for amino acids or nucleotide sequence
programs for phylogenetic tree constructions
PDB (protein data bank)
s a repository for the three-dimensional structural data of large biological molecules, such as proteins and nucleic acids
PDB contents
x-ray structures
NMR structures
EM models
Models from predictions/modelling
3D structures (proteins) databases - hierarchal fold classification
SCOP (annotaed)
CATH (automated)
= both have classes, fold, superfamily, family
PROSITE & Pfram
sequence based classification of protein domains, families
ENZYME
enzyme nomenclature