The extent to which two sequences are the same
Identity
Lining up two or more sequences to search for the maximal regions of identity in order to assess the extent of biological relatedness of homology
Alignment
The relatedness of sequences
Similarity
A fixed set of commands in a computer program
Algorithm
A space introduced in alignment to compensate for insertions or deletions in one of the sequences being
compared
Gap
Similarity attributed to descent from a common ancestor
Homology
The sequence presented for comparison with all other sequences in a selected database.
Query
The genetic sequence database sponsored by the National Institutes of Health.
GenBank
describes the number of matches
to the query by chance when searching a database of a
particular size.
E- value (Expect value)
study on evolutionary relatedness among species by comparing homologies and differences in gene
sequences
Phylogenetics
BIOINFORMATICS
CREATIO OF DATABASES
Determine relationships among members of large data users
DEVELOPMENT OF ALGORITHMS AND STATISTICS
BRANCHES OF BIOINFORMATICS
BIOINFORMATICS APPLICATIONS
BIOINFORMATICS APPLICATIONS IN VARIOUS FIELDS
THREE EARLIEST DNA SEQUENCE AND PROTEIN DATABASES
PRIMARY DATABASES
Contain information that has been
process and derived from the raw data available in primary
database
SECONDARY DATABASES
SEQUENCE ALIGNMENT
To understand functional, structural, or
evolutionary relationships between the sequences
identify regions of similarity
TYPES OF SEQUENCE ALIGNMENT
compare more than two sequences
o MUSCLE
o MAFFT
o CLUSTAL Omega
Multiple
compare two sequences
o EMBOSS WATER
o BLAST
Pairwise