Sequence and grammar
Sequences:
Grammar; adds hierarchical structure
We have been doing data-driven learning by counting observations in what contexts?
Vector space models
Classification
Supervised learning from labeled training data.
Given data annotated with predefinded class labels, learn to predict membership for new/unseen objects.
Cluster analysis
Unsupervised learning from unlabeled data.
Automatically forming groups of similar objects.
No predefined classes; we only specify the similarity measure.
Issues with categorization tasks
Examples of vector space classifiers
Rocchio vs. kNN
Differences between Rocchio and kNN
Evaluation of classifiers
Types of clustering
Flat clustering algorithm
Agglomerative clustering
Divisive clustering
Structured Probabilistic Models
What have we been modelling with Structured Probabilistic Models?
Linear
Hierarchical
Structured Probabilistic Models: Types
Linear
Hierarchical
n-gram language models
n-gram language models

Hidden Markov Model

PCFGs
(Probabilistic) Context-Free Grammars

Maximum Likelihood Estimation

HMMs can be used to:
CFGs can be used to:
n-gram models can be used to
n-gram models can be used to calculate the probability of a string
When to use the Viterbi algorithm?