Clustering
Classification
Some examples of classification tasks
Different ways of representing classes
Exemplar-based classification
Centroid-based representation of classes

In our vector space model, objects are represented as ___, so a class will correspond to a collection of ____; a region.
Vector space classification is based on the the _____ hypothesis.
The contiguity hypothesis
Classification amounts to computing the _____.
Classification amounts to computing the boundaries in the space that separate the classes; the decision boundaries.
Both centroids and medoids represent a group by a ____.
Both centroids and medoids represent a group by a single prototype.
While a medoid is an actual member of the group, a centroid is an ____.
While a medoid is an actual member of the group, a centroid is an abstract prototype; an average.
Typicality can be defined by a member’s distance to ____.
Typicality can be defined by a member’s distance to the prototype.
The centroid could also be ____:
Let each member’s contribution to the average be determined by its average _____ to the other members of the group.
The centroid could also be distance weighted:
Let each member’s contribution to the average be determined by its average pairwise similarity to the other members of the group.
Hard classes
Soft classes
Rocchio classification AKA
The decision boundary of the Rocchio classifier
Problems with the Rocchio classifier
Classes that are not linearly seperable in a given feature space may become linearly separable when the features are ____.
Classes that are not linearly seperable in a given feature space may become linearly separable when the features are mapped to a higher-dimensional space.
kNN-classification
k Nearest Neighbor classification
Voronoi tessellation
“Softened” kNN-classification
A probabilistic version
Distance weighted vote

For kNN, test time is _____ in the size of the training set, but independent of _____.
For kNN, test time is linear in the size of the training set, but independent of the number of classes.