What does P(X,Y) represent
The joint distribution that describes the joint statistics of 2 random variables
Define a joint cumulative distribution function
F(x,y) = P(X <= x, Y <= y)
What do the parameters of the univariate gaussian distribution control
μ controls the location of the peak
σ controls the width of the peak
Why is the euclidean distance not suitable for higher dimensions
It ignores covariance between features
Give the formula for the Mahalanobis distancce
Multiply inverse covariance matrix by (x-y)^T
Then dot with (x-y)
That answer then ^1/2
Describe the Mahalanobis distance in words
The distance between two vectors when we account for covariance
What are eigenvectors
The eigenvectors (e) of the covariance matrix describe the direction of the contours
What are eigenvalues
The eigenvalues (λ) describe the spread of the contours in these principal directions defined by the eigenvectors
What is the length of a contour of equal probability
The length will be proportional to the square root of the eigenvalue
How do you sketch the approximate contours for a guassian, given the mean column vector, the covariance matrix, the eigenvalues and the eigenvectors
The centre point is the mean’s co-ordinates
Draw ‘x and y’ arrows in the direction of the eigenvectors, starting at mean
Stretch the circle by the factors of the square root of the eigenvalues in their respective direction
Distance from mean to edge of ‘circle’ is the sqrt of eigenvalue in the direction
Sketch the contours for this guassian
Sketch the contours for this guassian
What are the 2 parameters of the multivariate guassian distribution
Mean vector and a covariance matrix
If there are L features, how many values do we have to estimate for the parameters
L values of the mean vector
L(L+1)/2 values for the covariance