Factor and Cluste Analysis
Factor Analysis method
– classification of similar variables measuring the same in factors/groups
Cluster Analysis method
– classification of similar objects in groups
Factor analysis
What the factor laoding?
the correlation coeff ajs between factor and variable
Purpose of factor analysis
1-4
two types of factor analysis
1.
2.
Exploratory Factor Analysis
Confirmatory Factor Analysis
3 steps of exploratory factor analysis
1-3
Assumptions!
Assumptions:
Factor extraction method
1-2
Key differences
differences:
- ass Principal c.f.a.: all variance can be explained –> use: data reduction
- Common f.a.: aims at latent constructs
How many factors?
3 criterias
Kaiser criterion:
Eigenvalue
Communality
Factor loadings
Eigenvalue = sum of suqared factor loadings of one factor over all variables / the amount of variance explained by a factor
Communality = sum of squared factor laodings of one variable / the proportion of common variance present in a variable (rest is random variance)
Factor loadings = correlation of factor and variable
Factor distinct
Factor rotation
Kaiser-Meyer-Olkin criteria
Exploratory Factor Analysis - Corrleation matrix
is this data set suitable for factor analysis?
–> if 1, than the patterns are relatively compact and f.a. should yield distinct and reliable factors
Bartlett’s test
tells us whether our correlation matrix is significantly different from an identity matrix
–> used to test if k samples are from populations with equal variances.
Equal variances across populations is called homoscedasticity or homogeneity of variances.
What to do with the results of factor rotation?
–> study the … …
Factor score
index scale
summated scale
Difference between factor and index score?
index score: add the values up
factor score: add them up with a weight accord. to their factor loading
Cluster Analysis
-
partly new algorithms exits in
– Social network analysis: usually uses connectivity-based logic
Steps in hierarchical cluster analysis
Wrap-up Cluster and Factor analysis
Factor analysis reduces the no. of variables
– allows to address issues of multicollinearity
– Key method to assess reliability and validity of constructs;
Cluster analysis reduces the number of observations
– More practical than research applications