What is factor analysis primarily used for?
Data reduction and summarisation.
Why do we need to reduce/summarise data using factor analysis?
Factor analysis allows us to ________________
Factor analysis allows to narrow down a large number of specific observable (and often highly correlated) items to a few composite underlying factors
What are the two types of “Factor analyses” in the module?
What is the difference between “exploratory factor analysis” and “confirmatory factor analysis”, and which do we use?
Exploratory = NO prior knowledge on how variables are related to each other
Confirmatory = there IS this knowledge..
== We use EXPLORATORY in this module
From an “Exploratory Factor Analysis”, which are the two relevant sub-analyses?
What is the difference between “Principal Component Analysis” and “Common Factor Analysis”?
BOTH are sub-analyses for “Exploratory Factor Analysis”..
BUT:
- PCA: considers total information (variance) in data
- CFA: considers only common variance of data
Show the diagram of an “Exploratory Factor Analysis”?
What is the goal of “Exploratory Factor Analysis”?
The goal of factor analysis is to group the variables that are most correlated to each other
== To divide variables into groups
The aim of EFA is to separate the variables that __________
The aim of EFA is to separate the variables that correlate highly from those that correlate less strongly
What is the difference between correlates within groups VS between group (EFA)
Within groups, variables should be as highly correlated as possible
… between groups, as low as possible!
What is the factor in “Exploratory Factor Analysis”?
The factor can be seen as an underlying/ hidden variable
-> that affects several observed variables (the ones we measure)
// Several variables are observable phenomena of less underlying factors
How does a Factor explain how observed variables are correlated?
Factor analysis groups/combines variables that are highly correlated with each other. It assumes those correlations happen because the variables share an underlying unmeasurable (latent) factor—a hidden construct that influences them (e.g., “anxiety,” “intelligence,” “satisfaction”).
–> THAT is the FACTOR
How do you determine what a FACTOR means/ what you should name it?
In factor analysis, you don’t “discover” a factor’s name from the math!
You discover a pattern of shared variance, and then humans decide what it means and what to call it.
The researcher/analyst ultimately names the factor.
What is an “EIGENVALUE”?
Total variance of all variables accounted for by ONE factor
What is “FACTOR LOADINGS”?
Correlations between the variables and factors (ranges from -1 to +1)
ie: How much does one variable load into a factor
What is “FACTOR SCORES”?
Relation between observations and factors
What is “COMMUNALITY”?
Proportion of one variable’s variance explained by all factors extracted
ie: How good one factors solution is, how much of the info in each variable is explained by all factors, (eg. 80% of the variable is being explained by each factor)
What is the OPPOSITE of “COMMUNALITY”?
EIGENVALUE
why?
They’re “opposites” in the sense that they answer two different directions of the same question:
- Communality: How much of this variable is explained by the factors?
- Eigenvalue: How much total variance is explained by this factor?
Show the PROCESS chain for an EFA