Describe the relationship between a dependent and an independent variable?
dependent variable is dependent of the independent one.
In an experiment the subjects are mostly the dependent variables and the treatment or the interventions are the independent ones. If the independent variable changes the dependent changes as well.
Define the four levels of measurement!
Variables can be measured
categorial so their category is important .
nominal: means that there are only a few categories you can put them into. pie chart / bar chart, central tendency - none
ordinal: the order of measured scores is important. central tendency: median / mean, histogram
quantitative variables:
ratio: every scale with an absolute zero point
mean , median ( with outliers)
interval: scale with a meaningful interval in between the numbers. mean / median ( with outliers)
What makes a graph skewed to the left / right?
skewed to the left : The skew ( tail) is on the left . So the peak is rather on the right side
skewed to the right: the skew is on the right. Peak on the left side.
What attributes do the mode, median and mean have?
If you see a symmetric graph / histogram, the mode is always on the left the median in the middle and the mean on the right.
the mode is the least informative and the mean is the most informative.
The mean is sensitive to outliers and not as stable as the median.
What is the summary of 5?
The minimum, Q1, Median , Q3 and the Maximum.
How do you calculate the Median? P.30
M= n+1/ 2 –> Location of M than add the values of the score location to the formular–> M= x1+x2/ 2
How do you calculate the mean? P. 28
sum of the values of the score divided by n.
How do you calculate the quartiles ?
To calculate the quartile you first need to find the median and then take the median out of the two halfs you identified through the median.
Define the boxplot! P.34
a graph of the five number summary. Not a real graph . A drawn box that visualizes the median, the two quartiles and the minimum and maximum on a scale.
How do you calculate the interquartile range IQR ?
IQR= (Q3-Q1)
How do you identify outliers? P.36
Multiply the IQR with 1,5 and
+ to Q3–> everything above this value is an outlier
What`s the standard deviation and what the difference towards the variance? P.38
the standard deviation looks @ how far the scores from their mean. And gives an average of a value like this.
The variance is simply the step before the standard deviation. From the variance you may also read out the difference to the mean, but the numbers are larger. This makes it easier to spot outliers.
The standard deviation is closer to the actual numbers.
How is it possible to define the correlation between two variables?
negative correlation: one goes up the other goes down.
Whats a scatterplot?
What do I need to consider when figuring out a relationship between data?
to identify the relationship you need to :
Whats a log transformation?
What does smoothing have to do with a scatterplot?
What does the little r stand for? P 101
What is the formula for r?
r = 1/ (n-1) * Z ( (xi-mean of x)/ sx) ((yi- mean of y)/sy) )
Tell me something about the character of r ?
How can you put two categorical variables in a table? P 137
What the hell is a joint distribution? P. 138
the collection ( so not the sum - just all of the proportions written out) = the joint distribution
How do you call a single comparison of a variable in a two-way table?
marginal distribution - of the column variable, consist out of two values of whom the sum is 1
How are the values for a met requirement - row variable called?
conditional distribution , both values are in sum 1