What do Descriptive Statistics do?
Summarise and describe the main features of a dataset
What are Descriptive Statistics used for?
-Identifying trends and patterns
-Detecting outliers
-Facilitates clinical decision-making
What are the 3 measures of central tendency?
-Mean
-Median
-Mode
Give the formula to calculate the mean
(∑xi) / n
xi - ith data item in the list of n data
What is the mean?
Average of all the data points within the data set
What is the median?
Middle value in an ordered dataset
How do you work out the median in an even number of data?
Average of the middle 2 numbers
What is the mode?
Most frequent value in the data set
Which is most influenced by outliers, median or mean?
Mean is most influenced by outliers, median is least influenced by outliers
Median is a better representation or summary of the data set than the mean is
What are the 3 measures of dispersion?
-Range
-Variance
-Standard Deviation
What is the range?
Difference between the highest and lowest value in the data set
What is the variance?
Average of the squared deviations from the mean
How do you calculate variance
(xi - mean)
(each data point - mean) and then square it, add all of the variances up and divide by the total number of points
Why should we square the deviations when calculating variance?
If we don’t square each deviation then the average will always be zero
What happens if we measure average of the squared deviations about a point other than the mean?
Why do we use the mean?
-Leads to greater variance
-Using the deviations about the mean will give us the lowest variance
How do you calculate Standard Deviation (SD)?
SD = √variance
Give the formula to calculate Standard Deviation (SD)
SD = √ [ ∑(xi - mean)^2 / n ]
What 2 types of variables can you have for Probability Density Functions (PDFs)?
-Discrete variable
-Continuous variable
What does a Probability Density Function (PDF) represent?
What does a Probability Density Function (PDF) describe?
A PDF represents the probability distribution of a continuous random variable
PDFs describe the probability of a variable being within a certain interval
Which part of a Probability Density Function (PDF) represents probability?
Area under the curve
Total area under the curve = 1
Will a Probability Density Function (PDF) ever by negative?
No, a PDF will never by negative (below the x-axis) as this would imply a negative probability
What does a PDF value at a single point (height) indicate?
Relative density of data around that point
Give some key principles of a Normal Distribution
-Type of PDF
-Bell shaped probability distribution
-Symmetric around the mean
-Mean = median = mode
-Determined by the mean and standard deviation
What can a Normal Distribution be useful for?
-Can define what is normal and help to identify deviations that might be pathological; usefulness in screening
-Statistical Inference
-Predictive Power (can help predict patient outcomes)
-Clinical Applications (screening and diagnosis, VA assessments)