What is an RCT?
What is their benefit?
A randomized controlled trial (RCT) is an intervention study where subjects are randomly allocated to treatment options.
Randomized controlled trials (RCTs) are the accepted ‘gold standard’ of individual research studies.
They provide sound evidence about treatment efficacy which is only bettered when several RCTs are pooled in a meta-analysis.
What is the benefit of randomizing?
What is an observational study?
In observational studies the subjects receive no additional intervention beyond what would normally constitute usual care.
Subjects are therefore observed in their natural state.
What is a case-control study?
This study investigates causes of disease, or factors associated with a condition.
It starts with the disease (or condition) of interest and selects patients with that disease for inclusion, the ‘cases’.
A comparison group without the disease is then selected, ‘controls’, and cases and controls are compared to identify possible causal factors.
Case-control studies are usually retrospective in that the data relating to risk factors are collected after the disease has been identified.
What are some limitations of a case-control study?
The choice of control group affects the comparisons between cases and controls.
Exposure to risk factor data is usually collected retrospectively and may be incomplete, inaccurate or biased.
What is a cohort study?
A cohort study is an observational study that aims to investigate causes of disease or factors related to a condition but, unlike a case-control study, it is longitudinal and starts with an unselected group of individuals who are followed up for a set period of time.
Cohort studies are sometimes used to confirm the findings of case-control studies such as happened when Doll and Hill observed a relationship between smoking and lung cancer in a case-control study and subsequently established the longitudinal study of doctors in the UK.
What are some limitations with a cohort study?
What is a cross-sectional study?
In a cross-sectional study a sample is chosen and data on each individual is collected at one point in time.
Note that this may not be exactly the same time point for each subject – for example a survey of primary care consultations may be conducted over a week - each patient will fill in the survey once but different subjects will fill out their survey on different days depending on when they came to the surgery.
When would you use a cross-sectional study?
Why would we summarise data?
What is the definition of quantitative data, and what are the two types?
Quantitative data are data which can be measured numerically and may be continuous or discrete:
What is ordinal data?
The data values can be arranged in a numerical order from the smallest to the largest.
What is categorical data?
Categorical data are data where individuals fall into a number of separate categories or classes.
What is dichotomous data?
This is where there are only 2 classes and all individuals fall into one or other of the classes.
These data are also known as binary data.
What is the problem with dichotomizing data?
Dichotomizing (re-categorizing data into two groups) is potentially very problematic because a great deal of information is discarded and statistical power is lost in the analysis.
In addition, the nature of any relationships may be masked.
For example, if the relationship was curved, this may be weaker if the data were categorized and if the relationship was U-shaped, categorization may totally obscure it
List the common statistical measures of center of data.
- median
List the common statistical measures of variability of data.
How would you calculate the mean?
This is the simple average of all the data: the sum of all values divided by the total number of values. This mean is known as the arithmetic mean.
How do you calculate the median?
This is the middle value when the data are arranged in ascending order of size.
If there are an odd number of values in the sample then the median will be the value with the same number of values both bigger than it and smaller than it. If there is an even number of values, there will be two middle values and the median will be the mean of the two.
How do you calculate the standard deviation, and what does it indicate?
It indicates how dispersed the data are and is a measure of the average difference between the mean and each data value.
It is calculated by taking the square root of the variance.
The variance is calculated by summing the squared differences between the overall mean and each value and then dividing by the number of values minus one.
What is the advantage of standard deviation over the variance?
The advantage of the standard deviation over the variance is that it is in the same units as the original data and so is easier to interpret.
How do you calculate the range?
This is the difference between the smallest and largest value and is usually expressed as the minimum and maximum.
How do you calculate the interquartile range?
This is the range of values that includes the middle 50% of values and is bounded by the lower and upper quartile.
The lower quartile is found by ranking the data as for the median and then taking the value below which 25% of the data sit.
The upper quartile is the value above which the top 25% of data points sit.
Describe how different centers of distributions of quantitative data help in choosing summary measure.