Statistics Flashcards

(25 cards)

1
Q

Name Frequency distributions

A

Frequency Counts and Percentages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Name measures of Central Tendency

A

Mean
Median
Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Name measures of Variability

A

Range
Variance
Standard Deviation
Interquartile Range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

___ tally the number of times each category appears, offering a straightforward way to identify the most and least common categories

A

Frequency counts (n) tally the number of times each category appears, offering a straightforward way to identify the most and least common categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

__ represent frequency counts as a part of the whole and can help in comparing the relative sizes of different categories, making it easier to interpret the data across groups of varying sizes.

A

Percentages (%) represent frequency counts as a part of the whole and can help in comparing the relative sizes of different categories, making it easier to interpret the data across groups of varying sizes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the purpose of frequency distributions?

A

Purpose: Identify initial patterns, imbalances, and the most/least common responses in a dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Average, Sum of all numbers ➗# of numbers (skewed by outlier)

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Middle. Rank order & middle number (not skewed by outlier)

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Most common & used with nominal data set (not skewed by outlier)

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Interpretive relationships:

Mean < Median < Mode = __ skewed
Mode < Median < Mean = __ skewed
Mean = Median = Mode →___ skewed

A

Mean < Median < Mode = negatively skewed (right)
Mode < Median < Mean = positively skewed (left)
Mean = Median = Mode → symmetric (normal) distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Purpose of measures of central tendency

A

Measures of Central Tendency: typical value in the dataset/interpreting the dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Purpose of Measures of Vairability

A

Measures of Variability: describe overall spread & reliability of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Difference between the highest and lowest value; Max - Min =

A

Range: Difference between the highest and lowest value; Max - Min = Range.
Greater Range = wider spread.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

___ average of the squared differences from the mean ↑= more spread out

A

Variance: average of the squared differences from the mean ↑= more spread out

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

__ Square root of the variance, providing a measure of dispersion that is in the same units as the data. average distance of each data point from the mean.

A

Standard Deviation: Square root of the variance, providing a measure of dispersion that is in the same units as the data. average distance of each data point from the mean.
* Larger SD = Greater variability
* Smaller SD = data is more tightly clustered
* Mean ↔ Standard Deviation
Variance = 250 √250 = 15.8 = SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

___ spread of the middle half of a dataset, range between the 25th and 75th percentile. Median ↔___

A

Interquartile Range: spread of the middle half of a dataset, range between the 25th and 75th percentile. Median ↔ IQR
* variability of the data around the median, giving us a clearer picture of where most of the data lies.
* Ex: 60,70,80,90,100 = 70 - 90 = IQR

17
Q

____ make predictions or inferences about a larger population based on sample data. Draw conclusions & make predictions

A

Inferential Statistics: make predictions or inferences about a larger population based on sample data. Draw conclusions & make predictions

18
Q

The Normal Distribution

A

Bell shaped curve, symmetrical distribution aka unimodal
Mean = Median = Mode are all equal
* Must have Normal distribution to use empirical rule

19
Q

___ describes how data are distributed in a normal distribution

A

The Empirical Rule: describes how data are distributed in a normal distribution

20
Q

What is the Emprical rule

A

The Empirical Rule: describes how data are distributed in a normal distribution
Normal distribution only
When a distribution is normally distributed, the Empirical Rule states that:
* 68% of the observations fall within ± 1 standard deviation of the mean
* 95% fall within ± 2 standard deviation of the mean
* 99.7% fall within ± 3 standard deviation of the mean

Allows researchers to estimate how typical or unusual a value is
Helps identify outliers or abnormal values

21
Q

___ as sample size increases, the distribution of sample means becomes normally distributed

A

Central Limit Theorem: as sample size increases, the distribution of sample means becomes normally distributed
* If you take enough averages from a group, those averages will form a bell-shaped curve (normal distribution), even if the original group wasn’t bell-shaped.

22
Q

Key Points

1) The mean of the sampling distribution is the mean of the _____.
2) The standard deviation of a sampling distribution is called the ___ ____ ___ ___ ___ .
3) The sampling distribution follows a ___ distribution, allowing use of the __ ___

A

1) The mean of the sampling distribution is the mean of the population.
2) The standard deviation of a sampling distribution is called the standard error of the mean.
3) The sampling distribution follows a normal distribution, allowing use of the empirical rule

23
Q

What is confidence interval? 

A

Confidence interval provides a range of values that is believed with a stated level of confidence to contain the true population parameter (most often the population mean)

25
True or false A 95% confidence interval means there is a 95% probability that the population mean is inside one specific interval. 
**FALSE** A 95% confidence interval gives us a range of values that we believe with considerable certainty include the true population value. * It does not mean there is a 95% probability that the population mean is inside one specific interval.