Descriptive Statistics Flashcards

(60 cards)

1
Q

Branch of statistics that describes or summarizes data.

A

Descriptive Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Differentiate Exploratory Data Analysis (EDA) from Descriptive Statistics

A

Exploratory Data Analysis (EDA) helps you understand your data.

Descriptive statistics help you explain your data to others.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Ways of Describing Data

A

Frequency Distribution → shows values and how often they occur.

Bar Graph → for nominal/ordinal data.

Histogram → for interval/ratio data.

Frequency Polygon → plots points at class midpoints instead of bars.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A way of describing data that presents the score values and their frequency of occurrence.

A

Frequency Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How frequency distributions of Nominal or Ordinal Data are customarily plotted

A

Bar Graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

used to represent frequency distributions composed of interval or ratio data using bars

A

Histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Used to represent interval or ratio data using a point that is plotted over the midpoint of each interval at a height corresponding to the frequency of the interval

A

Frequency Polygon

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

indicates the proportion of the total number of scores in each interval.

A

Relative Frequency Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

indicates the number of scores that fall below the upper limit of each interval.

A

Cumulative Frequency Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

indicates the percentage of scores that fall below the upper limit of each interval.

A

Cumulative Percentage Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

f/N

A

Relative Frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

frequency of interval + frequencies of all class intervals below it.

A

Cumulative Frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

cumulative f / N × 100

A

Cumulative Percentage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

also known as the Gaussian Distribution

A

Normal Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

symmetrical and bell shaped

curves outwards at the top and then inwards nearer the bottom, the tails getting thinner and thinner

A

Normal Distribution

Note: As long as the distribution is close to a normal distribution, it will not matter too much.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

a non-symmetrical distribution

A

skewed distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

the curve rises rapidly and then drops off slowly

A

Positive Skew

📌 In simpler terms:

The tail of the distribution is stretched out to the right (higher values).

Most of the scores are low, but a few very high scores pull the mean upward.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

the curve rises slowly and then decreases rapidly

A

Negative Skew

📌 In simpler terms:

The tail of the distribution is stretched out to the right (higher values).

Most of the scores are low, but a few very high scores pull the mean upward.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

occurs when there are either too many people at the extremes of the scale, or not enough people at the extremes.

A

Kurtosis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

when there are insufficient people in the tail (ends) of the scores to make the distribution normal.

A

Positive Kurtosis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

when there are too many people, too far away, in the tails of the distribution.

A

Negative Kurtosis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

small number of data points that lie outside the distribution when the distribution is approximately normal. Usually easily spotted in histograms.

A

Outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

the most central value of a data set with different interpretations of the sense of “central.”

A

Central Tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Measures of central tendency

A

Mean (x̄): sum of scores ÷ number of scores.

Median: middle score when ordered.

Mode: most frequent score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Measures of Dispersion
Range = highest – lowest. Interquartile Range (IQR) = Q3 – Q1. Variance = average of squared deviations. Standard Deviation (SD) = square root of variance.
26
∑x / N
Mean (arithmetic mean)
27
the middle score in a set of scores. Used when the mean is not valid, which might be because the data are not symmetrically or normally distributed, or because the data are measured in an ordinal level.
Median
28
the most frequent score in the distribution or the most common observation among a group of scores.
Mode
29
the simplest measure of dispersion. It is the distance between the highest score and the lowest score. Simple but distorted by outliers.
Range
30
measure of central tendency used with ordinal data or with non-normal distributions. Resistant to outliers, often used with median.
Inter-Quartile Range (IQR)
31
the distance between the upper and lower quartiles.
Inter-Quartile Range (IQR)
32
square root of variance.
Standard Deviation (SD)
33
average of squared deviations.
Variance
34
Standard Deviation (SD)
Tells how spread out data are relative to the mean.
35
Show median, quartiles, and outliers visually. Whiskers usually extend to 1.5 × IQR from the box.
Boxplots
36
37
describe/summarize the data a researcher has
Descriptive Statistics
38
helps a researcher understand the data that he has, while descriptive statistics help him explain to other people what is happening to his data
Exploratory data analysis
39
Different ways of Describing the Distribution
- Frequency Table - Charts (e.g., histograms, bar chart etc)
40
Highest frequency
Mode
41
used to present the pattern in the data
Charts
42
frequency distributions of Nominal or Ordinal Data are customarily plotted using a what?
Bar Graph
43
used to represent frequency distributions composed of interval or ratio data
Histogram
44
used to represent interval or ratio data.
Frequency polygon
45
46
presents the score values and their frequency of occurrence
Frequency distribution
47
indicates the proportion of the total number of scores in each interval.
Relative Frequency Distribution
48
indicates the number of scores that fall below the upper limit of each interval.
Cumulative Frequency Distribution
49
indicates the percentage of scores that fall below the upper limit of each interval.
Cumulative Percentage Distribution
50
f/N
Relative Frequency
51
frequency of interval + frequencies of all class intervals below it.
Cumulative Frequency
52
cum f / N x 100
Cumulative Percentage
53
54
Why does it matter if a distribution is normal or not?
There are mathematical equations that can be used to draw a normal distribution. These equations can be used in statistical tests. A lot of tests depend on the data being from a normal distribution. Many variables are normally distributed. It makes it easier to do inferential statistics accurately and without bias.
55
occurs when only few of the subjects are strong enough to get off the floor.
Floor Effect
56
causes negative skew and are much less common in Psychology
Ceiling Effect
57
Prerequisites for mean
1. Bell-shaped distribution 2. Continuous variable (ratio/interval)
58
the square of the Standard Deviation.
Variance
59
√ (Sum of ((x - x-bar)^2 / n))
Sample standard deviation
60
√ (Sum of ((x - x-bar)^2 / n-1))
Population standard deviation