Descriptive Statistics Flashcards

(67 cards)

1
Q

Statistics is a way of doing what?

A

summarizing/organizing and interpreting data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a statistic?

A

a number obtained from mathematical manipulation of the raw data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Descriptive statistics

A

describe characteristics of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

3 key goals of descriptive statistics

A

organize data into a usable form
describe key features
summarize data without making inferences about a larger population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

______ and _______ data are discrete

A

nominal
ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Grouping the nominal data is

A

counting the number/frequency of cases falling into each category

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Table conventions for nominal data

A

table must be clearly and fully labeled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Nominal data: f = ???

A

frequency of cases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Nominal data: n = ???

A

total number of cases or measurements in a study sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Nominal data: N = ???

A

number of cases in population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Visualization of discrete data: Discrete data often plots _________ distributions of ________ data as what?

A

frequency
qualitative
bar graphs or pie diagrams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does a bar graph involve

A

plotting the frequency of each category and drawing a car

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Conventions in plotting bar graphs: y axis

A

used to plot frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Conventions in plotting bar graphs: x axis

A

used to indicate the categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Conventions in plotting bar graphs: bars dont ___________ reflecting what?

A

touch each other
discontinuity of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Bar graphs can also be plotted as what instead of frequencies?

A

percent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Clustered bar graph

A

graphing experimental and control group on the same graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Visualization of discrete data: Pie diagram

A

% of each category as a piece of pie

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Organization and presentation of interval or ratio data: real numbers

A

can be processed mathematically

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Interval or ratio data often produce what?

A

continuous data (weight, length, time, IQ)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Step 1 of constructing a grouped frequency distribution

A

organize data into ordered array of score frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Step 2 of constructing a grouped frequency distribution

A

find ranges of scores: highest - lowest +1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Step 3 of constructing a grouped frequency distribution

A

decide on width (i) of the class intervals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Step 4 of constructing a grouped frequency distribution

A

note limits of each class interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Step 5 of constructing a grouped frequency distribution
count frequency of scores in each class interval and tabulate
26
True or false: the final grouped data of frequency distributions is much easier to interpret than original raw data
true
27
Final grouped frequency distributions lose some what?
precision in the data
28
3 conventions for grouped frequency distributions
not too few or too many groups/classes equally sized class intervals midpoint of class interval represents the class interval
29
If there are too many groups/classes in grouped frequency distributions what happens
it is difficult to inspect
30
If there are too few groups/classes in grouped frequency distributions what happens
meaning of the varied data is lost
31
Qualitative data of frequency distributions are often plotted as
histograms or frequency polygons
32
Histograms
like bar graphs but bars touch each other to reflect continuity of data
33
In histograms what to the midpoint and width of the bar correspond to
the midpoint and width of the class interval
34
Frequency polygons
same as histogram (bars touch each other) but plot a line through midpoint of each class interval, at a height representing the frequency of the scores
35
What do frequency polygons allow the reader to do?
interpolate/estimate the frequency of values in between those actually measured or graphed
36
Frequency polygons can take on what 3 shapes
bell shaped - normal distribution negatively skewed - mean to left of normal positively skewed - mean to right of normal
37
After summarizing data in frequency distribution it is often useful to do what?
compare relative frequencies of different categories
38
Comparing relative frequencies of different categories is useful for
understanding comparative trends in the data
39
Comparing relative frequencies of different categories can be used on what?
any scale
40
Calculation of statistics allow what?
"crunching" raw data into single numbers that summarize the data
41
Ratios
relative frequency of one set of frequencies to another
42
Proportion
frequency of one category relative to total sample or population
43
Rates in epidemiology are useful for what?
representing level at which a disease is present in a given population
44
Incidence rate =
(number of new cases/total population at risk) x base (ex. 1,000 or 100,000)
45
Prevalence rate =
(number of existing cases/total population at risk) x base
46
2 useful statistics for representing sample data
measures of central tendency measures of dispersion/variability
47
Measures of central tendency
statistics or numbers expressing most typical or representative scores in a data distribution
48
Measures of dispersion/variability
statistics representing the extent to which scores are dispersed or spread out numerically
49
3 measures of central tendency
mean median mode
50
What is the mean and what is it used for
average of the data for interval or ratio data
51
What is the median and what is it used for
score that divides the distribution in half for ordinal, interval, or ratio scale data
52
What is the mode and what is it used for
most frequently occurring score used for nominal data
53
The mode can also be calculated for what data?
continuous data
54
Sample mean
sum of all the data points in the sample/the number of data points in the sample
55
Population mean
sum of all data points in the population/the total number of data points in the population
56
Formula to calculate median
(n + 1)/2 and count to that number in the ordered array
57
If n is odd the median is
the middle score
58
If n is even the median is
midway between (average of) two middle scores
59
For skewed continuous data ________ may be more suitable than ________ for representing the "________" score
median mean typical
60
Three statistics for dispersal are
range variance standard deviation
61
Range
difference between highest and lowest scores
62
Range is easy to calculate but
dependent on only two extreme scores, not typical dispersal
63
Average deviation is about the _______
mean
64
Average deviation would be a convenient measure of __________
variability
65
When you solve a problem with squaring the deviations this is called what?
sum of squares
66
When calculating the standard deviation, the larger the standard deviation the
more spread out the scores about the mean
67