E1: Descriptive Statistics Flashcards

(38 cards)

1
Q

Descriptive Statistics

A

Summarize raw data or simplify our understanding of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What to keep in mind with descriptive stats?

A
  1. What do we want to know about our data?
  2. What is the level of measurement?
  3. What patterns can be seen in the data?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Central Tendency

A

average or typical case in the given data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Dispersion

A

How spread out or concentrated the cases are around one or more values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Nominal Central Tendency

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Mode

A

Most common value of variable (can have more than 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Nominal Dispersion

A

evaluated using percentages.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Ordinal Central Tendency

A

Mode or Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Median

A

Value of variable at 50% mark.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Ordinal Dispersion

A

Determined by how values are distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Interval Central Tendency

A

Mathematical Mean or Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mathematical Mean

A

Mean = sum of individual cases / total # of cases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How to determine mean or median for interval data?

A

MEAN is vulnerable to outliers & skewness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Skewness

A

data trends on 1 side of mean rather than even distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Nonskew

A

even distribution on either side of mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Positive Skew

A

Data slopes TOWARD 0

17
Q

Negative Skew

A

Data slopes AWAY FROM o

18
Q

Standard Deviation

A

Average dispersion point of how far each data point is from the mean

19
Q

Normal Distribution

A

Roughly equal # cases fall on both sides of mean. (bell curve)

20
Q

How to determine Standard Deviation?

A
  1. determine Mean
  2. Calculate individual deviations (data - mean)
  3. Square each deviation
  4. Sum of squares
  5. Calculate Variance (divide by (n-1)
  6. Calculate square root
21
Q

Index

A

Creating new variable from a variety of different sources of information

22
Q

Validity

A

Does conceptual definition match data you are measuring?

23
Q

Reliability

A

Is how we are measuring consistent? Will all researchers come to same measurement?

24
Q

Types of Validity

A
  1. Face Validity
  2. Construct Validity
  3. External Validity
25
Face Validity
Does it make sense at first glance? Does it really measure what we want?
26
Construct Validity
Is the variable functioning as we expect?
27
External Validity
Is the way the data was collected allow it to be generalized to the broader population?
28
How can external validity be ensured in some surveys?
oversampling
29
Types of Reliability
1. Split-half 2. Test-retest 3. Cronbach's Alpha 4. Intercoder
30
Split-Half Reliability
Half of sample get asked 1 version of question, other half get other version.
31
Test-Retest Reliability
Repeated testing should show consistency
32
Cronbach's Alpha Reliability
Mathematical evaluation for interval data
33
Intercoder Reliability
For subjective data. Researchers trained to code data a certain way. Reliable if different researchers code same data the same way.
34
Additive Index
Easiest index to create. Variables that are measured same & of equal importance are added together to create new variable.
35
Importance of recoding data`
1. Simplify/consolidate values to create ordinal scale 2. Fix codes so that values across a set of variables match 3. Create dummy variables to transform nominal data into something else. 4. Visual binning to consolidate interval data into equal sized groups
36
Why run a frequency table before recoding?
To decide how to recode. To know what to expect after collapsing data.
37
Mean-Centered Data
creating a scale for variables where 0=mean
38
Why create Mean-Centered Data?
Converts confusing data points into more familiar data that is easier to read.