Lecture 13 Flashcards

(18 cards)

1
Q

What does the conditional standard deviation measure?

A

The variability of y values for all observations with the same X-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What do we assume about sigma hat when estimating it?

A

that the
standard deviation σ of the conditional distribution of Y is
identical at the various values of X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

For the line to be held as the expectation of y, what assumptions do we need?

A
  1. the
    standard deviation σ of the conditional distribution of Y is
    identical at the various values of X
  2. each conditional distribution is normal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

“the
standard deviation σ of the conditional distribution of Y is
identical at the various values of X” is this usually true?

A

No, we are typically violating this

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Homoschidastic form

A

standard deviation σ of the conditional distribution of Y is
identical at the various values of X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Conditional distributions meaning

A

sets of incomes subsets of y conditional on given level of x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

“each conditional distribution is normal” assumption - what else does that entail?

A

most of the values clustered around the mean, the mean for each of those conditions being the point on the line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

“each conditional distribution is normal” does this assumption usually hold?

A

No

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why might we want to standardize coefficients with the Pearsan correlation?

A

The slope of the prediction equation depends on the units of measurement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Does slope of prediction equation say anything about whether association is strong or weak?

A

no - since we can make b as large or as small as we like by an
appropriate choice of units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Two methods to say how confident we are making predictions with our line, since the y-intercept might not tell us much?

A
  1. Take average of y, then see the error
  2. use ordinary least squares equation and find line of best fit
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Method 1 for gauging the line’s predictive power

A

take the average of y, then see the error (how much off is each of those points off from the average); square and add up the differnces to get E1, total error related to method 1/. Uses the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Method 2 for gauging the line’s predictive power

A

use ordinary least square equation anad find line of best fit, then find error associated with that. How much squared error is there with linear regression/BEST FITTING line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the proportional reduction in error (PRE) measure?

A

Another measure of association between two quantitative
variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Four elements of proportional reduction in error

A
  1. Rule 1 for predicting Y without using X
  2. Rule 2 for predicting Y using information on X
  3. A summary measure of prediction error for each rule
    -E1 for errors by rule 1
    -E2 for errors by rule 2
  4. The difference in the number of errors with the two rules is
    E1 – E2. Converting…
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Basic meaning of “proportional” reduction in error?

A

how much less, reduced proportional error does method two have from method one.

17
Q

Two parts about judging the relatedness

A

The strength of association between an explanatory variable X
and a response variable Y is judged by the goodness of X as a predictor of Y

If one can predict Y much better by substituting X-values into the
prediction equation than without knowing the X-values, the
variables are strongly related