What does the conditional standard deviation measure?
The variability of y values for all observations with the same X-value
What do we assume about sigma hat when estimating it?
that the
standard deviation σ of the conditional distribution of Y is
identical at the various values of X
For the line to be held as the expectation of y, what assumptions do we need?
“the
standard deviation σ of the conditional distribution of Y is
identical at the various values of X” is this usually true?
No, we are typically violating this
Homoschidastic form
standard deviation σ of the conditional distribution of Y is
identical at the various values of X
Conditional distributions meaning
sets of incomes subsets of y conditional on given level of x
“each conditional distribution is normal” assumption - what else does that entail?
most of the values clustered around the mean, the mean for each of those conditions being the point on the line
“each conditional distribution is normal” does this assumption usually hold?
No
Why might we want to standardize coefficients with the Pearsan correlation?
The slope of the prediction equation depends on the units of measurement
Does slope of prediction equation say anything about whether association is strong or weak?
no - since we can make b as large or as small as we like by an
appropriate choice of units
Two methods to say how confident we are making predictions with our line, since the y-intercept might not tell us much?
Method 1 for gauging the line’s predictive power
take the average of y, then see the error (how much off is each of those points off from the average); square and add up the differnces to get E1, total error related to method 1/. Uses the mean.
Method 2 for gauging the line’s predictive power
use ordinary least square equation anad find line of best fit, then find error associated with that. How much squared error is there with linear regression/BEST FITTING line.
What does the proportional reduction in error (PRE) measure?
Another measure of association between two quantitative
variables
Four elements of proportional reduction in error
Basic meaning of “proportional” reduction in error?
how much less, reduced proportional error does method two have from method one.
Two parts about judging the relatedness
The strength of association between an explanatory variable X
and a response variable Y is judged by the goodness of X as a predictor of Y
If one can predict Y much better by substituting X-values into the
prediction equation than without knowing the X-values, the
variables are strongly related