Week 7 - Linear pattern Flashcards by Isabel Donohoe

What relationship do we observe between Price and Mileage?

There is a negative relationship. As mileage increases, price decreases.

How well did you know this?

Not at all

Perfectly

What does a negative relationship mean in this context (price and mileage)?

Cars with higher mileage tend to be cheaper, and cars with lower mileage tend to be more expensive.

How well did you know this?

Not at all

Perfectly

Why is it important to understand the Price–Mileage relationship?

Because it:

Explains how car price changes with mileage

Helps us predict the average price given a mileage

Allows us to quantify how strong the relationship is

Helps measure the effect of mileage on price

How well did you know this?

Not at all

Perfectly

What is the dependent variable in this price-milage analysis?

Price — it is the value we want to describe and predict.
(Plotted on the y-axis.)

How well did you know this?

Not at all

Perfectly

What is the explanatory variable in this price-milage analysis?

Mileage — it may help explain Price.
(Plotted on the x-axis.)

How well did you know this?

Not at all

Perfectly

What is the correlation coefficient used to measure?

The strength and direction of a linear relationship between two variables.

How well did you know this?

Not at all

Perfectly

What is the formal name of the correlation coefficient?

The Pearson correlation coefficient.

How well did you know this?

Not at all

Perfectly

What is the formula for the Pearson correlation coefficient?

r_xy = cov(x,y)/ (σ_x σ_y)

How well did you know this?

Not at all

Perfectly

What does covariance measure?

The joint variability of two variables — how they move together.

How well did you know this?

Not at all

Perfectly

What is the formula for covariance?

cov(x,y) = 1/n n∑j=1 (x_j - x̄)(y_j - ȳ)

How well did you know this?

Not at all

Perfectly

Why do we divide covariance by the product of standard deviations in the correlation formula?

To scale the covariance so that correlation:

Is unit-free

Always lies between –1 and +1

How well did you know this?

Not at all

Perfectly

What does a positive covariance mean?

When x is above its mean, y tends to be above its mean too (move together).

How well did you know this?

Not at all

Perfectly

What does a negative covariance mean?

When x is above its mean, y tends to be below its mean (move in opposite directions).

How well did you know this?

Not at all

Perfectly

Example: If x_j - x̄ is positive, and y_j - ȳ
is negative, what can we say?

Their product is negative, indicating a negative relationship.

How well did you know this?

Not at all

Perfectly

What are the formulas for the variances of x and y?

σˆ2 _x = 1/n n∑j=1 (x_j - x̄)ˆ2
σˆ2 _y = 1/n n∑j=1 (y_j - ȳ)ˆ2

How well did you know this?

Not at all

Perfectly

What does r= -1,0, and +1 mean?

Study These Flashcards

–1 → perfect negative linear relationship

0 → no linear relationship

+1 → perfect positive linear relationship

What is the range of the correlation coefficient r?

Study These Flashcards

r ranges from –1 to +1.

What does a larger value
∣r∣ indicate?

Study These Flashcards

A stronger linear relationship between the variables.

What does r=0 tell us?

Study These Flashcards

There is no linear relationship (but other, non-linear patterns may still exist).

What does r=+1 or r=−1 indicate?

Study These Flashcards

A perfect linear relationship (very rare in real data).

What does the sign of r tell us?

Study These Flashcards

Positive sign → positive relationship

Negative sign → negative relationship

What does the value (size) of r NOT tell us?

Study These Flashcards

It tells us nothing about the steepness (slope) of the relationship.

State the hypotheses for testing correlation.

Study These Flashcards

Null hypothesis: H_0 : p_x,y
= 0 (no population correlation)

Alternative hypothesis: H_1 : p_x,y ≠ 0

What is the difference between p_x,y and r_x,y?

Study These Flashcards

p_x,y : population correlation coefficient
r_x,y : sample correlation coefficient

What does "Sig. (2-tailed)" mean in SPSS/outputs?

It is the p-value. If p < α, the correlation is statistically significant.

Can we calculate a correlation coefficient for a non-linear pattern?

Yes, but it will lead to incorrect or misleading analysis.

What does Anscombe’s Quartet demonstrate?

All four datasets have the same correlation r=0.816 But they show very different patterns Conclusion: Always plot your data before interpreting correlation.

What important warning must we remember about correlation?

CORRELATION IS NOT CAUSATION.

What do scatterplots help us observe?

The shape and direction of the relationship (linear or non-linear).

What does the correlation coefficient measure?

Direction (positive/negative) Strength (weak/moderate/strong) of a linear relationship.

Does a low correlation mean no pattern exists?

No — there may still be a non-linear pattern.

After identifying a linear pattern, what is the next question we ask?

How much does Mileage affect Price?

What statistical tool lets us measure the effect of Mileage on Price?

Simple Linear Regression.

Week 7 - Linear pattern Flashcards

(33 cards)