Module 10 - Probability and Statistics Flashcards

Question

What does it indicate if the **mean** is greater than the **median**?

Answer 1

The distribution is skewed right ## Footnote This means the tail of the distribution stretches to the right.

Answer 2

The most frequently occurring value ## Footnote The mode is the least used measure of central tendency.

Answer 3

Var(X)=E((X−μ)²)=∑(x_{i}−μ)²p(x_{i}) ## Footnote Variance measures how spread out the data is around the mean.

Answer 4

CV=σ/μ ## Footnote It expresses the standard deviation as a percent of the mean.

Answer 5

Symmetric about the mean and bell-shaped ## Footnote The normal distribution occurs in many natural phenomena.

Answer 6

* Mean (μ) * Standard deviation (σ) ## Footnote These parameters determine the shape and spread of the distribution.

Answer 7

X∼N(μ,σ²) ## Footnote This notation indicates that X follows a normal distribution with mean μ and variance σ².

Answer 8

X∼N(0,1) ## Footnote It has a mean of 0 and a standard deviation of 1.

Answer 9

Mean: 0 Standard Deviation: 1 ## Footnote A standard normal distribution is denoted as X∼N(0,1).

Answer 10

Subtract the mean and divide by the standard deviation ## Footnote This transformation helps analysts understand how many standard deviations a value is from the mean.

Answer 11

68.3% ## Footnote This is part of the empirical rule for normal distributions.

Answer 12

95.5% ## Footnote This indicates that 4.5% of the data falls outside of two standard deviations.

Answer 13

99.7% ## Footnote This means only 0.3% of the data falls outside of three standard deviations.

Answer 14

The sum of a large number of independent, identically distributed random variables approaches a normal distribution ## Footnote Generally true by n=30.

Answer 15

It approaches a normal distribution ## Footnote The t-distribution is used for small sample sizes.

Answer 16

Mean: 0 Variance: n/(n-2) ## Footnote The t-distribution is used in many statistical tests.

Answer 17

William Sealy Gossett, pseudonym: 'Student' ## Footnote He worked for the Guinness brewery.

Answer 18

Formed by raising e to the power of a normal random variable ## Footnote The mean is calculated as e^(μ + σ²/2).

Answer 19

x ∈ [0, ∞) ## Footnote This means it only takes positive real numbers.

Answer 20

Two chi-squared random variables ## Footnote It has parameters degrees of freedom m and n.

Answer 21

n/(n-2) where n > 2 ## Footnote This is important for statistical analysis.

Answer 22

2n²(m+n-2)/(m(n-2)²(n-4)) where n > 4 ## Footnote This is used in various statistical tests.

Answer 23

F = \frac{2n^{2}(m+n-2)}{m(n-2)^{2}(n-4)} ## Footnote The F-distribution is useful in hypothesis testing and regression analysis.

Answer 24

Normal distribution ## Footnote The F-distribution approaches the normal distribution under certain conditions.

Answer 25

* Hypothesis testing * Regression analysis ## Footnote The F-test denotes whether the regression model is statistically significant.

Answer 26

\frac{a+b+c}{3} ## Footnote The triangular distribution has three parameters: minimum (a), maximum (b), and mode (c).

Answer 27

\frac{a^{2} + b^{2} + c^{2} - ab - ac - bc}{18} ## Footnote Triangular distributions are often used in risk analysis.

Answer 28

Two outcomes ## Footnote The outcomes are typically represented as one (success) and zero (failure).

Answer 29

p ## Footnote The variance of a Bernoulli distribution is pq, where q = 1 - p.

Answer 30

Binomial distribution ## Footnote The Binomial distribution is denoted as Binomial(n, p).

Answer 31

\text{Cov}(X,Y) = E[(X-\mu_{x})(Y-\mu_{y})] = E[XY] - \mu_{x} \mu_{y} ## Footnote Covariance indicates how two variates fluctuate together.

Answer 32

TRUE ## Footnote Correlation is calculated by dividing the covariance of X and Y by the product of their standard deviations.

Answer 33

H_{0} ## Footnote The null hypothesis is assumed to be true unless proven otherwise.

Answer 34

t-test ## Footnote The null hypothesis for the t-test states that the means are the same.

Answer 35

F-test ## Footnote The null hypothesis for the F-test states that the standard deviations are the same.

Answer 36

0, 1 ## Footnote The standard normal distribution is a specific case of the normal distribution.

Answer 37

H₀: f(x) = g(x) ## Footnote This hypothesis states that the distributions of the two populations are the same.

Answer 38

H₁: f(x) ≠ g(x) ## Footnote This hypothesis states that the distributions of the two populations are different.

Answer 39

TRUE ## Footnote For example, testing if one population mean is greater than another.

Answer 40

α ## Footnote It represents the probability of rejecting the null hypothesis when it is true.

Answer 41

Rejecting H₀ when it is true ## Footnote This error occurs when a test incorrectly indicates a significant effect.

Answer 42

Accepting H₀ when it is false ## Footnote This error occurs when a test fails to detect a true effect.

Answer 43

A function of the sample data ## Footnote It is calculated under the assumption that the null hypothesis is true.

Answer 44

To determine whether to reject H₀ ## Footnote It is the threshold that the test statistic must exceed to reject the null hypothesis.

Answer 45

* H₀: μ₁ = μ₂ * H₁: μ₁ ≠ μ₂ ## Footnote This test checks for any difference without specifying a direction.

Answer 46

0.05 ## Footnote This level indicates a 5% chance of committing a Type I error.

Answer 47

Stronger evidence needed to reject H₀ ## Footnote It reduces the probability of committing a Type I error.

Answer 48

1 - β ## Footnote It is the probability of correctly rejecting a false null hypothesis.

Answer 49

Sₚ² = ((n-1)S₁² + (m-1)S₂²) / (n + m - 2) ## Footnote This formula combines the variances of two samples.

Answer 50

14 percentage points lower ## Footnote The data shows an average overrun of 19% for DoD programs and 33% for NAVAIR programs.

Answer 51

Choose the significance level α ## Footnote This choice should be made prior to testing and reported in results.

Answer 52

T-test ## Footnote This statistic is used to determine if there is a significant difference between the means of two groups.

Answer 53

T = (X̄ - Ȳ - (μX - μY)) / (S_P * √(1/n + 1/m)) ## Footnote This formula is used to determine whether to reject or accept the null hypothesis in hypothesis testing.

Answer 54

It compares the means of two groups ## Footnote The T-test is particularly useful for assessing whether the means of two populations are statistically different from each other.

Answer 55

n + m - 2 ## Footnote This calculation is essential for determining the critical values in hypothesis testing.

Answer 56

TRUE ## Footnote While testing for unequal means, the T-test assumes equal variances unless otherwise specified.

Answer 57

+/- 2.02 ## Footnote Since the test statistic is between -2.02 and 2.02, it fails to reject the null hypothesis.

Answer 58

The smallest α for which one could reject the null hypothesis ## Footnote The p-value indicates the probability of observing the test results under the null hypothesis.

Answer 59

0.194 ## Footnote This indicates a 19.4% probability of incorrectly rejecting the null hypothesis if it were true.

Answer 60

(1 - α) * 100% confidence that the true parameter value is contained within the calculated range ## Footnote CIs provide a range of values that likely contain the true population parameter.

Answer 61

α/2 chance ## Footnote For a 95% CI, there is a 2.5% chance of being too low or too high.

Answer 62

(X̄ - t(α/2, n-1) * (s/√n), μ, X̄ + t(α/2, n-1) * (s/√n)) ## Footnote This formula uses the sample mean, standard deviation, and sample size to estimate the CI.

Answer 63

(1.03, μ, 1.35) ## Footnote This interval suggests that the true mean DoD CGF is between 1.03 and 1.35 with 95% certainty.

Answer 64

Less than 95% certain that the two means are different ## Footnote This indicates that the difference in means is not statistically significant.

Answer 65

Checks if the mean of a population group is equal to a particular constant ## Footnote Null hypothesis: mean equals the constant; alternative hypothesis: mean does not equal the constant.

Answer 66

* H0: μ = μ0 * H1: μ ≠ μ0 (or μ > μ0 or μ < μ0) ## Footnote H0 is the null hypothesis and H1 is the alternative hypothesis.

Answer 67

t-distribution with n - 1 degrees of freedom ## Footnote n is the sample size.

Answer 68

T = (X̄ - μ0) / (s / √n) ## Footnote X̄ is the sample mean, μ0 is the population mean, s is the sample standard deviation, and n is the sample size.

Answer 69

Checks if the means of two population groups are equal ## Footnote Null hypothesis: means are equal; alternative hypothesis: means are not equal.

Answer 70

* H0: μx = μy * H1: μx ≠ μy (or μx > μy or μx < μy) ## Footnote H0 is the null hypothesis and H1 is the alternative hypothesis.

Answer 71

t-distribution with (n + m - 2) degrees of freedom ## Footnote n and m are the sample sizes of the two groups.

Answer 72

T = (X̄ - Ȳ - (μX - μY)) / (Sp√(1/n + 1/m)) ## Footnote Sp is the pooled standard deviation.

Answer 73

To test if the coefficients in the regression equation are statistically significantly different from zero ## Footnote Null hypothesis: bi = 0; alternative hypothesis: bi ≠ 0.

Answer 74

T = Estimated Coefficient / Standard Error ## Footnote T follows a t-distribution with n - k degrees of freedom.

Answer 75

Goodness of fit ## Footnote It can also be used to test for variance.

Answer 76

* H0: σ² = σ0² * H1: σ² ≠ σ0² (or σ² < σ0² or σ² > σ0²) ## Footnote H0 is the null hypothesis and H1 is the alternative hypothesis.

Answer 77

T = (n - 1)s² / σ0² ## Footnote s is the sample standard deviation and σ0² is the predetermined variance.

Answer 78

To check if the variance of one group is equal to the variance of another group ## Footnote Assumes both groups are normally distributed.

Answer 79

* H0: σx² = σy² * H1: σx² ≠ σy² (or σx² > σy² or σx² < σy²) ## Footnote H0 is the null hypothesis and H1 is the alternative hypothesis.

Answer 80

T = s1² / s2² ## Footnote s1² and s2² are the sample variances of the two groups.

Answer 81

H₀: b₁ = b₂ = b₃ ... = bₖ = 0 ## Footnote This hypothesis states that all coefficients in the regression equation are equal to zero.

Answer 82

H₁: at least one bᵢ ≠ 0 ## Footnote This indicates that at least one of the coefficients in the regression model is not equal to zero.

Answer 83

MSR / MSE ## Footnote This value is derived using ANOVA.

Answer 84

To determine whether sample data is representative of a distribution ## Footnote The K-S test applies only to continuous distributions.

Answer 85

* H₀: the data follow a specified distribution * H₁: the data do not follow a specified distribution ## Footnote These hypotheses help assess the fit of the sample data to the theoretical distribution.

Answer 86

D = max₁≤i≤n (F(Yᵢ) - (i-1)/n, (i/n) - F(Yᵢ)) ## Footnote This statistic measures the maximum difference between the empirical and theoretical cumulative distributions.

Answer 87

(x̄ - tₐ/₂,n₋₁ * (s/√n), μ, x̄ + tₐ/₂,n₋₁ * (s/√n)) ## Footnote This formula provides a range within which the true mean is expected to lie.

Answer 88

(x̄ - ȳ - tₐ/₂,n+m-₂ * Sₚ√(1/n + 1/m), x̄ - ȳ, x̄ - ȳ + tₐ/₂,n+m-₂ * Sₚ√(1/n + 1/m)) ## Footnote This CI estimates the range for the difference between two population means.

Answer 89

* Descriptive statistics * Inferential statistics ## Footnote Descriptive statistics summarize data, while inferential statistics draw conclusions about populations from samples.

Answer 90

* Mean * Median * Mode ## Footnote These measures provide insights into the average or typical values in a dataset.

Answer 91

* Variance * Standard deviation * Coefficient of variation ## Footnote These measures indicate the spread or variability of data points in a dataset.

Module 10 - Probability and Statistics Flashcards

(115 cards)