What is the mathematical study of the future that measures the chance of an event called?
Probability
Probability calculates the likelihood that an event may occur and assesses which outcomes are possible.
What are the two main areas of statistics?
Descriptive statistics summarize outcomes, while inferential statistics test probabilistic models and draw conclusions about populations.
The mean in probability is calculated by summing all possible values of X multiplied by their probabilities. What is the formula?
E(X) = ∑ x_i p(x_i)
This formula is used to calculate the expected value in discrete cases.
True or false: Statistics is used to describe historical outcomes and to determine underlying probability models.
TRUE
Statistics helps in understanding past data to make predictions about future events.
What are the two types of random variables?
Discrete variables take on specific values, while continuous variables can take any value within a range.
What does a Probability Density Function (PDF) define?
A distribution for continuous random variables
The PDF represents probabilities as areas under the curve.
What is the relationship between Cumulative Distribution Function (CDF) and Probability Density Function (PDF)?
CDF is the integral of PDF; PDF is the derivative of CDF
The CDF plots the probability that a random variable will take on a value less than a specified value.
What is the bell curve commonly associated with?
Normal distribution
The bell curve depicts the frequently-occurring normal distribution in statistics.
What are the two sides of the same coin in probability and statistics?
Both concepts are essential for understanding data analysis.
What is a population in statistical terms?
The set of all possible members of a specifically defined group
For example, all Navy aircraft designed after WWII can define a population for cost estimation.
What is a sample in statistics?
A subset of the population
Sample data is used to conduct analysis when the entire population cannot be gathered.
What is the difference between parametric and non-parametric statistics?
This module focuses mainly on parametric statistics.
What is the significance level in hypothesis testing?
The tolerance for error
It determines how confident one can be in the results of the statistical test.
What is the central tendency in statistics?
The middle or expected locations of distributions
It helps understand the center of the data.
What are the key ideas of this module?
These ideas form the foundation for understanding cost analysis.
What is the role of probability and statistics in cost estimating?
They are crucial for making informed cost estimates.
What is a random variable?
A variable that cannot be fully controlled or accurately predicted
It represents outcomes in the sample space.
What is the importance of sample statistics?
They estimate population parameters
Analysts strive to get larger and better samples to approximate the true population.
What is the formula for calculating the mean of a discrete random variable?
E(X)=∑ x_{i}p(x_{i})
The mean is calculated by summing all possible values of X multiplied by the probability of that sum occurring.
When rolling two dice, what is the probability of obtaining a sum of 2?
1/36
This is because both dice must show a 1, which has a probability of (1/6)*(1/6).
In the continuous case, how is the mean calculated?
E(X)=∫ x p(x) d x = μ
The integral is used to find the mean over an infinite number of possibilities.
What is the definition of the median in a data set?
The middle data point where half the data points are lower and half are higher
The median is not affected by outliers.
How is the median calculated when there is an even number of data points?
Average the two middle values
This ensures that the median accurately represents the center of the data.
True or false: The median of a normal distribution is equal to its mean.
TRUE
This holds true for symmetric distributions.