When using a particular estimating method, the analyst should
A. Use that method throughout the life of the program for consistency
B. Incorporate cost actuals only in the latter stage of the program because they are hard to obtain and often inaccurate
C. Re-evaluate the estimating method used at every milestone as the program matures
D. Always use parametric estimating because it applies at every stage (milestone) in the program
C. Re-evaluate the estimating method used at every milestone as the program matures
It is important to re-evaluate the cost estimating methodology at every milestone because
different methods may be better suited than others at certain points in the life cycle.
The cost analyst can identify cost drivers by:
A. Talking to subject matter experts
B. Reading requirements specification documents
C. Obtaining and understanding system architecture designs
D. Scatter plotting the data
E. All of the above
F. A and D Only
E. All of the above
All of the methods mentioned are ways to identify potential cost drivers. It is important to keep in
mind, however, that parameters that drive design (which will be discovered when exploring choices B
and C) are not always the parameters that drive cost.
CERs in parametric estimating are:
A. Cost Effectiveness Ratios
B. Cost Earned Relationships
C. Component Engineering Requests
D. Cost Estimating Relationships
E. Complete Engineering Releases
D. Cost Estimating Relationships
Parametric cost estimating uses Cost Estimaing Relationships (CERs), which are based on historical
data to predict the cost of a new project or system. Cost drivers, such as weight and size are used to estimate cost and production schedules.
Cost drivers are the parameters or
independent variables in the CERs which can be shown to drive cost: cost, the dependent variable
in the equation, changes as the input parameters change.
The term rate is best defined as which of the following:
A. Best Fit Equation
B. Cost on Cost
C. Cost on Parameter
D. Parameter on Parameter
C. Cost on Parameter
A rate uses a parameter to predict cost via a simple multiplicative relationship.
One of the most common rates is the labor rate, expressed in dollars per hour. Total labor cost
is then estimated labor hours times the project labor rate.
The term factor is best defined as which of the following:
A. Best Fit Equation
B. Cost on Cost
C. Cost on Parameter
D. Parameter on Parameter
B. Cost on Cost
A factor uses the cost of another element to predict cost via a simple multiplicative relationship.
Often “below-the-line” elements such as program management and systems engineering are
estimating as a factor of the prime mission equipment.
The term ratio is best defined as which of the following:
A. Best Fit Equation
B. Cost on Cost
C. Cost on Parameter
D. Parameter on Parameter
D. Parameter on Parameter
A regression-based CER is best defined as which of the following:
A. Best Fit Equation
B. Cost on Cost
C. Cost on Parameter
D. Parameter on Parameter
A. Best Fit Equation
A regression is the best fit equation of the data. The most common way of defining this “best fit”
is ordinary least squares (OLS) regression, wherein the sum squared error (SSE) is minimized.
True or False. Because parametric relationships are statistically verified for significance, the cost analyst can apply this relationship for all values of the cost driver.
False.
A relationship does not necessarily apply beyond a “reasonable” range. It is possible to apply CERs outside the range of the data, and the Prediction Interval (PI) captures appropriate.
uncertainty, but one would not estimate, for example, the cost of an object of zero weight.
Further discussion of this concern is addressed in Module 8 Regression Analysis and Module 9 Cost
and Schedule Risk Analysis.
Given the hypothesis “Weight is a significant cost driver at a significance level of 0.05,” which of the following statistics would you use to test for this?
A. R-squared = 0.867
B. cV = 15%
C. P-value = 0.022
D. All of the above
E. A and B only
F. A and C only
G. B and C only
C. P-value = 0.022
The test for significance of a parameter is the p-value corresponding to the t statistic.
The R-squared value is the ratio of the explained variation to the total variation in the data set.
The CV (coefficient of variation) is the ratio of the standard error to the mean, and is a measure of
variability. In this case, the cost driver would be statistically significant, since the p-value
of 0.022 is less than the alpha value of 0.05.
Parametric estimating is a valid approach when creating cost estimates because:
A. It includes a detailed build-up of all applicable costs
B. It models the current system on the model of a similar system or sub-system
C. It uses tested relationships to estimate costs using predefined parameters
D. It can be used early before detailed requirements are known
C. It uses tested relationships to estimate costs using predefined parameters
Parametric estimating uses relationships between costs and cost drivers (predefined parameters) to develop an estimate. Though the can be difficult to find, once developed,
CERs can be adjusted for requirements changes. While it is true that parametrics can be used early on in the life
cycle, this is not the basis for the validity of the technique.
True of False. If, when in the data collection stage in the parametric estimating process, the contractor provides a total estimate at complete for the program, the cost analyst can skip the steps of identifying cost drivers and developing CERs and go straight to building the parametric model.
False.
A parametric estimate is based on identified cost drivers and developed CERs. A contractor-provided estimate at complete does not serve as the basis for a parametric
estimating methodology.
If a CER for Site Development was developed giving the relationship, y (in $K) = 26.635x + 105.16 (where x is the number of workstations) for a data set cost driver that had a range minimum of 7 workstations to 47 workstations, and the independent variable has tested positively for significance, the predicted cost for a site that had 36 workstations would be:
A. $1,064.02
B. $1,064,020
C. $958.86
D. $958,860
E. CER may not be applicable. Further data collection would be advisable.
B. $1,064,020
y=26.635(36)+105.16= 1064.02 $K
Using the same example in question 12, what would be the predicted cost for a site that had 10 workstations?
Site Development: y (in $K) = 26.635x + 105.16 (where x is the number of workstations)
A. $371.51
B. $266.35
C. $371,510
D. $266,350
E. CER may not be applicable. Further data collection would be advisable.
C. $371,510
y=26.635(10)+105.16= 371.51 $K
Using the same example in question 12, what would be the predicted cost for a site that had 60 workstations?
Site Development: y (in $K) = 26.635x + 105.16 (where x is the number of workstations)
A. $1,598,100
B. $1,703.26
C. $1,742,350
D. $7,907.7
E. CER may not be applicable. Further data collection would be advisable.
E. CER may not be applicable. Further data collection would be advisable.
CER was developed using between 7 and 47 workstations. Since 60 is above our maximum,
the CER may not be applicable. There is probably not a problem with applying the CER for this value,
as long as the appropriate prediction interval (PI) is used to characterize the increased uncertainty.
We’d be much more nervous about applying the CER for a site with, say, a thousand workstations.
If you were developing a multivariate CER to predict the payroll of a Major League Baseball (MLB) team, which of the following would be good candidate cost drivers?
A. Population of the team’s home city
B. Number of players on the roster
C. Number of free agents signed at the beginning of the current season
D. Whether or not the team is owned by George Steinbrenner or Ted Turn
E. A and B
F. A and C
G. B and C
F. A and C
Bigger cities tend to have higher payrolls (due to larger fan bases).
If a team signs a large numebr of free agents before the season, they may have a higher payroll
than a team that has a lot of players from their farm system. Free agents are often won in
bidding wars. While it makes sense that the more players a team has, the higher its payroll
will be, the reason this is not a good cost driver is that all teams have the same number of players,
so you’d be trying to fit a sloping line through a vertical cloud of points! While teams owned by
George Steinbrenner (the New York Yankees) and Ted Turner (formerly the Atlanta Braves) may
indeed have higher payrolls, as might be shown with an appropriate dummy variable, this is too
restrictive to be of much value as a predictive variable. It might be better to try to develop an
objective (yet non-circular) method for characterizing ownership groups as extravagant or
parsimonious.
True or False. CERs are always linear.
False. Though CERs are certainly often linear, the relationship could be of a non-linear
functional form.
True or False. A good way to identify cost drivers is to use comb charts or Pareto charts to identify WBS elements with the highest cost.
False. The highest cost WBS items, or the “big ticket items” can be termed “Cost
Passengers.” These high cost WBS elements are not necessarily the elements with the greatest
potential for cost savings. Instead, it is important to look for the elements that drive the costs.
Which of the following statements is true regarding calibration of parametric CERs?
I. The calibration point must be a part of the original data set.
II. A calibrated CER is mathematically equivalent to an adjusted analogy.
III. Calibration of a CER changes the Y-intercept.
A. I
B. II
c. III
D. I and II only
E. I and III only
F. II and III only
G. All of the above.
F. II and III only
Statements II and III are correct. Statement I is false: the calibration point must not be part of the
original data set. While there are valid reasons for calibrating a CER, it is important to calibrate carefully
and with good reason, as incorrect and unjustified calibration can lead to suspicions that an analyst is
“cooking” the data.
True or False. A parametric model can be updated with program-specific actuals.
This statement is true, though the use of actuals depends on the situation. When new data is available,
the analyst has the option to update the CER, recalibrate the CER so that it passes through the new
data point, change the methodology altogether, or leave the model unchanged. Whatever the analyst
decides, it is important that the rationale for this decision is defensible.
Which of the following statements is accurate.
I. Forcing a CER through the origin is not possible.
II. Forcing a CER through the origin is necessary, because if something has 0 mass (for example), it should also cost nothing.
III. Forcing a CER through the origin is not advised.
A. I
B. II
С. III
С. III
The y-intercept of a CER should not artificially be forced through zero. General practice is to
accept the y-intercept, even if it is not statistically significant. Though it is possible to force the
y-intercept through zero in excel, this practice is not advised. Just like you would not force your
regression through any other data point, you should not artificially force your regression through
zero. The y-intercept should not, however, be interpreted as a fixed cost.
PARAMETRIC TECHNIQUES
Ratio
Estimates effort, defined as parameter on parameter
Factor
Uses the cost of one element to predict the cost of another with
a simple, multiplicative relationship
Rate
Predicts cost via a simple multiplicative relationship
Arithmetic Mean
Sum of all items divided by the number of items
ESTIMATING
THROUGHOUT
THE PROGRAM
LIFE CYCLE
Analysts rely heavily on analogy and parametrics during concept and design stages.
As more design detail becomes available, analysts begin to use build-up, although parametric estimating continues to play an important role.
Actual costs are incorporated into cost estimates as Low-Rate Initial Production (LRIP) or full-rate production begins. Actual cost experience on prototype units and early engineering development hardware can enlighten the estimate before production actuals become
available. Manufacturers directly incorporate their own costs.
Government analysts use data
available via the Contractor Cost Data Reporting (CCDR) system.
PARAMETRIC ESTIMATING PROCESS
1.Data Collection: Analysts search for data sources. Cost, schedule, and technical data are the raw
materials for parametric relationships.
2.Cost Drivers: Analysts identify cost drivers. A hypothesis based on analogy, organization history,
or expert judgment may provide insight to cost drivers with the highest confidence interval. Scatter
plots are a visualization tool for uncovering underlying relationships in the data. Cost drivers are
explored via correlation of each independent element with cost. Analysts use cost drivers to capture
the underlying engineering/physical causality in a complex system.
3.Cost Estimating Relationships (CERs): Analysts develop equations to capture CERs using linear
models with one parameter or with more complex mathematical functions.
4.Parametric Model: Analysts build an inclusive parametric model containing all the parametric
inputs and CERs for all the cost elements in an entirely parametric cost model approach. Analysts
use common software programs or more sophisticated tools. Methodologies will often change
throughout the life of the program; an exclusively parametric model may seem stochastic.
y= ax + b. Linear
y = ax^b Power
y= a + b*ln(x) Logarithmic
y=ae^(bx). exponential
y = a + b1x1 + b2x2 + … polynomial