What are the four types of data analytics?
What question does descriptive analytics answer?
What happened?
What is the purpose of diagnostic analytics?
To answer the question, ‘Why did it happen?’ by analyzing historical data.
What is the focus of predictive analytics?
It focuses on the future using correlative analysis to answer the question, ‘What is likely to happen?”
What question does prescriptive analytics answer?
What needs to happen?
What is a model in data analytics?
A representation of a relationship between variables in data.
What is statistical modeling?
The process of using statistical analysis on a dataset, usually a sample drawn from a population, to identify patterns and infer something about the population.
What is linear regression analysis used for?
To develop a mathematical equation modeling the extent to which one variable (called the dependent or response variable) has historically been affected by one or more other variables (called the independent or predictor variables).
What is a time series?
A sequence of measurements of the same variable taken at equally spaced, ordered points in time.
What are the four patterns in time series analysis?
What is a trend pattern in time series analysis?
The historical data exhibits a gradual shifting to a higher or lower level over time, useful for making predictions.
What is a cyclical pattern in time series analysis?
A cyclical pattern exhibits fluctuations that last longer than one year, often due to the cyclical nature of the economy.
What is a seasonal pattern in time series analysis?
Variability in a time series within a period such as a year due to seasonal influences, identified by regularly spaced peaks and troughs.
What is an irregular pattern in a time series?
Random variations not repeating in any regular pattern, caused by short-term, non-recurring factors.
What is the equation of a simple linear regression line?
ŷ = a + bx
What does the “a” represent in the linear regression equation
ŷ = a + bx?
The constant coefficient, or the y-intercept.
What does the “b” represent in the linear regression equation
ŷ = a + bx?
The variable coefficient, or the slope of the line.
The slope of the line is the amount of change in the predicted value of the dependent variable for each unit of increase in the independent variable.
What is the purpose of the regression line in time series analysis?
To make forecasts using historical data.
What is the purpose of linear regression analysis?
To calculate the location of the regression line mathematically and predict the value of y for any given value of x if the independent variable serves well as the predictor variable.
Linear regression minimizes the sum or mean of the squares of the residuals using the least squares method.
What must be determined before using linear regression analysis to develop a prediction?
Whether the dependent variable, y, has a linear relationship with the independent variable, x.
A scatterplot can be used to determine if a linear relationship exists between the variables.
What is correlation analysis used for in simple linear regression analysis?
To understand the relationship or absence of a relationship between the independent variable and the dependent variable and to determine the strength of the linear relationship between the two variables.
What does a positive correlation coefficient indicate about a regression?
Increasing measurements of the independent x-variable tend to be associated with increasing measurements of the dependent y-variable; and decreasing measurements of x tend to be associated with decreasing measurements of y.
What does a negative correlation coefficient indicate about a regression?
Increasing measurements of the independent x-variable tend to be associated with decreasing measurements of the dependent y-variable and vice versa.
What is the correlation coefficient (r) in simple linear regression and what does it measure?
A number between −1 and +1 that measures the strength and direction of the relationship between the independent and dependent variables.
A number close to either +1 or −1 means the two variables are correlated, and if there is a cause-and-effect relationship, a simple linear regression could be useful for forecasting.
A correlation coefficient of +1 indicates a perfectly positive linear relationship, while −1 indicates a perfectly negative linear relationship.