What are the two categories of data based on structure?
Structured data, unstructured data
What is structured data?
organised, often numeric, easy to store in databases (e.g., transaction data).
What is unstructured data?
not organised, harder to analyse (e.g., contracts, texts, voices, video).
What are the four categories of data based on characteristics?
Descriptive data
Behaviour data
Interaction data
Attitudinal data
What does descriptive data include?
attributes, characteristics, geo/demographics.
What is does behaviour data include?
orders, transactions, payments, credit history.
What does interaction data include?
emails, chat transcripts, click-streams.
What does attitudinal data include?
opinions, preferences, needs, desires.
What is Business Analytics (BA)?
The skills, technologies, and practices for continuous exploration of past business performance to gain insight and support fact-based decision making.
What are the 4 main components of Business Analytics?
(Big) Data
Statistical & Quantitative Analysis
Explanatory or Predictive Models
Decision Making & Actions
What types of questions does Business Analytics answer?
What happened? (Describe)
Why is this happening?
What if these trends continue?
What will happen next? (Predict)
What is the best that can happen? (Optimize)
What are the three categories of analytics?
Descriptive Analytics
Predictive Analytics
Prescriptive Analytics
What are descriptive analytics?
Using and visualising data (e.g., reports, scorecards, clustering) to understand performance.
What are predictive analytics?
Using statistical/machine learning techniques (time-series, regression) to find relationships and predict outcomes.
What are prescriptive analytics?
Using optimisation and simulation techniques to improve business performance, given objectives and constraints.
Example of Descriptive Analytics.
Reports showing past sales trends, dashboards, and performance scorecards.
Example of Predictive Analytics.
Using regression models to forecast future sales or customer churn.
Example of Prescriptive Analytics.
Supply chain optimisation to minimise costs under constraints or scheduling staff to maximise efficiency.
What are the two main types of structured data?
Categorical (qualitative)
Numerical (quantitative)
What are the two subtypes of categorical (qualitative) data?
Nominal: Categories without order (e.g., gender, colors, countries).
Ordinal: Categories with implied order (e.g., rankings, education levels).
What are the two subtypes of numerical (quantitative) data?
Interval: Numeric values where differences make sense, but no absolute zero (e.g., temperature in °C).
Ratio: Numeric values with a true zero, allowing all arithmetic operations, including ratios (e.g., weight, salary).
Formula for the mean (x~?
= (∑x_i)/n
Formula for standard deviation (s)?
s = √ (∑(x_i - x~)^2 /(n-1))
What does skewness describe?
The shape of a distribution – whether it is symmetric, left-skewed (negative), or right-skewed (positive).