Common Definations Flashcards

(70 cards)

1
Q

What is Data Integration?

A

The process of combining data from multiple sources into a single, unified view for analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Data Modeling?

A

Structuring data using relationships and logical tables to enable accurate analysis and reporting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Data Cleaning (Data Wrangling)?

A

The process of fixing, formatting, and preparing raw data by removing errors, duplicates, or inconsistencies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is ETL (Extract, Transform, Load)?

A

A process where data is extracted from sources, transformed (cleaned/reshaped), and loaded into a final system.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a KPI (Key Performance Indicator)?

A

A measurable value that indicates how effectively a company is achieving key business objectives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a Metric?

A

A quantitative measure used to track performance, not always tied to specific goals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Dashboard?

A

A visual display of data summaries, KPIs, and charts used for decision-making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Aggregation?

A

Summarizing data using functions like SUM, AVERAGE, COUNT, MIN, MAX, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Data Visualization?

A

Representing data graphically using charts, graphs, maps, etc., to make patterns and trends clear.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a Filter?

A

A tool or condition used to display only specific rows or segments of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a Pivot Table?

A

A tool in Excel that allows summarizing, analyzing, and rearranging data by rows and columns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is DAX (Data Analysis Expressions)?

A

A formula language in Power BI used to create custom calculations and aggregations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are Data Types?

A

Categories of data like Number, Text, Date, Boolean that define how data is stored and processed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a Relationship (Join)?

A

A logical link between tables based on a common column (key).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Time Intelligence?

A

The analysis of data across time (e.g., YTD, MTD, QoQ, YoY) for trends and performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Data Granularity?

A

The level of detail in the data, such as daily, monthly, or yearly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a Data Source?

A

The original location where data is stored, such as Excel, SQL, APIs, or cloud services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is Null/Blank Data?

A

Missing or empty values in a dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is EDA (Exploratory Data Analysis)?

A

The initial step in analysis where patterns, anomalies, and summaries are explored.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is Conditional Logic?

A

Logical expressions like IF/ELSE that assign values or perform calculations based on conditions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is Data Refresh?

A

Updating your reports or dashboards with the latest data from the source.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is Incremental Refresh?

A

Refreshing only the new or modified data instead of the entire dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is Append in Power Query?

A

Combining tables by stacking them row-wise (adding more records).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is Merge in Power Query?

A

Combining tables side-by-side based on a matching key (like a JOIN in SQL).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is Power Query?
A data transformation and cleaning tool in Power BI and Excel.
26
What is a Fact Table?
A central table in a data model containing measurable, transactional data like sales or orders.
27
What is a Dimension Table?
A supporting table with descriptive data like Product Names or Customer Info.
28
What is a Star Schema?
A data modeling structure with a central fact table connected to multiple dimension tables.
29
What is a Slowly Changing Dimension (SCD)?
A dimension table where values change slowly over time (e.g., customer address history).
30
What is a Calculated Column in Power BI?
A new column added to a table using a DAX formula, stored in memory.
31
What is a Measure in Power BI?
A DAX calculation that is computed on the fly during visualization.
32
What is Row Context?
The context where DAX evaluates each row individually (used in calculated columns).
33
What is Filter Context?
The context created by filters, slicers, or visuals that influence a DAX calculation.
34
What is a Slicer in Power BI?
A visual control to filter data in a report interactively.
35
What are Hierarchies in Data Modeling?
Ordered levels of data (e.g., Year > Month > Day) used for drill-down analysis.
36
What is Drill Down in Power BI?
Viewing more detailed data by going deeper into a hierarchy.
37
What is Drill Through in Power BI?
Navigating to a detailed report page based on a selection from another report.
38
What is Target vs Actual Analysis?
Comparing planned values (targets) to real-world outcomes (actuals).
39
What is Variance in Data Analysis?
The difference between actual and target values.
40
What is RLS (Row-Level Security)?
A method to restrict access to data at the row level based on the user.
41
What is an API?
An interface that allows applications to communicate and retrieve data from other systems or web services.
42
What is a Data Pipeline?
A sequence of steps to extract, process, and load data from source to destination.
43
What is Time Series Analysis?
Analyzing data over time to identify trends, seasonality, or patterns.
44
What is CALCULATE in DAX?
A DAX function that modifies filter context to calculate values under specific conditions.
45
What is ALL in DAX?
A function that removes all filters from the specified column or table.
46
What is ALLSELECTED in DAX?
A function that keeps slicer filters but removes visual-level filters.
47
What is ALLEXCEPT in DAX?
A function that removes all filters except the ones you specify.
48
What is a CTE in SQL?
A Common Table Expression is a temporary named result set used in a SQL query.
49
What is a Subquery in SQL?
A query nested within another query to filter or calculate specific values.
50
What is Normalization in SQL?
Organizing data to reduce redundancy and improve efficiency, often using multiple normalized forms.
51
What is an Index in SQL?
A database object that improves the speed of data retrieval.
52
What is a Factless Fact Table?
A fact table with no numeric values, used to track events.
53
What is a Composite Key?
A primary key made up of two or more columns.
54
What is CAC (Customer Acquisition Cost)?
The cost to acquire a new customer, calculated as Total Marketing Cost ÷ Number of New Customers.
55
What is CLV (Customer Lifetime Value)?
The total profit expected from a customer over their entire relationship with a business.
56
What is Gross Margin (GM%)?
(Revenue - Cost of Goods Sold) ÷ Revenue × 100. Shows how much profit remains after direct costs.
57
What is ROI (Return on Investment)?
(Net Profit ÷ Investment Cost) × 100. Measures the efficiency of an investment.
58
What is Conversion Rate?
The percentage of users who take a desired action, e.g., purchases ÷ total visitors × 100.
59
What is Benchmarking?
Comparing performance metrics to industry standards or competitors.
60
What is Cohort Analysis?
Grouping users by shared traits (e.g., signup date) and analyzing behavior over time.
61
What is Churn Rate?
The percentage of customers who stop using a product or service within a time period.
62
What is Forecasting?
Predicting future values based on historical trends and patterns.
63
What is the difference between Data and Insight?
Data is raw numbers; insights are actionable conclusions drawn from data.
64
What is Power Pivot?
An Excel add-in for creating data models and using DAX for advanced analytics.
65
What is a Lambda Function in Python?
A short, anonymous function used for simple operations in one line.
66
What is Pandas in Python?
A library for data analysis and manipulation using tabular (DataFrame) structures.
67
What is Matplotlib/Seaborn in Python?
Libraries used for data visualization in Python.
68
What is a Data Lake?
A storage repository that holds raw, unstructured or semi-structured data.
69
What is a Data Warehouse?
A centralized repository for structured, cleaned, and processed data ready for reporting.
70
What is a Semantic Layer?
A layer that defines business logic (metrics, KPIs, relationships) in BI tools to standardize analysis.