π§ LEVEL 1 β POWER BI FOUNDATIONS (DETAILED MASTER CARDS)
π₯ CORE ARCHITECTURE
π Card 1 β Purpose of Power BI
Q: What fundamental problem does Power BI solve in the data ecosystem, and how does it differ from traditional reporting tools?
A:
Power BI solves the problem of self-service business intelligence:
Traditional reporting:
Static reports
IT-controlled
Slow updates
Limited interactivity
Power BI enables:
Interactive exploration
Real-time slicing/filtering
Integration of multiple sources
Democratized analytics
It shifts organizations from report consumption β data exploration.
Visual cue: Static PDF report vs interactive dashboard
π Card 2 β End-to-End Workflow
Q: What is the full lifecycle of data inside Power BI from source to user interaction?
A:
Data acquisition (connect to sources)
Transformation (Power Query)
Loading into model (VertiPaq engine)
Modeling (relationships, measures)
Visualization (reports)
Publishing (Service)
Consumption (dashboards/apps/mobile)
Key insight:
Power BI is not just visualization β it is a complete BI pipeline.
π Card 3 β Why Power BI Desktop Is Central
Q: Why is Power BI Desktop considered the core authoring environment rather than the Service?
A:
Because Desktop contains:
Full Power Query engine
Data modeling tools
DAX authoring
Report creation
The Service is primarily for:
Distribution
Collaboration
Governance
Think:
Desktop = Development
Service = Deployment
π₯ DATASET / REPORT / DASHBOARD β DEEP
π Card 4 β Dataset (Semantic Model)
Q: What is a dataset in Power BI from a semantic modeling perspective?
A:
A dataset is a semantic layer that translates raw data into business meaning.
It contains:
Tables
Relationships
Measures
Hierarchies
Metadata
It defines how users interpret data.
Important insight:
Reports do not analyze raw data β they query the semantic model.
π Card 5 β Report Mechanics
Q: How does a report interact with the dataset at query time?
Q: How does a report interact with the dataset at query time?
A:
Each visual generates a query against the dataset using filter context.
User interactions:
Slicers
Cross-filtering
Drill-down
modify the query dynamically.
π Card 6 β Dashboard Limitations
Q: Why are dashboards only available in the Service and not Desktop?
A:
Because dashboards are:
Aggregations of visuals from multiple reports
Designed for monitoring
Not for detailed analysis
They rely on cloud-based sharing features.
π₯ STORAGE MODES β INTERNAL LOGIC
π Card 7 β Import Mode Internals
Q: Why is Import mode typically faster than DirectQuery?
A:
Because data is stored in-memory using the VertiPaq columnar engine.
Benefits:
Compressed storage
Vectorized scans
No network latency
π Card 8 β DirectQuery Trade-offs
Q: What are the architectural trade-offs of DirectQuery?
A:
Advantages:
Real-time data
No duplication
Disadvantages:
Slower performance
Limited DAX functions
Dependency on source database
π Card 9 β Composite Models
Q: What problem do composite models solve?
A:
They allow combining:
Import tables
DirectQuery tables
in one model.
Useful when:
Some data needs real-time access
Other data benefits from in-memory speed
π₯ POWER QUERY β CONCEPTUAL FOUNDATION
π Card 10 β Role Separation
Q: Why should data cleaning be done in Power Query instead of DAX?
A:
Power Query operates before data enters the model, making transformations:
Reusable
Efficient
Applied once
DAX transformations happen at query time β expensive.
π Card 11 β Immutable Pipeline Concept
Q: Why are Power Query steps considered an immutable transformation pipeline?
A:
Each step produces a new table without altering previous steps.
Benefits:
Traceability
Reproducibility
Debugging
π₯ DATA MODEL β WHY IT EXISTS
π Card 12 β Need for Modeling Layer
Q: Why doesnβt Power BI simply visualize tables directly without modeling?
A:
Because raw tables:
Lack relationships
Lack business logic
Cause ambiguous aggregations
Modeling creates a coherent analytical structure.
π Card 13 β Semantic vs Physical Model
Q: What is the difference between the semantic model and the physical data source?
A:
Physical:
Raw tables in databases/files
Semantic:
Business interpretation layer:
Relationships
Measures
Hierarchies
π₯ VISUAL INTERACTION MECHANICS
π Card 14 β Cross-Filtering Logic
Q: How does selecting one visual affect others on a report page?
A:
It modifies filter context applied to the dataset.
Other visuals recompute results based on new context.
π Card 15 β Drill-Down Concept
Q: What analytical problem does drill-down solve?
A:
Navigating between aggregation levels:
Year β Quarter β Month β Day
Supports hierarchical analysis.
π§ LEVEL 2 β DATA MODELING MASTERY (ELITE CARDS)
π₯ WHY DATA MODELING MATTERS
π Card 1 β Modeling vs Visualization
Q: Why is data modeling considered more important than visualization in Power BI?
A:
Because the model determines:
Correctness of aggregations
Query performance
Filter behavior
DAX complexity
Poor model β impossible calculations
Good model β simple measures
Expert insight:
Visualization errors often originate in model design.
π Card 2 β Analytical Engine Dependency
Q: Why does the VertiPaq engine rely heavily on model structure?
A:
VertiPaq scans columns independently.
Efficient models:
Narrow fact tables
Low-cardinality dimensions
Clear relationships
Bad models:
Wide tables
High cardinality
Ambiguous joins
π₯ STAR SCHEMA β DEEP UNDERSTANDING
π Card 3 β Star Schema Purpose
Q: What fundamental analytical problem does the star schema solve?
A:
Separating:
Measurements (facts)
Context (dimensions)
This enables:
Flexible slicing
Efficient aggregation
Clear semantics
π Card 4 β Fact Table Properties
Q: What characteristics define a proper fact table?
A:
Contains:
Numeric measures
Foreign keys to dimensions
High row count
Granular events
Should NOT contain:
Descriptive attributes
Repeated text
π Card 5 β Grain Concept (Critical)
Q: What is the grain of a fact table and why must it be defined explicitly?
A:
Grain = level of detail of each row.
Example:
One row per:
Transaction
Order line
Daily summary
Undefined grain causes:
Double counting
Inconsistent aggregations
π Card 6 β Dimension Table Role
Q: Why are dimension tables essential for slicing data?
A:
They provide descriptive attributes used for filtering:
Customer demographics
Product categories
Geographic hierarchy
Dimensions define analysis perspectives.
π₯ RELATIONSHIPS β INTERNAL LOGIC
π Card 7 β One-to-Many Relationship
Q: Why should relationships typically flow from dimension (one) to fact (many)?
A:
Because:
Dimensions filter facts
Not the reverse
This mirrors real-world logic:
Category β Products β Sales
π Card 8 β Filter Propagation
Q: How does filter propagation work across relationships?
A:
Filters applied to a dimension:
β propagate to fact table
β affect aggregations
Direction matters.
π Card 9 β Bidirectional Relationships Risk
Q: Why can bidirectional filtering create ambiguity?
A:
It allows filters to travel multiple paths, causing:
Circular dependencies
Incorrect totals
Performance issues
Use sparingly.