Visual workflow tool that moves data from A to B (copy, schedule, orchestrate).
Data Factory Pipelines
Shortcut (OneLake Shortcut) - What is it?
A zero-copy reference to external data stored outside Fabric (e.g.
Shortcut (OneLake Shortcut) - Why use it?
Use when data already exists in a clean data lake and you want immediate access with no storage duplication.
Shortcut (OneLake Shortcut) - Example external data
ADLS Gen2 curated lake zones, Amazon S3 marketing data, Google Cloud Storage research datasets.
Database Mirroring - What is it?
A high availability feature for SQL Server that maintains a mirrored database for failover.
Database Mirroring - Why use it?
Use when you need redundancy and disaster recovery for an operational SQL Server—not for data ingestion.
Database Mirroring - Example use case
Primary ticketing database + secondary failover replica.
Dataflow (ETL Dataflow) - What is it?
Power Query-based GUI tool for ingesting and transforming data into a Lakehouse.
Dataflow (ETL Dataflow) - Why use it?
Use for business-analyst-friendly cleaning
Dataflow (ETL Dataflow) - Example external data
Salesforce Contacts, SurveyMonkey CSV exports, Excel finance report uploads, On-prem SQL via Gateway.
Data Pipeline (ETL Data Pipeline) - What is it?
An orchestrated and scheduled ETL workflow using Data Factory inside Fabric.
Data Pipeline (ETL Data Pipeline) - Why use it?
Use for recurring
Data Pipeline (ETL Data Pipeline) - Example external data
Azure SQL DB, Snowflake, AWS S3, Google BigQuery, Dynamics 365, Zendesk.
Notebook (ETL Notebook) - What is it?
A Spark-based code workspace for ingestion and transformation using Python
Notebook (ETL Notebook) - Why use it?
Use for complex logic
Notebook (ETL Notebook) - Example external data
IoT telemetry, website clickstream logs, multi-GB CSV archives, sensor data feeds.
Eventstream - What is it?
A real-time streaming ingestion pipeline that processes events continuously.
Eventstream - Why use it?
Use when dashboards or alerts must update instantly based on live event data.
Eventstream - Example external data
People counter sensors, POS transaction event messages, Kafka or Event Hubs streams, real-time user activity logs.
Front
Back
On-premise Data Gateway - What is it?
A secure connector that allows Fabric to access on-premises data sources inside your internal network.
On-premise Data Gateway - Why use it?
Use when the data source is on-prem and cannot have a public endpoint. Requires secure outbound-only access.
On-premise Data Gateway - Example external data
On-prem SQL Server, Oracle DB in data center, Shared drive with Excel/CSV files.
VNet Data Gateway - What is it?
A Microsoft-managed private network connection to Azure data sources that are secured with private endpoints/VNet integration.