Getting Started With ADF Flashcards

(36 cards)

1
Q

What is Azure Data Factory?

A

A cloud-based ETL/ELT and data orchestration service.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the main purpose of ADF?

A

To move, transform, and orchestrate data across systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Does ADF store data?

A

No - it only orchestrates and moves data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a pipeline?

A

A logical container that groups activities into a workflow.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is an activity?

A

A single step inside a pipeline.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a dataset?

A

A metadata reference pointing to actual data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a linked service?

A

The connection information to a data source or compute service.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an Integration Runtime (IR)?

A

The compute engine ADF uses to run activities.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the 3 types of IR?

A

Azure IR, Self-hosted IR, Azure SSIS IR.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the ADF Studio?

A

The UI used to build, monitor, and manage ADF

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Author mode?

A

The workspace where you design pipelines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Monitor mode?

A

The area where you view pipeline runs, triggers, and logs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Git integration for ADF?

A

Allows version control and collaborative development

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Live mode?

A

The published version of your data factory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Do you edit directly in Live mode?

A

The published version of your data factory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Do you edit directly in Live mode?

A

No - you edit in Git mode and publish

17
Q

What is publishing in ADF?

A

Moving code from Git workspace > Live data factory

18
Q

What is an ARM template?

A

An infrastructure-as-code definition for ADF components.

19
Q

Can ADF work with on-prem data?

A

Yes - via Self-hosted IR.

20
Q

What is a trigger?

A

A scheduler that runs pipelines automatically.

21
Q

Does ADF support manual pipeline execution?

A

Yes - “Debug” and “Add Trigger > Now”

22
Q

Can ADF call APIs?

A

Yes - via Web Activity.

23
Q

Can ADF run SQL stored procedures?

A

Yes - via Stored Procedure Activity.

24
Q

Can ADF process files?

A

Yes - copy, delete, validate, metadata.

25
What is Copy Activity used for?
Moving data from source to destination.
26
Can ADF perform transformation?
Yes - using Mapping Data Flows or external compute.
27
What is a Data Flow?
A Spark-based visual transformation engine.
28
What is Debug mode in Data Flow?
Turns on a temporary Spark cluster for interactive testing.
29
What is a parameter?
A value passed at runtime for dynamic pipelines.
30
What is a variable?
A value kept in memory during pipeline execution.
31
What is dynamic content?
Expressions used to build flexible pipeline logic.
32
What is a pipeline run ID?
A unique ID for each pipeline execution.
33
What is pipeline concurrency?
Max number of simultaneous pipeline runs.
34
What is pipeline timeout?
Max runtime before forced stop (default 7 days).
35
Can ADF integrate with Synapse or Databricks?
Yes - via dedicated activities.
36
What languages does ADF use for logic?
ADF Expression Language (similar to JSON)