Data Roles and Services Flashcards

(30 cards)

1
Q

What does a DBA mainly do?

A

Manage databases, security, backups, and performance.

DBA stands for Database Administrator.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a Data Engineer mainly do?

A

Build data pipelines (ETL/ELT), move/clean data, integrate systems.

ETL stands for Extract, Transform, Load.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does a Data Analyst mainly do?

A

Analyse data, create dashboards, find insights.

Data Analysts focus on interpreting data to support decision-making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why have data roles grown?

A

Because data volume exploded and businesses rely on data to compete.

The increase in digital data has created a demand for data professionals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Azure SQL Database?

A

A fully managed SQL database in the cloud.

It provides built-in high availability and security.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Azure SQL Managed Instance?

A

A more flexible cloud SQL service similar to full SQL Server.

It allows for easier migration from on-premises SQL Server.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Azure SQL VM?

A

A virtual machine running SQL Server with full server control.

Users have complete control over the SQL Server environment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Who mainly uses Azure SQL services?

A
  • DBAs
  • Data Engineers
  • Data Analysts

These roles utilize Azure SQL for database management and analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does Azure Database for MySQL/Postgres/MariaDB provide?

A

Managed open-source relational databases.

These services simplify database management for developers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Azure Cosmos DB?

A

A global-scale NoSQL database supporting multiple data models.

It is designed for high availability and low latency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Name four Cosmos DB data models.

A
  • Document
  • Key-Value
  • Column-Family
  • Graph

These models allow for flexible data storage and retrieval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Who uses Cosmos DB most?

A
  • Developers
  • Data engineers

They leverage its capabilities for building scalable applications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Azure Storage used for?

A

Storing raw files, blobs, and big data.

It provides scalable storage solutions for various data types.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Name 3 Azure Storage types.

A
  • Blob storage
  • File shares
  • Data lake storage

Each type serves different storage needs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Who uses Azure Storage heavily?

A

Data Engineers.

They require storage solutions for data processing and analytics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Azure Data Factory (ADF)?

A

Azure’s main ETL tool for moving and transforming data.

ADF facilitates data integration from various sources.

17
Q

Who uses ADF?

A

Data Engineers.

They utilize ADF for data pipeline creation and management.

18
Q

What is Azure Synapse Analytics?

A

A unified platform including pipelines, SQL pools, Spark, and Data Explorer.

It integrates big data and data warehousing.

19
Q

Name Synapse’s 4 main components.

A
  • Pipelines
  • SQL Pools
  • Apache Spark
  • Data Explorer

These components work together for data analytics and processing.

20
Q

Who uses Synapse?

A
  • Data engineers
  • Data analysts
  • Data scientists

These roles benefit from its comprehensive analytics capabilities.

21
Q

What is Azure Databricks?

A

A platform combining Spark with notebooks for big data & ML.

It enhances collaborative data science and engineering.

22
Q

Who typically works in Databricks?

A
  • Data engineers
  • Data scientists

They use Databricks for data processing and machine learning tasks.

23
Q

What is Azure HDInsight?

A

Cloud-hosted clusters for open-source tools like Spark, Hive, Kafka.

It provides a managed environment for big data processing.

24
Q

What is Azure Stream Analytics used for?

A

Real-time data processing (IoT, logs, events).

It enables real-time insights from streaming data.

25
Who uses **Azure Stream Analytics**?
Data engineers building real-time dashboards. ## Footnote They require tools for immediate data analysis.
26
What is **Azure Data Explorer (Kusto)**?
A high-speed engine for time-series and log analytics. ## Footnote It is optimized for fast querying of large datasets.
27
Who uses **Kusto** most?
Data analysts for fast, real-time insights. ## Footnote They leverage its speed for immediate data analysis.
28
What is **Microsoft Purview**?
A data governance tool for tracking data, lineage, and compliance. ## Footnote It helps organizations manage their data assets effectively.
29
Who uses **Microsoft Purview**?
* Data engineers (governance) * Analysts (finding trustworthy data) ## Footnote Both roles benefit from improved data management and compliance.
30
What is **Microsoft Fabric**?
A full unified analytics platform for ingestion, engineering, warehousing, BI, and ML. ## Footnote It integrates various analytics processes into a single platform.