S5 - Storage & Data Processing Flashcards

(20 cards)

1
Q

List the main storage types in GCP.

A

Object (GCS), Block (PD), File (Filestore), Relational (Cloud SQL / Spanner), Analytical (BigQuery).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When to use Nearline vs Coldline?

A

Nearline for 30–90 day rare access; Coldline for ≥ 90 days archival.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is durability of Cloud Storage?

A

99.999999999% (11 nines).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Difference between Uniform and Fine-grained access?

A

Uniform = IAM-only; Fine = ACL per object.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When to use Filestore?

A

When VMs or containers need shared POSIX filesystem.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Cloud SQL vs Spanner vs BigQuery?

A

Cloud SQL = regional OLTP; Spanner = global OLTP; BigQuery = serverless OLAP.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What tool builds ETL pipelines serverlessly?

A

Dataflow.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Pub/Sub used for?

A

Asynchronous global messaging and ingestion layer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How to protect PII in streaming data?

A

Use DLP API for detection + masking within Dataflow.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How to import PB-scale on-prem data?

A

Transfer Appliance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which service moves S3 data to GCS?

A

Storage Transfer Service.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How to ingest SaaS data into BigQuery?

A

BigQuery Data Transfer Service.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What reduces BigQuery query costs?

A

Partitioning, clustering, compressed formats, and byte limits.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When to choose CMEK for storage?

A

When regulations or audits require customer-managed keys.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Exam cue: “Scalable data warehouse with zero ops” → ?

A

BigQuery.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Exam cue: “Global relational transactions” → ?

17
Q

Exam cue: “Shared file system across VMs” → ?

18
Q

Exam cue: “ETL stream processing” → ?

A

Dataflow + Pub/Sub.

19
Q

Exam cue: “Mask PII in data lake” → ?

20
Q

Exam cue: “Automate archival after 1 year” → ?

A

Cloud Storage lifecycle policy to Archive class.