Chapter 1 Flashcards

(17 cards)

1
Q

What are the data lifecycle four stages?

A
  1. Ingest (acquiring data and bringing data into the GCP)
  2. Store (persisting data to a storage system)
  3. Process and analyze (transforming data into a usable format for analysis applications)
  4. Explore and visualize
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the three ingestion modes?

A
  1. Application data
  2. Streaming data
  3. Batch data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is nearline storage and coldline storage?

A

Nearline storage is used for data that is accessed less than once per
30 days. Coldline storage is used to store data accesses less than once
per year.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are 3 types of data structure how do they differ?

A
  1. Structured (Structured data has a fixed set of attributes that can be modeled in a table of rows and columns.)
  2. Semi-structured (Semi-structured data has attributes like structured data, but the set of attributes can vary from one instance to another.)
  3. Unstructured (Unstructured data does not fit into a tabular structure. Images and audio files are good examples of unstructured data.)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are types of structured data structure?

A
  1. Transactional (one row at time)
  2. Analytical (table aggregation)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are types of Semi-Structured data structure?

A
  1. Fully Indexed (index per each field)
  2. Row Key Access (just an id)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which storage technology (for structured data) for:
1. Transactional - Regional
2. Transactional - Global
3. Analytical

A
  1. Cloud SQL
  2. Cloud Spanner
  3. BigQuery
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which storage technology (for semi structured data) for:
1. Fully indexed
2. Row Key

A
  1. Cloud datastore
  2. Bigtable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which storage technology for unstructured data

A

Cloud storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Expand the acronym: OLTP

A

Online transaction processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Expand the acronym: OLTP

A

Online analytical processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are NoSQL databases designs?

A
  1. Key-value
  2. Document
  3. Wide column
  4. Graph
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How key-value database design works?
Which storage tool works with this paradigm?

A

Key-value data stores are databases that use associative arrays or dictionaries as the basic datatype. Keys are data used to look up values.

Tool in gcp is: Cloud Memorystore

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How document database design works?
Which storage tool works with this paradigm?

A

Document stores allow complex data structures, called documents, to be used as values and accessed in more ways than simple key lookup.

Tool in gcp is: Cloud Datastore.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the tool for storing loging data?

A

Stackdriver Logging is a menaged service designed for storing logging data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the best storage service for a IoT data

17
Q

What is the best tool designed for transaction processing at a regional level