Midterm 2 Flashcards

(9 cards)

1
Q

True/False: In Spark, an action triggers the execution of the computation that has been defined by transformations.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Lineage tracking in Spark is essential for:

A

Recovering lost data partitions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

True/False: Columnar storage can significantly reduce the amount of data read from disk for queries accessing a subset of columns.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does column format benefit big data analytics compared to row format?

A

By avoiding unnecessary attribute reading from disk.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

True/False: Columnar formats can apply more efficient compression techniques because of the uniformity of data within each column.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which application scenario is more suitable for MongoDB than Parquet?

A

Managing real-time user interactions and profiles on a social networking site.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which application scenario is more suitable for MongoDB than Parquet?

A

Tracking and updating inventory levels in real-time across global warehouses for an international logistics company.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Consider the following records in a MongoDB collection:

Record 1: { “name”: “Alice”, “numbers”: [3, 5, 8] }

Record 2: { “name”: “Bob”, “numbers”: [2, 4] }

Record 3: { “name”: “Charlie”, “numbers”: [5, 2, 3] }

Record 4: { “name”: “Dana”, “numbers”: [1, 9] }]
Which record would appear first in a MongoDB collection sorted in descending order of the numbers field?

A

Record 4

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly