True/False: In Spark, an action triggers the execution of the computation that has been defined by transformations.
True
Lineage tracking in Spark is essential for:
Recovering lost data partitions.
True/False: Columnar storage can significantly reduce the amount of data read from disk for queries accessing a subset of columns.
True
How does column format benefit big data analytics compared to row format?
By avoiding unnecessary attribute reading from disk.
True/False: Columnar formats can apply more efficient compression techniques because of the uniformity of data within each column.
True
Which application scenario is more suitable for MongoDB than Parquet?
Managing real-time user interactions and profiles on a social networking site.
Which application scenario is more suitable for MongoDB than Parquet?
Tracking and updating inventory levels in real-time across global warehouses for an international logistics company.
Consider the following records in a MongoDB collection:
Record 1: { “name”: “Alice”, “numbers”: [3, 5, 8] }
Record 2: { “name”: “Bob”, “numbers”: [2, 4] }
Record 3: { “name”: “Charlie”, “numbers”: [5, 2, 3] }
Record 4: { “name”: “Dana”, “numbers”: [1, 9] }]
Which record would appear first in a MongoDB collection sorted in descending order of the numbers field?
Record 4