Week 11 - MapReduce Flashcards

Question 1

Q

What are the steps involved in MapReduce

Answer

A

Input
Split
Mapping
Shuffling
Reducing
Output

Question 2

Q

What is Hive

Answer

A

Hive (HQL) is like SQL, its tables are stored on HDFS as flat files, query language similar to SQL
Developed by Facebook and open source. Provides necessary SQL abstraction to integrate SQL like queries into java.

Question 3

Q

What is Pig

Answer

A

Pig Latin is a bit like Perl, scripts written in pig latin a dataflow language, developed by yahoo, can exectute Hadoop jobs in MapReduce, Tez or Spark. Can be extended with user defined functions which can be written in a variety of languages.

Question 4

Q

What are the differences between Hive and Pig?

Answer

A

Hive

Used by analysts
Used for reporting
Declarative SQLish language
Works on the server side of a cluster
for structured data

Pig

Used by programmers and researchers
Used for programming
Procedural data-flow language
Works on the client side of a cluster
for semi-structured data

Question 5

Q

What are the MapReduce Architecture Components

Answer

A

Job client
Job tracker: monitors resources and coords jobs, health of all the task trackers (transfers jobs to other nodes once failures found), monitors execution percentage of jobs and resources availablility.
Task tracker periodically heartbeat with resource information job execution stauts to jobtracker, receive and execute commands from JobTracker (start new tasks or kill existing tasks)
Task: map task and reduce task

Question 6

Q

Which of the following maps input k,v pairs to a set of intermediate k,v pairs?

A. Mapper
B. Reducer
C. Both A and B

Answer

A

A. Mapper

Question 7

Q

Q2: What is the correct sequence of data flow in MapReduce?

a. InputFormat
b. Mapper
c. Combiner
d. Reducer
e. Partitioner
f. OutputFormat

A. abcdfe
B. abcedf
C. acdefb
D. abcdef

Answer

A

B. abcedf

Question 8

Q

Q3: The total number of partitioners is equal to

A. The number of reducers
B. The number of mappers
C. The number of combiners
D. All of the above

Answer

A

A. The number of reducers

Week 11 - MapReduce Flashcards

(8 cards)