How distributed file systems work?
Hadoop MapReduce
Map
input data is split into discrete chunks to be processed
split-apply
Reduce
output of the map phase is aggregated to produce the desired result
(combine)
The simple nature of the programming model (MapReduce) lends itself to
efficient and large-scale implementations across thousands of cheap nodes
Key benefits of MapReduce
Limitations of MapReduce
2. Low level framework (hard to use)