The main idea of the Lambda Architecture is…
To build Big Data systems as a series of layers
Each layer satisfies a subset of the desired properties and builds upon the functionality provided by the layers above it
Batch Layer
Speed Layer
Speed Layer Goal
To have updated information on what happened since the last batch view was generated
Serving Layer Goal
To merge views created by the batch layers with views created by the speed layer
Difference Batch and Speed Layer
One big difference is that the speed layer only looks at recent data, whereas the batch layer looks at all the data at once
Serving Layer
Storing data in raw format has many advantages:
Data should be stored in raw format, should be
- Kept forever