Batch vs. Stream Processing Flashcards

(10 cards)

1
Q

What are two common approaches for processing data?

A

Batch and stream processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is batch processing?

A

Processing data in large, discrete blocks typically on an interval or after meeting some threshold.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are two characteristics of batch processing?

A

Latency and throughput, generally batch processing will introduce latency (while waiting for it to be collected) and is high throughput.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are two pros of batch processing?

A

Efficiency and simplicity.

It can be resource efficient for systems will large volumes of data (batches can be better optimized) and is generally simpler to implement than stream processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the two major cons of batch processing?

A

Delay in insights and inflexibility.

Since batches can typically require some amount of data before processing, there’s usually a delay in results (making it less practical for real-time scenarios) and it typically isn’t flexible enough to handle immediate changes or changes based on the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is stream processing?

A

Stream processing involves continually processing data as soon as it arrives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are two characteristics of stream processing?

A

Immediate processing and real-time suitability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the two pros of stream processing?

A

Real-time analysis and dynamic data handling.

Since data is processed in real-time it allows systems to immediately provide insights and actions. It’s also more adaptable to changing data and conditions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the two cons of stream processing?

A

Complexity and resource-intensity.

Stream processing is generally more complex/complicated than batch processing and can require significantly more resources to process data as it arrives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When might you use batch processing? What about stream processing? Can you provide some real-world examples?

A

Batch is preferred in scenarios where you have all of the data available, such as financial reporting (e.g. weekly, daily etc.)

Stream processing is preferred in scenarios where real-time insights are required. Situations like fraud detection, analytics, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly