Batch & Stream processing Flashcards

(10 cards)

1
Q

Define batch processing.

A

A method of processing data in large groups or batches at scheduled intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is stream processing?

A

The continuous input, processing, and output of data in real-time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

True or false: Lambda architecture combines batch and stream processing.

A

TRUE

It allows for both real-time and historical data processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Fill in the blank: Kappa architecture simplifies the _______ architecture.

A

Lambda

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the main advantage of stream processing?

A

It enables real-time analytics and immediate insights from data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define event time in stream processing.

A

The time at which an event occurred, as opposed to when it was processed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does windowing refer to in stream processing?

A

The technique of grouping events into finite chunks for processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or false: Micro-batching is a form of batch processing.

A

TRUE

It processes data in small batches at short intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a data lake?

A

A centralized repository that stores structured and unstructured data at scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Fill in the blank: Apache Kafka is commonly used for _______ processing.

A

stream

How well did you know this?
1
Not at all
2
3
4
5
Perfectly