CH1 P3 Flashcards

(8 cards)

1
Q

What are the 5 V’s of big data?

A

1- Volume
2- Veracity
3- Variety
4- Value
5- Velocity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is volume as a characteristic of big data?

A

The large scale of data that cannot be stored on a single machine and requires specialized tools and frameworks for storage, processing, and analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is velocity as a characteristic of big data?

A

The speed at which data is generated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is variety as a characteristic of big data?

A

The different forms of data. Including structured, unstructured, and semi-structured formats like text, images, audio, video, and sensor data. Big data systems must be flexible enough to handle diverse data formats effectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is veracity as a characteristic of big data?

A

The accuracy and the reliability of the data. Data needs to be cleaned to remove noise and inaccuracies, ensuring the extracted insights are meaningful and reliable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is value as a characteristic of big data?

A

The usefulness of the data in achieving the desired outcomes.
The value of the data is linked to its accuracy and, in some cases, how quickly it can be processed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the types of processing of big data?

A

1- OLTP: Online Transaction Processing (DBMSs).
2- OLAP: Online Analytical Processing (Data Warehousing).
3- RTAP: Real-Time Analytics Processing (Big Data Architecture & Technology).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the common sources of raw data?

A

1- Logs: generated by web apps and servers, which can be used for performance monitoring.
2- Transactional Data: generated by apps such as eCommerce, banking, and financial.
3- Social Media.
4- Databases: Structured data residing in relational databases.
5- Sensor Data: generated by IoT systems.
6- Healthcare Data: generated by Electronic Health Record (EHR) and other healthcareapps.
7- Network Data: generated by network devices such as routers and firewalls.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly