What are the 5 V’s of big data?
1- Volume
2- Veracity
3- Variety
4- Value
5- Velocity
What is volume as a characteristic of big data?
The large scale of data that cannot be stored on a single machine and requires specialized tools and frameworks for storage, processing, and analysis.
What is velocity as a characteristic of big data?
The speed at which data is generated.
What is variety as a characteristic of big data?
The different forms of data. Including structured, unstructured, and semi-structured formats like text, images, audio, video, and sensor data. Big data systems must be flexible enough to handle diverse data formats effectively.
What is veracity as a characteristic of big data?
The accuracy and the reliability of the data. Data needs to be cleaned to remove noise and inaccuracies, ensuring the extracted insights are meaningful and reliable.
What is value as a characteristic of big data?
The usefulness of the data in achieving the desired outcomes.
The value of the data is linked to its accuracy and, in some cases, how quickly it can be processed.
What are the types of processing of big data?
1- OLTP: Online Transaction Processing (DBMSs).
2- OLAP: Online Analytical Processing (Data Warehousing).
3- RTAP: Real-Time Analytics Processing (Big Data Architecture & Technology).
What are the common sources of raw data?
1- Logs: generated by web apps and servers, which can be used for performance monitoring.
2- Transactional Data: generated by apps such as eCommerce, banking, and financial.
3- Social Media.
4- Databases: Structured data residing in relational databases.
5- Sensor Data: generated by IoT systems.
6- Healthcare Data: generated by Electronic Health Record (EHR) and other healthcareapps.
7- Network Data: generated by network devices such as routers and firewalls.