Data classified (3)
Structured
Semi-Structured
Unstructured
Structured Data
Semi-structured data
Unstructured data
Lacks a predifined model or consistent organizational structure.
- Ei.:
- Text documents, images, videos…
Catagories of data stores (2)
File Storage explained
Delimited Text Files (File types)
JSON (File types)
XML (File types)
Binary Large Object (BLOB) (File types)
(Storage) Optimized File Formats
Avro (Optimized File Formates)
ORC (Optimized File Formates)
Parquet (Optimized File Formates)
Databases explained
Centralized and specialized system to store data, and enable querying.
Relational Databases
Non-Relational Databases (NoSQL Databases)
They do not enforce a realational schema on the data they store.
Key-value (NoSQL Databases categories)
-One record is made up of
- a unique key
- an associated value that can be of any format
Document (NoSQL Databases categories)
-One record in made up of
- a unique key
- a JSON document
Column Family (NoSQL Databases categories)
Graph (NoSQL Databases categories)
Data is represented as nodes, that are entities with links (edges) used to define the relationships between them.
Transactional data processing
OLTP systems (Transactional data processing)
ACID properties (Transactional data processing)