What is data?
Unorganised information that becomes meaningful after processing (facts, numbers, text, images, etc.)
What are the three main types of data structures?
Structured, Unstructured, Semi-structured
What is structured data?
Data organised in tables with rows and columns and a fixed schema
Give two examples of structured data
Excel spreadsheets, SQL tables
What is unstructured data?
Data with no predefined format or structure
Give two examples of unstructured data
Text documents, images, videos, social media posts
What is semi-structured data?
Data with some organisation using tags or keys but not rigid tables
Give two examples of semi-structured data
JSON files, XML files, emails
What is a CSV file?
A comma-separated values text file used to store tabular data
What is a TSV file?
A tab-separated values text file
What is a relational database?
A database that stores structured data in related tables
What does RDBMS stand for?
Relational Database Management System
Give two examples of relational databases
MySQL, SQL Server, Oracle, IBM DB2
What is OLTP used for?
Daily business transactions and operations
What is OLAP used for?
Analysing large datasets for reporting and insights
What is a non-relational database?
A flexible database used for semi-structured or unstructured data
Give two examples of NoSQL databases
MongoDB, Cassandra, Redis
What is an information model?
A high-level abstract representation of entities, relationships, and business rules
What is a data model?
A technical blueprint for how data is stored in a database
Who uses information models?
Business analysts and stakeholders
Who uses data models?
Database designers and developers
What is the hierarchical model?
A physical database model that stores data in a tree structure
Name two problems with the hierarchical model
Poor support for many-to-many relationships, data redundancy
What is an ERD?
A diagram showing entities and relationships in a database