Database systems Flashcards

Question

What are 4 differences between triggers and stored procedures?

Answer 1

- Triggers are **automatic** whereas stored procedures are **explicitly called** - Triggers are **event-driven** whereas stored procedures are **task-driven** - Triggers are **table-bound** whereas stored procedures are **independant** - Triggers are **hidden** but stored procedures are **visible**

Answer 2

- First execution: compiled and stored in cache - Subsequent calls: executed directly from cache - This reduces CPU and parsing overhead

Answer 3

* IN: accepts a value * OUT: returns a values * INOUT: accepts and returns a value ## Footnote OUT must be captured using session variables

Answer 4

- Users can be granted access to **procedures**, but **not underlying tables** - Prevents unauthorised data access

Answer 5

- Difficult to **debug** - Require **specialised skills** - Poor **portability** - Increase server resource usage

Answer 6

- A cursor **iterates** through a result set row-by-row - Used inside stored procedures only - Read-only, forward-only

Answer 7

- Use UNKNOWN logical state and can be checked with IS NULL or IS NOT NULL

Answer 8

- PDO is an **object-oriented** databased access layer in PHP - Provides a **secure**, **consistent** interface for interacting with databases - `$pdo = new PDO($dsn, $username, $password);`

Answer 9

- Can set modes as attributes - `$pdo->setAttribute(PDO::ATTR_ERRMODE, PDO::ERRMODE_WARNING);` - Can also use `PDO::ERRMODE_SILENT` and `PDO::ERRMODE_EXCEPTION`

Answer 10

- Use placeholders that place user-supplied data into SQL - ``` $stmt = $pdo->prepare("SELECT * FROM users WHERE email = ?"); $stmt->execute([$email]);``` - Replaces ? with variable ## Footnote Prevents SQL injection

Answer 11

- Data is retrieved using fetch() or fetchAll() - `$row = $stmt->fetch(PDO::FETCH_ASSOC);` - Returns an associative array

Answer 12

- `$pwd = password_hash($password, PASSWORD_DEFAULT); ` - Protects users if database is compromised and uses strong up-to-date hashing algorithms

Answer 13

- `password_verify($plainPassword, $storedHash); ` - Returns true if password matches and false if not - Passwords are not rehashed- comparison is handled internally

Answer 14

- A form - `

Answer 15

- By checking the request method - `$_SERVER['REQUEST_METHOD']=== 'POST'` - Using `if (...)` would show the form and process it

Answer 16

- GET would expose any sensitive data in the URL

Answer 17

- `$var = $_POST['variable']` - `isset($_POST['variable'])` - Need to use "name" attribute in the HTML input otherwise it doesn't get sent to PHP

Answer 18

- A transaction is an executing program that forms a **logical unit** of database processing - Can be executed inside the **application** or the **DBMS server** - Can be **Read-only**: SELECT, or **Write-only**: INSERT, UPDATE, DELETE

Answer 19

- Multiple transactions may access the same database items at the same time - Transactions run as **independant programs**: own **namespace** and own **memory** - DBMS manages coordination to ensure **correctness**, **consistency** and **performance**

Answer 20

- **Secondary storage** is slow - Frequently accessed data is copied into **buffer** in **main memory** on the DBMS **server** - Applications **read from buffer** which improves performance and response time

Answer 21

- A WRITE updates the database **conceptually** - Actual update is stored on **DBMS buffer** - Disk/secondary storage is updated later - Writing to disk is slow so buffering allows efficient **transaction processing**

Answer 22

- Multi-user systems have multiple users trying to access the same database at the same time - Allows **concurrecy** - Critical for systems with **high availability** and that require **fast response times**

Answer 23

- Concurrency is when multiple users are **accessing** and **changing** the database at the same time - Achieved using **multiprogramming**: multiple processes admitted to **ready queue**, and they are **scheduled** so they **interleave** - Not truly parallel

Answer 24

- Interleaving is when the CPU switches between processes and created the **illusion** of simulataneous execution - Maximises system **throughput**

Answer 25

- **Lost update**: Two transactions read the same original value; one update overwrites the other - **Dirty read (Temporary update)**: A transaction reads uncommitted data that is later rolled back after abortion - **Incorrect summary**: An aggregate calculation mixes old and new values due to concurrent updates - **Unrepeatable read**: Re-reading the same item returns a different value becuase another transaction updated it

Answer 26

- **Atomicity**: transaction fully completes or fully fails- no partial changes - **Consistency**: database moves from one valid state to another- rules and constraints preserved - **Isolation**: concurrent transactions do not interfere- appear as if executed sequentially - **Durability**: commited changes are permanent, even after crashes or failures

Answer 27

- Concurrent transations are executed in an **interleaved fashion** - Schedule defines the execution **order** of all operations - We want to preserve transaction order and **avoid conflicts**

Answer 28

1. Belong to different transactions 2. Access the same item 3. At least one operation is a WRITE ## Footnote Conflict = inconsistent data

Answer 29

- **Serial** schedules are when transactions execute one after another - **Non-serial** is interleaved transactions but can produce **errors** - **Serialisable** schedule is a an interleaved schedule that is **equivalent** to some serial schedule - **Equivalence** focusses on **read/write** operations only

Answer 30

- We **can't rely on observing states** to determine if schedules are equivalent- some non-equivalent schedules may produce the same state - **Conflict equivalence**: If the **relative order** of any 2 conflicting operations is the same in both schedules

Answer 31

- Construct a dependancy graph where **nodes are transactions** and **directed edges are conflicts** - If there are **no cycles** the schedule is serializable - Ensures interleaved schedules **preserve correctness**

Answer 32

- **Key-based**: value is record or object, accessed quickly by key - **Document-based**: stores data in documents accessed by document_id - **Column-based**: each column stored in own file- for large data storage - **Graph-based**: entities=nodes, relations=edges

Answer 33

- A distributed system can only guaratee 2/3: - **Consistency**: all nodes have same data - **Availability**: system is consistently available for read/write operations - **Partition tolerance**: system is continually available during network faults

Answer 34

- **Embedding**: stores data in single document structure - **Referencing**: stores links or references from one document to another

Answer 35

- When related items are **frequently used or fetched together** - **One-to-one** relationship between documents - Document is **not a key document** - Data does **not change or grow** - Nested documents have **same volatility**

Answer 36

- When embedding would result in **substantial data duplication** - When documents **grow** - **Many-to-many** relationship between documents - The document is a **key document** - **Fast writes** are required

Answer 37

- Uses a **key-value model** - Data split into 1024 **shards** called **Buckets** using **hashing** - **Clusters** are multiple nodes that **act as one database** that **automatically share and balance the load**

Answer 38

- **Master-slave**: master becomes bottleneck so slows down scalability and if it goes down the service is stopped - **Masterless**: any node can handle reads and writes so avoids bottlenecks ## Footnote Masterless has better performance, scalability and availability

Answer 39

- Relational schema requires **migrations and joins** that can be complex and risky - Document data modelling **preserves natural structure** rather than flattening

Answer 40

- Each table mapped directly to **single JSON document** - Column becomes fields and PK becomes document keys - **One-to-many relationships** represented using **nested arrays** so related data is stored in a single document

Answer 41

- **Nested**: one-to-one or one-to-many - **Seperate**: many-to-one or many-to-many

Answer 42

- **Nested**: data reads and write are mostly parent+child fields - **Seperate**: data reads are mostly parent fields and writes are mostly parent or child fields

Database systems Flashcards

(66 cards)