What do queries run by big stores or hospitals tend to feature?
What types of queries should we avoid executing on DBMS that serve a lot of customers simultaneously?
What are data warehouses?
How are data warehouses updated?
What is OLAP?
(Online Analytic processing) OLAP refers to the process of analysing complex data stored in a data warehouse
what is an OLAP query?
What is OLTP?
What do OLAP applications typically feature?
Describe an example fact table with schema Sales(productNo, date, store, price)?
What do we get back when we query a data cube (fact table)with a range for each edge?
What are star schemas?
What are star schemas made up of?
What exactly does a star schema a describe?
A database consisting of:
a fact table R(A1,…An, B1…Bm)
-What are star schemas?
- In essence A1 to An are foreign keys
- Each dimension can also have further dimensions of their own
- B1…Bm are dependent attributes
What are the Characteristics of Star Schemas?
Why don’t we mind that the data is denormalised in star schemas?
We don’t mind storing data in multiple places because we care more about making these type of queries fast
What is Dicing?
What is the difference between slicing and dicing?
What is data mining?
Can be seen as extended form of OLAP
- Is more that you have a lot of data and you’re trying to find answers to questions that you care about. For instance instead of making sequences of queries, we just want a way to get this information directly.
Why do we use data mining?
To discover patterns/knowledge in data
- can use data mining to determine the reason behind a relationship that may not be obvious (for example nappies and beer)
If we have a table of films and which watchers enjoyed the films what could this help us to predict?
What are some applications of data mining?
What are the types of Discovered Knowledge from data mining?
What are the basics of the market-basket model
What is Frequent-Itemset Mining?