When is a multidimensional data model better suitable than a relational data model?
How is data categorized in multidimensional models?
What three application areas are there for multidimensional data models?
*Means: In which processes are they used?
What is a pivot table?
A two-dimensional spreadsheet with associated subtotals and totals that supports viewing more complex data by nesting several dimensions on the x- or y-axis and displaying data on multiple pages.
Why are spreadsheets not adequate for managing and storing multidimensional data?
They tie data storage to tightly to the presentation - they do not seperate the structural information from the desired views of the information.
Why are relational databases and SQL databases not adequate for managing and storing multidimensional data?
- It is harder to combine all the dimensions you are interested in
What is a multidimensional database/data warehouse built up?
A collection of related cubes
Why can cubes easily manage the addition of new dimension values?
Because dimensions in a cube are first-class, built-in concepts with associated domains
How many dimensions can a cube have?
Unlimited, but current tools start having performance problems with more than 10 dimensions.
When do cubes become more sparse?
As there are more dimensions added to the cube and when the level of detail (granularity) becomes finer
How many dimensions can you view of a cube?
Generally, only two or three. But it can show up to 4 low-cardinality dimensions by nesting one dimension within another on the axis.
What is an important use of dimensions in a multidimensional database?
To use dimensions to provide as much context as possible for facts.
Why are data redundancies sometimes okay in multidimensional databases?
It is okay if it increases the data’s information value.
-> Since the data is derived from other sources, and not born into the multidimensional database, redundancy problems related to updates can be managed more readily
Where is in general redundancy in a multidimensional database?
In the dimensions, not the facts.
What is the relation of hierarchy and dimension?
Dimensions are used for selecting and aggregating data at the desired level of detail. These levels of details are part of the hierarhy (time -> year, month, day).
-> Sometimes define multiple hierarchies for a dimension (fiscal year and calender year)
How do you avoid duplicate definitions when defining multiple hierarchies for dimension?
The metadata of a cube defines the dimension hierarchy.
How can you avoid adding dimensions to a cube if you want to add extra information?
Add a property that you initially wanted as a dimension as a level under an existing dimension:
Dimension: product –> you add the level of package size instead of making package-size an dimension.
What kind of ordering is available in a multi-dimensional database?
What are facts?
The representation of the subject for a specific combination of dimensions
How is the granularity of a fact determined?
By the levels from which its combination of dimension values is drawn.
Which three types of facts are commonly included in data warehouses?
–> Because these 3 are complementary, a database often has them all.
Of which two components does a measure consist?
2. A formula that can combine several measure values into one (aggregation function sum)
What is a measure?
A representation of the properties of the fact that the user wants to optimize.
-> Takes on different values for various dimension combinations.
Which three classes of measures behave differently in computations?