What Method did you use to combine different data sources?
Full Outer Join: keeps all the information from both tables regardless if they are matching
What other methods of joining are there?
Left Inner: keeps all the information from the left tables and bring only the matching value from the right table
Inner: keeps only the matching values from both tables
What is a join?
combines two or more tables based on a related column, allowing data to be reviewed and analysed together
What does combining data refer to?
What are the risks of combining data?
List in order all the stages of the lifecycle
Provide an example of what you have done for refine & compare for a model you created
What is Privacy by Design?
How have you applied privacy by design?
What DQ risks did you come across? (3 examples)
How did you resolve each DQ risk example?
Provide an example in which you acted logical and analytical
What was the conclusion of the analysis? (5 things)
What alternative methods or tools did you suggest for the project to be successful?
What were your customer requirements and how did you define them? (2 answers)
How did customer requirements shape the project? (7 answers)
What are the differences between open and public data?
What is administrative data?
What is research data?
information collected or generated to validate findings in research
What data structures did you take into account?
What different database system designs did you take into account? (What is a relational database?)
How do you adapt your communication depending on the audience & situational requirements? (4 answers)
Can you provide 2 examples of data classification principles you’ve applied in your project?
Can you think of a time where you need to be flexible with classification?
If the usage data was to be combined with customer data or sensitive financial data, the risk would increase as it would include PII or sensitive information