Data Integration Flashcards

(80 cards)

1
Q

What is the primary goal of data synchronization?

A

Integrating data from multiple sources while ensuring the data remains consistent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which process refers to merging software pieces so they function in conjunction with one another?

A

Data integration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How is the relationship between data synchronization and data integration defined?

A

Data synchronization is a subset of data integration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What specific ability does data synchronization provide to databases?

A

Maintaining constant communication between databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Does every data integration method result in a data set that is in perfect sync?

A

No.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which term implies that several versions of data have been brought up to date?

A

Data synchronization.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which term implies the existence of two or more complete and identical copies of data?

A

Data replication.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the distinctive emphasis of data replication compared to synchronization?

A

The existence of identical copies rather than just being up to date.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is another name for one-way synchronization in various contexts?

A

Data pushes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Where is data commonly transferred in a one-way synchronization setup?

A

To a data warehouse or local application storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What characterizes two-way or bi-directional synchronization?

A

Editing one system automatically updates the other and vice versa.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are two common examples of applications using two-way synchronization?

A

Google Calendar and Outlook Calendar.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How does data synchronization contribute to data security?

A

By protecting data from corruption through proper harmonization techniques.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the primary benefit of data synchronization for business decision-making?

A

Ensuring access to the most accurate information possible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How does synchronization assist in preventing significant business repercussions?

A

By assisting in preventing errors and correcting mistakes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a common negative result of inaccurate data and frequent workarounds?

A

Inconsistent reports.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Term: Data Harmonization

A

The process of creating a unified set of data from several types, fields, and formats.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the main advantage of having harmonized data for a company?

A

It makes it easier to evaluate and visualize data relevant to goals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What type of integration occurs when two applications can be immediately integrated using APIs?

A

Native integration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Definition: Tailored Integrations

A

Software developed specifically to meet unique business requirements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What does the acronym iPaaS stand for?

A

Integration Platform as a Service.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the nature of the solution provided by an iPaaS provider?

A

It is a cloud-based solution offered by a third party.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What factor should a company prioritize when searching for a synchronization solution to handle pre-existing data?

A

Historical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Term: Historical Data

A

Any data that was already present before the synchronization process began.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What does the acronym ETL stand for?
Extract, Transform, Load.
26
In the ETL process, when does the transformation of data occur?
Before the data is loaded into the destination system.
27
What is another common name for the ETL strategy?
Data loading.
28
What does the acronym ELT stand for?
Extract, Load, Transform.
29
In the ELT process, where are transformations conducted?
Directly within the target system (e.g., a data lake).
30
What is another common name for the ELT strategy?
Data lake integration.
31
Why is ELT well-suited for massive amounts of data and complex transformations?
It leverages the processing power and scalability of modern data platforms.
32
Which integration approach allows real-time data access without physically moving the data?
Data Virtualization.
33
How does Data Virtualization encapsulate underlying data sources?
By providing a logical or virtual layer.
34
Which strategy is most suitable for batch processing and centralized reporting?
ETL (Extract, Transform, Load).
35
Which strategy is most helpful when real-time access to disparate sources is required without consolidation?
Data Virtualization.
36
What are two examples of applications that use ETL?
Business Intelligence (BI) and Data Warehousing.
37
What is a typical use case for ELT regarding data volume?
Big Data Analytics.
38
Name an application that utilizes Data Virtualization for quick data access.
Federated Data Access.
39
What is the purpose of Data Transformation and Mapping?
Converting data from an original format to a target format while maintaining compatibility.
40
In the context of data migration, what are the two main roles of transformation and mapping?
Legacy system modernization and cloud migration.
41
How do transformation and mapping improve data quality?
Through data cleansing and data enrichment.
42
Term: Data Denormalization
A technique used in data preparation for analytics and reporting.
43
What are the four dimensions of data quality evaluation?
Accuracy, completeness, consistency, and reliability.
44
What is the primary focus of Data Profiling during the cleansing process?
Evaluating and assessing the current state of data quality.
45
How does data cleansing support compliance and regulatory requirements?
By ensuring data privacy and accuracy for standards like GDPR.
46
What is the objective of Master Data Management (MDM)?
Maintaining a single, consistent, and trustworthy source of truth for essential business entities.
47
What are the four essential entities typically managed by MDM?
Customers, products, suppliers, and employees.
48
Term: Customer 360 View
A consistent and accurate view of customer data across all interaction points enabled by MDM.
49
What does PIM stand for in the context of Product Data Management?
Product Information Management.
50
What is the goal of Supplier and Vendor Data Management in MDM?
Making procurement and supply chain activities more efficient.
51
How does MDM benefit HR processes?
By ensuring employee data is accurate across different HR systems.
52
What is the role of Database Replication in high availability systems?
Keeping several copies of a database synced to provide fault tolerance and load balancing.
53
How is data consistency maintained across multiple nodes in distributed systems?
Through data synchronization and replication.
54
In Mobile and Edge Computing, where must data be synchronized?
Between centralized systems and mobile or edge devices.
55
How does Cloud Data Integration use synchronization?
To ensure integrity across on-premises and cloud-based systems.
56
Term: CDC (Change Data Capture)
A methodology used to identify and capture changes made to data in a database for synchronization.
57
Name three common methodologies for data synchronization and replication.
Change data capture (CDC), log-based replication, and event-driven methods.
58
What is the main focus of Real-time Data Integration?
Processing and synchronizing data as it happens or near-instantaneously.
59
Why is real-time integration vital in financial services?
For real-time risk assessment, fraud detection, and trade processing.
60
How does real-time data integration support E-commerce?
Through instant inventory management and personalized marketing.
61
Which field requires real-time integration to process massive amounts of sensor data for automation?
Internet of Things (IoT).
62
What is a primary healthcare application of real-time data integration?
Remote patient monitoring.
63
How is real-time integration used in logistics?
For real-time tracking, tracing, and route optimization.
64
What architectural style is often required for effective real-time data integration?
Event-driven architecture.
65
How does real-time integration benefit business agility?
It allows companies to respond more swiftly to shifting business situations.
66
What is the term for merging data from multiple sources into a single, cohesive view?
Data consolidation.
67
What specific MDM application involves managing both vendors and customers together?
Vendor-Customer Integration (Cross-Domain Integration).
68
The process of moving data from a source system to a destination system is called _____.
Data migration.
69
Which activity involves identifying and removing duplicate records to ensure data quality?
Data deduplication.
70
What is the relationship between Data Harmonization and the safety of a company?
It improves the safety, reliability, and efficiency of the company.
71
What is the function of a 'Data Lake' in the context of ELT?
It acts as a target system where data is loaded 'as-is' before being transformed.
72
In the context of MDM, what does 'SRM' stand for?
Supplier Relationship Management.
73
What technology is typically used to enable Native Integrations?
APIs (Application Programming Interfaces).
74
Which integration strategy is best for creating a 'uniform perspective' without extensive migration?
Data Virtualization.
75
True or False: Data synchronization is a process that only occurs for new data.
False (it occurs for both new and old data).
76
What is the primary driver for using Geographic Redundancy in database replication?
High availability and disaster recovery across different locations.
77
What is the benefit of Data Enrichment in the cleansing process?
Improving data quality by adding missing or related information to existing data.
78
How does MDM assist in 'Omnichannel Commerce'?
By ensuring product information is consistent across all sales channels.
79
In real-time integration, what is meant by 'Demand-Supply Matching' in logistics?
Aligning inventory and transport capacity with customer needs in real-time.
80
What defines 'Workforce Analytics and Planning' in the context of Employee MDM?
Using accurate, integrated employee data to plan for future business needs.