Module 5 Flashcards

(53 cards)

1
Q

How do we manage data?

A

Through complex and powerful information systems
SAP Enterprise Resource Planning (ERP) system is the world’s most popular

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are databases?

A

Structuring data
Database management system - manages access and usage of all users to all data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do databases minimize?

A

Data redundancy
Data isolation
Data inconsistency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What do database management systems maximize?

A

Data security
Data integrity
Data independence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Big Data?

A

Data is being collected at an increasingly rapid pace from many sources
New types of data are being collected
Organizations and individuals must process that data
Worldwide, reports show that data is increasing by 50% per year
Big Data = data so large and complex it cannot be managed by traditional systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the most common data warehouse?

A

The central enterprise data warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are independent data marts?

A

They store data for a single or a few applications, such as marketing or finance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is hub and spoke?

A

It stores data in a central data warehouse while simultaneously maintaining dependent data marts that obtain their data from the central repository.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is knowledge management?

A

It is a process that helps organizations manipulate important knowledge that is part of the organizations memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is knowledge?

A

Information that is contextual, relevant, and useful, developed with the assistance of expertise, also known as intellectual capital (or intellectual assets)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is explicit knowledge?

A

Objective, rational, technical knowledge that has been documented

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is tacit knowledge?

A

Cumulative store of subjective or experiential learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the difficulties in managing data?

A

Amount of data increases exponentially over time
Data are scattered throughout organizations
Data obtained from multiple internal and external sources
Data degrade over time
Data subject to media rot
Data security, quality, and integrity are critical yet easily jeopardized
Information systems that do not communicate with each other can result in inconsistent data
Federal regulations require corporations to account for how their data are managed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Data Governance?

A

It is an approach to managing information across an entire organization
Uses Master Data Management

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does Data Governance provide control over?

A

It provides control over Master Data and Transaction Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the database approach?

A

Database is arranged so that one set of software programs - the database management system - provides all users with access to all data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What does database minimize?

A

Data redundancy: the same data are stored in many places
Data isolation: applications cannot access data associated with other applications
Data inconsistency: various copies of the data do not agree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What do database management systems maximize?

A

Data security: databases must have extremely high security measures in place to deter mistakes and attacks since data is stored in one place
Data integrity: data must meet certain constraints, such as no alphabetic characters in a social insurance number field
Data independence: applications and data are not linked to each other (kept separate or independent by the DBMS that controls access), so that all applications are able to access the same data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the data hierarchy?

A

Bit: (binary digit) represents the smallest unit of data a computer can process
Byte: represents a single character, often often composed of eight bits
Field: a logical grouping of related characters
Record: a logical grouping of related fields
File (or table): a logical grouping of related records
Database: a logical grouping of related files

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What do database management systems do?

A

They create and manage a database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the relational database model based on?

A

On the concept of two-dimensional tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is a data model?

A

It is a diagram that represents the entities in the database and their relationships

23
Q

What is an entity?

A

It is a person, place, thing, or event about which an organization maintains information

24
Q

What is an instance?

A

It is one specific, unique representation of the entity

25
What is an attribute?
It is a characteristic or quality of a particular entity
26
What is a primary key?
IT is a field that uniquely identifies a record
27
What are secondary keys?
They are other identifying fields that typically do not identify the file with complete accuracy
28
What are foreign key fields?
They are used to uniquely identify a row of another table that is linked to the current table
29
What are the issues of big data?
Sources can include: Untrusted sources Data could be "dirty" i.e. inaccurate, incomplete, incorrect, duplicate or erroneous Big data changes since quality issues can arise during collection that are temporary or permanent
30
How do we manage Big Data?
Deciding what needs to be done is the most important task Then, the organization needs to move from traditional processing to Big Data processing
31
What is Big Data processing?
Integrate information silos into a database environment Develop data warehouses to access multiple databases or long term data Query the data warehouse using business intelligence
32
How do we put Big Data to use?
Making data available: using public data (open data) or meta analysis Enabling organizations to conduct experiments: ex multiple views of a web site to test consumer repsonce Microsegmenting customers: dividing customers into smaller groups to provide tailored services Creating new business models Being able to analyze more data
33
How does Big Data interact with functional areas of the organization?
Human resources: discovering health care trends to lower health care costs Product development: text mining the Internet to explore customer preferences on a 3-blink turn signal Operations: reducing fuel consumption on deliveries and optimizing delivery routes Marketing: creating of 1500 tailored marketing messages by integrating data from internal systems with social media Government operations: water management systems based on water control sensors
34
What are the characteristics of data warehouses and data marts?
Organized by business dimension or subject Use On-line analytical processing Integrated Time Variant Nonvolatile Multidimensional
35
What does the environment for data warehouses and data marts include?
Source systems that provide data to the data warehouse or data mart Data-integration technology and processes that prepare the data for use Different architectures for storing data in an organization's data warehouse or data marts Different tools and applications for the variety of users Metadata, data quality, and governance to ensure that the data warehouse or data mart meets its purposes
36
What is data integration?
To extract data from source systems, transform them, and load them into a data mart or warehouse Can be performed by hand-written code or by commercial data-integration software Can be transformed to make them more useful
37
How do we store data?
The most common architecture is one central enterprise data warehouse, without data marts, called "a single version of the truth"
38
What are independent data marts?
They store data for a single or a few applications, such as in marketing or finance
39
What is metadata?
Data is about data
40
What is data quality?
The quality of the data in the warehouse must be adequate to satisfy users' needs
41
What does governance require?
It requires that people, committees and processes be in place to plan and control the development and use of the data warehouse
42
What are users?
There are a large number of potential BI users, including IT developers; front-line workers; analysts; information workers; managers and executives; and suppliers, customers, and regulators
43
What do knowledge management systems do?
They use modern information technologies to systemize, enhance, and expedite knowledge management, with the goal to make the most productive use of knowledge
44
What are query languages?
They search for information in databases or data warehouses
45
What is normalization?
A method for analyzing and reducing a relational database to its most simple form Purpose is to provide minimum redundancy Focus is to reduce non-key attributes Improves processing efficiency of the database It optimizes the tables in a relational database
46
What are joins?
They link relational database tables with common attributes
47
What are the key words of SQL (Structured Query Language)?
SELECT - what to locate FROM - specify the source files WHERE - provides conditions for the search
48
What are Entity-Relationship Models (ERD)?
Database designers plan the database design in a process called entity-relationship modeling (ERD) ER diagrams consist of entities, attributes and relationships organized using business rules Business rules describe how the organization uses its data to run its operations Entities are shown as rectangles, with relationships shown on the lines between them
49
What are the degrees of relationships for ERD's?
Unary: single entity Binary: two related entities Ternary: three related entities
50
What are the relationship classifications of ERD's?
One-to-one One-to-many Many-to-man
51
What are some ERD terms?
Cardinality: the number of times a single record/instance of one entity can be associated with a single record/instance of another entity Mandatory single: ex inventory quantity on hand with its sale price Optional single: ex employee wage rate with pay cheque Mandatory many: ex department details with employee details Optional many: customer details with customer sales details/invoices
52
What are functional dependencies?
They express how attributes are associated and need to be retained during normalization
53
What does the third normalized form show?
There is no data redundancy Foreign keys are used to link tables