intro to database systems Flashcards

(75 cards)

1
Q

where are databases?

A

everywhere

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what are some examples of databases?

A

cloud storage
YouTube
spotify
banks and credit cards
zoom and teams
social media apps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

instead of the user perspective, what perspective should we think from in this class?

A

think from the developer’s perspective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is transactional data?

A

structured data created within an organization, with sizes ranging from gigabytes to terabytes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are some examples of transactional data?

A

chase bank account data
amazon purchase history

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is big data?

A

data generated by multimedia applications and internet of things

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what are some examples of big data?

A

instagram
apple
youtube

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

how does big data differ from transactional data?

A

in the four v’s: volume, velocity, veracity, and variety

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what are the four v’s?

A

volume, velocity, veracity, and variety (the ways big data differs from transactional data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is volume?

A

data at rest, big data is much larger than transactional data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is velocity?

A

data in motion, the speed of streaming data, big data is much faster than transactional data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is variety?

A

data in many forms, big data has a larger variety than transactional data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is veracity?

A

uncertainty due to data inconsistency, big data has a higher veracity than transactional data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the big data equation?

A

big data = transactions + interactions + operations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

who is Edgar F. Codd?

A

he is the author of the research paper on the relational model in 1970 (very important person)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

how many companies use databases?

A

a very large amount, banks airlines, and sales are a few examples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what is a database?

A

a very large collection of interrelated data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what is a database management system (DBMS)?

A

a software system designed to store, manage, and facilitate access to databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what are entities and objects?

A

equivalent statements!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

what is DBMS?

A

a software system, NOT a database!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

when will companies use a filesystem?

A

when their data isn’t growing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

when will companies use a database?

A

when their data is growing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

what is the SELECT statement in SQL?

A

the statement used to retrieve data from a database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

what is the insert into statement in SQL?

A

statement used to insert new data into a table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
what are the new demands the 3rd platform brings to databases?
global high availability large data volumes unstructured data transaction rates latency
26
what motivated developers to create the noSQL database?
the new demands the 3rd platform brings
27
what is network latency important to?
developers building applications that can be accessed globally
28
what is the old model of generating data?
few companies produce data, and everyone else consumes data
29
what is the new model of generating data?
everyone generates data, and everyone consumes data
30
it's not just the _________ of information, it's the ___________ of information
quantity, quality
31
why do we use databases?
they allow us to store a lot of content (quantity), while keeping the quality of the data high (lower veracity)
32
what are the drawbacks of using file systems to store data?
data redundancy and inconsistency (duplicate data) difficulty accessing (finding) data data isolation integrity problems (integrity constraints) atomicity of updates concurrent access by multiple users security problems
33
what does a database handle (file systems)?
all of the file system's problems!
34
what does DBMS provide?
efficient, reliable, convenient, and safe multi-user storage of and access to massive amounts of persistent data
35
what are the advantages of databases?
data independence efficient data access data integrity and security concurrent access, crash recovery reduced application development time
36
why would databases not be used?
high costs complexity not suited for special purpose tasks
37
what are the four elements of a data model?
1) data 2) data relationships 3) operations on the data 4) data constraints
38
what is a data model?
a tool for describing its elements
39
what are relationships and tables?
the same thing! (equivalent statements)
40
what are attributes?
the column names of a table (relation)
41
what is a database?
a collection of relations
42
what is a relation schema?
relation name and attribute list (optionally: data types of attributes)
43
what is a database schema?
set of all relation schemas in the database
44
what is an advantage of using a relational database?
it can control the quality of information (lower veracity)
45
what can you use to remember schema?
it is the type of information of a variable in a program
46
what can you use to remember instance?
is is the value of a variable
47
what is an instance?
the actual content of a database at a particular point in time
48
what is SQL designed for?
managing data held in a relational database management system
49
what does SQL consist of?
a data definition language (DDL) and a data manipulation language (DML)
50
what level is data definition at?
the table level
51
what level is data manipulation at?
the data level
52
what are some DDL keywords?
create alter drop
53
what are some DML keywords?
insert update delete select
54
what does create do?
creates a table
55
what does alter do?
changes the table, add, delete, or modify columns
56
what does drop do?
deletes the whole table
57
what does insert do?
allows you to insert data into the table
58
what does update do?
make changes to existing data
59
what does delete do?
deletes data from a table (one row or multiple)
60
what is the most basic element of a table?
an attribute and its type
61
what are the most common attribute types?
INT or INTEGER REAL or FLOAT CHAR(n), fixed length string of n characters VARCHAR(n), variable length string of up to n characters
62
what values can be null?
any value but a primary key
63
how many primary keys can a table have?
at most one
64
can a primary key be null?
NO, NEVER
65
can a table have more than one unique key?
yes
66
can unique keys have null entries?
yes
67
what is the generic syntax to create a multi-attribute key?
PRIMARY KEY (attribute one, attribute two)
68
what is logical data independence?
you can change the logical structure (schema) of the database (add / drop columns), without having to change or update the programs that viewers use the database on
69
what is physical data independence?
you can change the physical structure, basically where the data is stored or how it is stored, without affecting the logical structure or external schema
70
what is external schema?
what the user sees (user views)
71
what is physical schema?
the relations stored in unordered files
72
what is concurrency control?
process of managing simultaneous execution of transactions in a shred database, to ensure the serializability of transactions
73
why do we need concurrency control?
to prevent: 1. lost updates 2. uncommitted data 3. inconsistent retrievals
74
what does logical data independence refer to?
the ability to modify the conceptual schema without affecting external views (how users see the data). It doesn't relate to storage changes.
75
what is an entity?
A real-world object or concept, such as a customer or an account.