Big Data Flashcards

(8 cards)

1
Q

What is Big Data?

A

Can be described in terms of:
- Volume - too big to fit on a single server
- Velocity - streaming data requires milliseconds to analyse and respond
- Variety - data in many forms such as structured, unstructured, text, multimedia

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the most difficult aspect of Big Data?

A

The lack of structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why is the lack of structure in Big Data a challenge?

A
  • Analysing the data is made significantly more difficult
  • Relational databases are not appropriate because they require the data to fit into a row-and-column format
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the ways to handle Big Data?

A
  • Fact-graph model for representing data
  • Use machine learning to process unstructured data and extract useful information
  • Functional programming used in distributed processing of data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why does that fact that data doesn’t fit on a single server become a problem for Big Data?

A

Relational Databases do not scale well across multiple machines so functional programming must be used

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the source of Big Data?

A

Data from networked sensors, smartphones, video surveillance, mouse clicks etc are continuously streamed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Describe the fact-graph model

A
  • Raw data stored as atomic facts
  • Facts capture one single piece of information
  • Facts are immutable and externally true due to timestamps
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the components used to make up the fact-graph model?

A
  • Nodes used to represent the core entities
  • Edges used to represent the relationships
  • Properties used to capture information about the nodes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly