Data Wrangling Flashcards

To understand how Data is Wrangled (9 cards)

1
Q

What is Data Wrangling

A

Sorting,
cleaning,
and structuring
raw data so that it becomes reliable and usable for analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is data wrangling important

A

Because mistakes in data due to it being unreliable can cost a company millions or even billions of dollars.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 6 steps of Data Wrangling

A

Discovery, Structuring, Cleaning, Enriching, Verifying, Publishing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the Discovery stage

A

Familiarizing yourself with data
to get an idea of its content
and understand how it can be organized effectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the Structuring stage

A

Transforming data so it is usable, such as making files the same type to ensure compatibility.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the Cleaning stage

A

Removing errors and useless data to make sure the dataset is accurate and free of mistakes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the Enriching stage

A

Improving weak or incomplete pieces of data to make the dataset stronger and more valuable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Verifying stage

A

Ensuring that the data is correct and accurate, since incorrect data can lead to costly mistakes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the Publishing stage

A

Finalizing the wrangled data so it is ready to be used or shared effectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly