Exploratory Data Analytics Techniques Flashcards

(31 cards)

1
Q

True or false: A data scientist typically spends almost 70% of their work time performing EDA on datasets.

A

TRUE

The time invested in EDA provides a comprehensive understanding of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does EDA stand for?

A

Exploratory Data Analysis

EDA is a systematic approach to uncovering insights and preparing data for further analysis or modeling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the sequence of steps known as in data analytics?

A

Exploratory Data Analysis (EDA)

EDA is crucial for exploring data and uncovering insights.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

List the five steps learned with the Pandas tools for data analysis.

A
  • Import data from various sources
  • Select specific rows and columns
  • Identify and handle missing and duplicate values
  • Explore unique values and calculate summary statistics
  • Visualize data distributions, relationships, and trends

These steps are essential for preparing data for analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the five steps learned with the Feature-engine tools?

A
  • Import data from various sources
  • Encoding techniques for categorical variables
  • Handle outliers and impute missing values
  • Transformation techniques for variable distribution
  • Select relevant variables for analysis

These steps enhance the robustness and reliability of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Who pioneered the concept of Exploratory Data Analysis (EDA)?

A

John W. Tukey

Tukey introduced EDA in the 1970s as a critical step in data analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the nine steps crucial for EDA?

A
  • Initial Insight
  • Error Detection
  • Understanding Patterns
  • Outlier Detection
  • Variable Relationships
  • Data Transformation
  • Univariate Feature Visualization
  • Hypothesis Testing
  • Multivariate Data Visualization

These steps guide the analysis process and help uncover insights.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Fill in the blank: EDA allows us to look at data in its raw form and get a feel for what it contains, providing ________ for guiding subsequent steps in the analysis.

A

preliminary insight

This insight is invaluable for effective data analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the purpose of Error Detection in EDA?

A

Identify obvious errors in the data

Early detection prevents skewed results in analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does EDA help us uncover regarding patterns and trends?

A

It helps form hypotheses and make informed decisions

Understanding patterns is essential for applying analytical techniques.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the significance of Outlier Detection in EDA?

A

Identifies anomalous events that can impact analysis

Proper handling of outliers is crucial for accurate results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does EDA assist in understanding Variable Relationships?

A

Finds interesting relationships among variables

These relationships can lead to new insights and further investigation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the role of generative AI in the context of EDA?

A

Streamline and enhance analysis

Generative AI can assist in gathering detailed statistics and creating visualizations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What should be provided in prompts to effectively leverage AI for visualizations?

A

Clear and detailed instructions

This helps in creating meaningful visualizations that highlight trends.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the importance of model evaluation in the context of data analysis?

A

Ensures the effectiveness of the models built

Model evaluation will be covered in a future lesson.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Fill in the blank: Generative AI can propose meaningful actions based on the data, providing valuable suggestions that can inform ________.

A

policy-making and decision-making processes

This enhances the relevance of insights derived from data.

17
Q

What is the purpose of a dashboard in a business context?

A

To provide key insights into sales performance, inventory levels, and customer behaviour

A well-planned dashboard translates complex data into actionable insights.

18
Q

Effective dashboard planning involves understanding which three key elements?

A
  • Purpose
  • Audience
  • Data

These elements ensure the dashboard meets user needs and delivers valuable insights.

19
Q

True or false: A well-planned dashboard can become cluttered and confusing without careful planning.

A

TRUE

Careful planning is essential to create a visually appealing and functionally robust dashboard.

20
Q

Who are typical stakeholders involved in the dashboard planning process?

A
  • Business managers
  • Data analysts
  • IT professionals
  • End-users

Engaging with stakeholders helps gather diverse perspectives on dashboard needs.

21
Q

What is the first step in engaging with stakeholders for dashboard planning?

A

Identifying all key individuals and groups who will interact with or benefit from the dashboard

This helps ensure that all stakeholder voices are heard and understood.

22
Q

After gathering input from stakeholders, what is the next step in the dashboard planning process?

A

Documenting the requirements clearly and comprehensively

Visual aids like diagrams can help illustrate data flow and relationships.

23
Q

What approach is valid for dashboard planning to ensure user-centric solutions?

A

Design thinking approach

This approach focuses on understanding user needs and creating tailored solutions.

24
Q

What are Key Performance Indicators (KPIs) used for?

A
  • Tracking progress toward business objectives
  • Assessing strategies and actions
  • Motivating employees

KPIs link strategic goals to day-to-day operations.

25
Fill in the blank: Each KPI should be **_______**, Measurable, Achievable, Relevant, and Time-bound.
Specific ## Footnote This approach helps select KPIs that are clear and actionable.
26
What are the primary goals of effective **dashboard design**?
* Clarity * Simplicity * Functionality ## Footnote The design should communicate complex information quickly and accurately.
27
What is the purpose of creating **wireframes** and **mockups** before developing a dashboard?
* Wireframes outline structure and layout * Mockups include design elements like colours and fonts ## Footnote These tools provide a visual blueprint for the dashboard.
28
What enhances the **usability** of dashboards?
Interactivity ## Footnote Features like clickable elements and drill-down capabilities allow users to explore data dynamically.
29
Which tool is known for its integration with Microsoft products and robust data visualisation capabilities?
Microsoft Power BI ## Footnote It is highly regarded for its features and ease of use.
30
What should be evaluated when choosing the right tool for **dashboard creation**?
* Compatibility with data sources * Ease of use * Visualisation capabilities * Scalability * Cost * Integration * Security * User support ## Footnote These criteria ensure the tool meets project and organisational needs.
31
What should a comprehensive **dashboard plan document** include?
* Requirements from stakeholders * Purpose and objectives * Target audience * Key Performance Indicators (KPIs) * Design wireframes and mockups * Interactivity and usability considerations * Tool selection rationale ## Footnote This document guides the dashboard development process.