Introduction Flashcards

(29 cards)

1
Q

What is artificial intelligence

A

is the creation of intelligent machines that react like humans

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is data science

A

the use of artificial intelligence to solve real world problems with data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is weak AI

A

AI designed for specific tasks
no self awareness or understanding beyond trained domain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is strong AI

A

Hypothetical AI with human like general reasoning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is data mining

A

subset of AI focusing on Extracting of patterns or knowledge from huge amounts of data not previously known

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What kind of data is data mining done on

A

Temporal data,
Structure data,
Text databases,
Image/videos

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is machine leanring

A

Machine learning is the field of study that gives computers the ability to learn without being explicitly programmed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the deep learning models

A

Deep neural network
Convolutional neural network
Recurrent neural network
Autoencoders

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Machine learning vs Data mining

A

“Data Mining is a subset of Artificial Intelligence and it focuses on extracting knowledge from data — how to identify previously unknown patterns, relationships, or anomalies in the large data sets that humans can then use to solve a complex problem.”
This is a manual process that requires human intervention and decision making.
“Machine Learning is a subset of Artificial Intelligence and it focuses on teaching a computer
- how to learn to analyse large datasets and “learn” the patterns in it (from the training dataset) that can help make predictions on new data.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why do deep learning models usually outperform classical statistical machine learning models when trained on very large data bases?

A

Deep learning models have many more parameters, which allows them to learn very complex patterns when enough data is available

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the primary goal of data exploration

A

Data visulisation
Summary Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why does data visualisation matter

A

summary statistics alone can be misleading
different datasets can share identical statistics but have very different structures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is data visulisation

A

a way to communicate complex information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is data visualisation able to do

A

It is critical tool in AI, it provides an effective way to identify summaries, structure, relationships, differences,
and abnormalities in the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When should you use the distribution graph

A

to find how the data is spread

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

when should you use the comparison graph

A

to find the differences between groups of data
bar charts

17
Q

when should you use the composition graph

A

to find the parts of a whole
pie charts

18
Q

when should you use the connection graph

A

network graphs
to find networks and links between data

19
Q

when should you use the location graph

A

maps/heatmaps
show geographical patterns

20
Q

What is the grammar off graphics

A

provides a structured, layered system for building data visualizations, separating the core components like data, aesthetic mappings

21
Q

What happens as you increase the span value

A

The greator the value the smaller the curve will be

22
Q

What is important about the input data for AI

A

Quality of the output depends on the quality of the input and dramitcally effects the perofmrance of the AI

23
Q

What are common issues with data that affect eh performance of AI

A

Missing values
Duplicate records
noise
invalid or inconsistent data
outliers

24
Q

What does data wrangling involve

A

data cleansing
feature selection and transformation

25
Define data cleansing
addresses problems such as removing or impute missing value merge duplicate data generate best estimates for invalid records
26
What is Data wrangling(feature selection and transformation)
the process of cleaning, transforming, and preparing raw data so it is suitable for analysis, visualisation, or machine learning.
27
What are ways if data transormation
scaling log transformation converting data types encoding variables
28
What is feature selection
improve model performance and reduce complexity remove: - highly correlated features - features with many missing values - irrelevant field
29
What is feature transformation
the process of mapping set of values for a feature to a new set of values to make the representation of the data easier for the analysis.