STAISTICS Flashcards

(25 cards)

1
Q

Define mean in statistics.

A

The average value of a set of numbers, calculated by dividing the sum by the count.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

True or false: The median is always the middle value in a dataset.

A

TRUE

The median is the value separating the higher half from the lower half of the dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does standard deviation measure?

A

It measures the amount of variation or dispersion in a set of values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Fill in the blank: The mode is the value that appears ______ most frequently.

A

most

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define variance.

A

The average of the squared differences from the mean, indicating data spread.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a sample in statistics?

A

A subset of a population used to represent the whole population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

True or false: A population includes all members of a defined group.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is hypothesis testing?

A

A method to determine if there is enough evidence to reject a null hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Fill in the blank: A p-value indicates the probability of observing results ______ the null hypothesis.

A

if

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define confidence interval.

A

A range of values derived from a sample that is likely to contain the population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a normal distribution?

A

A probability distribution that is symmetric about the mean, forming a bell curve.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

True or false: In a skewed distribution, data is symmetrically distributed around the mean.

A

FALSE

Skewed distributions have a longer tail on one side.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does correlation measure?

A

The strength and direction of a linear relationship between two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Fill in the blank: Regression analysis is used to predict the value of one variable based on ______.

A

another variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define outlier.

A

A data point that differs significantly from other observations in a dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a box plot?

A

A graphical representation of data that shows the distribution’s quartiles and outliers.

17
Q

True or false: A scatter plot shows the relationship between two quantitative variables.

18
Q

What is descriptive statistics?

A

Statistics that summarize or describe characteristics of a dataset.

19
Q

Fill in the blank: Inferential statistics allows us to make predictions about a population based on a ______.

20
Q

Define type I error.

A

The incorrect rejection of a true null hypothesis, also known as a false positive.

21
Q

What is data normalization?

A

The process of adjusting values in a dataset to a common scale.

22
Q

True or false: A bar chart is used to compare categorical data.

23
Q

What is a frequency distribution?

A

A summary of how often each value occurs in a dataset.

24
Q

Fill in the blank: The interquartile range measures the spread of the middle ______ of data.

25
Define **data mining**.
The practice of examining large datasets to discover patterns and extract valuable information.