SP1 Flashcards

(29 cards)

1
Q

Probability

A

Probability is a mathematical tool used to study randomness. It deals with the chance (the likelihood) of an event occurring. Technically, a probability is a number between zero and one, inclusive, that gives the likelihood that a specific event will occur. Example : The chance that the beautiful and bright girl sitting in front of you offers you her phone number by the end of this class. Or the chance that you rank “delicious” the sandwich you bought at the cafeteria.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Population

A

all individuals, objects, or measurements whose properties are being studied.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

random sampling

A

a method of selecting a sample that gives every member of the population an equal chance of being selected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sample

A

a subset of the population studied.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Statistic

A

a numerical characteristic of the sample; a statistic estimates the corresponding population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Parameter

A

a number that is used to represent a population characteristic and that generally cannot be determined easily.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Representative sample

A

a subset of the population that has the same characteristics as the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Variable

A

A characteristic, measure or property of interest for each person or object in a population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Numerical versus Categorical

A

variables that take on values that are indicated by numbers ( continuous or discrete ) versus variables that take on values that are names or labels such as Liberal – Conservative – New Democrat or True and False.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data / Datum ( singular )

A

: a set of observations (a set of possible outcomes); most data can be put into two groups: qualitative (an attribute whose value is indicated by a label) or quantitative (an attribute whose value is indicated by a number). Quantitative data can be separated into two subgroups: discrete and continuous. Data is discrete if it is the result of counting (such as the number of students of a given ethnic group in a class or the number of books on a shelf). Data is continuous if it is the result of measuring (such as distance traveled or weight of luggage).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Quantitative

A

data are always numbers. Quantitative data are the result of counting or measuring attributes of a population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Qualitative

A

data are the result of categorizing or describing attributes of a population. Qualitative data are also often called categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

continuous

A

Data is discrete if it is the result of counting (such as the number of students of a given ethnic group in a class or the number of books on a shelf).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

discrete

A

Data is continuous if it is the result of measuring (such as distance traveled or weight of luggage)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

pie chart

A

categories of data are represented by wedges in a circle and are proportional in size to the percent of individuals in each category. The sum of all categories should add up to 100%. Otherwise, look at a bar or Pareto graph.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

bar graph

A

the length of the bar for each category is proportional to the number or percent of individuals in each category. Bars may be vertical or horizontal.

16
Q

Pareto chart

A

consists of bars that are sorted into order by category size (largest to smallest).

17
Q

Sampling with replacement

A

Sampling with replacement is truly random sampling, once a member is picked, that member goes back into the population and thus may be chosen more than once. When does it matter? And when not?

17
Q

sampling errors

A

sampling errors and nonsampling errors. The actual process of sampling causes sampling errors. For example, the sample may not be large enough, you miscalculated twice the 50th following

18
Q

nonsampling errors

A

A defective counting or reporting device can cause a nonsampling error. Postal codes were erroneous for a part of your sample on a mailing questionnaire.

19
Q

Sampling bias

A

is created when a sample is collected from a population and some members of the population are not as likely to be chosen as others (remember, each member of the population should have an equally likely chance of being chosen).

20
Q

Variation

A

is present in any set of data. Do you really think there are precisely 16oz of drink in every soda can you buy?

21
Q

Nominal scale level

A

is qualitative (categorical), such as “weekly”, “New Democrats”, “Blue eyes”. Nominal is from “nom…”, that is, name of the category. You cannot order them, you cannot average them… You can only count elements using “this label” out of your sample.

22
Q

Ordinal scale level:

A

is like nominal but you can “rank” them, order them. We could rank a restaurant from “awful” to “delicious”. We know we all prefer the “delicious” to the “awful”, no number needed! But you cannot know the difference between awful and delicious, it does not have a meaning.

23
Interval scale level
again similar to ordinal, they are ordered. You can even do a difference between the two. Take the temperature in Celsius. It is -15 Celsius out, that is cold! And 45 degrees cooler than a nice 30 degree Celsius. All true. But you cannot say it is twice as hot at 30 than at -15 … The ratio does not have any sense. Another element: usually the interval scale data have no absolute zero…
24
Ratio scale level
is the most complete one. Real numbers, ordered, with a meaningful difference and ratio. Example : number of siblings in the family, duration time, weight, length, volume, etc…
25
Frequency
the number of times a value of the data occurs.
26
Cumulative relative frequency
is the accumulation of the previous relative frequencies.
27
relative frequency
the number of times a value of the data occurs.