quantitative variables
(numerical): yield numerical information.
qualitative variables
(categorical): does not assume a numerical value but rather, is classifiable into 2 or more non-numeric, distinct, categorical values,
Variable
A variable is a characteristic/condition that varies for different individuals/observations,
i.e. height, weight, g.p.a., gender, number of times visited by aliens, etc.
discrete variable
is a variable whose possible values can be listed, even though the list may continue indefinitely.
-involves a count of something
continuous variable
is a variable whose possible values form some interval of numbers.
-involves a measurement of something
Data
Values of a variable.
Observation
Each individual piece of date is called this.
Data set
A collection of all observations is called a data set
frequency distribution
A frequency distribution of qualitative data is a listing of the distinct values and their frequencies.
Frequency or count
In a qualitative dataset, the number of times a particular, distinct value occurs is referred to as its frequency or count.
To Construct a Frequency Distribution of Qualitative Data
Step 1 List the distinct values of the observations in the data set in the first column of a table.
Step 2 For each observation, place a tally mark in the second column of the table in the row of the appropriate distinct value.
Step 3 Count the tallies for each distinct value and record the totals in the third column of the table.
Relative-Frequency Distribution of Qualitative Data (or proportions)
A relative-frequency distribution of qualitative data is a listing of the distinct values and their relative frequencies.
-to obtain we first find a frequency distribution and then divide each freq by the total number of observations.
To Construct a Relative-Frequency Distribution of Qualitative Data
Step 1 Obtain a frequency distribution of the data.
Step 2 Divide each frequency by the total number of observations.
Pie chart
Is a disk divided into wedge-shaped pieces proportional to the relative frequencies of the qualitative data
To construct a pie chart
Step 1 Obtain a relative-frequency distribution of the data by applying
Procedure 2.2.
Step 2 Divide a disk into wedge-shaped pieces proportional to the relative frequencies
Step 3 Label the slices with the distinct values and their relative frequencies.
Bar chart
To construct a bar chart
Step 1 Obtain a relative-frequency distribution of the data by applying
Procedure 2.2.
Step 2 Draw a horizontal axis on which to place the bars and a vertical axis on which to display the relative frequencies.
Step 3 For each distinct value, construct a vertical bar whose height equals the relative frequency of that value.
Step 4 Label the bars with the distinct values, the horizontal axis with the name of the variable, and the vertical axis with “Relative frequency.”
Classes/categories/bins
To organize quantitative data, we first group the observations into classes (also known as categories or bins) and then treat the classes as the distinct values of qualitative data.
3 common sense and important guidelines for grouping quantitative data into classes are:
Single-value grouping
we use the distinct values of the observations as the classes, a method completely analogous to that used for qualitative data.
Limit grouping
A second way to group quantitative data is to use class limits. With this method, each class consists of a range of values.
Terms used in limit grouping
Lower class limit: The smallest value that could go in a class.
Upper class limit: The largest value that could go in a class
Class width: The difference between the lower limit of a class and the lower limit of the next-higher class.
Class mark: The average of the two class limits of a class.
Cutpoint grouping
A third way to group quantitative data is to use class cutpoints. As with limit grouping, each class consists of a range of values.
Terms used in cutpoint grouping
Lower class cutpoint: The smallest value that could go in a class.
Upper class cutpoint: The smallest value that could go in the next-higher class (equivalent to the lower cutpoint of the next-higher class).
Class width: The difference between the cutpoints of a class.
Class midpoint: The average of the two cutpoints of a class.