Ch.2 Flashcards

Question

Choosing the grouping method

Answer 1

Grouping method Sinole-value grouping: Use with discrete data in which there are only a small number of distinct values. Limit grouping: Use when the data are expressed as whole numbers and there are too many distice values to employ single-value grouping. Cutpoint geouping: Use when the data are continuous and are expressed with decimals.

Answer 2

displays the classes of the quantitative data on a horizontal axis and the frequencies (relative frequencies, percents) of those classes on a vertical axis. - The frequency (relative frequency, percent) of each class is represented by a vertical bar whose height is equal to the frequency (relative fre-quency, percent) of that class. The bars should be positioned so that they touch each other. * For single-value grouping, we use the distinct values of the observations to label the bars, with each such value centered under its bar. * For limit grouping or cutpoint grouping, we use the lower class limits (of, equivalently, lower class cutpoints) to label the bars. Note: Some statisticans and technologies use class marks or class midpoints centered under the bars. Look for: - Central or typical value and corresponding spread - Gaps in the data or outliers - Presence of symmetry in the distribution - Number and location of peaks

Answer 3

Uses freqs on vertical axes. - relative frequencies or prevents in the vertical axis is called relative-freq histogram or prevent histogram

Answer 4

Step 1 Obtain a frequency (relative-frequency, percent) distribution of the data. Step 2 Draw a horizontal axis on which to place the bars and a vertical axis on which to display the frequencies (relative frequencies, percents). Step 3 For each class, construct a vertical bar whose height equals the frequency (relative frequency, percent) of that class. Step 4 Label the bars with the classes, as explained in Definition 2.9, the horizontal axis with the name of the variable, and the vertical axis with "Frequency" ("Relative frequency," "Percent").

Answer 5

A dotplot is a graph in which each observation is plotted as a dot at an appropriate place above a horizontal axis. Observations having equal values are stacked vertically. Dotplots are particularly useful for showing the relative positions of the data in a data set or for comparing two or more data sets. Look for: - Typical values and corresponding spread - Gaps in the data or outliers - Presence of symmetry in the distribution - Number and location of peaks

Answer 6

Step 1 Draw a horizontal axis that displays the possible values of the quantitative data. Step 2 Record each observation by placing a dot over the appropriate value on the horizontal axis. Step 3 Label the horizontal axis with the name of the variable.

Answer 7

In a stem-and-leaf diagram (or stemplot), each observation is separated into two parts, namely, a stem- consisting of all but the rightmost digit--and a leaf, the rightmost digit.

Answer 8

Step 1 Think of each observation as a stem consisting of all but the rightmost digit--and a leaf, the rightmost digit. Step 2 Write the stems from smallest to largest in a vertical column to the left of a vertical rule. Step 3 Write each leaf to the right of the vertical rule in the row that contains the appropriate stem. Step 4 Arrange the leaves in each row in ascending order.

Answer 9

The distribution of a data set is a table, graph, or formula that provides the values of the observations and how often they occur.

Answer 10

When considering the shape of a distribution, you should observe its number of peaks (highest points). - A distribution is unimodal if it has one peak, - bimodal if it has two peaks, and - multimodal if it has three or more peaks.

Answer 11

A distribution that can be divided into two pieces that are mirror images of one another is called symmetric. -The three distributions called bell shaped, triangular, and uniform (or rectangular), are specific categories of symmetric distributions.

Answer 12

A unimodal distribution that is not symmetric is either right skewed or left skewed. • A right-skewed distribution rises to its peak rapidly and comes back toward the horizontal axis more slowly--its "right tail" is longer than its "left tail." • A left-skewed distribution rises to its peak slowly and comes back toward the horizontal axis more rapidly-its "left tail" is longer than its "right tail." It is important to note the following distinction between general and specific classifications of distribution shape: • Modality, symmetry, and skewness are general classifications of distribution shape. • Such designations as bell shaped, triangular, and uniform are specific classifications of distribution shape.

Answer 13

Population data: The values of a variable for the entire population. Sample data: The values of a variable for a sample of the population.

Answer 14

The distribution of population data is called the population distribution, or the distribution of the variable. The distribution of sample data is called a sample distribution.

Answer 15

For a simple random sample, the sample distribution approximates the population distribution (i.e., the distribution of the variable under consideration). The larger the sample size, the better the approximation tends to be.

Answer 16

Misleading graphs and charts can result in this. Gives a false visual impression

Answer 17

data characterized by one variable.

Answer 18

data characterized by two variables. Number of different ways we can summarize bivariate data - Side-by-Side/ Stacked Plots --> Stacked and side-by-side plots are generally used when the variable of interest is numerical, and the grouping variable is categorical (or sometimes discrete with a small number of distinct values). - Scatterplot --> When both variables are numerical, a scatterplot is the preferred graphical summary. A scatterplot is a graphical summary of two numerical variables: x-variable goes on the xaxis, y-variable on the y-axis. - Two-Way Tables --> gives a joint frequency (or joint relative frequency) which is the number of elements in the data corresponding one level of each categorical variable

Ch.2 Flashcards

(42 cards)