Mean/Median/Mode
Finding the Median
Cases where Mean = Median
The Mean can = Medium in the below cases:
* The Dataset is Evenly Spaced: The gap between the two numbers is equal to each other: 1, 5, 9, 13, 17→+4 btwn each term
* The Dataset is Symmetrical: The gaps between the two numbers is symmetrical: 4, 6, 9, 9, 12, 14
4+2=6, 6+3=9, 9, 9+3=12, 12+2=14
If Mean = Median
If the Mean = Median, this can make some calculations easier:
* Calculating Median/Mean: (First # + Last #) ÷ 2
* Calculating Sum of Integers: Mean x # of integers
Mode
The Weighted Mean in FQ
Calculate weighted avg of below:
Number/FQ
3/1
7/5
11/3
The average but places importance on numbers based on their weight/frequency.
Weighted Mean = (Number x FQ) ÷ Total number of #/FQ
(3 x 1) + (7 x 5) + (11 x 3) ÷ 9 = 71/9
The Weighted Mean in Grades
What is the weighted avg?
Midterm: 85% score/40% weight
Final Exam: 80% score/60% weight
Weighted Avg: (.85 x 40) + (.8 x 60) = 34 + 48→82 Weighted Avg
Quartiles
Dividing the dataset into four equal groups.
1st Quartile, 2nd Quartile, 3rd Quartile, 4th Quartile
Q₁, Q₂, Q₃ values
What are the quartiles for the below datasets?
* Dataset 1: 10,14,18,21,25,34,46
* Dataset 2: 13,17,19,22,23,40,41,46
Percentiles
Measures of Dispersion
Tells us how “spread out” a certain dataset is.
3 measures:
* The Range
* The Interquartile Range
* Standard Deviation
Range
What is the range of the below dataset?
Dataset: {62, 60, 75, 73, 80, 95}
Range = Largest # - Smallest #
* Make sure to arrange numbers in ascending order
If all values in a dataset are equal to each other, the range is
0
Outliers and Effects
Datapoints that differ significantly from most (or all) of the other data.
Outlier Effects:
* Average: Affected. If we add a large number outlier to the data, the average will increase.
* Median: is NOT affected (or only slightly so).
* Range: Affected. If we add a large number outlier to the data, the range will increase.
* Interquartile range: is NOT affected.
Interquartile Range
Calculate the interquartile range of the dataset below:
9,13,17,30,34,37,42,46,49,53
Use to find a more realistic range when there is an outlier.
Interquartile Range: Q₃ - Q₁
Boxplots
Construct a box plot with below values:
3,6,12,19,31,36,37,60
A box diagram that indicates 5 data points:
* Lowest #
* Highest #
* Q₁
* Q₂
* Q₃
Standard Deviation
A value that gives us an idea of how “spread out” a group of numbers is. The higher the standard deviation, the greater the spread.
Calculating SD (simple cases)
What’s the standard deviation of a dataset of 10 and 22?
Calculating SD (complex cases)
What is the SD of the dataset: 1,4,9,16,25?
Population vs Sample SD
Sample SD of below dataset:
5, 6, 8, 21
Population: Calculating SD using all terms
Sample: Calculating SD with only a sample (not all)
Population: Normal SD calculation
Sample: Normal SD calculation but you divide by ÷ (# of terms -1)
SD Effects
Sets vs. Lists
Sets:
* Written with (), [], or no brackets
* All elements are in order: (1, 2, 3) ≠ (3, 2, 1)
* Does not count repeats: 1, 1, 2, 2, 3 = 3 elements
* Can be finite or infinite elements
Lists:
* Written with {}
* Elements do not have to be in order
* Counts repeats: {1, 1, 2, 2, 3} = 5 elements
* Always finite
Standardization
Standardize the values below. The average is 3 and σ is √2 or approximately 1.4
1,2,3,4,5
Set Intersections
What is the set intersections for the below sets?
Set A={1,2,3,4,5,6,7,8,9,10}
Set B ={2,3,5,7,11,13,17,19}
For the below sets, what is A∩B?
Set A={1,2,3,4,5,6,7,8,9,10}
Set B ={2,3,5,7,11,13,17,19}
Since they both share 2,3,5,7 in common, the set intersection:
A∩B={2,3,5,7}
Set Unions
What is the set union for the below sets?
Set A={1,2,3,4,5,6,7,8,9,10}
Set B ={2,3,5,7,11,13,17,19}
For the below sets, what is A∪B?
Set A={1,2,3,4,5,6,7,8,9,10}
Set B ={2,3,5,7,11,13,17,19}
The set union is all the numbers:
A∪B={1,2,3,4,5,6,7,8,9,10,11,13,17,19}