week 3 - visualisation Flashcards

(17 cards)

1
Q

Showing Data for A Variable - Relative Freq.

A
  • proportion of observations having a given measurement
  • Calc. freq/total number of observations
  • Proportion of occurrences of each value in the data set.
  • Describes the fraction of occurrences of each value of a variable.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Showing Data for A Variable - Freq. Table & Bar Graph (Categorical)

A
  • Freq. table = text display of the # of occurrences of each category in the data set.
  • Bar graph uses height of rectangular bars to visualize the freq of occurrence of each category.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Showing Data for A Variable - Making a Good Bar Graph

A
  • Illustrating freqs in each group.
  • Eye compares the areas of the bars → must be equal width
  • Categorical variable = nominal → order by freq. of occurrence
  • Ordinal categorical variable → order naturally
  • Bars should be separate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Showing Data for A Variable - Pie Charts

A
  • Used to display freq. of categorical variables
  • Uses colour wedges around circumference of circle to rep freq.
  • Eye = more difficulty comparing freq. of diff. groups
  • Hard to compare 2+ pie charts side-by-side
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Showing Data for A Variable - Histograms

A
  • Shows data for single numerical variable
  • Histogram uses the area of rectangular bars to display freq.
  • Splits data values into intervals/bins of equal width
  • Freq. of observations falling into each bin = displayed.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Showing Data for A Variable - Describing Histogram Shape

A
  • Reveals the shape of a freq dist.
  • Any interval of the freq dist that = noticeably more freq than surrounding intervals = peak.
  • Mode = highest peak.
  • Bell-shaped freq dist = 1 peak in the center of the range of observations.
  • A freq dist w/ 2 peaks = bimodal
  • Frequency dist = symmetric → left = right
  • If a freq dist = not symmetric → skewed.
  • Left skew = -ve
  • Right skew = +ve
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Showing Data for A Variable - Outliers

A
  • Extreme data points lying well away from the rest of the dataset
  • Result from mistakes in recording the data → can be removed
  • May rep real observations from nature → leave in
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Showing Data for A Variable - Making a Good Histogram

A
  • Choice of interval = affect conclusions
  • Each bar must start from 0 → area = proportional to freq.
  • No space betw. bars
  • No strict rules on # of bars
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Showing Assoc. Betw. 2 Variables & Diffs Betw. Groups - Categorical Variables

A
  • 2 categorical variables = associated → relative freqs for 1 variable = differ among categories of the other variable.
  • To reveal such assoc → use a contingency table/mosaic plot/ grouped bar graph.
  • Contingency table = freq table for 2+ categorical variables b/c shows how the freqs of categories in a response variable
  • Cell = 1 combo of categories of row & column variables
  • Explanatory = column
  • Response = row
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Showing Assoc. Betw. 2 Variables & Diffs Betw. Groups - Grouped Bar Graph

A
  • Uses heights of rectables to graph the freq. of occurrence of all combos to 2+ categorical variables
  • Includes diff categories for response variables = repped by diff colours
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Showing Assoc. Betw. 2 Variables & Diffs Betw. Groups - Mosaic Plots

A
  • Treatment groups = stacked on top of each other
  • Bar area & height = relative freq of response
  • Easier to see assoc. betw variables
  • Assoc. = vertical position at which colours meet = differ betw stacks
  • No assoc. = meeting point = be the same
  • Width = proportional to # of observations in that group
  • Only provides relative freqs not absolute freq
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Showing Assoc. Betw. 2 Variables & Diffs Betw. Groups - Scatter Plots

A
  • Assoc betw numerical variables
  • Can +ve or -ve or 0 correlations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Showing Assoc. Betw. 2 Variables & Diffs Betw. Groups - Strip Plots

A
  • Assoc betw a numerical & categorical variables
  • Explan variable = categorical
  • Need to spread/jitter points along horizontal axis to reduce overlap → points = easier to see
  • Each observations reps a dot
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Showing Assoc. Betw. 2 Variables & Diffs Betw. Groups - Violin Plots

A
  • Uses compact visual summary
  • Approxs the freq dist for each group (sim to histogram)
  • Dist = smoothed & shown w/ mirror image
  • Central dot = mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Showing Assoc. Betw. 2 Variables & Diffs Betw. Groups - Multi Histograms

A
  • Histogram for each group
  • Each plot must be stacked above one another → easier to compare
  • Use same scale for each
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Showing Trends in Space & Time - Line Graph

A
  • Uses dots connected by lines to display trends over time
  • Help show temp pattern easily
  • Steepness reps changes between one point to the next
  • Area under two points = proportional to total number during that time
17
Q

Showing Trends in Space & Time - Maps

A
  • Spatial equiv of line graph
  • Uses colour gradient to rep numerical response @ multi locations on surface
  • Explan variable = space, political/geographical boundaries
  • Used to show measurements @ locations