Statistics Flashcards

(24 cards)

1
Q

Variability of results.
How do you increase the validity of results?
What is the problem with this method?

A

To increase the validity of values that are used, it is normal to take replicate readings and calculate a mean value. However, this value gives no indication of how variable the original results were. Two means may have different values but the range of original values in both data sets may overlap so much that it is not possible to be confident that they really are significantly different. Alternatively, two means may have been produced from data sets that hardly overlap and there is a significant difference between them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Variability of results.
How can the degree of variability be calculated?

A

The degree of variability can be calculated as the standard deviation.

For normally distributed results, it is possible to estimate the percentage of all values that are within a particular range around the mean.
Where a graph is drawn using means, error bars may be added to show the standard deviation. This can help to assess whether the differences in mean values are statistically significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Significance.
What do most scientific investigations produce?

A

Most scientific investigations produce quantitative results that are compared to establish the significance of any differences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Significance.
What do good scientific analysis never ‘proves’?

A

Good scientific analysis never ‘proves’ anything as there is always some remaining uncertainty. The key issue is the level of confidence that can be held over the significance of a conclusion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Significance.
What makes a result statistically significant?

A

A result is statistically significant if a result is unlikely to have occurred by random chance. The probability that the result was produced by random chance can be calculated. The lower it is, the greater the reliability if the results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Significance.
What value refers to the probability if an event being caused by random chance?

A

The P-value refers to the probability of an event being caused by random chance and is usually expressed as the probability that a single event was significant: 0.9, 0.95, 0.99, 0.995, or as a degree of confidence that the difference investigated is significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Interpreting significance.
Comparing P-values and significance levels .
what is the significance level of P-values of:
0.1
0.05
0.01
0.005

A

Showing that a result is significant does not mean it is important or that a casual relationship has been found. Two correlated factors may both be caused by another variable that has not been investigated.
P- value Significance level: % confidence
that the results were NOT
produced by random chance.
0.1 90%
0.05 95%
0.01 99%
0.005 99.5%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Statistical tests.
What is it important you ensure when choosing a statistical test?

A

It is important that the test selected is appropriate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Statistical tests.
What are you trying to do-
if you are comparing the relationship between two variables what test should you use?

A

Spearman’s rank correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Statistical tests.
What are you trying to do-
If you are comparing the means of two data sets and the data was produced through calculated or counted what test should you use?

A

Mann-Whitney u test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Statistical tests.
What are you trying to do-
if you are comparing means of two data sets and the data was produced by being measured and the data is normally distributed what test should you use?

A

T-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Statistical tests.
What are you trying to do-
if you are comparing means of two data sets and the data was produced by being measured and the data is not normally distributed what test should you use?

A

Mann-Whitney U test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Statistical tests.
What are you trying to do-
if you are comparing the frequencies of items or events what test should you use?

A

Chi-squared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Statistical tests.
when is the spearman rank correlation coefficient used?
what does it assess?

A

This is used to see whether there is a consistent change in the value of one variable as another variable increases or decreases. It assesses how closely two variables are correlated by comparing the rank orders of the two variables in a variety of sampling situations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Statistical tests.
Examples of hypotheses that can be tested using the Spearman’s rank correlation coefficient?

A
  • crop yield increases as fertiliser application increases
  • noise levels decrease as distance from a road increases
  • per capita energy use increases as per capita income increases
    the number of moths increases as temperature rises
  • the number of earthworms declines as soils become more acidic
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Statistical tests.
What is the Chi-squared test used to compare?
What data can it not be used for?

A

The chi-squared test is used to compare the frequencies or numbers of things in different groups are statistically significant. It cannot be used for data that can be continuously variable, such as measured or calculated data.

( compares our observed data to our expected data, used when data is counted in various categories. Used to see if there is a significant relationship between variables, often used in ecology, especially in genetics (expected number of phenotypes vs observed number of phenotypes). The number (we get) is then used to see if there is significance between the variables by comparing it to a critical value chart)

17
Q

Statistical tests.
Examples of hypotheses that can be tested using the chi-squared test?

A
  • the different management practices in a range of woodlands affects the number of dormice
  • a range of new GM varieties of wheat produce more seeds per seed head than a traditional variety
18
Q

Statistical tests.
When is the t-test used?

A

This is used to see if there is a significant difference between two means, where the data were measured, for example, mass, length, wind speed, and dissolved oxygen concentration. It can only be used if the data are normally distributed, which may be difficult to determine with small samples . If there is uncertainty the the Mann-Whitney U test should be used instead.

19
Q

Statistical tests.
Examples of hypotheses that can be tested using the t-test?

A
  • The mean mass of lobsters is higher in areas where collection is banned
  • the dissolved oxygen level is lower in a sewage-polluted river
  • wind speed is lower in a woodland than in a clearing
  • soil water content is lower on the south side of a hedge than on the north side
20
Q

Statistical tests.
What is the Mann-Whitney test used to measure?

A

This is used to see if there is a significant difference between two median values, where the data were counted, such as numbers of organisms or vehicles, or were counted such as diversity indices.

21
Q

Statistical tests.
Examples of hypotheses that can be tested using the Mann-Whitney U test?

A
  • fewer cars use a road when road charging is introduced
  • biodiversity is higher in a hedge that is trimmed less frequently
  • more wading birds visit a nature reserve after the water level is raised
  • more bats feed over an uncut grassland than over a close-mown one
  • fewer seeds germinate at pH 5 than at pH 7
22
Q

How are degrees of freedom calculated?
What are they used for?

A

degrees of freedom = Number of categories (k) - 1

This number is then used to pick the correct column to see the level of significance (if any)

This is used to disprove a null hypothesis

23
Q

why do we run a chi-squared test?

A

We run a chi-squared test to see if there is in fact a significant relationship seen in the data.

24
Q

what is the general null hypothesis.

A

there is no significant difference in_______________.