What is statistics?
Statistics if the science of data. Involves classifying, summarizing, organizing, analyzing, and interpreting information.
What is Data?
Data are the things about which information can be collected and analyzed, like people, numerical information, geographical areas, etc.
Data set
A data set is a rectangular array of data that contains various categories of data collected for a particular study.
Element
The element of a data set is simply the individual and unique entry in a data set about which data has been collected, analyzed, and presented in the same manner.
Variable
A variable is a particular, measurable attribute that the researcher believes is needed to describe the element in their study.
Types of data collection: cross-sectional data
Cross sectional data is information that is collected at the same or approximately the same point in time.
For example: a date, the average GPA of graduating seniors from college, split in lap time at a race.
Types of data collection: time series data
Time series data are collected over several time periods.
For example: the average US price per gallon for conventional regular gasoline
- Often found in business and economic publications
Descriptive statistics
Descriptive statistics utilizes numerical and graphical methods to explore data.
It is simply a compiled summary of the data analysis.
the most commonly used numerical statistic is called the average or mean
5829 –> 5829/10 =
Inferential statistics
Inferential statistics utilize the sample data to make estimates, decisions, predictions, or other generalizations about a larger set of data
Level of measurement: nominal
Level of measurement: ordinal
Level of measurement: interval
Level of measurement: ratio
-Similar to interval, but add that the zero is an inherent zero
- A ratio of two data entries can be formed, so that one data entry can be expressed as a multiple of another
((DOES “TWICE AS MUCH” MAKE SENSE?))
Experimental studies
Observational studies
Data collection methods: simulation
A simulation is the use of mathematical or physical models to reproduce a situation or process
- The use of computers allows for easy collection of data
- Simulations allow for the study of situations that may be impractical or dangerous in real life
- often employed because they are cheaper and save time
Data collection methods: surveys
A survey is an investigation of one or more characteristics of a population
- Most surveys are people asking questions
- Most often by carefully crafted interviews held in person, by phone, Internet, or mail.
- Questions have to be worded to yield unbiased responses and results
Data errors
Data errors can have a greater impact on research than omitted data.
This can be as simple as transposing two numbers when entering a number in a spreadsheet or as complicated as a respondent to a questionnaire not understanding the data correctly
Researchers need to develop skills that enable them to identify when data errors occur. A researcher’s ability to notice when a value for a particular variable is out of place, either too small or too large, is vital to the validity of the study
Three key elements of a well designed experiment
Control, randomization, and replication.
Important factors affecting experimental results: Confounding variable
Confounding variable occurs when an experimenter cannot tell the difference between the effects of different factors on the variable.
- For example a researcher is studying the effect of mental fatigue on study habits. The confounding effect found early on is boredom. The researcher is not sure how to distinguish between real mental fatigue or becoming bored with the test tasks.
Important factors affecting experimental results: placebo effect
A placebo effect occurs when a subject reacts favorably to a placebo when a subject has been given a fake treatment.
Blinding
Blinding is a technique where the subjects Do not know whether they are receiving a treatment or a placebo
Double blind experiment
In double blind experiment, neither the experimenter nor the subjects know whether the subjects are receiving the treatment or the placebo.
- Experimenters informed after all data has been collected