definition:
simple random sampling
简单随机抽样
population中的每个个体都有相同几率被选入样本
definition:
stratified random sampling
分层随机抽样
population is devided into several strata
从每个stratum中按比例随机抽取
definition:
sampling error
the difference between sample statistics and population parameter
如:
sampling error of the mean =
sample mean - population mean
definition:
Sampling Distribution of a Statistic
从总体中随机抽取的、样本大小相同的所有可能性样本的statistics的概率分布
two different forms of data
Time-series data
Cross-sectional data
Central Limit Theorem
实质
条件2
结论2
实质:就关于是所有样本的均值服从怎样的分布
条件:若n>=30, 且总体的均值、方差已知,
结论:则1。sampling distribution服从正态分布,
2。该正态分布的平均值等于总体均值,S2=σ2/n
Standard Error
定义与实质
公式
标准差:样本均值分布的标准差。实质就是如果取n个数的样本,其描述的均值与总体均值的偏差,n越大偏差越小
know population variance:
σx = σ/根号n
unknown population variance:
sx = s/根号n
(即假设样本标准差就是总体标准差)
estimator内涵
3 properties of a good estimator
2 approaches for
estimation
Confidence Interval Estimation:
公式
confidence interval: Xbar +- k·(σ/根号n)
[Point Estimate +/- k· Standard Error]
(根据normal distribution的置信区间,和central limit theorum推来)
width of confidence interval
and
how to decrease width
sample variance 变小,
number of obervations增加

definition:
Level of significance
Degree of Confidence
α
Student t-distribution
图形及性质5点
z-Alternative: large sample, unknown population variation
equation with z and s
2. reliable factor: 即Z value
confidence interval estimation时,
何时选择z分布,何时选择t分布?
3点
查z分布表和t分布表时
有何区别
t表需知degrees of freedom.
影响confidence interval 宽度的因素
what may be downsides of
bigger sample size?
5 types of bias
A statistically significant result might not be
economically meaningful if you account
the risk, transaction costs, and applicable taxes