计量经济学英文重点知识点考试必备_第1页
计量经济学英文重点知识点考试必备_第2页
计量经济学英文重点知识点考试必备_第3页
计量经济学英文重点知识点考试必备_第4页
计量经济学英文重点知识点考试必备_第5页
已阅读5页,还剩21页未读 继续免费阅读

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

1、第一章1. Econometrics(计量经济学):the social science in which the tools of economic theory, mathematics, and statistical inference are applied to the analysis of economic phenomena.the result of a certain outlook on the role of economics, consists of the application of mathematical statistics to economic da

2、ta to lend empirical support to the models constructed by mathematical economics and to obtain numerical results.2. Econometric analysis proceeds along the following lines计量经济学分析步骤1) Creating a statement of theory or hypothesis.建立一个理论假说2) Collecting data.收集数据3) Specifying the mathematical model of t

3、heory.设定数学模型4) Specifying the statistical, or econometric, model of theory.设立统计或经济计量模型5) Estimating the parameters of the chosen econometric model.估计经济计量模型参数6) Checking for model adequacy : Model specification testing.核查模型的适用性:模型设定检验7) Testing the hypothesis derived from the model.检验自模型的假设8) Using t

4、he model for prediction or forecasting.利用模型进行预测l Step2:收集数据Ø Three types of data三类可用于分析的数据1) Time series(时间序列数据):Collected over a period of time, are collected at regular intervals.按时间跨度收集得到2) Cross-sectional截面数据:Collected over a period of time, are collected at regular intervals.按时间跨度收集得到3) Po

5、oled data合并数据(上两种的结合)l Step3:设定数学模型1. plot scatter diagram or scattergram2. write the mathematical modell Step4:设立统计或经济计量模型Ø CLFPR is dependent variable应变量Ø CUNR is independent or explanatory variable独立或解释变量(自变量)Ø We give a catchall variable U to stand for all these neglected factors&

6、#216; In linear regression analysis our primary objective is to explain the behavior of the dependent variable in relation to the behavior of one or more other variables, allowing for the data that the relationship between them is inexact.线性回归分析的主要目标就是解释一个变量(应变量)与其他一个或多个变量(自变量)只见的行为关系,当然这种关系并非完全正确l

7、Step5:估计经济计量模型参数Ø In short, the estimated regression line gives the relationship between average CLFPR and CUNR 简言之,估计的回归直线给出了平均应变量和自变量之间的关系Ø That is, on average, how the dependent variable responds to a unit change in the independent variable.单位因变量的变化引起的自变量平均变化量的多少。l Step6:核查模型的适用性:模型设定检验

8、The purpose of developing an econometric model is not to capture total reality, but just its salient features.l Step7:检验自模型的假设Why do we perform hypothesis testing?We want to find our whether the estimated model makes economic sense and whether the results obtains conform with the underlying economic

9、 theory.第二章1. The meaning of regression(回归)Regression analysis is concerned with the study of the relationship between one variable called the dependent or explained variable, and one or more other variables called independent or explanatory variables.2. Objectives of regression1) Estimate the mean,

10、 or average, and the dependent values given the independent values2) Test hypotheses about the nature of the dependence -hypotheses suggested by the underlying economic theory3) Predict or forecast the mean value of the dependent variable given the values of the independents4) One or more of the pre

11、ceding objectives combined3. Population Regression Line(PRL)In short, the PRL tells us how the mean, or average, value of Y is related to each value of X in the whole population4. The dependence of Y on X, technically called the regression of Y on X.5. How do we explain it?A students S.A.T. score, s

12、ay, the ith individual, corresponding to a specific family income can be expressed as the sum of two components1) The component can be called the systematic, or deterministic, component.2) May be called the nonsystematic or random component6. What is the nature of U(stochastic error) term?1) The err

13、or term may represent the influence of those variables that are not explicitly included in the model.误差项代表了未纳入模型变量的影响2) Some intrinsic randomness in the math score is bound to occur that can not be explained even we include all relevant variables.即使模型包括了决定性数学分数的所有变量,内在随机性也不可避免,这是做任何努力都无法解释的。3) U may

14、 also represent errors of measurement. U还代表了度量误差4) The principle of Ockhams razor - the description be kept as simple as possible until proved inadequate - would suggest that we keep our regression model as simple as possible.“奥卡姆剃刀原则”,描述应该尽可能简单,只要不遗漏重要信息。这表明回归模型应尽可能简单。7. How do we estimate the PRF(

15、population regression function)?Unfortunately, in practice, We rarely have the entire population in our disposal, often we have only a sample from this population.8. Granted that the SRF is only an approximation of PRF. Can we find a method or a procedure that will make this approximation as close a

16、s possible? SRF仅仅是PRF的近似,那么能不能找到一种方法使这种近似尽可能接近真实呢?9. Special meaning of “linear”1) Linearity in the variables变量线性The conditional mean value of the dependent variable is a linear function of the independent variables2) Linearity in the Parameters参数线性The conditional mean of the dependent variable is a

17、 linear function of the parameters, the Bs; it may or may not be linear in the variables.第三章1. Unless we are willing to assume how the stochastic U terms are generated, we will not be able to tell how good an SRF is as an estimate of the true PRF.只有假定了随机误差的生成过程,才能判定SRF对PRF拟合的是好是坏。2. Classical Linear

18、 Regression Model1) Assumption 1: The regression model is linear in the parameters. It may or may not be linear in the variables.回归模型是参数线性的,但不一定是变量线性的。2) Assumption 2: The explanatory variables X is uncorrelated with the disturbance term U. Xs are nonstochastic, U is stochastic. 解释变量X与扰动误差项u不相关. X是非

19、随机的,U是随机的。3) Assumption 3: Given the value of Xi, the expected, or mean value of the disturbance term U is zero.给定Xi,扰动项的期望或均值为零。 Disturbance U represent all those factors that are not specifically introduced in the model干扰项U代表了所有未纳入模型的影响因素。4) Assumption 4:The variance of each Ui is constant, or hom

20、oscedastic. U的方差为常数,或同方差。l Homoscedasticity(同方差):a. This assumption simply means that the conditional distribution of each Y population corresponding to the given value of X has the same variance. 该假定表明,与给定的X相对应的每个Y的条件分布具有同方差。b. The individual Y values are spread around their mean values with the sa

21、me variance.即每个Y值以相同的方差分布在其均值周围。5) Assumption 5:There is no correlation between two error terms, this is the assumption of no-autocorrelation.无自相关假定,即两个误差项之间不相关。6) Assumption 6:The regression model is correctly specified.回归模型是正确假定的。There is no specification bias or specification error in the model.实

22、证分析的模型不存在设定偏差或设定误差。l This assumption can be explained informally as follows. An econometric investigation begins with the specification of the econometric model underlying the phenomenon of interest. 3.Variances and Standard errors of OLS estimators普通最小二乘估计量的方差与标准误:One immediate result of the assump

23、tions introduced is that they enable us to estimate the variances and standard errors of the OLS estimators given in Eq.(2.16) and (2.17).4.We should know:l Variances of the estimatorsl Standard errors of the estimators5.What is the value of l The homoscedastic is estimated from formula6.Standard Er

24、ror of the Regression (SER) 回归标准误l Is simply the standard deviation of the Y values about the estimated regression line. Y值偏离估计回归的标准差。7.Summary of math S.A.T.score function1) Interpretationl The standard deviation, or standard error, is 0.000245, is a measure of variability of b2 from sample to samp

25、le.l If we can say that our computed b2 lies within a certain number of standard deviation units from the true B2, we can state with some confidence how good the computed SRF is as an estimator of the true PRF.2)Sampling Distribution 抽样分布Once we determine the sampling distribution of our two estimat

26、ors, the task of hypothesis testing becomes straightforward.一旦确定了两个估计量的抽样分布,那么假设检验就是举手之劳的事情。8.Why do we use OLS ?l The properties of OLS estimatorsl The method of OLS is used popularly not only because it is easy to use but also because it has some strong theoretical properties. OLS法得到广泛使用,不仅是因为它简单易

27、行,还因为它具有很强的理论性质。9.Gauss-Markov theorem 高斯-马尔科夫定理Given the assumptions of the classical linear regression model (CLRM), the OLS estimators have minimum variance in the class of linear estimators.The OLS estimators are BLUE (best linear unbiased estimators)满足古典线性模型的基本假定,则在所有线性据计量中,OLS估计两具有最小方差性,即OLS是最

28、优线性无偏估计量(BLUE)10. BLUE property 最优线性无偏估计量的性质1) B1 and B2 are linear estimators. B1和B2是线性估计量2) They are unbiased , that is E(b1)=B1, E(b2)=B2. B1和B2是无偏估计两3) The OLS estimator of the error variance is unbiased.误差方差的OLS估计量是无偏的4) b1 and b2 are efficient estimators.B1和B2是有效估计量Var(b1) is less than the var

29、iance of any other linear unbiased estimator of B1Var(b2) is less than the variance of any other linear unbiased estimator of B211. Monte Carlo simulation 蒙特卡洛模拟l Do the experiment at labl Do it by Excell. =NORMINV(RAND(),0,2)l Do it by matlab.= NORMINV(uniform(),MU,SIGMA)l Do it by Stata. =invnorm(

30、uniform()12. Central Limit Theorems 中心极限定理If there is a large number of independent and identically distributed (iid) random variables, then, with a few exceptions , the distribution of their sum tends to be a normal distribution as the number of such variables increases indefinitely. 随着变量个数的无限增加,独立

31、同分布随机变量近似服从正态分布13. RecallU, the error term represents the influence of all those forces that affect Y but are not specifically included in the regression model because there are so many of them and the individual effect of any one such force on Y may be too minor. 误差项代表了未纳入回归模型的其他所有因素的影响。因为在这些影响中,每种

32、因素对Y的影响都很微弱If all these forces are random, if we let U represent the sum of all these forces, then by invoking the CLT, we can assume that the error term U follows the normal distribution.如果所有这些影响因素都是随机的,用U代表所有这些影响因素之和,那么根据中心极限定理,可以假定误差项服从正态分布。14. Another property of normal distribution另一个正态分布的性质Any

33、 linear function of a normally distributed variable is itself normally distributed.正态变量的性质函数仍服从正态分布。15. Hypothesis testing 假设检验Having known the distribution of OLS estimators b1 and b2, we can proceed the topic of hypothesis testing.16. Null hypothesis 零假设“zero” null hypothesis is deliberately chose

34、n to find out whether Y is related to X al all, which is also called straw man hypothesis.之所以选择这样一个假设是为了确定Y是否与X有关,也称为稻草人假设。17. We need some formal testing procedure to reject or receive the null hypothesis and make the skeptical guys shut up.需要正规的检验过程拒绝或接受零假设18. If our null hypothesis is B2=0 and th

35、e computed b2=0.0013, we can find out the probability of obtaining such a value from the Z, the standard normal distribution.如果零假设为B2=0,计算得到b2=0.0013,那么根据标准正态分布Z,能够求得获此b2值的概率If the probability is very small, we can reject the null hypothesis.如果这个概率非常小,则拒绝零假设。If the probability is larger, say , great

36、er than 10 percent, we may not reject the null hypothesis.如果这概率比较大,比如大于10%,就不拒绝零假设。19. We dont know the 2We must know the true 2, but we can estimate it by using 20. What will happen if we replace by its estimator -hat 21. Let us assume that , the level of significance or the probability of committi

37、ng a type I error, is fixed at 5 percent.假定,显著水平成犯第一类错误的概率为5%。22. red area = rejection region for 2-sided test(1-a)t0f(t)-tctca/2a/223. Loop and balla. This is a 95% confidence interval for B2 给出了B2的一个95%的置信区间。b. in repeated applications 95 out of 100 such intervals will include the true B2重复上述过程,10

38、0个这样的区间中将有95个包括真实的B2。c. Such a confidence interval is known as the region of acceptance (of H0) and the area outside the confidence interval is known as the rejection region (of H0)用假设检验的语言把这样的置信区间称为(H0的)接受区域,把置信区间以外的区间成为(H0的)拒绝区域24. 回归系数的假设检验目的:简单线性回归中,检验X对Y是否真有显著影响基本概念回顾: 临界值与概率、大概率事件与小概率事件相对于显著性水

39、平的临界值为: (单侧)或(双侧)计算的统计量为:统计量 t0(大概率事件)(小概率事件)25. ConclusionsSince this interval does not include the null-hypothesized value of 0.因为这个区间没有包括零假设值0。We can reject the null hypothesis that annual family income is not related to math S.A.T. Scores.所以拒绝假设:家庭年收入对数学SAT没有影响。Put positively, income does have a

40、 relationship to math S.A.T. scores. 换言之,收入确实与数学SAT有关系。26. A cautionary noteAlthough the statement given is true, we cannot say that the probability is 95 percent that the particular interval includes B2, for this interval is not a random interval, it is fixed, therefore, the probability is either 1

41、 ore 0 that the interval includes B2.虽然式子3.26为真,但不能说某个特定区间式3.27包括真实B2的概率为95%,因为与式子3.26不同,式3.27是固定的,而不是一根随机区间,所以区间3.27包括B2的概率为1或0.We can only say that if we construct 100 intervals like this interval, 95 out of 100 such intervals will include the true B2.我们只能说,如果建立100个像式3.27这样的区间,则有95个区间包括真实的B2.We ca

42、n not guarantee that this particular interval will necessarily includes B2.并不能保证某个区间一定有B2.27. The test of significance approach to hypothesis testing 假设检验的显著性检验方法Hypothesis testing is that of a test statistic and the sampling distribution of the test statistic under the null hypothesis, H0.假设检验方法涉及两

43、个重要的概念检验统计量和零假设下检验统计量的抽样分布。The decision to accept or reject H0 is made on the basis of the value of the test statistic obtained from the sample data.根据从样本数据求得的检验统计量的值决定接受或拒绝零假设。28. T testWe can use the t value computed here ad the test statistic, which follows the t distribution with (n-2) d.f.可以计算出

44、t值作为检验统计量,它服从自由度为(n-2)的t 分布。29. Instead of arbitrarily choosing the value , we can find the p value (the exact level of significance) and reject the null hypothesis if the computed P value is sufficiently low.为了避免选择显著水平的随意性,通常求出p值(精确的显著水平),如果计算的p值充分小,则拒绝零假设。30. ConclusionsIn the case of two-sided t

45、test 双边检验情况中If the computed |t|, the absolute value of t, exceeds the critical t value at the chosen level of significance, we can reject the null hypothesis.如果计算得到的|t|值超过临界t值,则拒绝零假设。31. P valueThe P value of that t statistic of 5.4354 is about 0.0006. t统计量(5.4354)的p值(概率值)约为0.0006。The smaller the p

46、value, the more confident we are when reject the null hypothesis.p值越小,在拒绝零假设的时候就越有自信。Thus if we were to reject the null hypothesis that the true slope coefficient is zero at this P value, we would be wrong in six out of ten thousand occasions. 如果在这个p值水平之上拒绝零假设:真实的斜率系数为0,则犯错误的机会有万分之六。32. How can we c

47、omputed tWe first compute the t value as if the null hypothesis were that B2=0, we still get the t首先计算在零假设B2=0下的t值Since this value exceeds any of the critical values shown in the preceding table, following the rules laid down. t值大与上表给出的任何临界值,附录D表D-2列出的规则,We can reject the hypothesis that annual fami

48、ly income has no relationship to math S.A.T. Scores.拒绝零假设:家庭年收入对数学SAT没有影响。33. How good is the fitted regression line: the coefficient of determination r2On the basis of t test both the estimated intercept and slope coefficients are statistically significant (i.e. significantly different from zero) s

49、uggests that the SRF seems to “fit” the data “reasonably” well.根据t检验,估计的斜率和结局都是统计显著的,这说明样本回归函数式3.16很好地拟合了样本数据。34. Coefficient of determinationCan we develop an overall measure of “goodness of fit ” that will tell us how well the estimated regression line fits the actual Y values?能否建立一个“拟合优度”的判定规则,从而

50、辨别估计的回归线拟合真实Y值的优劣程度呢?Such a measure has been developed and is known as the coefficient of determination.称之为判定系数。35. Recall 36. Rearrange it37. Decomposition1、2、 3、38. In deviation forms1、 2、 39. Square both sides and sum=the total variation of the actual Y values about their sampling mean Y bar, whi

51、ch may be called the total sum of squares (TSS)总平方和,真实Y值围绕其均值的总变异=The total variation of the estimated Y values about their mean value, Y hat bar, which may be called appropriately the sum of squares due to regression (i.e., due to the explanatory variables), or simply called the explained sum of sq

52、uares (ESS)解释平方和,估计的Y值围绕气均值的变异,也称回归平方和(由解释变量解释的部分)40. Put simplyThe total variation in the observed Y values about their mean value can be partitioned into two parts, one attributable to the regression line and the other to random forces, because not all actual Y observations lie on the fitted line.

53、Y值与其均值的总离差可以分解为两部分:一部分归于回归线,另一部分归于随机因素,因为不是所有的真实观察值Y都落在你和直线上。41. ESS vs RSSa. If the chosen SRF fits the data quite well, ESS should be much larger than RSS.如果选择的SRF很好的拟合了样本数据,则SEE远大于RSS。b. If the SRF fits the data poorly RSS will be much larger than ESS.如果SRF拟合的不好,则RSS远大于ESS。42. Let us define 定义43.

54、 R2样本判定系数l R2 measures the proportion or percentage of the total variation in Y explained by the regression model样本判定系数度量了回归模型对Y变异的解释比例(或百分比)l R2 is the coefficient of determination and is the most commonly used measure of the goodness of fit of a regression line.样本判定系数通常用来度量回归线的拟合优度。44. Properties

55、of R2a. it is a non-negative quantity.非负性b. its limits are 0 R2 1 since a part (ESS) cannot be greater than the whole (TSS). 0 R2 1,因为部分(ESS)不可能大于整体(TSS)。An R2 of 1 means a “perfect fit” for the entire variation in Y is explained by the regression.若R2=1,则表示完全拟合,即线性模型完全解释Y的变异。An R2 of zero means no r

56、elationship between Y and X whatsoever.若R2=0,则表示Y与X之间无任何关系。45. Reporting the results46. Explanationa. The figures in the first set of parentheses are the estimated standard errors (se) of the estimated regression coefficients.第一行括号内的数值表示估计回归系数的标准误b. Those in the second set of parentheses are the est

57、imated t value computed under the null hypothesis that the population value of each regression coefficient individually is zero.T values are simply computed the ratios of the estimated coefficient to their standard errors.c. 第二行括号内的数值表示在零假设下(每个回归系数的真实值为零),根据式3.22估计的t值(即估计的系数与其标准误之比)d. those in the t

58、hird set of parentheses are p values of the computed t values.e. 第三行括号内的数值表示获得t值的p值。47. As a matter of conventionFrom now on , if we do not specify a specific null hypothesis, then we will assume that it is the zero null hypothesis.从现在起,如果没有设定特殊的零假设,习惯地规定零假设为:总体参数为零。48. P valueBy quoting the P values we can determine the exact level of significance of the estimated t value. 通过列出的p值能够确定t值的精确显著水

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论