版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Chap 13-1,Chapter 13 Multiple Regression,Business Statistics: A First CourseFifth Edition,Learning Objectives,In this chapter, you learn: How to develop a multiple regression model How to interpret the regression coefficients How to de
2、termine which independent variables to include in the regression model How to determine which independent variables are more important in predicting a dependent variable How to use categorical variables in a regression model,2,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,The Mult
3、iple Regression Model,Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (Xi),Multiple Regression Model with k Independent Variables:,Y-intercept,Population slopes,Random Error,3,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Multiple Re
4、gression Equation,The coefficients of the multiple regression model are estimated using sample data,Estimated (or predicted) value of Y,Estimated slope coefficients,Multiple regression equation with k independent variables:,Estimated intercept,In this chapter we will use Excel or Minitab to obtain t
5、he regression slope coefficients and other regression summary measures.,4,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Two variable model,Y,X1,X2,Slope for variable X1,Slope for variable X2,Multiple Regression Equation,(continued),5,Business Statistics: A First Course, 5e 2009 Pr
6、entice-Hall, Inc.,Example: 2 Independent Variables,A distributor of frozen dessert pies wants to evaluate factors thought to influence demand Dependent variable: Pie sales (units per week) Independent variables: Price (in $) Advertising ($100s) Data are collected for 15 weeks,6,Business Statistics:
7、A First Course, 5e 2009 Prentice-Hall, Inc.,Pie Sales Example,Sales = b0 + b1 (Price) + b2 (Advertising),Multiple regression equation:,7,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Excel Multiple Regression Output,8,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc
8、.,Minitab Multiple Regression Output,The regression equation is Sales = 307 - 25.0 Price + 74.1 Advertising Predictor Coef SE Coef T P Constant306.50 114.30 2.68 0.020 Price -24.98 10.83 -2.31 0.040 Advertising 74.13 25.97 2.85 0.014 S = 47.4634 R-Sq = 52.1% R-Sq(adj) = 44.2% Analysis of Variance So
9、urce DF SS MS F P Regression 2 29460 14730 6.54 0.012 Residual Error12 27033 2253 Total 14 56493,9,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,The Multiple Regression Equation,b1 = -24.975: sales will decrease, on average, by 24.975 pies per week for each $1 increase in selling
10、price, net of the effects of changes due to advertising,b2 = 74.131: sales will increase, on average, by 74.131 pies per week for each $100 increase in advertising, net of the effects of changes due to price,where Sales is in number of pies per week Price is in $ Advertising is in $100s.,10,Business
11、 Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Using The Equation to Make Predictions,Predict sales for a week in which the selling price is $5.50 and advertising is $350:,Predicted sales is 428.62 pies,Note that Advertising is in $100s, so $350 means that X2 = 3.5,11,Business Statistics:
12、A First Course, 5e 2009 Prentice-Hall, Inc.,Predictions in Excel using PHStat,PHStat | regression | multiple regression ,Check the “confidence and prediction interval estimates” box,12,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Input values,Predictions in PHStat,(continued),Pre
13、dicted Y value,Confidence interval for the mean value of Y, given these X values,Prediction interval for an individual Y value, given these X values,13,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Predictions in Minitab,Input values,Predicted Values for New Observations New Obs F
14、it SE Fit 95% CI 95% PI 1 428.6 17.2 (391.1, 466.1) (318.6, 538.6) Values of Predictors for New Observations New Obs Price Advertising 1 5.50 3.50,Confidence interval for the mean value of Y, given these X values,Prediction interval for an individual Y value, given these X values,14,Business Statist
15、ics: A First Course, 5e 2009 Prentice-Hall, Inc.,Coefficient of Multiple Determination,Reports the proportion of total variation in Y explained by all X variables taken together,15,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,52.1% of the variation in pie sales is explained by th
16、e variation in price and advertising,Multiple Coefficient of Determination In Excel,16,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Multiple Coefficient of Determination In Minitab,The regression equation is Sales = 307 - 25.0 Price + 74.1 Advertising Predictor Coef SE Coef T P C
17、onstant306.50 114.30 2.68 0.020 Price -24.98 10.83 -2.31 0.040 Advertising 74.13 25.97 2.85 0.014 S = 47.4634 R-Sq = 52.1% R-Sq(adj) = 44.2% Analysis of Variance Source DF SS MS F P Regression 2 29460 14730 6.54 0.012 Residual Error12 27033 2253 Total 14 56493,52.1% of the variation in pie sales is
18、explained by the variation in price and advertising,17,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Adjusted r2,r2 never decreases when a new X variable is added to the model This can be a disadvantage when comparing models What is the net effect of adding a new variable? We lose
19、 a degree of freedom when a new X variable is added Did the new X variable add enough explanatory power to offset the loss of one degree of freedom?,18,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Shows the proportion of variation in Y explained by all X variables adjusted for th
20、e number of X variables used and sample size (where n = sample size, k = number of independent variables) Penalize excessive use of unimportant independent variables Smaller than r2 Useful in comparing among models,Adjusted r2,(continued),19,Business Statistics: A First Course, 5e 2009 Prentice-Hall
21、, Inc.,44.2% of the variation in pie sales is explained by the variation in price and advertising, taking into account the sample size and number of independent variables,Adjusted r2 in Excel,20,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Adjusted r2 in Minitab,The regression eq
22、uation is Sales = 307 - 25.0 Price + 74.1 Advertising Predictor Coef SE Coef T P Constant306.50 114.30 2.68 0.020 Price -24.98 10.83 -2.31 0.040 Advertising 74.13 25.97 2.85 0.014 S = 47.4634 R-Sq = 52.1% R-Sq(adj) = 44.2% Analysis of Variance Source DF SS MS F P Regression 2 29460 14730 6.54 0.012
23、Residual Error12 27033 2253 Total 14 56493,44.2% of the variation in pie sales is explained by the variation in price and advertising, taking into account the sample size and number of independent variables,21,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Is the Model Significant?
24、,F Test for Overall Significance of the Model Shows if there is a linear relationship between all of the X variables considered together and Y Use F-test statistic Hypotheses: H0: 1 = 2 = = k = 0 (no linear relationship) H1: at least one i 0 (at least one independent variable affects Y),22,Business
25、Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,F Test for Overall Significance,Test statistic: where FSTAT has numerator d.f. = k and denominator d.f. = (n k - 1),23,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,(continued),F Test for Overall Significance In Excel,With 2
26、and 12 degrees of freedom,P-value for the F Test,24,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,F Test for Overall Significance In Minitab,The regression equation is Sales = 307 - 25.0 Price + 74.1 Advertising Predictor Coef SE Coef T P Constant306.50 114.30 2.68 0.020 Price -24
27、.98 10.83 -2.31 0.040 Advertising 74.13 25.97 2.85 0.014 S = 47.4634 R-Sq = 52.1% R-Sq(adj) = 44.2% Analysis of Variance Source DF SS MS F P Regression 2 29460 14730 6.54 0.012 Residual Error12 27033 2253 Total 14 56493,With 2 and 12 degrees of freedom,P-value for the F Test,25,Business Statistics:
28、A First Course, 5e 2009 Prentice-Hall, Inc.,H0: 1 = 2 = 0 H1: 1 and 2 not both zero = .05 df1= 2 df2 = 12,Test Statistic: Decision: Conclusion:,Since FSTAT test statistic is in the rejection region (p-value .05), reject H0,There is evidence that at least one independent variable affects Y,0, = .05,F
29、0.05 = 3.885,Reject H0,Do not reject H0,Critical Value: F0.05 = 3.885,F Test for Overall Significance,(continued),F,26,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Two variable model,Y,X1,X2,Yi,Yi,x2i,x1i,The best fit equation is found by minimizing the sum of squared errors, e2,
30、Sample observation,Residuals in Multiple Regression,Residual = ei = (Yi Yi),27,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Multiple Regression Assumptions,Assumptions: Independence of errors Error values are statistically independent Normality of errors Error values are normally
31、 distributed for any given set of X values Equal Variance (also called Homoscedasticity) The probability distribution of the errors has constant variance,ei = (Yi Yi),Errors (residuals) from the regression model:,28,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Residual Plots Used
32、 in Multiple Regression,These residual plots are used in multiple regression: Residuals vs. Yi Residuals vs. X1i Residuals vs. X2i Residuals vs. time (if time series data),Use the residual plots to check for violations of regression assumptions,29,Business Statistics: A First Course, 5e 2009 Prentic
33、e-Hall, Inc.,Are Individual Variables Significant?,Use t tests of individual variable slopes Shows if there is a linear relationship between the variable Xj and Y holding constant the effects of other X variables Hypotheses: H0: j = 0 (no linear relationship) H1: j 0 (linear relationship does exist
34、between Xj and Y),30,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Are Individual Variables Significant?,H0: j = 0 (no linear relationship) H1: j 0 (linear relationship does exist between Xj and Y) Test Statistic: (df = n k 1),(continued),31,Business Statistics: A First Course, 5e
35、 2009 Prentice-Hall, Inc.,t Stat for Price is tSTAT = -2.306, with p-value .0398 t Stat for Advertising is tSTAT = 2.855, with p-value .0145,(continued),Are Individual Variables Significant? Excel Output,32,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Are Individual Variables Sig
36、nificant? Minitab Output,The regression equation is Sales = 307 - 25.0 Price + 74.1 Advertising Predictor Coef SE Coef T P Constant306.50 114.30 2.68 0.020 Price -24.98 10.83 -2.31 0.040 Advertising 74.13 25.97 2.85 0.014 S = 47.4634 R-Sq = 52.1% R-Sq(adj) = 44.2% Analysis of Variance Source DF SS M
37、S F P Regression 2 29460 14730 6.54 0.012 Residual Error12 27033 2253 Total 14 56493,t Stat for Price is tSTAT = -2.306, with p-value .0398 t Stat for Advertising is tSTAT = 2.855, with p-value .0145,33,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,d.f. = 15-2-1 = 12 = .05 t/2 = 2
38、.1788,Inferences about the Slope: t Test Example,H0: j = 0 H1: j 0,The test statistic for each variable falls in the rejection region (p-values .05),There is evidence that both Price and Advertising affect pie sales at = .05,From the Excel and Minitab output:,Reject H0 for each variable,Decision: Co
39、nclusion:,Reject H0,Reject H0,a/2=.025,-t/2,Do not reject H0,0,t/2,a/2=.025,-2.1788,2.1788,For Price tSTAT = -2.306, with p-value .0398 For Advertising tSTAT = 2.855, with p-value .0145,34,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Confidence Interval Estimate for the Slope,Con
40、fidence interval for the population slope j,Example: Form a 95% confidence interval for the effect of changes in price (X1) on pie sales: -24.975 (2.1788)(10.832) So the interval is (-48.576 , -1.374) (This interval does not contain zero, so price has a significant effect on sales),where t has (n k
41、1) d.f.,Here, t has (15 2 1) = 12 d.f.,35,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Confidence Interval Estimate for the Slope,Confidence interval for the population slope j,Example: Excel output also reports these interval endpoints: Weekly sales are estimated to be reduced b
42、y between 1.37 to 48.58 pies for each increase of $1 in the selling price, holding the effect of price constant,(continued),36,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Using Dummy Variables,A dummy variable is a categorical independent variable with two levels: yes or no, on
43、or off, male or female coded as 0 or 1 Assumes the slopes associated with numerical independent variables do not change with the value for the categorical variable,37,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Dummy-Variable Example (with 2 Levels),Let: Y = pie sales X1 = price
44、 X2 = holiday (X2 = 1 if a holiday occurred during the week) (X2 = 0 if there was no holiday that week),38,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Same slope,Dummy-Variable Example (with 2 Levels),(continued),X1 (Price),Y (sales),b0 + b2,b0,Holiday No Holiday,Different inter
45、cept,Holiday (X2 = 1),No Holiday (X2 = 0),If H0: 2 = 0 is rejected, then “Holiday” has a significant effect on pie sales,39,Business Statistics: A First Course, 5e 2009 Prentice-Hall, Inc.,Sales: number of pies sold per week Price: pie price in $ Holiday:,Interpreting the Dummy Variable Coefficient (with 2 Levels),Example:,1 If a holiday occurred during the week,0 If no holiday occurred,b2 = 15: on average, sales were 15 pies greater in weeks with a holiday than in weeks without a holiday, given the
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 眼内感染应急预案(3篇)
- 口腔护理团队协作:提升诊疗质量与患者体验的核心支撑
- 2026年全域旅游发展规划实施效果评估报告
- 2026年产品设计专业文创产品设计实训报告
- 湖南版一年级上册美术教案
- 固定资产管理教学
- 肾内科慢性肾病护理培训
- 痛风患者饮食护理方案
- 药物过敏应急处理流程
- 2025年公务员(社区居家养老融合)试题及答案
- 旅游资源调查评价
- 西安医专面试题及答案
- 临床医学《门静脉高压症》教学课件
- 《碳碳复合材料》课件
- DL∕T 507-2014 水轮发电机组启动试验规程
- DL-T5001-2014火力发电厂工程测量技术规程
- 2024年国家税务总局贵州省税务局所属事业单位招聘公开引进高层次人才和急需紧缺人才笔试参考题库(共500题)答案详解版
- 《畜禽营养与饲料》中职农林牧渔类高考复习题库大全-下(判断题)
- 闪修侠加盟方案
- 麦冬施工方案
- DL/T 5182-2021 火力发电厂仪表与控制就地设备安装、管路、电缆设计规程
评论
0/150
提交评论