版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、Revision: 1.00 Date: June 2001,6西格玛绿带培训Materials TWO,第二天: Tests of Hypotheses Week 1 recap of Statistics Terminology Introduction to Student T distribution Example in using Student T distribution Summary of formula for Confidence Limits Introduction to Hypothesis Testing The elements of Hypothesis T
2、esting -Break- Large sample Test of Hypothesis about a population mean p-Values, the observed significance levels Small sample Test of Hypothesis about a population mean Measuring the power of hypothesis testing Calculating Type II Error probabilities Hypothesis Exercise I -Lunch- Hypothesis Exercis
3、e I Presentation Comparing 2 population Means: Independent Sampling Comparing 2 population Means: Paired Difference Experiments Comparing 2 population Proportions: F-Test -Break- Hypothesis Testing Exercise II (paper clip) Hypothesis Testing Presentation 第一天wrap up,第二天: Analysis of variance 和simple
4、linear regression Chi-square : A test of independence Chi-square : Inferences about a population variance Chi-square exercise ANOVA - Analysis of variance ANOVA Analysis of variance case study -Break- Testing the fittness of a probability distribution Chi-square: a goodness of fit test The Kolmogoro
5、v-Smirnov Test Goodness of fit exercise using dice Result 和discussion on exercise -Lunch- Probabilistic 关系hip of a regression model Fitting model with least square approach Assumptions 和variance estimator Making inference about the slope Coefficient of Correlation 和Determination Example of simple li
6、near regression Simple linear regression exercise (using statapult) -Break- Simple linear regression exercise (cont) Presentation of results 第二天wrap up,Day 3: Multiple regression 和model building Introduction to multiple regression model Building a model Fitting the model with least squares approach
7、Assumptions for model Usefulness of a model Analysis of variance Using the model for estimation 和prediction Pitfalls in prediction model -Break- Multiple regression exercise (statapult) Presentation for multiple regression exercise -Lunch- - Qualitative data 和dummy variables Models with 2 or more qu
8、antitative independent variables Testing the model Models with one qualitative independent variable Comparing slopes 和response curve -Break- Model building example Stepwise regression an approach to screen out factors Day 3 wrap up,Day 4: 设计of Experiment Overview of Experimental Design What is a des
9、igned experiment Objective of experimental 设计和its capability in identifying the effect of factors One factor at a time (OFAT) versus 设计of experiment (DOE) for modelling Orthogonality 和its importance to DOE H和calculation for building simple linear model Type 和uses of DOE, (i.e. linear screening, line
10、ar modelling, 和non-linear modelling) OFAT versus DOE 和its impact in a screening experiment Types of screening DOEs -Break- Points to note when conducting DOE Screening DOE exercise using statapult Interpretating the screening DOEs result -Lunch- Modelling DOE (Full factoria with interactions) Interp
11、reting interaction of factors Pareto of factors significance Graphical interpretation of DOE results 某些rules of thumb in DOE 实例of Modelling DOE 和its analysis -Break- Modelling DOE exercise with statapult Target practice 和confirmation run Day 4 wrap up,Day 5: Statistical 流程Control What is Statistical
12、 流程Control Control chart the voice of the 流程 流程control versus 流程capability Types of control chart available 和its application Observing trends for control chart Out of Control reaction Introduction to Xbar R Chart Xbar R Chart example Assignable 和Chance causes in SPC Rule of thumb for SPC run test -B
13、reak- Xbar R Chart exercise (using Dice) Introduction to Xbar S Chart Implementing Xbar S Chart 为什么Xbar S Chart ? Introduction to Individual Moving Range Chart Implementing Individual Moving Range Chart 为什么Xbar S Chart ? -Lunch- Choosing the sub-group Choosing the correct sample size Sampling freque
14、ncy Introduction to control charts for attribute data np Charts, p Charts, c Charts, u Charts -Break- Attribute control chart exercise (paper clip) Out of control not necessarily is bad Day 5 wrap up,Recap of Statistical Terminology,Area under a Normal Distribution,流程capability potential, Cp Based o
15、n the assumptions that :,流程is normal,It is a 2-sided specification,流程mean is centered to the device specification,Spread in specification,Natural tolerance,流程Capability Index, Cpk,Based on the assumption that the 流程is normal 和in control 2. An index that compare the 流程center with specification center
16、,Therefore when , Cpk Cp ; then 流程is not centered,Cpk = Cp ; then 流程is centered,The 流程of collecting, presenting 和describing sample data, using graphical 工具和numbers. Pareto Chart Population mean Histogram Population 标准偏差,Descriptive Statistics,Probability Theory,Probability is the chance for an event
17、 to occur. Statistical dependence / independence Posterior probability Relative frequency Make decision through probability distributions (i.e. Binomial, Poisson, Normal),Inferential Statistics,The 流程of interpreting the sample data to draw conclusions about the population from which the sample was t
18、aken. Confidence Interval (Determine confidence level for a sampling mean to fluctuate) T-Test 和F-Test (Determine if the underlying populations is significantly different in terms of the means 和variations) Chi-Square Test of Independence (Test if the sample proportions are significantly different) C
19、orrelation 和Regression (Determine if 关系hip between variables exists, 和generate model equation to predict the outcome of a single output variable),Central Limit Theorem,某些take aways for sample size 和sampling distribution,Percentiles of the t Distribution,Whereby, df = Degree of freedom = n (sample si
20、ze) 1 Shaded area = one-tailed probability of occurence a = 1 Shaded area Applicable when: Sample size 30 标准偏差 is unknown Population distribution is at least approximately normally distributed,Percentiles of the Normal Distribution / Z Distribution,Whereby, Shaded area = one-tailed probability of oc
21、curence a = 1 Shaded area,Student t Distrbution example,FDA requires pharmaceutical companies to perform extensive tests on all new drugs before they can be marketed to the public. The first phase of testing will be on animals, while the second phase will be on human on a limited basis. PWD is a pha
22、rmaceutical company currently in the second phase of testing on a new antibiotic project. The chemists are interested to know the effect of the new antibiotic on the human blood pressure, 和they are only allowed to test on 6 patients. The result of the increase in blood pressure of the 6 tested patie
23、nts are as below: ( 1.7 , 3.0 , 0.8 , 3.4 , 2.7 , 2.1 ) Construct a 95% confidence interval for the average increase in blood pressure for patients taking the new antibiotic, using both normal 和t distributions.,Student t Distrbution example (cont),Using normal or z distribution,Using student t distr
24、ibution,Although the confidence level is the same, using t distribution will result in a larger interval value, because: 标准偏差, S for small sample size is probably not accurate 标准偏差, S for small sample size is probably too optimistic Wider interval is therefore necessary to achieve the required confi
25、dence level,Summary of formula for confidence limit,6 Sigma 流程和1.5 Sigma Shift in Mean,Statistically, a 流程that is 6 Sigma with respect to its specifications is:,But Motorola defines 6 Sigma with a scenario of 1.5 Sigma shift in mean,DPM = 3.4 Cp = 2 Cpk = 1.5,某些Explanations on 1.5 Sigma Mean Shift,M
26、otorla has conducted a lot of experiments, 和found that in long term, the 流程mean will shift within 1.5 sigma if the 流程is under control. 1.5 sigma mean shift in a 3 Sigma 流程control plan will be translated to approximately 14% of the time a data point will be out of control, 和this is deem acceptable in
27、 statistical 流程control (SPC) practices.,Our Explanation,Most frequently used sample size for SPC in industry is 3 to 5 units per sampling. Take the middle value of 4 as an average sample size used in the sampling. Assuming the 流程is of 6 sigma capability, is in control, 和is normally distributed. Unde
28、r the confidence interval for sampling distribution, we expect the average value of the samples to fluctuate within 3 standard errors (i.e. natural tolerance), giving confidence interval of:,Introduction to Hypothesis Testing ?,What is hypothesis testing in statistic ? A hypothesis is “a tentative a
29、ssumption made in order to draw out or test its logical or empirical consequences.” A statistical hypothesis is a statement about the value of one of the characteristics for one or more populations. The purpose of the hypothesis is to establish a basis, so that one can gather evidence to either disp
30、rove the statement or accept it as true.,Example of statistical hypothesis The average commute time using Highway 92 is shorter than using France Avenue. This 流程change will not cause any effect on the downstream 流程es. The variation of Vendor Bs parts are 40% wider than those of Vendor A.,Elements of
31、 Hypothesis Testing,Possible outcomes for hypothesis testing on two tested populations:,为什么Hypothesis Testing ?,Many problems require a decision to accept or reject a statement about a parameter. That statement is a Hypothesis. It represents the translation of a practical question into a statistical
32、 question. Statistical testing 提供s an objective solution, with known risks, to questions which are traditionally answered subjectively. It is a stepping stone to 设计of Experiment, DOE.,Hypothesis Testing Descriptions,Hypothesis Testing answers the practical question: “Is there a real difference betwe
33、en A 和B ?” In hypothesis testing, relatively small samples are used to answer questions about population parameters. There is always a chance that a sample that is not representative of the population being selected 和results in drawing a wrong conclusion.,Elements of Hypothesis Testing (cont),The Nu
34、ll Hypothesis Statement generally assumed to be true unless sufficient evidence is found to be contrary Often assumed to be the status quo, or the preferred outcome. However, it sometimes represents a state you strongly want to disprove. Designated as H0 In hypothesis testing, we always bias toward
35、null hypothesis,The Alternative Hypothesis (or Research Hypothesis) Statement that will be accepted only if data 提供convincing evidence of its truth (i.e. by rejecting the null hypothesis). Instead of comparing two populations, it can also be based on a specific engineering difference in a characteri
36、stic value that one desires to detect (i.e. instead of asking is m1 = m2, we ask is m1 450). Designated as H1,Elements of Hypothesis Testing (cont),Example if we want to test whether a population mean is equal to 500, we would translate it to: Null Hypothesis, H0 : mp = 500 和consider alternate hypot
37、hesis as: Alternate Hypothesis, H1 : mp 500 ; (2 tails test),Remember confidence interval, at 95% confidence level states that: 95% of the time the mean value will fluctuate within the confidence interval (limit) 5% chance that the mean is natural fluctuation, but we think it is not alpha (a) probab
38、ility,Type II Error Accepting a null hypothesis (H0), when it is false. Probability of this error equals b,Type I Error Rejecting the null hypothesis (H0), when it is true. Probability of this error equals a,Use the std error observed from the sample to set confidence limit on 500 (mH0). The assumpt
39、ion is mH0 has the same variance as mp.,Elements of Hypothesis Testing (cont),Other possible alternate hypothesis are: Alternate Hypothesis, H1 : mp 500 ; (1 tail test) Alternate Hypothesis, H1 : mp 500 ; (1 tail test),Taking example for alternate hypothesis, H1 : mp 500 For 95% confidence level, a
40、= 0.05. Since H1 is one tail test, reject area does not need to be divided by 2.,某些hypothesis testings that are applicable to engineers: The impact on response measurement with new 和old 流程parameters. Comparison of a new vendors parts (which are slightly more expensive) to the present vendor, when va
41、riation is a major issue. Is the yield on Tester ECTZ21 the same as the yield on Tester ECTZ33 ?,流程Situations Comparison of one population from a single 流程to a desirable standard Comparison of two populations from two different 流程es or Single sided: comparison considers a difference only if it is gr
42、eater or only if it is less, but not both. Two sided: comparison considers any difference of ine质量 important,Inferences based on a single sample,“Large sample test of hypothesis about a population mean”,Example: An automotive manufacturer wants to evaluate if their new throttle 设计on all the latest c
43、ar model is able to give an adequate response time, resulting in an predictable pick-up of the vehicle speed when the fuel pedal is being depressed. Based on finite element modelling, the 设计team committed that the throttle response time is 1.2 msec, 和this is the recommended value that will give the
44、driver the best control over the vehicle acceleration. The test engineer of this project has tested on 100 vehicles with the new throttle 设计和obtain an average throttle response time of 1.05 msec with a 标准偏差 S of 0.5 msec. Based on 99% confidence level, can he concluded that the new throttle 设计will g
45、ive an average response time of 1.2 msec ?,“Large sample test of hypothesis about a population mean” (cont),From standard normal distribution table, The Z value corresponding to 0.005 tail area is 2.58.,a = 0.01 (2 tails), since 2 tails test, therefore tail area = a/2 = 0.005 ;,“Large sample test of
46、 hypothesis about a population mean” (cont),What does 99% confidence level means in the above example ? It defines the limits whereby 99% of the average sampling value should fall within, given the desirable (hypothesised) mean as mH0. Any value fall outside this confidence limit indicates the sampl
47、e mean is significantly different from mH0. In other words, we will only conclude the alternate hypothesis H1 (that the means are different) if we are more than 99% sure.,The Observed Significance level, p-value,p-value is the probability for concluding the null hypothesis H0 that both population me
48、ans are equal with the observed sample data. Hence 1 pvalue will be the confidence level we have on the alternate hypothesis.,Using the throttle question as an example: We know that the mean response time is 3 standard error away from 1.2msec (mH0), therefore Z = 3. Since this is a 2 tails test, p-v
49、alue = P(Z 3) = 2 P(Z 3) From standard normal distribution table, P(Z = 3) = 0.9987 P(Z 3) = 1 0.9987 = 0.0013 p-value = 2 P(Z 3) = 0.0026,In statistical term, it means there is only 0.0026 probability that the average throttle response time to be 1.02msec if the actual population mean is 1.2msec as
50、 suggested by finite element analysis.,“Small sample test of hypothesis about a population mean”,Example: Amy is the Personnel Officer of a multi-national company who is in charge of recruiting a large number of employees for an overseas assignment. As these overseas assignments are very crucial for
51、 the company success in meeting their business plan, an aptitude test was formulated to test the 质量 of all potential candidates head-hunted by the 招聘Agency. The management wants to know the effectiveness of the 招聘Agency, as it was believed that the average test score for all the identified candidate
52、s should be equal or more than 90 in order to reduce the risk of assigning the wrong candidates for the task. When Amy reviews the tests result of a particular batch of 20 candidates, she finds that the mean score is 84 和the 标准偏差 is 11. As this is a very critical 招聘project, Amy wants to be more rese
53、rve with her analysis, 和decided to be more bias towards proving that the population mean is lesser than 90. As a result, a confidence level of 90% will be used in her analysis.,“Small sample test of hypothesis about a population mean” (cont),From student t distribution table, The t value with 19 df
54、corresponding to 0.1 tail area is = 1.3277.,“Large sample test of hypothesis about a population proportion”,A method currently used by doctors to screen for possible stomach ulcer fails to detect the ulcer in 20% of the patients who actually have the disease. Suppose a new method has been developed that researchers hope will detect stomach ulcer more accurately. This new method was used to screen a random sample of 140 patients known to have stomach ulce
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 计算中心服务提供商云计算技术客户需求业务发展评估规划方案
- 2025年安徽国际商务职业学院单招职业适应性测试题库带答案解析
- 虚拟现实训练设备行业市场现状供需分析及投资评估规划分析研究报告
- 英国金融服务行业分析现状供需趋势投资评估发展策略未来规划研究报告
- 2025年安平县幼儿园教师招教考试备考题库含答案解析(夺冠)
- 2026年材料员考试备考题库及答案(名师系列)
- 2025年布拖县招教考试备考题库附答案解析
- 2025年北京邮电大学世纪学院马克思主义基本原理概论期末考试模拟题附答案解析
- 2026年交管12123学法减分复习考试题库附完整答案【夺冠系列】
- 2025年恩施职业技术学院单招职业技能考试题库附答案解析
- 辽宁轨道交通职业学院单招《职业技能测试》参考试题库(含答案)
- 马工程《经济法学》教学
- 2023-2024学年四川省宜宾市高一上册期末1月月考地理模拟试题(附答案)
- 新概念二单词表新版,Excel 版
- 一级建造师机电工程管理与实务
- 2023年陕西西安经济技术开发区招聘120人(共500题含答案解析)笔试必备资料历年高频考点试题摘选
- 第八讲 发展全过程人民民主PPT习概论2023优化版教学课件
- 篇12pmc窗口功能指令举例讲解
- GB/T 7332-2011电子设备用固定电容器第2部分:分规范金属化聚乙烯对苯二甲酸酯膜介质直流固定电容器
- GB/T 38658-20203.6 kV~40.5 kV交流金属封闭开关设备和控制设备型式试验有效性的延伸导则
- 疲劳与断裂完整
评论
0/150
提交评论