




已阅读5页,还剩95页未读, 继续免费阅读
版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
Hence,P(4.5x8.5)=0.9864-0.3745=0.6119(approxequalto0.6123bybinomialdistribution),天马行空官方博客:,IntroductiontoSampling,Whatispopulationinstatistic?Apopulationinstatisticreferstoallitemsthathavebeenchosenforstudy.,Whatisasampleinstatistic?Asampleinstatisticreferstoaportionchosenfromapopulation,bywhichthedataobtaincanbeusedtoinferontheactualperformanceofthepopulation,Population,Sample2,Sample6,Sample8,Sample1,Sample3,Sample7,Sample4,Sample5,Samplingdistribution-adistributionofsamplemeansIfyoutake10samplesoutofthesamepopulations,youwillmostlikelyendupwith10differentsamplemeans和sample标准偏差s.ASamplingdistributiondescribestheprobabilityofallpossiblemeansofthesamplestakenfromthesamepopulation.,Whensamplesizeincreases,thestandarderror(orthestddeviationofsamplingdistribution)willgetsmaller.,Samplingdistribution(cont),CentralLimitTheorem,Exampleofsamplingdistribution,Thepopulationdistributionofannualincomeofengineersisskewednegatively.Thisdistributionhasameanof$48,000,和a标准偏差of$5000.Ifwedrawasampleof100engineers,whatistheprobabilitythattheiraverageannualincomeis$48700和more.,Exampleofsamplingdistribution(cont),Therefore,mean=48000sigma=500X=50000Z=(48700-48000)/500=700/500=1.4,Fromthestandardizednormaldistributiontable,P(X$48700)=0.9192Therefore;P(X$48700)=1-0.9192=0.0808Thus,wehavedeterminedthatithasonly8.08%chancefortheaverageannualincomeof100engineerstobemorethan$48700.,CentralLimitTheoremExercise,Breakinto4groupsasbelow:Group1:Thepopulationgroup.Group2to4:Thesamplesub-groupThepopulationgroupwillhave3oftheirmembersthrowingasingledices60timeseach.Atotalof180throwswillberecorded和thisdatawillbethepopulationdata.Eachsamplesub-groupwillhave3oftheirmembersthrowing5dicesatonetime,和collectthesum和averagevalueoftheparticularthrow.Eachmemberistoconduct20throws和obtainthesamplemeanofeachthrow.Attheendoftheexercise,atotalof180samplemeanswillbecollectedfromthe3sub-groups.Fromthearriveddata,plotthehistogram和commentonthedistributionofboththepopulation和thesamplings.,Thefinitepopulationmultiplier,Previouslywesaythat:,Finitepopulationmultiplierwithrespecttopopulation和samplesize,RuleofthumbsThefinitepopulationmultiplierneedonlybeincludedifpopulationsizetosamplesizeratioislessthan25.,ConfidenceInterval,“Pointestimates”Apointestimateisasinglenumberthatisusedtoestimateanunknownpopulationparameter.,Whatdoes95%confidenceintervalsmeans?Itdefines95%ofthetime,theaveragevalueofarandomsamplingwillfallwithinavaluerangewhichis+/-1.96standarderrorfromthesamplemean.,为什么1.96standarderror?Referringtothestandardizednormaldistributiontable,whenz=1.96,theassociatedprobabilityis0.975(or97.5%)asbelow:,Butthisisa2tailsinterval(i.e.+/-1.96),henceweneedtominusoffanother2.5%fromtheotherend,givingatotalcoverageof95%.,EquationforcomputingconfidenceintervalsForcontinuousdata:,Fordiscretedata:,ConfidenceintervalforcontinuousdataAlargediskdrivemanufacturerneedsanestimateofthemeanlifeitcanexpectfromthemagneticmediabyreciprocallyswitchingitsdigitalstateat1MHzfrequency.Thedevelopmentteamhasdeterminedpreviouslythatthevarianceofthepopulationlifeis36months,和hadconductedareliabilitytestingfor100samples,collectingdataofitsusefullifeasbelow:,Fromtheabovedata,whatisthe95%confidenceintervalfortheusefulmeanlifeofthemagneticmediainadiskdrive?Whatdoesthismean?,Confidenceintervalforcontinuousdata(cont),Applyingtheconfidenceintervalsequation:Upperconfidencelimit=27.75+1.96(0.6)=28.926Lowerconfidencelimit=27.751.96(0.6)=26.574,Assuch,thereis95%confidencelevelthattheaverageusefullifeofthemagneticmediatofallbetween26.574和28.926months.,ConfidenceintervalfordiscretedataBreakinto4or5teams,combinedalltheMthen流程isnotcentered,Cpk=Cp;then流程iscentered,The流程ofcollecting,presenting和describingsampledata,usinggraphical工具和numbers.ParetoChartPopulationmeanHistogramPopulation标准偏差,DescriptiveStatistics,ProbabilityTheory,Probabilityisthechanceforaneventtooccur.Statisticaldependence/independencePosteriorprobabilityRelativefrequencyMakedecisionthroughprobabilitydistributions(i.e.Binomial,Poisson,Normal),InferentialStatistics,The流程ofinterpretingthesampledatatodrawconclusionsaboutthepopulationfromwhichthesamplewastaken.ConfidenceInterval(Determineconfidencelevelforasamplingmeantofluctuate)T-Test和F-Test(Determineiftheunderlyingpopulationsissignificantlydifferentintermsofthemeans和variations)Chi-SquareTestofIndependence(Testifthesampleproportionsaresignificantlydifferent)Correlation和Regression(Determineif关系hipbetweenvariablesexists,和generatemodelequationtopredicttheoutcomeofasingleoutputvariable),CentralLimitTheorem,某些takeawaysforsamplesize和samplingdistribution,PercentilesofthetDistribution,Whereby,df=Degreeoffreedom=n(samplesize)1Shadedarea=one-tailedprobabilityofoccurencea=1ShadedareaApplicablewhen:Samplesize450).DesignatedasH1,ElementsofHypothesisTesting(cont),Exampleifwewanttotestwhetherapopulationmeanisequalto500,wewouldtranslateitto:NullHypothesis,H0:mp=500和consideralternatehypothesisas:AlternateHypothesis,H1:mp500;(2tailstest),Rememberconfidenceinterval,at95%confidencelevelstatesthat:95%ofthetimethemeanvaluewillfluctuatewithintheconfidenceinterval(limit)5%chancethatthemeanisnaturalfluctuation,butwethinkitisnotalpha(a)probability,TypeIIErrorAcceptinganullhypothesis(H0),whenitisfalse.Probabilityofthiserrorequalsb,TypeIErrorRejectingthenullhypothesis(H0),whenitistrue.Probabilityofthiserrorequalsa,Usethestderrorobservedfromthesampletosetconfidencelimiton500(mH0).TheassumptionismH0hasthesamevarianceasmp.,ElementsofHypothesisTesting(cont),Otherpossiblealternatehypothesisare:AlternateHypothesis,H1:mp500;(1tailtest)AlternateHypothesis,H1:mp500For95%confidencelevel,a=0.05.SinceH1isonetailtest,rejectareadoesnotneedtobedividedby2.,某些hypothesistestingsthatareapplicabletoengineers:Theimpactonresponsemeasurementwithnew和old流程parameters.Comparisonofanewvendorsparts(whichareslightlymoreexpensive)tothepresentvendor,whenvariationisamajorissue.IstheyieldonTesterECTZ21thesameastheyieldonTesterECTZ33?,流程SituationsComparisonofonepopulationfromasingle流程toadesirablestandardComparisonoftwopopulationsfromtwodifferent流程esorSinglesided:comparisonconsidersadifferenceonlyifitisgreateroronlyifitisless,butnotboth.Twosided:comparisonconsidersanydifferenceofine质量important,Inferencesbasedonasinglesample,“Largesampletestofhypothesisaboutapopulationmean”,Example:Anautomotivemanufacturerwantstoevaluateiftheirnewthrottle设计onallthelatestcarmodelisabletogiveanadequateresponsetime,resultinginanpredictablepick-upofthevehiclespeedwhenthefuelpedalisbeingdepressed.Basedonfiniteelementmodelling,the设计teamcommittedthatthethrottleresponsetimeis1.2msec,和thisistherecommendedvaluethatwillgivethedriverthebestcontroloverthevehicleacceleration.Thetestengineerofthisprojecthastestedon100vehicleswiththenewthrottle设计和obtainanaveragethrottleresponsetimeof1.05msecwitha标准偏差Sof0.5msec.Basedon99%confidencelevel,canheconcludedthatthenewthrottle设计willgiveanaverageresponsetimeof1.2msec?,“Largesampletestofhypothesisaboutapopulationmean”(cont),Fromstandardnormaldistributiontable,TheZvaluecorrespondingto0.005tailareais2.58.,a=0.01(2tails),since2tailstest,thereforetailarea=a/2=0.005;,“Largesampletestofhypothesisaboutapopulationmean”(cont),Whatdoes99%confidencelevelmeansintheaboveexample?Itdefinesthelimitswhereby99%oftheaveragesamplingvalueshouldfallwithin,giventhedesirable(hypothesised)meanasmH0.AnyvaluefalloutsidethisconfidencelimitindicatesthesamplemeanissignificantlydifferentfrommH0.Inotherwords,wewillonlyconcludethealternatehypothesisH1(thatthemeansaredifferent)ifwearemorethan99%sure.,TheObservedSignificancelevel,p-value,p-valueistheprobabilityforconcludingthenullhypothesisH0thatbothpopulationmeansareequalwiththeobservedsampledata.Hence1pvaluewillbetheconfidencelevelwehaveonthealternatehypothesis.,Usingthethrottlequestionasanexample:Weknowthatthemeanresponsetimeis3standarderrorawayfrom1.2msec(mH0),thereforeZ=3.Sincethisisa2tailstest,p-value=P(Z3)=2P(Z3)Fromstandardnormaldistributiontable,P(Z=3)=0.9987P(Z3)=10.9987=0.0013p-value=2P(Z3)=0.0026,Instatisticalterm,itmeansthereisonly0.0026probabilitythattheaveragethrottleresponsetimetobe1.02mseciftheactualpopulationmeanis1.2msecassuggestedbyfiniteelementanalysis.,“Smallsampletestofhypothesisaboutapopulationmean”,Example:AmyisthePersonnelOfficerofamulti-nationalcompanywhoisinchargeofrecruitingalargenumberofemployeesforanoverseasassignment.Astheseoverseasassignmentsareverycrucialforthecompanysuccessinmeetingtheirbusinessplan,anaptitudetestwasformulatedtotestthe质量ofallpotentialcandidateshead-huntedbythe招聘Agency.Themanagementwantstoknowtheeffectivenessofthe招聘Agency,asitwasbelievedthattheaveragetestscoreforalltheidentifiedcandidatesshouldbeequalormorethan90inordertoreducetheriskofassigningthewrongcandidatesforthetask.WhenAmyreviewsthetestsresultofaparticularbatchof20candidates,shefindsthatthemeanscoreis84和the标准偏差is11.Asthisisaverycritical招聘project,Amywantstobemorereservewithheranalysis,和decidedtobemorebiastowardsprovingthatthepopulationmeanislesserthan90.Asaresult,aconfidencelevelof90%willbeusedinheranalysis.,“Smallsampletestofhypothesisaboutapopulationmean”(cont),Fromstudenttdistributiontable,Thetvaluewith19dfcorrespondingto0.1tailareais=1.3277.,“Largesampletestofhypothesisaboutapopulationproportion”,Amethodcurrentlyusedbydoctorstoscreenforpossiblestomachulcerfailstodetecttheulcerin20%ofthepatientswhoactuallyhavethedisease.Supposeanewmethodhasbeendevelopedthatresearchershopewilldetectstomachulcermoreaccurately.Thisnewmethodwasusedtoscreenarandomsampleof140patientsknowntohavestomachulcer.Ofthese,thenewmethodfailedtodetectulcerin12ofthepatients.Using95%confidencelevel,doesthissample提供evidencethatthefailurerateofthenewmethoddiffersfromtheonecurrentlyinuse?,Solution:Lettheprobabilityofsuccessinmisseddetectionasptherefore;H0:p=0.2(i.e.pH0)H1:p0.2Samplesize,n=140(i.e.usestandardnormalzastheteststatistic)Computethestandarderrorfornullhypothesis(i.e.whenp=0.2),TestifpH03stderrorwillgivereasonablevalue(i.e.between0to1)pH03stderror0.23(0.034)=(0.166,0.234),“Largesampletestofhypothesisaboutapopulationproportion”(cont),Calculatethenumberofstandarderrorsbetweenthesampled和hypothesisedvalue:,Conclusions:Sincep=0.086isinrejectarea,werejectnullhypothesis和concludethatthenewscreenmethodissignificantlydifferentthantheoldscreenmethodwith95%confidencelevel.With3.36stderrorfrompH0(0.2),thep-valueiscalculatedtobe0.00078,hencethereisa99.922%confidenceinthealternatehypothesis.Itappearsthatthenewscreenmethodwillgivelessermissdetectionforstomachulcer.,PowerofaHypothesisTesting,TypeIErrorRejectingthenullhypothesis(H0),whenitistrue.Probabilityofthiserrorequalsa.TypeIIErrorAcceptinganullhypothesis(H0),whenitisfalse.Probabilityofthiserrorequalsb.Inhypothesistesting,wearealwaysbiastowardsH0.Thereforea95%confidencelimitwillonlytellusifwearemorethan95%surethatthetwopopulationmeansaredifferent.Howeverthetruestatesofthepopulationmeanscanbedifferentevenifwearelessthan95%sure.Inotherwords,ifthereisnosignificantdifferencebetweenthe2means,itdoesnotindicatethattheyareequal,itcouldbethattheyarenotfarenoughapart.Illustrateintheabovedistribution,assumingm1hasthesamevarianceasmH0和theyaredifferent,theareaunderm1curvethatisfallwithin95%confidencelimitofmH0willbeb(probabilityfortypeIIerror).,PowerofaHypothesisTesting(cont),Assuchthereisatradeoffbetweena和b.Asadecreases,bincreases和viceversa.,mH0,(a/2)RejectArea,(a/2)RejectArea,PowerofaHypothesisTesting(cont),Ahospitaluseslargequantitiesofpackageddosesofaparticulardrug.Theindividualdoseofthisdrugis100cc.Theactionofthedrugissuchthatthebodywillharmlesslypassoffexcessivedoses.Ontheotherhand,insufficientdoses(i.e.99.6cc和below)donotproducethedesiredmedicaleffect,和theyinterferewithpatienttreatment.Thehospitalhaspurchaseditsrequirementsofthisdrugfromthesamemanufacturerforanumberofyears和knowsthatthepopulation标准偏差is2cc.Thehospitalinspects50dosesofthisdrugatrandomfromaverylargeshippment和findsthemeanofthesedosestobe99.75cc.With90%confidencelevel,howcanthehospitalconcludewetherthedosagesinthisshipmentaretoosmall?,mH0=100cc(hypothesisedvalueofpopulationmean)=2(knownpopulation标准偏差)X=99.75(samplemean)n=50(samplesize)H0:m=100(nullhypothesisthatmeandosagefromshippmentis100
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2025年福建省古田县人力资源和社会保障局招聘10人考前自测高频考点模拟试题带答案详解
- 2025湖南永州市零陵区第二批公开引进急需紧缺专业人才(医疗岗9人)模拟试卷及答案详解(历年真题)
- 2025赤峰市中心医院招聘8控制数人员考前自测高频考点模拟试题及答案详解(易错题)
- 2025广东汕头大学医学院教务处医学教育拓展项目教辅人员招聘1人考前自测高频考点模拟试题附答案详解(考试直接用)
- 2025年宁波市鄞州区面向社会公开招聘社区专职工作者55人模拟试卷及答案详解(典优)
- 2025贵州省体育局直属事业单位第十三届贵州人才博览会引才1人模拟试卷有完整答案详解
- 2025重庆泰科防务科技有限公司招聘8人笔试历年参考题库附带答案详解
- 2025重庆两江新区人才发展集团有限公司派往泰科防务科技(重庆)有限公司招聘8人笔试历年参考题库附带答案详解
- 2025辽宁葫芦岛市南票区招聘区属国有企业高级管理人员3人笔试历年参考题库附带答案详解
- 2025贵州关岭自治县农旅产业投资(集团)有限责任公司引聘人才(第一批次)笔试历年参考题库附带答案详解
- 2025广西崇左凭祥市委宣传部招聘编外工作人员1人考试参考题库及答案解析
- 2025江西赣州南康赣商村镇银行招聘4人考试参考题库及答案解析
- 社保协议书模板6篇
- 企业安全生产责任书范本大全
- 工艺设备变更风险评估报告模板
- 红星照耀中国考试真题及答案
- 2025离婚起诉状民事诉状(离婚案件用)
- 前端Vue3项目实战教程
- 国开(河北)2024年秋《现代产权法律制度专题》形考作业1-4答案
- 运用PDCA循环降低住院患者雾化吸入的不规范率品管圈成果汇报
- 感触最深的一件事七年级作文大全600字
评论
0/150
提交评论