版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、Anomaly detec-onProblem mo-va-onMachine LearningAnomalydetec-on exampleAircra9 engine features:= heat generated= vibra-on intensityDataset:New engine:(heat)Andrew Ng(vibra-on)Densityes-ma-onDataset:Isanomalous?(heat)Andrew Ng(vibra-on)Anomalydetec-on exampleFraud detec-on:= features of users ac-vi-e
2、sModelfrom data.Iden-fy unusual users by checking which haveManufacturingMonitoring computers in a data center.= features of machine= memory use,= CPU load,= number of disk accesses/sec,= CPU load/network trac.Andrew NgAnomaly detec-onGaussian distribu-onMachine LearningGaussian (Normal) distribu-on
3、Say. Ifis a distributed Gaussian with mean, variance.Andrew NgGaussian distribu-on exampleAndrew NgParameter es-ma-onDataset:Andrew NgAnomalydetec-onAlgorithmMachine LearningDensityes-ma-onTraining set:Each example isAndrew NgAnomalydetec-onalgorithm1.Choose featuresthat you think might be indica-ve
4、 ofanomalous examples.Fit parameters2.3.Given new example, compute:Anomaly ifAndrew NgAnomalydetec-on exampleAndrew NgAnomaly detec-onDeveloping and evalua-ng an anomaly detec-on systemMachine LearningThe importance of real-number evalua-onWhen developing a learning algorithm (choosing features, etc
5、.), making decisions is much easier if we have a way of evalua-ng our learning algorithm.Assume we have some labeled data, of anomalous and non-anomalous examples.(if normal,if anomalous).Training set:anomalous)(assume normal examples/notCross valida-on set:Test set:Andrew NgAircraA engines mo-va-ng
6、 example10000good (normal) engines20awed engines (anomalous)Training set: 6000 good enginesCV: 2000 good engines( Test: 2000 good engines), 10 anomalous (), 10 anomalous ()Alterna-ve:Training set: 6000 good enginesCV: 4000 good engines( Test: 4000 good engines), 10 anomalous (), 10 anomalous ()Andre
7、w NgAlgorithm evalua-onFit modelon training setOn a cross valida-on/testexample, predictPossible evalua-on metrics:- True posi-ve, false posi-ve, false nega-ve, true nega-ve- Precision/Recall- F1-scoreCan also use cross valida-on set to choose parameterAndrew NgAnomaly detec-onAnomaly detec-on vs. s
8、upervised learningMachine LearningAnomaly detec-onVery small number ofposi-vevs.Supervised learningLarge number of posi-ve and nega-ve examples.examples( commonLarge number of nega-ve ( examples.Many dierent “types” of). (0-20 is)Enough posi-ve examples foralgorithm to get a sense of what posi-ve ex
9、amples are like, future posi-ve examples likely to be similar to ones in training set.anomalies. Hard for any algorithmto learn from posi-ve exampleswhat the anomalies look like; future anomalies may look nothing like any of the anomalousexamples weve seen so far.Andrew NgAnomaly detec-onvs.Supervis
10、ed learningEmail spam classica-onFraud detec-onWeather predic-on (sunny/rainy/etc).Manufacturing (e.g. aircra9engines)Monitoring machines in a dataCancer classica-oncenterAndrew NgAnomalydetec-onChoosing whatfeatures to useMachine LearningNon-gaussian featuresError analysis for anomaly detec-onWantl
11、arge for normal examples.small for anomalous examplesMost common problem:.is comparable (say, both large) for normaland anomalous examplesMonitoring computers in a data centerChoose features that might take on unusually large or small values in the event of an anomaly.= memory use of computer= numbe
12、r of disk accesses/sec= CPU load= network tracAnomalydetec-onMul-variateGaussiandistribu-onMachine LearningMo-va-ng example: Monitoring machines in a data center(CPU Load)(CPU Load)(Memory Use)Andrew Ng(Memory Use)Mul-variate Gaussian (Normal) distribu-on. Dont modeletc. separately.ModelParameters:a
13、ll in one go.(covariance matrix)Andrew NgMul-variate Gaussian (Normal) examplesAndrew NgMul-variate Gaussian (Normal) examplesAndrew NgMul-variate Gaussian (Normal) examplesAndrew NgMul-variate Gaussian (Normal) examplesAndrew NgMul-variate Gaussian (Normal) examplesAndrew NgMul-variate Gaussian (No
14、rmal) examplesAndrew NgAnomaly detec-onAnomaly detec-on using the mul-variate Gaussian distribu-onMachine LearningMul-variate Gaussian (Normal) distribu-onParametersParameter fng:Given training setAndrew NgAnomaly detec-on with the mul-variate Gaussian1. Fit modelby sefng2. Given a new example, computeFlag an anomaly ifAndrew NgRela-onship tooriginal modelOriginal model:Corresponds to mul-variateGaussianwhereAndrew NgOriginal modelvs.Mul-variate GaussianManually create features tocapture anomalies where take unusual combi
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2025江苏无锡市梁溪区卫生健康委下属医疗卫生事业单位公开招聘工作人员34人(普通类)笔试历年典型考题及考点剖析附带答案详解试卷2套
- 隧道二衬混凝土施工方案
- 2025年造纸纸制品统计报告
- 2026散装浓缩果汁冷链配送网络优化报告
- 2025年公司治理结构优化方案
- 2025年产品上市报告范文
- 2026散装建材贸易市场供需分析与投资回报预测报告
- 2026散装建材市场供需状况与投资策略规划研究报告
- 2026散装干果市场消费特征与渠道建设研究报告
- 2026散装宠物食品市场需求增长与投资可行性研究报告
- 2026年人教版新教材数学三年级下册教学计划(含进度表)
- 小学元宵节主题班会 课件(希沃版 )
- 2025年江西电力职业技术学院单招职业技能考试题库附答案解析
- pp板施工项方案
- 2026湖北武汉东风延锋汽车座椅有限公司招聘备考题库及一套完整答案详解
- 河北省“五个一”名校联盟2025-2026学年高一上学期期末语文试题(含答案)
- 易制毒、易制爆化学品安全管理制度
- 2026年CGTN招聘考试试题
- 白描笔法课件
- 诸暨袜业行业现状分析报告
- 2026年河南经贸职业学院单招职业技能测试题库完美版
评论
0/150
提交评论