版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、Anomaly detec-onProblem mo-va-onMachine LearningAnomalydetec-on exampleAircra9 engine features:= heat generated= vibra-on intensityDataset:New engine:(heat)Andrew Ng(vibra-on)Densityes-ma-onDataset:Isanomalous?(heat)Andrew Ng(vibra-on)Anomalydetec-on exampleFraud detec-on:= features of users ac-vi-e
2、sModelfrom data.Iden-fy unusual users by checking which haveManufacturingMonitoring computers in a data center.= features of machine= memory use,= CPU load,= number of disk accesses/sec,= CPU load/network trac.Andrew NgAnomaly detec-onGaussian distribu-onMachine LearningGaussian (Normal) distribu-on
3、Say. Ifis a distributed Gaussian with mean, variance.Andrew NgGaussian distribu-on exampleAndrew NgParameter es-ma-onDataset:Andrew NgAnomalydetec-onAlgorithmMachine LearningDensityes-ma-onTraining set:Each example isAndrew NgAnomalydetec-onalgorithm1.Choose featuresthat you think might be indica-ve
4、 ofanomalous examples.Fit parameters2.3.Given new example, compute:Anomaly ifAndrew NgAnomalydetec-on exampleAndrew NgAnomaly detec-onDeveloping and evalua-ng an anomaly detec-on systemMachine LearningThe importance of real-number evalua-onWhen developing a learning algorithm (choosing features, etc
5、.), making decisions is much easier if we have a way of evalua-ng our learning algorithm.Assume we have some labeled data, of anomalous and non-anomalous examples.(if normal,if anomalous).Training set:anomalous)(assume normal examples/notCross valida-on set:Test set:Andrew NgAircraA engines mo-va-ng
6、 example10000good (normal) engines20awed engines (anomalous)Training set: 6000 good enginesCV: 2000 good engines( Test: 2000 good engines), 10 anomalous (), 10 anomalous ()Alterna-ve:Training set: 6000 good enginesCV: 4000 good engines( Test: 4000 good engines), 10 anomalous (), 10 anomalous ()Andre
7、w NgAlgorithm evalua-onFit modelon training setOn a cross valida-on/testexample, predictPossible evalua-on metrics:- True posi-ve, false posi-ve, false nega-ve, true nega-ve- Precision/Recall- F1-scoreCan also use cross valida-on set to choose parameterAndrew NgAnomaly detec-onAnomaly detec-on vs. s
8、upervised learningMachine LearningAnomaly detec-onVery small number ofposi-vevs.Supervised learningLarge number of posi-ve and nega-ve examples.examples( commonLarge number of nega-ve ( examples.Many dierent “types” of). (0-20 is)Enough posi-ve examples foralgorithm to get a sense of what posi-ve ex
9、amples are like, future posi-ve examples likely to be similar to ones in training set.anomalies. Hard for any algorithmto learn from posi-ve exampleswhat the anomalies look like; future anomalies may look nothing like any of the anomalousexamples weve seen so far.Andrew NgAnomaly detec-onvs.Supervis
10、ed learningEmail spam classica-onFraud detec-onWeather predic-on (sunny/rainy/etc).Manufacturing (e.g. aircra9engines)Monitoring machines in a dataCancer classica-oncenterAndrew NgAnomalydetec-onChoosing whatfeatures to useMachine LearningNon-gaussian featuresError analysis for anomaly detec-onWantl
11、arge for normal examples.small for anomalous examplesMost common problem:.is comparable (say, both large) for normaland anomalous examplesMonitoring computers in a data centerChoose features that might take on unusually large or small values in the event of an anomaly.= memory use of computer= numbe
12、r of disk accesses/sec= CPU load= network tracAnomalydetec-onMul-variateGaussiandistribu-onMachine LearningMo-va-ng example: Monitoring machines in a data center(CPU Load)(CPU Load)(Memory Use)Andrew Ng(Memory Use)Mul-variate Gaussian (Normal) distribu-on. Dont modeletc. separately.ModelParameters:a
13、ll in one go.(covariance matrix)Andrew NgMul-variate Gaussian (Normal) examplesAndrew NgMul-variate Gaussian (Normal) examplesAndrew NgMul-variate Gaussian (Normal) examplesAndrew NgMul-variate Gaussian (Normal) examplesAndrew NgMul-variate Gaussian (Normal) examplesAndrew NgMul-variate Gaussian (No
14、rmal) examplesAndrew NgAnomaly detec-onAnomaly detec-on using the mul-variate Gaussian distribu-onMachine LearningMul-variate Gaussian (Normal) distribu-onParametersParameter fng:Given training setAndrew NgAnomaly detec-on with the mul-variate Gaussian1. Fit modelby sefng2. Given a new example, computeFlag an anomaly ifAndrew NgRela-onship tooriginal modelOriginal model:Corresponds to mul-variateGaussianwhereAndrew NgOriginal modelvs.Mul-variate GaussianManually create features tocapture anomalies where take unusual combi
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2025-2026学年饮品拼音教学设计
- 2026 年中职地理观测技术(地理观测基础)试题及答案
- 2025-2026学年青蛙卖泥塘微课教学设计
- 2026年前沿技术对机械设计的影响
- 湖南吉利汽车职业技术学院《小学语文教学设计与实施》2024-2025学年第二学期期末试卷
- 广西职业技术学院《艺术学概论》2024-2025学年第二学期期末试卷
- 南宁职业技术学院《日本社会》2024-2025学年第二学期期末试卷
- 桂林信息工程职业学院《食品安全风险分析(实验)》2024-2025学年第二学期期末试卷
- 湖北师范大学文理学院《现代设施园艺新技术讲座》2024-2025学年第二学期期末试卷
- 2026届河北省承德市联校数学高一下期末质量检测试题含解析
- 期货入门基础知识【期货新手基础入门】
- 孕妇孕期心理健康指导健康宣教
- 锂产业发展现状及趋势课件
- 第一章 组织工程学-概述
- 211和985工程大学简介PPT
- 【基于7P理论的汉庭酒店服务营销策略14000字(论文)】
- 初中数学:《二次根式》大单元教学设计
- 分清轻重缓急
- 山东大学核心期刊目录(文科)
- 2023年医技类-康复医学治疗技术(中级)代码:381历年考试真题(易错、难点与常考点摘编)有答案
- 噪声及振动环境课件
评论
0/150
提交评论