YOLOdeepsystems.io算法讲解-电子版说明文档_第1页
YOLOdeepsystems.io算法讲解-电子版说明文档_第2页
YOLOdeepsystems.io算法讲解-电子版说明文档_第3页
YOLOdeepsystems.io算法讲解-电子版说明文档_第4页
YOLOdeepsystems.io算法讲解-电子版说明文档_第5页
已阅读5页,还剩68页未读 继续免费阅读

付费下载

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

Обзор

от deepsystems.ioYOLOYouOnlyLookOnce:Unified,Real-TimeObjectDetectionJosephRedmon,SantoshDivvala,RossGirshick,Ali

FarhadiBig

pictureTimeFast

R-CNNFPS:

0.5mAP:

70Faster

R-CNNFPS:

7mAP:

73.2R-CNNFPS:-mAP:

58.5DPMFPS:

0.5mAP:

34.3YOLOFPS:

45mAP:

63.4SSDFPS:

58mAP:

72.1Nov

2013 Apr

2015 June

2015 Dec

2015РезультатынатестовойвыборкиPascalVOC2007.Обучениенаtrainvalsets

2007+2012mAP2deepsystems.io448x448x3InputimageInference3deepsystems.io448x448x3InputimageGoogLeNetmodification(20

layers)14x14x1024Inference4deepsystems.io448x448x3InputimageGoogLeNetmodification(20

layers)14x14x102414x14x1024C,RInference5deepsystems.io448x448x3InputimageGoogLeNetmodification(20

layers)14x14x102414x14x1024C,R14x14x1024C,RInference6deepsystems.io448x448x3InputimageGoogLeNetmodification(20

layers)14x14x102414x14x1024C,R14x14x1024C,R7x7x1024C,RInference7deepsystems.io448x448x3InputimageGoogLeNetmodification(20

layers)14x14x102414x14x1024C,R14x14x1024C,R7x7x1024C,R7x7x1024C,RInference8deepsystems.io448x448x3InputimageGoogLeNetmodification(20

layers)14x14x102414x14x1024C,R14x14x1024C,R7x7x1024C,R7x7x1024C,RFC,R4096x1Inference9deepsystems.io448x448x3InputimageGoogLeNetmodification(20

layers)14x14x102414x14x1024C,R14x14x1024C,R7x7x1024C,R7x7x1024C,RFC,R4096x1FC1470x1Inference10deepsystems.io448x448x3InputimageGoogLeNetmodification(20

layers)14x14x102414x14x1024C,R14x14x1024C,R7x7x1024C,R7x7x1024C,RFC,R4096x1FC1470x1Reshape7x7x30Inference11deepsystems.ioGoogLeNetmodification(20

layers)448x448x314x14x102414x14x1024C,R14x14x10247x7x10247x7x1024FC,R4096x1FC1470x1Reshape7x7x30C,RC,RC,RDetection

ProcedureInputimageInference12deepsystems.ioGoogLeNetmodification(20

layers)448x448x314x14x102414x14x1024C,R14x14x10247x7x10247x7x1024FC,R4096x1FC1470x1Reshape7x7x30C,RC,RC,RDetection

ProcedureInputimageInference13deepsystems.ioTrainfrom

scratchGoogLeNetmodification(20

layers)448x448x314x14x102414x14x1024C,R14x14x10247x7x10247x7x1024FC,R4096x1FC1470x1Reshape7x7x30C,RC,RC,RDetection

ProcedureimageInferenceusenewadditionalconvlayers=>better

performanceInput14deepsystems.ioGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage8x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x11470x17x7x715deepsystems.io730InferenceGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage8x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x11470x17x7x716deepsystems.io730InferenceGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage8x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x11470x17x7x7730grid

cell7717deepsystems.ioInferenceGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage8x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x11470x17x7x7730grid

cell7Inference718deepsystems.ioGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage8x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x11470x17x7x7730grid

cell71x30719deepsystems.ioInferenceGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage7730grid

cell71x308x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x11470x17x7x5x-coordinateofbboxcenterinsidecell([0;1]wrtgridcell

size)y-coordinateofbboxcenterinsidecell([0;1]wrtgridcell

size)w-bboxwidth([0;1]wrt

image)h-bboxheight([0;1]wrt

image)c-bboxconfidence~P(objin

bbox1)Inference720deepsystems.ioGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage7730grid

cell71x308x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x11470x17x7x5 5x-coordinateofbboxcenterinsidecell([0;1]wrtgridcell

size)y-coordinateofbboxcenterinsidecell([0;1]wrtgridcell

size)w-bboxwidth([0;1]wrt

image)h-bboxheight([0;1]wrt

image)c-bboxconfidence~P(objin

bbox2)Inference721deepsystems.ioGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage7730grid

cell71x308x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x11470x17x7x5 5twobboxesforeachgrid

cellInference722deepsystems.ioGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage7730grid

cell71x30Inference8x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x1 1470x17x7x5 520-numberof

classes723deepsystems.ioGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage7730grid

cell71x30Inference8x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x1 1470x17x7x5 5i(from1to

20)20-numberof

classesClassscore~P(objisclass_i|objin

box)724deepsystems.ioGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage7730grid

cell71x30InferenceClassscoresfor

bb18x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x11470x17x7xbb1

confidence5 5MUL20725deepsystems.io20x1GoogLeNetmodification(20

layers)448x448x314x14x102414x14x1024C,R14x14x10247x7x10247x7x1024FC,R4096x1FC1470x1Reshape7x7x30C,RC,RC,RDetectionProcedureInputimageTensorvalues

interpretation730grid

cell755Inference20bb2

confidence1x30MUL7Classscoresfor

bb2726deepsystems.io20x1GoogLeNetmodification(20

layers)448x448x314x14x102414x14x1024C,R14x14x10247x7x10247x7x1024FC,R4096x1FC1470x1Reshape7x7x30C,RC,RC,RDetectionProcedureInputimageTensorvalues

interpretation730grid

cell755Inference20bb2

confidence1x30MUL7Classscoresfor

bb2Dothisoperationforeachbboxineachgrid

cell727deepsystems.io20x1deepsystems.ioGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage7730gridcell(1,

1)77Inference8x314x14x102414x14x1024 14x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x11470x17x7x2bboxesforfirstcell(1,

1)bb1

bb220x1

20x128deepsystems.ioGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage7730gridcell(1,

2)77Inference8x314x14x102414x14x1024 14x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x11470x17x7x2bboxesforsecondcell(1,

2)bb1bb2bb3

bb420x120x120x1

20x129deepsystems.ioGoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage7730gridcell(7,

7)77Inference…….988x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x1 1470x17x7xbb1bb2bb3

bb42bboxesforlastcell(7,

7)bb97

bb20x120x120x120x120x1

20x120x120x120x120x120x120x120x120x1

20x130GoogLeNetmodification(20

layers)448x4C,RFC,RFCReshape30C,RC,RC,RDetectionProcedureInputimage7730gridcell(7,

7)77Inference…….988x314x14x102414x14x102414x14x1024 7x7x1024Tensorvalues

interpretation7x7x10244096x1 1470x17x7xbb1bb2bb3

bb4Total7*7*2=98

bboxesbb97

bb31deepsystems.ioLookatdetection

procedure32deepsystems.ioDetectionProcedure7x7x30…….bb97

bb98bb1bb2bb3

bb4Classscoresforeach

bbox33deepsystems.io20x120x1bb97

bb98bb1bb2bb3

bb4Dog

scores…….Classscoresforeach

bbox34deepsystems.ioGetfirstclassscoresforeach

bbox20x120x1bb97

bb98bb1bb2bb3

bb4Dog

scores…….Set

zeroifscore<thresh1

(0.2)35deepsystems.iobb97

bb98bb1bb2bb3

bb4000…….20x120x1bb97

bb98bb1bb2bb3

bb4Dog

scores…….Set

zeroifscore<thresh1

(0.2)bb97

bb98bb1bb2bb3

bb4000…….Sort

descending36deepsystems.iobb2bb4

bb98bb3bb1

bb98000…….20x120x1bb97

bb98bb1bb2bb3

bb4Dog

scores…….Set

zeroifscore<thresh1

(0.2)bb97

bb98bb1bb2bb3

bb4000…….Sort

descendingbb2bb4

bb98bb3bb1

bb98000…….NMSalgorithm

setscorestozeroforredundant

bboxesbb2bb4

bb9837deepsystems.iobb3bb1

bb980000…….20x120x1bb97

bb98bb1bb2bb3

bb4Dog

scores…….Set

zeroifscore<thresh1

(0.2)bb97

bb98bb1bb2bb3

bb4000…….Sort

descending…….bb3bb1

bb98bb2bb4

bb980 0 0NMSalgorithm

setscorestozeroforredundant

bboxes…….bb3bb1

bb98bb2bb4

bb980 0 0020x120x1Howit

works38deepsystems.ioNon-MaximumSuppression:

intuition39deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bbox0.50.30.20.1000040deepsystems.ioNon-MaximumSuppression:

intuitionbb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bbox0.50.30.20.10000Non-MaximumSuppression:

intuition41deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bbox0.50.30.20.10000Non-MaximumSuppression:

intuition42deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bbox0.50.30.20.10000Non-MaximumSuppression:

intuition43deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bbox0.50.30.20.10000Non-MaximumSuppression:

intuition44deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bbox0.50.30.20.10000Non-MaximumSuppression:

intuition45deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxGetbboxwithmaxscore.Let’sdenoteit

“bbox_max”0.50.30.20.10000Non-MaximumSuppression:

intuition46deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxCompare“bbox_max”withotherslessscore(non-zero!)bboxes.Let’sdenoteit“bbox_cur”0.50.30.20.10000Non-MaximumSuppression:

intuition47deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxIfIoU(bbox_max,bbox_cur)>0.5thenset0scoreto

bbox_cur.0.50.30.20.10000Non-MaximumSuppression:

intuition48deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxIfIoU(bbox_max,bbox_cur)>0.5thenset0scoreto

bbox_cur.Inthiscase:setto

0.0.500.20.10000Non-MaximumSuppression:

intuition49deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxGotonext

bbox_cur.0.500.20.10000Non-MaximumSuppression:

intuition50deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxIfIoU(bbox_max,bbox_cur)>0.5thenset0scoreto

bbox_cur.Gotonext

bbox_cur.0.500.20.10000Non-MaximumSuppression:

intuition51deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxNon-MaximumSuppression:

intuitionIfIoU(bbox_max,bbox_cur)>0.5thenset0scoreto

bbox_cur.Inthiscase:

continue.Gotonext

bbox_cur.0.500.20.1000052deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxGotonext

bbox_cur.0.500.20.10000Non-MaximumSuppression:

intuition53deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxGotonext

bbox_cur.0.500.20.10000IfIoU(bbox_max,bbox_cur)>0.5thenset0scoreto

bbox_cur.54deepsystems.ioNon-MaximumSuppression:

intuitionbb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxGotonext

bbox_cur.0.500.20.10000IfIoU(bbox_max,bbox_cur)>0.5thenset0scoreto

bbox_cur.Inthiscase:

continue.55deepsystems.ioNon-MaximumSuppression:

intuitionbb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxGotonext

bbox_cur.0.500.20.10000IfIoU(bbox_max,bbox_cur)>0.5thenset0scoreto

bbox_cur.Inthiscase:

continue.Dothisprocedureforother“bbox_cur”.Afterthat

...56deepsystems.ioNon-MaximumSuppression:

intuitionbb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bboxGotonextbboxwithbigscore.Let’sdenoteit“bbox_max”0.500.20.10000Non-MaximumSuppression:

intuition57deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bbox0.500.20.10000Gotonext

bbox_cur.Non-MaximumSuppression:

intuition58deepsystems.iobb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bbox0.500.20.10000Gotonext

bbox_cur.IfIoU(bbox_max,bbox_cur)>0.5thenset0scoreto

bbox_cur.59deepsystems.ioNon-MaximumSuppression:

intuitionbb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bbox0.500.200000Gotonext

bbox_cur.IfIoU(bbox_max,bbox_cur)>0.5thenset0scoreto

bbox_cur.Inthiscase:setto

0.Dothisprocedureforother“bbox_max”andforothercorresponding

“bbox_cur”.60deepsystems.ioNon-MaximumSuppression:

intuitionbb47bb20bb15

bb7class:

dogbb1bb4bb8

bb981x98class(dog)scoresforeach

bbox0.500.200000Aftercomparisonalmostallpairsofbboxestheonlytwobboxesleftwithnon-zeroclassscore

value.Non-MaximumSuppression:

intuition61deepsystems.iobb97

bb98bb1bb2bb3

bb4Cat

scores…….Set

zeroifscore<thresh1

(0.2)bb97

bb98bb1bb2bb3

bb4000…….Sort

descending…….bb2bb4

bb98bb3bb1

bb980 0 0NMSalgorithm

setscorestozeroforredundant

bboxes…….bb2bb4

bb98bb3bb1

bb980 0 062deepsystems.io0Dothisprocedurefornextclass20x120x1bb97

bb98bb1bb2bb3

bb4person

scores…….Set

zeroifscore<thresh1

(0.2)bb97

bb98bb1bb2bb3

bb4…….000Sort

descendingbb2bb4

bb98bb3bb1

bb98…….000bb2bb4

bb9863deepsystems.iobb3bb1

bb98…….0000Dothisprocedureforall

classesNMSalgorithm

setscorestozeroforredundant

bboxes20x120x1bb97

bb98bb1bb2bb3

bb4person

scores…….Set

zeroifscore<thresh1

(0.2)bb97

bb98bb1bb2bb3

bb4…….000Sort

descendingbb2bb4

bb98bb3bb1

bb98…….000bb2bb4

bb98bb3bb1

bb98Afterthisprocedure-alotof

zerosNMSalgorithm

setscorestozeroforredundant

bboxes000…….000000064deepsystems.io20x120x1bb97

bb98bb1bb2bb3

bb4person

scores…….Set

zeroifscore<thresh1

(0.2)bb97

bb98bb1bb2bb3

bb4…….000Sort

descendingbb2bb4

bb98bb3bb1

bb98…….000bb2bb4

bb98bb3bb1

bb98Selectbboxestodrawbyclassscore

valuesNMSalgorithm

setscorestozeroforredundant

bboxes000…….000000065deepsystems.io20x120x1…….bb97

bb98bb1bb2bb3

bb4Set

zeroifscore<thresh1

(0.2)…….bb97

bb98bb1bb2bb3

bb4000Sort

descending…….bb2bb4

bb97bb3bb1

bb98000…….bb2bb4

bb97bb3bb1

bb980class=max_index(scoresfor

bb3);score=max(scoresfor

bb3);Score>

0yesnodrawbboxwithclass

colorskip

bboxNMSalgorithm

setscorestozeroforredundant

bboxes66deepsystems.io00000000020x120x1…….bb97

bb98bb1bb2bb3

bb4Set

zeroifscore<thresh1

(0.2)…….bb97

bb98bb1bb2bb3

bb4000Sort

descending…….bb2bb4

bb97bb3bb1

bb98000…….bb2bb4

bb97bb3bb1

bb980class=max_index(scoresfor

bb1);score=max(scoresfor

bb1);Score>

0yesnodrawbboxwithclass

colorskip

bboxNMSalgorithm

setscorestozeroforredundant

bboxes67deepsystems.io00000000020x120x1…….bb97

bb98bb1bb2bb3

bb4Set

zeroifscore<thresh1

(0.2)…….bb97

bb98bb1bb2bb3

bb4000Sort

descending…….bb2bb4

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

最新文档

评论

0/150

提交评论