21-卷积网络视觉应用上

上传人：h*** IP属地：山东上传时间：2026-03-02 格式：PPTX 页数：28 大小：8.14MB 积分：15 举报 版权申诉

已阅读5页，还剩23页未读，继续免费阅读

版权说明：本文档由用户提供并上传，收益归属内容提供方，若内容存在侵权，请进行举报或认领

文档简介

ComputervisiontasksPARTiDeepLearninganditsApplicationSJTUDeepLearningLecture.1Practices:DataAugmentationSJTUDeepLearningLecture.2Practices:DataAugmentationChangethepixelswithoutchangingthelabelTrainontransformeddataVerywidelyusedHorizontal

flipsRandom

crops/scalesColor

jitter…Especially

useful

for

small

datasetsSJTUDeepLearningLecture.3Practices:DataAugmentationHorizontal

flipsSJTUDeepLearningLecture.4Practices:DataAugmentationRandom

crops/scalesTake

ResNet

example:Training:

sample

random

crops/scalesPickrandomLinrange[256,480]Resizetrainingimage,shortside=LSamplerandom224*224patchesTesting:averageafixedsetofcropsResizeimageat5scales:{224,256,384,480,640}Foreachsize,use10224*224crops:4corners+center,and

+flipsSJTUDeepLearningLecture.5Practices:DataAugmentationColor

jitterRandomly

jitter

contrastSJTUDeepLearningLecture.6Practices:

Transfer

LearningSJTUDeepLearningLecture.7Practices:

Transfer

LearningSJTUDeepLearningLecture.8Practices:

Transfer

LearningSJTUDeepLearningLecture.9Practices:

Transfer

LearningSJTUDeepLearningLecture.10ComputerVisionTasksClassificationClassification+LocalizationDetectionSegmentationSJTUDeepLearningLecture.11SJTUDeepLearningLecture.12CowGrassSkyTreesThisimageisCC0public

domainGrassCatSkyTreesLabeleachpixelintheimagewithacategorylabelDon’tdifferentiateinstances,onlycareaboutpixelsSemanticSegmentationSJTUDeepLearningLecture.13Full

imageExtract

patchClassify

centerpixelwith

CNNCowCowGrassProblem:Veryinefficient!Notreusingsharedfeatures

betweenoverlapping

patchesFarabetetal,“LearningHierarchicalFeaturesforSceneLabeling,”TPAMI

2013PinheiroandCollobert,“RecurrentConvolutionalNeuralNetworksforSceneLabeling”,ICML

2014SemanticSegmentation:

Sliding

WindowSJTUDeepLearningLecture.14Input:3xHx

WConvolutions:DxHx

WConvConvConvConvScores:CxHx

WargmaxPredictions:Hx

WDesignanetworkasabunchofconvolutionallayerstomakepredictionsforpixelsallat

once!Problem:convolutionsatoriginalimageresolution

willbeveryexpensive

...SemanticSegmentation:

Fully

ConvolutionalSJTUDeepLearningLecture.15Input:3xHx

WPredictions:Hx

WDesignnetworkasabunchofconvolutionallayers,

withdownsamplingandupsamplinginsidethe

network!High-res:D1xH/2x

W/2High-res:D1xH/2x

W/2Med-res:D2xH/4x

W/4Med-res:D2xH/4x

W/4Low-res:D3xH/8x

W/8Long,Shelhamer,andDarrell,“FullyConvolutionalNetworksforSemanticSegmentation”,CVPR2015Nohetal,“LearningDeconvolutionNetworkforSemanticSegmentation”,ICCV

2015Downsampling:Pooling,stridedconvolutionUpsampling:???SemanticSegmentation:

Fully

ConvolutionalSemanticSegmentationIdea:

Fully

ConvolutionalSJTUDeepLearningLecture.16Input:3xHx

WPredictions:Hx

WDesignnetworkasabunchofconvolutionallayers,

withdownsamplingandupsamplinginsidethe

network!High-res:

D1xH/2x

W/2High-res:

D1xH/2x

W/2Med-res:D2xH/4x

W/4Med-res:D2xH/4x

W/4Low-res:D3xH/8

W/8Downsampling:Pooling,stridedconvolutionUpsampling:Unpoolingorstridedtranspose

convolutionLong,Shelhamer,andDarrell,“FullyConvolutionalNetworksforSemanticSegmentation”,CVPR2015Nohetal,“LearningDeconvolutionNetworkforSemanticSegmentation”,ICCV

2015SJTUDeepLearningLecture.17Classification+

LocalizationClass

ScoresCat:

0.9

Dog:

0.05Car:

0.01...Vector:4096FullyConnected:4096to

1000BoxCoordinates(x,y,w,

h)FullyConnected:4096to

4SoftmaxLossL2

LossLossCorrect

label:CatCorrect

box:(x’,y’,w’,

h’)+ThisimageisCC0public

domainOftenpretrainedonImageNet(Transfer

learning)Treatlocalizationas

aregression

problem!Multitask

LossSJTUDeepLearningLecture.18ObjectDetectionas

Regression?DOG:(x,y,w,

h)DOG:(x,y,w,

h)CAT:(x,y,w,

h)12numbersDUCK:(x,y,w,h)

ManyDUCK:(x,y,w,h)

numbers!….Eachimageneedsadifferentnumberof

outputs!CAT:(x,y,w,h)

numbersObjectDetectionasClassification:

Sliding

WindowSJTUDeepLearningLecture.19Dog?

NOCat?

NOBackground?

YESApplyaCNNtomanydifferentcropsoftheimage,CNNclassifieseachcropasobjector

backgroundObjectDetectionasClassification:

Sliding

WindowSJTUDeepLearningLecture.20Dog?

YESCat?

NOBackground?

NOApplyaCNNtomanydifferentcropsoftheimage,CNNclassifieseachcropasobjector

backgroundObjectDetectionasClassification:

Sliding

WindowSJTUDeepLearningLecture.21ApplyaCNNtomanydifferentcropsoftheimage,CNNclassifieseachcropasobjector

backgroundDog?

NOCat?

YESBackground?

NOProblem:NeedtoapplyCNNtohugenumberoflocations,scales,andaspectratios,verycomputationally

expensive!SJTUDeepLearningLecture.22RegionProposals/Selective

SearchFind“blobby”imageregionsthatarelikelytocontain

objectsRelativelyfasttorun;e.g.SelectiveSearchgives2000regionproposalsinafewsecondson

CPU[1]Alexeetal,“Measuringtheobjectnessofimagewindows”,TPAMI2012[2]Uijlingsetal,

“SelectiveSearchforObjectRecognition”,IJCV

2013[3[Chengetal,“BING:Binarizednormedgradientsforobjectnessestimationat300fps”,CVPR2014[4]ZitnickandDollar,“Edgeboxes:Locatingobjectproposalsfromedges”,ECCV

2014SJTUDeepLearningLecture.23RegionProposals/Selective

SearchEfficientGraph-BasedImageSegmentationHierarchicalGroupingAlgorithmSJTUDeepLearningLecture.24R-CNNGirshicketal,“Richfeaturehierarchiesforaccurateobjectdetectionandsemanticsegmentation”,CVPR

2014.FigurecopyrightRossGirshick,2015;source.SJTUDeep

人人文库> 全部分类> 教育资料 > 课件下载

温馨提示

1. 本站所有资源如无特殊说明，都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
2. 本站的文档不包含任何第三方提供的附件图纸等，如果需要附件，请联系上传者。文件的所有权益归上传用户所有。
3. 本站RAR压缩包中若带图纸，网页内容里面会有图纸预览，若没有图纸预览就没有图纸。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 人人文库网仅提供信息存储空间，仅对用户上传内容的表现方式做保护处理，对用户上传分享的文档内容本身不做任何修改或编辑，并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容，请与我们联系，我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

21-卷积网络视觉应用上

文档简介

温馨提示

最新文档

评论

21-卷积网络视觉应用上

文档简介

温馨提示

最新文档

评论

相关文档