（英文）行业资料Twin Contrastive Learning with Noisy Labels-Huang et al-

上传人：1*** IP属地：北京上传时间：2023-04-12 格式：DOCX 页数：22 大小：355.82KB 积分：30 举报 版权申诉

（英文）行业资料Twin Contrastive Learning with Noisy Labels-Huang et al-_第2页

（英文）行业资料Twin Contrastive Learning with Noisy Labels-Huang et al-_第3页

（英文）行业资料Twin Contrastive Learning with Noisy Labels-Huang et al-_第4页

（英文）行业资料Twin Contrastive Learning with Noisy Labels-Huang et al-_第5页

已阅读5页，还剩17页未读，继续免费阅读

版权说明：本文档由用户提供并上传，收益归属内容提供方，若内容存在侵权，请进行举报或认领

文档简介

arXiv:2303.06930v1[cs.CV]13Mar2023

TwinContrastiveLearningwithNoisyLabels

ZhizhongHuang1

HongmingShan2.3*

JunpingZhang1

1ShanghaiKeyLabofIntelligentInformationProcessing,SchoolofComputerScience,FudanUniversity,Shanghai200433,China

InstituteofScienceandTechnologyforBrain-inspiredIntelligenceandMOEFrontiersCenterforBrainScience,FudanUniversity,Shanghai200433,China

3ShanghaiCenterforBrainScienceandBrain-inspiredTechnology,Shanghai200031,China(zzhuang19,jpzhang,hmshan}@

Abstract

Learningfromnoisydataisachallengingtaskthatsig-niﬁcantlydegeneratesthemodelperformance.Inthispaper,wepresentTCL,anoveltwincontrastivelearningmodeltolearnrobustrepresentationsandhandlenoisylabelsforclassiﬁcation.Speciﬁcally,weconstructaGaussianmix-turemodel(GMM)overtherepresentationsbyinjectingthesupervisedmodelpredictionsintoGMMtolinklabel-freelatentvariablesinGMMwithlabel-noisyannotations.Then,TCLdetectstheexampleswithwronglabelsastheout-of-distributionexamplesbyanothertwo-componentGMM,takingintoaccountthedatadistribution.Wefurtherproposeacross-supervisionwithanentropyregularizationlossthatbootstrapsthetruetargetsfrommodelpredictionstohandlethenoisylabels.Asaresult,TCLcanlearndiscriminativerepresentationsalignedwithestimatedlabelsthroughmixupandcontrastivelearning.Extensiveexperimentalresultsonseveralstandardbenchmarksandreal-worlddatasetsdemonstratethesuperiorperformanceofTCL.Inparticular,TCLachieves7.5%improvementsonCIFAR-10with90%noisylabel—anextremelynoisyscenario.Thesourcecodeisavailableat

/Hzzone/TCL

1.Introduction

Deepneuralnetworkshaveshownexcitingperformanceforclassiﬁcationtasks[

].Theirsuccesslargelyresultsfromthelarge-scalecurateddatasetswithcleanhumananno-tations,suchasCIFAR-10[

]andImageNet[

],inwhichtheannotationprocess,however,istediousandcumbersome.Incontrast,onecaneasilyobtaindatasetswithsomenoisyannotations—fromonlineshoppingwebsites[

],crowd-sourcing[

],orWikipedia[

]—fortrainingaclas-

*Correspondingauthor

siﬁcationneuralnetwork.Unfortunately,themislabelleddataarepronetosigniﬁcantlydegradetheperformanceofdeepneuralnetworks.Therefore,thereisconsiderableinter-estintrainingnoise-robustclassiﬁcationnetworksinrecentyears[

Tomitigatetheinﬂuenceofnoisylabels,mostofthemeth-odsinliteratureproposetherobustlossfunctions[

],reducetheweightsofnoisylabels[

],orcorrectthenoisylabels[

].Inparticular,labelcorrectionmeth-odshaveshowngreatpotentialforbetterperformanceonthedatasetwithahighnoiseratio.Typically,theycorrectthelabelsbyusingthecombinationofnoisylabelsandmodelpredictions[

],whichusuallyrequireanessentialitera-tivesampleselectionprocess[

].Forexample,Arazoetal.[

]usesthesmall-losstricktocarryoutsampleselectionandcorrectlabelsviatheweightedcombination.Inrecentyears,contrastivelearninghasshownpromisingresultsinhandlingnoisylabels[

].Theyusuallyleveragecontrastivelearningtolearndiscriminativerepre-sentations,andthencleanthelabels[

]orconstructthepositivepairsbyintroducingtheinformationofnearestneighborsintheembeddingspace.However,usingthenear-estneighborsonlyconsidersthelabelnoisewithinasmallneighborhood,whichissub-optimalandcannothandleex-tremelabelnoisescenarios,astheneighboringexamplesmayalsobemislabeledatthesametime.

Toaddressthisissue,thispaperpresentsTCL,anoveltwincontrastivelearningmodelthatexploresthelabel-freeunsupervisedrepresentationsandlabel-noisyannotationsforlearningfromnoisylabels.Speciﬁcally,weleveragecontrastivelearningtolearndiscriminativeimagerepresen-tationsinanunsupervisedmannerandconstructaGaus-sianmixturemodel(GMM)overitsrepresentations.Un-likeunsupervisedGMM,TCLlinksthelabel-freeGMMandlabel-noisyannotationsbyreplacingthelatentvariableofGMMwiththemodelpredictionsforupdatingthepa-rametersofGMM.Then,beneﬁttingfromthelearneddata

distribution,weproposetoformulatelabelnoisedetectionasanout-of-distribution(OOD)problem,utilizinganothertwo-componentGMMtomodelthesampleswithcleanandwronglabels.ThemeritoftheproposedOODlabelnoisedetectionistotakethefulldatadistributionintoaccount,whichisrobusttotheneighborhoodwithstronglabelnoise.Furthermore,weproposeabootstrapcross-supervisionwithanentropyregulationlosstoreducetheimpactofwronglabels,inwhichthetruelabelsofthesampleswithwrongla-belsareestimatedfromanotherdataaugmentation.Last,tofurtherlearnrobustrepresentations,weleveragecontrastivelearningandMixuptechniquestoinjectthestructuralknowl-edgeofclassesintotheembeddingspace,whichhelpsaligntherepresentationswithestimatedlabels.

Thecontributionsaresummarizedasfollows:

•WepresentTCL,anoveltwincontrastivelearningmodelthatexploresthelabel-freeGMMforunsuper-visedrepresentationsandlabel-noisyannotationsforlearningfromnoisylabels.

•WeproposeanovelOODlabelnoisedetectionmethodbymodelingthedatadistribution,whichexcelsathan-dlingextremelynoisyscenarios.

•Weproposeaneffectivecross-supervision,whichcanbootstrapthetruetargetswithanentropylosstoregu-larizethemodel.

•Experimentalresultsonseveralbenchmarkdatasetsandreal-worlddatasetsdemonstratethatourmethodoutperformstheexistingstate-of-the-artmethodsbyasigniﬁcantmargin.Inparticular,weachieve7.5%improvementsinextremelynoisyscenarios.

2.RelatedWork

Contrastivelearning.Contrastivelearningmethods[

]haveshownpromisingresultsforbothrepresentationlearninganddownstreamtasks.ThepopularlossfunctionisInfoNCEloss[

],whichcanpulltogetherthedataaug-mentationsfromthesameexampleandpushawaytheothernegativeexamples.MoCo[

]usesamemoryqueuetostoretheconsistentrepresentations.SimCLR[

]optimizesInfoNCEwithinmini-batchandhasfoundsomeeffectivetrainingtricks,e.g.,dataaugmentation.However,asunsuper-visedlearning,theymainlyfocusoninducingtransferablerepresentationsforthedownstreamtasksinsteadoftrainingwithnoisyannotations.Althoughsupervisedcontrastivelearning[

]canimprovetherepresentationsbyhumanla-bels,itharmstheperformancewhenlabelnoiseexists[

Learningwithnoisylabels.Mostofthemethodsinlitera-turemitigatethelabelnoisebyrobustlossfunctions[

],noisetransitionmatrix[

],samplese-lection[

],andlabelcorrection[

–

Inparticular,labelcorrectionmethodshaveshownpromis-ingresultsthanothermethods.Arazoetal.[

]appliedamixturemodeltothelossesofeachsampletodistinguishthenoisyandcleanlabels,inspiredbythefactthatthenoisysampleshaveahigherlossduringtheearlyepochsoftrain-ing.Similarly,DivideMix[

]employstwonetworkstoperformthesampleselectionforeachotherandappliesthesemi-supervisedlearningtechniquewherethetargetsarecomputedfromtheaveragepredictionsofdifferentdataaug-mentations.Duetothesuccessofcontrastivelearning,manyattemptshavebeenmadetoimprovetherobustnessofclas-siﬁcationtasksbycombiningtheadvantagesofcontrastivelearning.Zheltonozhskiietal.[

]usedcontrastivelearningtopre-traintheclassiﬁcationmodel.MOIT[

]quantiﬁesthisagreementbetweenfeaturerepresentationandoriginallabeltoidentifymislabeledsamplesbyutilizingak-nearestneighbor(k-NN)search.RRL[

]performslabelclean-ingbytwothresholdsonthesoftlabel,whichiscalculatedfromthepredictionsofpreviousepochsanditsnearestneigh-bors.Sel-CL[

]leveragesthenearestneighborstoselectconﬁdentpairsforsupervisedcontrastivelearning[

Unlikeexistingmethods[

]thatdetectthewronglabelswithintheneighborhood,TCLformulatesthewronglabelsastheout-of-distributionexamplesbymodelingthedatadistributionofrepresentationslearnedbycontrastivelearning.Inaddition,weproposeacross-supervisionwithentropyregularizationtobetterestimatethetruelabelsandhandlethenoisylabels.

3.TheProposedTCL

EachimageindatasetD={zi}associateswithanannotationye{1,2,...,K}.Inpractice,someexamplesmaybemislabeled.Weaimtotrainaclassiﬁcationnetwork,pθ(y|z)=g(z;θ)eRK,thatisresistanttothenoisylabelsintrainingdata,andgeneralizeswellonthecleantestingdata.Fig.

illustratestheframeworkofourproposedTCL.

Overview.Inthecontextofourframework,f(.)andg(.)sharethesamebackboneandhaveadditionalindividualheadstooutputrepresentationsandclasspredictionsfromtworandomandonemixupdataaugmentations.Afterward,therearefourcomponentsinTCL,including(i)modelingthedatadistributionviaaGMMinSec.

3.1

fromthemodelpredictionsandrepresentations;(ii)detectingtheexampleswithwronglabelsasout-of-distributionsamplesinSec.

3.2

;(iii)cross-supervisionbybootstrappingthetruetargetsinSec.

3.3

;and(iv)learningrobustrepresentationsthroughcontrastivelearningandmixupinSec.

3.4

3.1.ModelingDataDistribution

GiventheimagedatasetconsistingofNimages,weopttomodelthedistributionofzoveritsrepresentationg=f(z)viaasphericalGaussianmixturemodel(GMM).Afterin-

ClassPredictions

2ndview

InputAugmentationBackboneHead

1stview

mixupview

Sec3.3:

Sec3.1:ModelingDataDistribution

CrossSupervision

Crong

?lbWnorwronglWabls?

?lbWn

Sec3.2:LabelNoiseDetection

mixupview

Sec.3.4:

LearningRobustRepresentations

Representations

GMMofDataDistribution

1stview

Encoder

2ndview

MLP

Figure1.IllustrationoftheproposedTCL.Thenetworksgandfwithsharedencoderandindependenttwo-layerMLPoutputtheclasspredictionsandrepresentations.Then,TCLmodelsthedatadistributionviaaGMM,anddetectstheexampleswithwronglabelsasout-of-distributionexamples.TooptimizeTCL,theseresultsleadtocross-supervisionandrobustrepresentationlearning.

troducingdiscretelatentvariablesze{1,2,...,K}thatdeterminetheassignmentofobservationstomixturecompo-nents,theunsupervisedGMMcanbedeﬁnedas

p(U)=Lk=1p(U,z=k)

=Lk=1p(z=k)N(U|uk,σk).(1)

whereukisthemeanandσkascalardeviation.Ifweassumethatthelatentvariableszareuniformdistributed,thatis,p(z=k)=1/K,wecandeﬁnetheposteriorprobabilitythatassignszitok-thcluster:

γik=p(zi=k|zi)xN(zi|uk,σk).(2)

Inanidealscenariowhereallthesampleshavecleanlabelsye{1,2,...,K},thediscretelatentvariableszwouldbeidenticaltotheannotationy,andtheparametersuk,σkandlatentvariablezcanbesolvedthroughastandardExpectation-Maximization(EM)algorithm[

However,inpractice,thelabelsareoftennoisyandthelatentvariablez,estimatedinanunsupervisedmanner,hasnothingtodowiththelabely.Therefore,weareinterestedinconnectinglatentvariablez,estimatedinanunsuper-visedfashion(i.e.label-free),andtheavailableannotationsy,label-noisy,forthetaskoflearningfromnoisylabels.

Tolinkthemtogether,weproposetoinjectthemodelpre-dictionspθ(yi=k|zi),learnedfromnoisylabels,intothelatentvariablesz.Speciﬁcally,weproposetoreplacetheun-supervisedassignmentp(zi=k|zi)withnoisy-supervisedassignmentpθ(yi=k|zi).Asaresult,wecanconnectthelatentvariablezwiththelabely,andthususethenoisysupervisiontoguidetheupdateoftheparametersofGMM.

Inparticular,theupdateoftheGMMparametersbecomes

uk=norm╱、,

←ipθ(yi=k|zi),

σk=←ipθ(yi=k|zi)(Ui-uk)(Ui-uk)T

(3)

(4)

wherenorm(.)ise2-normalizationsuchthat|uk|2=1.

3.2.Out-Of-DistributionLabelNoiseDetection

Previousworks[

]typicallydetectthewronglabelswithintheneighborhood,thatis,usingtheinformationfromnearestneighbors.Itislimitedastheneighboringexamplesareusuallymislabeledatthesametime.Toaddressthisissue,weproposetoformulatelabelnoisedetectionastodetecttheout-of-distributionexamples.

Afterbuildingtheconnectionbetweenthelatentvariableszandlabelsy,weareabletodetectthesamplewithwronglabelsthroughtheposteriorprobabilityinEq.(

).Weim-plementitasanormalizedversiontotakeintoaccounttheintra-clusterdistance,whichallowsfordetectingthesampleswithlikelywronglabels:

←kexp(-(Ui-uk)T(Ui-uk)/2σk).

γik=exp╱-(Ui-uk)T(Ui-µk)/2σk、(5)

Sincee2-normalizationhasbeenappliedtobothembeddingsUandtheclustercentersuk,yielding(U-uk)T(U-uk)=2-2UTuk.Therefore,wecanre-writeEq.(

)as:

γik=p(zi=k|zi)

=exp(Uuk/σk)\Lkexp(Uuk/σk).(

OncebuilttheGMMoverthedistributionofrepresenta-tions,weproposetoformulatetheconventionalnoisylabel

detectionproblemasout-of-distributionsampledetectionproblem.Ourideaisthatthesampleswithcleanlabelsshouldhavethesameclusterindicesafterlinkingtheclusterindexandclasslabel.Speciﬁcally,givenoneparticularclassy=k,thesampleswithinthisclasscanbedividedintotwotypes:in-distributionsampleswithcleanlabels,andout-of-distributionsampleswithwronglabels.Therefore,wedeﬁnethefollowingconditionalprobabilitytomeasuretheprobabilityofonesamplewithcleanlabel:

γy=Xìi=p(yi=zi|zi)

=exp(guXi/σXi)′Lkexp(guk/σk).(

AlthoughEqs.(

)and(

)sharesimilarcalculations,theyhavedifferentmeanings.Eq.(

)calculatestheprobabilityofoneexamplebelongingtok-thclusterwhileEq.(

)theprobabilityofoneexamplehavingcleanlabel—thatis,yi=zi.Therefore,theprobabilityofoneexamplehavingthewronglabelcanbewrittenasγyXìi=p(yizi|zi)=1-p(yi=zi|zi).

Furthermore,insteadofsettingahuman-tunedthresholdforγy=Xìi,weopttoemployanothertwo-componentGMMfollowing[

]toautomaticallyestimatethecleanproba-bilityγy=Xìiforeachexample.SimilartothedeﬁnitionofGMMinEq.(

),thistwo-componentsGMMisdeﬁnedasfollows:

p(γy=Xìi)=Lp(γy=Xìi,c)=Lp(c)p(γy=Xìi|c),(8)

c=0c=0

wherecisthenewintroducedlatentvariable:c=1indicatestheclusterofcleanlabelswithhighermeanvalueandviceversusc=0.AftermodelingtheGMMovertheprobabilityofoneexamplehavingcleanlabels,γy=Xìi,weareabletoinfertheposteriorprobabilityofoneexamplehavingcleanlabelsthroughthetwo-componentGMM.

3.3.Cross-supervisionwithEntropyRegularization

Afterthelabelnoisedetection,thenextimportantstepistoestimatethetruetargetsbycorrectingthewrongla-beltoreduceitsimpact,calledlabelcorrection.Previousworksusuallyperformlabelcorrectionusingthetemporalensembling[

]orfromthemodelpredictions[

]beforemixupaugmentationwithoutback-propagation.

TCLleveragesasimilarideatobootstrapthetargetsthroughtheconvexcombinationofitsnoisylabelsandthepredictionsfromthemodelitself:

,(9)

whereg(z)andg(z)arethepredictionsoftwoaug-mentations,yithenoisyone-hotlabel,andwie[0,1]rep-resentstheposteriorprobabilityasp(c=1|γy=Xìi)fromthe

two-componentGMMdeﬁnedinEq.(

).Whencomput-ingEq.(

),westopthegradientfromgtoavoidthemodelpredictionscollapsedintoaconstant,inspiredby[

Guidedbythecorrectedlabelsti,weswaptwoaugmen-tationstocomputetheclassiﬁcationlosstwice,leadingto

thebootstrapcrosssupervision,formulatedas:

ccross=e╱g(z),t、+e╱g(z),t、,(

10)

whereeisthecross-entropyloss.Thislossmakesthepre-dictionsofthemodelfromtwodataaugmentationsclosetocorrectedlabelsfromeachother.Inasense,ifwi=0,themodelisencouragedforconsistentclasspredictionsbe-tweendifferentdataaugmentations,otherwisewi=1itissupervisedbythecleanlabels.

Inaddition,weleverageanadditionalentropyregulariza-tionlossonthepredictionswithinamini-batchB:

creg=-H╱g(z)\+H(g(z)),(11)

whereH(.)istheentropyofpredictions[

].Theﬁrsttermcanavoidthepredictionscollapsingintoasingleclassbymaximizingtheentropyofaveragepredictions.Thesecondtermistheminimumentropyregularizationtoencouragethemodeltohavehighconﬁdenceforpredictions,whichwaspreviouslystudiedinsemi-supervisedlearningliterature[

]. Althoughbothusingthemodelpredictions,wewouldemphasizethatthecross-supervisioninTCLisdifferent

to[

]inthreeaspects:(i)bothzandzarein-volvedinback-propagation;(ii)thestrongaugmentation[

]usedtoestimatethetruetargetscanpreventtheoverﬁtting

ofestimatedtargets;and(iii)TCLemploystwoentropy

regularizationtermstoavoidthemodelcollapsetooneclass.Theﬁnalclassiﬁcationlossisgivenasfollows:

ccls=ccross+creg.(12)

3.4.LearningRobustRepresentations

Tomodelthedatadistributionthatisrobusttonoisylabels,weleveragecontrastivelearningtolearntherepre-sentationsofimages.Speciﬁcally,contrastivelearningper-formsinstance-wisediscrimination[

]usingtheInfoNCEloss[

]toenforcethemodeloutputtingsimilarembed-dingsfortheimageswithsemanticpreservingperturbations.Formally,thecontrastivelossisdeﬁnedasfollows:

(13)

←℃炷éexp╱f(z(1))Tf(z)/τ、,

exp╱f(z(1))Tf(z(2))/τ、cctr=-log

whereτisthetemperatureandsistheBexceptz(1).z(1)andz(2)aretwoaugmentationsofz.Intuitively,InfoNCElossaimstopulltogetherthepositivepair(z(1),z(2))from

twodifferentaugmentationsofthesameinstance,andpushthemawayfromnegativeexamplesofotherinstances.Con-sequently,itcanencouragediscriminativerepresentationsinapureunsupervised,orlabel-freemanner.

Althoughbeneﬁcialinmodelinglatentrepresentations,

contrastivelearningcannotintroducecompactclasseswith-outusingthetruelabels.Sincethelabelyisnoisy,welever-ageMixup[

]toimprovewithin-classcompactness,whichhasbeenshownitseffectivenessagainstlabelnoiseinlitera-ture[

].Speciﬁcally,amixuptrainingpair(z,)islinearlyinterpolatedbetween(zi,i)and(zj,j)underacontrolcoefﬁcientλ~Beta(α,α):

,(14)

wherezjisrandomlyselectedwithinamini-batch,andi=(t+t)/2istheaverageofestimatedtruelabelsoftwodataaugmentations.Intuitively,wecaninjectthestructuralknowledgeofclassesintotheembeddingspace

learnedbycontrastivelearning.Thislosscanbewrittenas:

calign=e╱g(z),、+e(p(:|z),),(15)

wherethesecondtermcanaligntherepresentationswithestimatedlabels.Inasense,calignregularizesclassiﬁcationnetworkgandencouragesftolearncompactandwell-separatedrepresentations.Furthermore,wewouldpointouttwodifferencesbetweenTCLand[

],althoughbothusingmixuptoboosttherepresentations.First,[

]doesnotexplicitlymodelthedatadistributionp(:|z)likeTCL.

Second,TCLhasleveragedthefulltrainingdatasetviathe

correctedlabelinsteadofasubsetofcleanexamplesin[

],whichleadstostrongerrobustnessofTCLover[

]onextremehighlabelnoiseratios.

3.5.Trainingandinference

Theoveralltrainingobjectiveistominimizethesumofalllosses:

c=ccls+cctr+calign.(16)

Weﬁndthatasimplesummationofalllossesworkswellforalldatasetsandnoiselevels,whichindicatesthestronggeneralizationoftheproposedmethod.Duringinference,thedataaugmentationsaredisabledandtheclasspredictionsareobtainedbyargmaxkpθ(k|女).

ThetrainingalgorithmoftheproposedmethodisshowninAlg.

.Inasense,thearchitectureofourmethodleadstoanEM-likealgorithm:(1)theE-stepupdates{(uk,σk)}forTCL,and{wi}foreachsampleinDtoformthetruetargetswiththepredictionsfromanotherdataaugmentations,and(2)theM-stepoptimizesthemodel

Algorithm1:TrainingAlgorithm

Input:DatasetD={(zi,yi)};functions{f,g}Output:Classiﬁcationnetworkg.

repeat

E-step:update{(uk,σk)}forTCL,and{wi}foreachsampleinD

M-step:repeat

Randomlysampleamini-batchBfromDforeachziinBdo

Randomlysampletwoaugmentations

c-Eq.(

)

andamixupone:{z,z,z}

end

UpdatefandgwithSGDoptimizer.untilanepochﬁnished;

untilreachingmaxepochs;

parametersbyEq.(

)tobetterﬁtthoseestimatedtargets.Therefore,theconvergenceofTCLcanbetheoreticallyguar-anteed,followingthestandardEMalgorithm.

4.Experiments

Inthissection,weconductexperimentsonmultiplebenchmarkdatasetswithsimulatedandreal-worldlabelnoises.Westrictlyfollowtheexperimentalsettingsinprevi-ousliterature[

]forfaircomparisons.

4.1.Experimentsonsimulateddatasets

Datasets.Following[

],wevalidateourmethodonCIFAR-10/100[

],whichcontains50Kand10Kim-ageswithsize32×32fortrainingandtesting,respectively.Weleave5Kimagesfromthetrainingsetasthevalidationsetforhyperparametertuning,thentrainthemodelonthefulltrainingsetforfaircomparisons.Twotypesoflabelnoisearesimulated:symmetricandasymmetriclabelnoise.Symmetricnoiserandomlyassignsthelabelsofthetrain-ingsettorandomlabelswithpredeﬁnedpercentages,a.k.a,noiseratio,whichincludes20%,50%,80%,and90%ontwodatasetsinthispaper.Asymmetricnoisetakesintoac-counttheclasssemanticinformation,andthelabelsareonlychangedtosimilarclasses(e.g.,truck→automobile).Here,onlyexperimentsontheCIFAR-10datasetwith40%noiseratioforasymmetricnoiseareconducted;otherwise,theclasseswithabove50%labelnoisecannotbedistinguished.

Trainingdetails.Sameaspreviousworks[

],weuseaPreActResNet-18[

]astheencoder.WeadoptSGDoptimizertotrainourmodelwithamomentumof0.9,aweightdecayof0.001,andabatchsizeof256for200epochs.Thelearningratearelin

人人文库> 全部分类> 应用文书 > 研究报告

温馨提示

1. 本站所有资源如无特殊说明，都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
2. 本站的文档不包含任何第三方提供的附件图纸等，如果需要附件，请联系上传者。文件的所有权益归上传用户所有。
3. 本站RAR压缩包中若带图纸，网页内容里面会有图纸预览，若没有图纸预览就没有图纸。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 人人文库网仅提供信息存储空间，仅对用户上传内容的表现方式做保护处理，对用户上传分享的文档内容本身不做任何修改或编辑，并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容，请与我们联系，我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

（英文）行业资料Twin Contrastive Learning with Noisy Labels-Huang et al-

文档简介

温馨提示

最新文档

评论

（英文）行业资料Twin Contrastive Learning with Noisy Labels-Huang et al-

文档简介

温馨提示

最新文档

评论

相关文档