GraphConvolutionalRecurrentNeuralNetworkData-DrivenTrafficForecasting_第1页
GraphConvolutionalRecurrentNeuralNetworkData-DrivenTrafficForecasting_第2页
GraphConvolutionalRecurrentNeuralNetworkData-DrivenTrafficForecasting_第3页
GraphConvolutionalRecurrentNeuralNetworkData-DrivenTrafficForecasting_第4页
GraphConvolutionalRecurrentNeuralNetworkData-DrivenTrafficForecasting_第5页
已阅读5页,还剩7页未读 继续免费阅读

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

arXiv:1707.01926v1cs.LG6Jul20171GraphConvolutionalRecurrentNeuralNetwork:Data-DrivenTracForecastingYaguangLiRoseYuCyrusShahabiYanLiuDepartmentofComputerScience,UniversityofSouthernCaliforniayaguang,qiyu,shahabi,AbstractSpatiotemporalforecastinghassignicantimplicationsinsustainability,transportationandhealth-caredomain.Tracforecastingisonecanonicalexampleofsuchlearningtask.Thistaskischallengingdueto(1)non-lineartemporaldynamicswithchangingroadconditions,(2)complexspatialdependenciesonroadnetworkstopologyand(3)inherentdicultyoflong-termtimeseriesforecasting.Toaddressthesechallenges,weproposeGraphConvolutionalRecurrentNeuralNetworktoincorporatebothspatialandtemporaldependencyintracow.Wefurtherintegratetheencoder-decoderframeworkandscheduledsamplingtoimprovelong-termforecasting.Whenevaluatedonreal-worldroadnetworktracdata,ourapproachcanaccuratelycapturespatiotemporalcorrelationsandconsistentlyoutperformsstate-of-the-artbaselinesby12%-15%.IntroductionSpatiotemporalforecastingisacrucialtaskforalearningsystemthatoperatesinadynamicenvironment.Accuratespatiotemporalforecastinghasawiderangeofapplicationsrangingfromvideocompressionandunderstanding,astoenergyandsmartgridmanagement,economicsandnance,toenvironmentalandhealthcare.Inthispaper,westudyoneexampleofspatiotemporalforecastingtask:tracforecasting,thecorecomponentoftheintelligenttransportationsystems.Webelieveourapproachisnotlimitedtotransportation,andisreadilyapplicabletootherdomainsaswell.Thegoaloftracforecastingistopredictthefuturespeedsofasensornetworkusingprevioustracspeedsaswellastheunderlyingroadnetworksstructure.Thistaskischallengingmainlyduetothecomplexspatialandtemporaldependencies.Ononehand,tractimeseriesdemonstratestrongtemporaldynamic.Recurringincidentssuchasrushhoursoraccidentscancausenon-stationarybehaviorintracspeeds,leadingtodicultyinlong-termforecasting.Ontheotherhand,multivariatetimeseriesfromasensornetworkcontaincomplexspatialcorrelations.Itisoftenthecasethatsuchspatialcorrelationarehighlylocalized.Figure1showstheweightslearnedfromtheauto-regressivemodelusingweightedaverageforasinglesensorprediction.Thelearnedweightshighlyconcentrateonitscloseneighbors.Anotherimportantcharacteristicsoftracisthe“conservationofow”,whichmeansthenumberofvehiclesinaroadnetworkstaysrelativelythesameduringashorttimeperiod.Intheliterature,tracforecastinghasbeenstudiedfordecades,fallingintotwomaincate-gories:data-drivenapproachandknowledge-drivenapproach.Intransportationandoperationalresearch,knowledge-drivenmethodsusuallyapplyqueuingtheoryandsimulateuserbehaviorsintrac5.Intimeseriescommunity,data-drivenmethodssuchasautoregressiveintegratedmovingaverage(ARIMA)modelandKalmanlteringremainpopular15,14.However,simpletimese-riesmodelsusuallyrelyonthestationarityassumptionofthetimeseries,andhavelimitedcapacity1torepresenthighlynonlineardynamics.Mostrecently,deeplearningmodelsfortracforecastingforecastinghavebeendevelopedin16,28.In10,theauthorsdevelopdeepauto-regressivemodelsformoregeneralspatiotemporalforecastingtask,e.g.,inventoryforecasting.However,thesedeeplearningmodelsonlyapplytounivariatetimeseriesorfocusonshort-termforecasting.Deepneuralnetworkmodelsforthedomainofspatiotemporalforecastingstaylargelyelusive.Ourworkservesasanimportantsteptointegratemanyimportantdevelopmentsindeeprecurrentneuralnetworksintotimeseriesanalysis,particularlyforspa-tiotemporalforecasting.Weleveragerecentadvancesingraphconvolution7,21andsequencemodeling6,3todesigntheGraphConvolutionalRecurrentNeuralNetwork(GCRNN).GCRNNmodelsboththespatialandthetemporaldependenceinthetracnetwork.Specically,weresorttorecurrentneuralnetworktocapturethenon-lineardynamics,andmodifytheGatedRecurrentUnittoincorporatetheunderlyingsensornet-workstructure.Thisisdonethroughtransformationofinputsequencethroughagraphconvolutionalkernel.Figure1:LocalspatialdependencyforToaddresstheerrorpropagationissueinlong-termfore-singlesensorlearnedfromweightedav-castingtask,wefurtherintegratetheencoder-decodererage.Largerweightsindicatehigherframeworkandscheduledsamplingtechnique3.Whencorrelation.evaluatedonthereal-worldtracdata,GCRNNconsistentlyoutperformsstate-of-the-arttracforecastingbaselinesbyalargemargin.Ourcontributionscanbesummarizedasfollows:Weinvestigatedtracforecasting,animportantmultivariatespatiotemporalforecastingtask,andidentieditsuniquespatiotemporaldependencystructure.Weproposedgraphconvolutionrecurrentneuralnetworkasawholisticframeworktoecientlycapturebothspatialandtemporalstructure.Theproposedapproachachievesthebestreportedresultsonreal-worldtracforecastingandobtainedsignicantimprovementoverstate-of-the-artmethods.2RelatedWorkTracforecastingisaclassicproblemintransportationandoperationalresearchwhicharelargelybasedonqueuingtheoryandsimulations9.Data-drivenapproachesfortracforecastinghavereceivedconsiderableattention,detailscanbefoundinarecentsurveypaper25andthereferencestherein.However,existingmachinelearningmodelseitherimposestrongstationaryassumptionsofthedata(e.g.,auto-regressivemodel)orfailtoaccountforhighlynon-lineartemporaldependency(e.g.,latentspacemodel27,8).Recently,deeplearningmodelsdelivernewpromisefortimeseriesforecastingproblem.Forexample,in28,theauthorsstudyunivaritetimeseriesforecastingusingdeepLSTMnetwork.In10,theauthorsproposeaprobabilisticdeepauto-regressiverecurrentframeworktoforecastinventorytimeseriesacrossdierentdomains.Theforecastingproblemwearefacinghereisspatiallycorrelatedtimeseries,whichrequirescarefulmodelingofbothspatialandtemporaldependency.Intermsofgeneralsequencemodeling,RecurrentNeuralNetworks(RNNs)havebecomethestate-of-the-artchoice,leadingtosuccessfulapplicationsinlanguagemodeling2,videogeneration23,speechrecognition17andweathernowcasting26.However,mostexisting2GraphConvolutionalGraphConvolutionalRecurrentLayerRecurrentLayerGraphConvolutionalGraphConvolutionalRecurrentLayerRecurrentLayerInputGraphSignalsPredictions.TimeDelay=1EncoderCopyStatesDecoderFigure2:SystemarchitectureforGraphConvolutionalRecurrentNeuralNetworkdesignedforspatiotemporaltracforecasting.deepsequencemodelsdealswitheitherdiscretetimesequenceorsequencesthatareevenlydistributedoveraregulargrid.Forinstance,convolutionalLSTMnetwork26capturesthespatiotemporalstructureamongpixelsbyapplyingaconvolutionallterovereachframeofthevideostream.Thelanguagesequencesareoftenencodedasdiscretetimeseries.Onthecontrary,timeseriesfromsensornetworksintracforecastingarecontinuoustimesequencesdistributedoveragraph.Closelyrelatedtoourworkisthedeeplearningmodelsfornon-Euclideanstructureddata.Forexample,in20,theauthorsproposeGraphNeuralNetworks(GNN)modelinthevertexspace,whichlearnsnoderepresentationsforthegraph.Lietal.13extendsGNNforsequencemodeling.TheresultingGatedGraphSequenceNeuralnetworkachievesthestate-of-the-artperformanceforprogramverication.Goingfromvertexdomaintospectraldomain,spectralgraphconvolutionalneuralnetworks(GCN)arerstintroducedin4,whichbridgesthespectralgraphtheoryanddeepneuralnetworks.In7,theauthorsfurtherimproveGCNwithfastlocalizedconvolutionslters.OurmodelextendsGCNtomodelmultivariatetimeseriesdistributedonanetwork.OurmodelcoincideswitharecentworkonsequentialgeneralizationofGCN21,however,wefocusoncontinuoustimepredictionandlong-termforecastingbyincorporatingencoder-decoderarchitecture24andscheduledsampling3techniques.3MethodologyWerstformalizethelearningproblemoftracforecastingandidentifyuniquespatiotemporaldependencystructures.Wethenproposeavariationofthedeeprecurrentneuralnetworkmodel.Givenaseriesofroadnetworksnapshots,ourmodeladdressesthreetechnicaldiculties:(1)localizedspatialdependency,(2)temporaldynamicsingraphs,and(3)long-termforecasting.3.1TracForecastingProblemThegoaloftracforecastingistopredictthefuturetracspeedbasedonpreviouslyobservedtracow.Thetracowismeasuredbynspatiotemporalcorrelatedsensorsontheroadnetwork.Thepair-wiserelationshipbetweenthosesensorscanbemodeledasaweightedgraphG=(V,E,A),whereVisanitesetof|V|=nvertices,whileEisasetofedgesandARnnis3Htimesteps,i.e.,Xt+1,Xt+H,whereHistheforecastinghorizon.aweightedadjacencymatrixrepresentingtheconnectivitybetweensensors.Thus,anobservationoftracspeedsatatimetcanbeviewedasagraphsignal,Xt:VRdx,wheredxisthedimensionofsignalineachnode.Thetracforecastingproblemcanbeformulatedasfollows:givensensorgraphGandhistoricaltracmeasurementsofsensors,inferthemostlikelytracmeasurementsinthenextXt+H,Xt+1=argmaxlogP(Xt+H,Xt+1|Xt,XtK+1;G,)Xt+H,Xt+1Supposethateachtimestepis5minutesandHis12,theoutputsofthemodelwillbethetracmeasurementofevery5minutesforallthesensorsinthenexthour.Notethattheaforementionedtracforecastingproblemisdierentfromthesingle-steptimeseriesforecastingproblem.ThepredictiontargetofourproblemisasequenceofmultivariatetimeseriesdistributedoveragraphGwhichcontainsbothspatialandtemporalstructures.Moreover,thepredictionproblemdenedin26canbeconsideredasaspecialcaseofthisproblemwhereGisaregulargrid.3.2SpatialDependencyModelingTractimeseriesfromroadnetworksensorsdemonstratestrongspatialdependency.Itismainlydueto(1)networkconnectivity:highwaynetworksusuallyhavesensorsinstalledevery1-2miles,andtracowofadjacentsensorsarehighlycorrelated;(2)owconservation:thenumberofvehiclesenteringandexitingtheroadsareapproximatelythesame.Unfortunately,recurrentneuralnetworks(RNNs)donotexplicitlymodelsuchspatialdependency.Inthiswork,weaugmentRNNsbyconsideringspatialcorrelationsamongmultivariatetimeseries.GraphAttentionMechanismInRNNs,theactivationisaweightedcombinationofallthehistoricalobservationsandhiddenstates,whilethespatialdependencyoftracisratherlocalized.Inordertoaccountforsuchlocaldependency,wegeneralizetheattentionmechanismfromsequencemodelingtospatialmodeling.Inparticular,weallowthemodeltolearntofocusoncloseneighborhoodsinsteadoftheentirenetwork.Thisisachievedbyrepresentingthehiddenstateofasensorusingacombinationofthehiddenstatesfromnearbysensorsweightedbyattention.Theattentionmechanismisdenedas:fatt(hi,hj)=hiWahj,aij=exp(fatt(hi,hj)knb(i,K)exp(fatt(hi,hk),gi=jnb(i,K)aijhj,(1)wherehidenotesthehiddenstatesofsensoriwhichisextractedusingaRNNsharedacrossallthenodes.nb(i,K)returnsthesetofneighborsthatarewithinK-hopfromnodei,andgirepresentstheaggregatedhiddenstatefornodeithatincorporatesinformationfromneighborhoodnodes.Then,theforecastingtaskofnodeiisimplementedusingafullyconnectedfeedforwardnetworkwithgiastheinput.GraphLaplacianTransformationGraphattentionmechanismenablesexplicitnetworkstructuremodeling,butinpracticeitonlyleadstomarginalperformanceimprovement.Thisispartlybecauseofthedicultyintrainingsuchmodelasitistimeconsumingtocomputepair-wiseattentionforlargenumberofnodes.Anotherreasonisthatgraphattentiononlymodelsthetopologicaldependencyinthevertexdomain,andyetitfailstocapturethe“conservationofow”propertyintrac.WeresolvethisissuebytransformingthetractimeseriesfromvertexdomainintothespectraldomainusinggraphLaplacian.4Figure3:Visualizationofeigen-functions.(a)showssensorlocationsonthemap.(b)and(c)correspondtoeigen-functionswithsmalleigenvalue(lowfrequencyandsmooth)while(d)and(e)correspondtooneswithlargeeigenvalues(highfrequencyandnon-smooth).GraphLaplacianisadiscreteversionofLaplacianoperator,whichcharacterizesthecon-nectivityofthegraph.ApplyingLaplacianoperatory=Lxtothesignalrepresentsone-stepdiusionofthesignalonthegraph.WearguethatitisnaturaltouseGraphLaplacianoperatorfortracforecastingproblems.Ifwemodelthechangeoftracowasxti(t)=jAij(xixj),wehavexti(t)=cLix,whereAijistheelementoftheadjacencymatrixofthegraph,ListhegraphLaplacianandcisaconstant.Thissharessimilarformastheheatequation,whichisgivenbythelaw“conservationofenergyinphysics.Inimageprocessing,thistransformationisknownasgraphconvolutionalkernel,denotedasg.Tracforecastingproblemprovidesanalternativemotivationofperformingsuchtransformation.ToobtaintheLaplacianmatrix,weconstructtheadjacencymatrixbasedonroadnetworkdistancewithathresholdedGaussiankernel22.Figure3visualizestheeigen-functionsofthenormalizedLaplacianmatrixforpartoftheroadnetworkinLosAngeles.Smalleigen-functionsrepresentsmoothspatialdependencywhilelargeonesdenotehighoscillation.WecanmakesomeinterestingobservationsinFigure3,whichcouldhelpexplainthespatialdependencycapturedbyLaplacian.Forexample,in(c)NearUniversalStudiosHollywood,atthecrossingofhighway101,134and170(d)NearRoseBowlStadium,atthecrossingofhighway2and134.Todealwithspatialdependencyatdierentresolutions,wecomputeaweightedsumofkthpowerofLaplacianasthespectraltransformation.ThisisbasedonthefactthatkthpowerofLaplacianissupportedbyexactlyk-hopneighbors22,representingthespreadoftracowatdierentscale.ComputingthekthpowerLaplacianmatrixcanbecomputationallyexpensive,soweapplyChebyshevpolynomialexpansion7forecientapproximation.K1K1K1y=gw(L)x=wkLx=UwkUxwkTk()xtttktkt(2)k=0k=0k=0ofChebyshevcoecientswhileTk()RnnistheChebyshevpolynomialoforderkevaluatedat=2/maxI.ThisapproximationreducesthelteringcomputationalcostfromO(|V|2)wherethegw(L)isthelearnedlterbasedonLaplacianmatrix.parameterwRKisavectortoO(K|E|).3.3TemporalDynamicsModelingWemodelthetemporaldynamicsintheframeworkofrecurrentneuralnetworks.OneofthevariantsofRNNistheGatedRecurrentUnits(GRU)6whichhasasimplerstructureandcompetitiveperformancecomparingwithLSTM.GatedGraphConvolutionWeincorporatespatialdependencyintoGRUbyreplacingthematrixmultiplicationwiththegraphconvolutionGdenedinEquation2.Thisgraphconvolu-5Figure4:Visualizationof24hoursroadnetworktractimeseriesevolutioninspectraldomainwithLaplaciantransformedinput(toprow)andvertexdomainwithrawinput(bottomrow).Spectraldomainenjoysbettersparsity.Theskewnessofthedistributionofthetransformedinputreectsthetraccongestioncondition.tionaloperationisappliedtobothinputsandhiddenstatestoobtainaGraphConvolutionalGatedRecurrentUnit(GCGRU).rt=(WrGxt+UrGht1+br)ut=(WuGxt+UuGht1+bu)ct=tanh(WcGxt+UcG(rtht1)+bc)ht=utht1+(1ut)ctWestackGRUandunrolltherecurrenceforaxednumberofstepsTanduseback-propagationthroughtimeinordertocomputegradients.Figure4showstheroadnetworktracevolutionin24hours,goingthroughmorningrushhourandafternoonrushhour.Wecanseethatinspectraldomain,thetracspeedtimeseriesenjoysbettersparsitythaninthevertexdomain.Thedistributionofthetransformedinputreectthetraccongestioncondition.Withheavycongestioninrushhour,thespectraldistributionofthetimeseriesbecomemoreheavy-tailed.Long-TermForecastingInlong-termforecasting,simplytrainingthemodelforonestepaheadprediction,andthenback-feedingthepredictionsattesttimeispronetoerrorpropagation.Theforecastingerrorinearlierstepscouldbequicklyampliedoverlong-timespan.Wedrawinspirationfromtheencoder-decoderarchitecture24aswellasscheduledsam

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论