




已阅读5页,还剩24页未读, 继续免费阅读
版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
InformationNetworks GraphClusteringLecture14 Clustering GivenasetofobjectsV andanotionofsimilarity ordistance betweenthem partitiontheobjectsintodisjointsetsS1 S2 Sk suchthatobjectswithintheeachsetaresimilar whileobjectsacrossdifferentsetsaredissimilar GraphClustering Input agraphG V E edge u v denotessimilaritybetweenuandvweightedgraphs weightofedgecapturesthedegreeofsimilarityClustering Partitionthenodesinthegraphsuchthatnodeswithinclustersarewellinterconnected highedgeweights andnodesacrossclustersaresparselyinterconnected lowedgeweights mostgraphpartitioningproblemsareNPhard Measuringconnectivity Whatdoesitmeanthatasetofnodesarewellinterconnected min cut theminnumberofedgessuchthatwhenremovedcausethegraphtobecomedisconnectedlargemin cutimpliesstrongconnectivity Measuringconnectivity Whatdoesitmeanthatasetofnodesarewellinterconnected min cut theminnumberofedgessuchthatwhenremovedcausethegraphtobecomedisconnectednotalwaystrue Graphexpansion NormalizethecutbythesizeofthesmallestcomponentGraphexpansion WewillnowseehowthegraphexpansionrelatestotheeigenvalueoftheadjancenymatrixA Spectralanalysis TheLaplacianmatrixL D AwhereA theadjacencymatrixD diag d1 d2 dn di degreeofnodeiThereforeL i i diL i j 1 ifthereisanedge i j LaplacianMatrixproperties ThematrixLissymmetricandpositivesemi definitealleigenvaluesofLarepositiveThematrixLhas0asaneigenvalue andcorrespondingeigenvectorw1 1 1 1 1 0isthesmallesteigenvalue Thesecondsmallesteigenvalue Thesecondsmallesteigenvalue alsoknownasFieldervalue 2satisfiesThevectorthatminimizes 2iscalledtheFieldervector Itminimizes where FielderValue Thevalue 2isagoodapproximationofthegraphexpansion Spectralordering ThevaluesofxminimizeForweightedmatricesTheorderingaccordingtothexivalueswillgroupsimilar connected nodestogetherPhysicalinterpretation Thestablestateofspringsplacedontheedgesofthegraph Spectralpartition PartitionthenodesaccordingtotheorderinginducedbytheFieldervectorIfu u1 u2 un istheFieldervector thensplitnodesaccordingtoavaluesbisection sisthemedianvalueinuratiocut sisthevaluethatmaximizes G sign separatepositiveandnegativevalues s 0 gap separateaccordingtothelargestgapinthevaluesofuThisworksprovablywellforspecialcases Conductance ThenodeswithhighdegreearemoreimportantGraphConductanceConductanceisrelatedtotheeigenvalueofthematrixM D 1A ClusteringConductance Theconductanceofaclusteringisdefinedastheminimumconductanceoverallclustersintheclustering Maximizingconductanceseemslikeanaturalchoice butitdoesnothandlewelloutliers Aclusteringbi criterion Maximizetheconductance butatthesametimeminimizetheinter clusteredgesAclusteringC C1 C2 Cn isa c e clusteringifTheconductanceofeachCiisatleastcThetotalnumberofinter clusteredgesisatmostafractioneofthetotaledges Theclusteringproblem Problem1 Givenc finda c e clusteringthatminimizeseProblem2 Givene finda c e clusteringthatmaximizescTheproblemsareNP hard Aspectralalgorithm CreatematrixM D 1 2AFindthesecondlargesteigenvectorvFindthebestratio cut minimumconductancecut withrespecttovRecurseonthepiecesinducedbythecut Thealgorithmhasprovableguarantees Discoveringcommunities Community asetofnodesS wherethenumberofedgeswithinthecommunityislargerthanthenumberofedgesoutsideofthecommunity Min cutMax flow GivenagraphG V E whereeachedgehassomecapacityc u v asourcenodes andadestinationnodet findthemaximumamountofflowthatcanbesentfromstot withoutviolatingthecapacityconstraintsThemax flowisequaltothemin cutinthegraph weightedmin cut Solvableinpolynomialtime Aseededcommunity Thecommunityofnodeswithrespecttonodet isthesetofnodesreachablefromsinthemin cutthatcontainssthissetdefinesacommunity DiscoveringWebcommunities StartwithasetofseednodesSAddavirtualsourcesFindneighborsafewlinksawayCreateavirtualsinktFindthecommunityofswithrespecttot Amorestructuredapproach Addavirtualsourcetinthegraph andconnectallnodestot withedgesofcapacity LetSbethecommunityofnodeswithrespecttot ForeverysubsetUofSwehaveSurprisingly thissimplealgorithmgivesguaranteesfortheexpansionandtheinter communitydensity Min CutTrees GivenagraphG V E themin cuttreeTforgraphGisdefinedasatreeoverthesetofverticesV wheretheedgesareweightedthemin cutbetweennodesuandvisthesmallestweightamongtheedgesinthepathfromutov removingthisedgefromTgivesthesamepartitionasremovingthemin cutinG Lemma1 u w U U2 U1 W c W U2 c U1 U2 Lemma2 LetSbethecommunityofthenodeswithrespecttotheartificialsinkt ForanysubsetUofSwehave Lemma3 LetSbethecommunityofnodeswithrespecttot Thenwehave Algorithmforfindingcommunities AddavirtualsinkttothegraphGandconnectallnodeswithcapacity graphG Createthemin cuttreeT ofgraphG RemovetfromT Returnthedisconnectedcomponentsasclusters Effectof When istoosmall thealgorithmreturnsasinglecluster theeasythingtodoistoremovethesinkt When istoolarge thealgorithmreturnssingletonsInbetweenistheinterestingarea Wecanexplorefortherightvalueof Wecanrunthealgorithmhierarchicallystartwithsmall andincreaseitgraduallytheclustersreturnedarenested References J Kleinberg LecturenotesonspectralclusteringDanielA SpielmanandShang HuaTeng SpectralPartitioningWorks Planargraphsandfiniteelementmeshes Proceedingsofthe37thAnnualIEEEConferenceonFoundationsofComputerScience 1996 andUCBerkeleyTechnicalReportnumberUCBCSD 96 898 RaviKannan SantosVempala AdrianVetta Onclusterings good badandspectral JournaloftheACM JACM 51 3 497 515
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2025年化妆品销售授权代理合同范本
- 2025年养老服务中心老年人照护合同
- 2025年私人住宅室内装饰设计工程合同
- 2025年款汽车配件合作供应合同样本
- 三八节活动策划方案
- 2025年租赁商场柜台合同协议样本
- 2025年动物致伤医疗费用承担合同
- 2025年殡葬技术生态葬礼仪师模拟考试试题集
- 2025年山东考安全员试题及答案
- 江苏省2015-2015学年高中英语 Unit1 Tales of the unexplained Period1 Welcome and Reading说课稿 牛津译林版必修2
- 纯净水生产项目可行性分析报告
- 监理内业资料整理要点
- GB/T 21652-2017铜及铜合金线材
- GB/T 12234-2019石油、天然气工业用螺柱连接阀盖的钢制闸阀
- 全套教学课件《公共艺术(音乐)》
- 高中数学《基于问题链的数学教学探索》课件
- (卓越绩效)质量奖申报材料
- 同创伟业投资分析报告(附358家被投企业介绍)
- 数学-四年级(上册)-人教版-《亿以上数的认识及读法》教学课件
- 政治经济学ppt课件汇总(完整版)
- 互联网保险概述课件
评论
0/150
提交评论