版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
中国机器学习白皮书中国人工智能学会二○一五年十一月《中国人工智能系列白皮书》编委会主任:李德毅执行主任:王国胤副主任:杨放春谭铁牛黄河燕焦李成马少平刘宏蒋昌俊任福继杨强委员:陈杰董振江杜军平桂卫华韩力群何清黄心汉贾英民李斌刘民刘成林刘增良鲁华祥马华东马世龙苗夺谦朴松昊乔俊飞任友群孙富春孙长银王轩王飞跃王捍贫王万森王卫宁王小捷王亚杰王志良吴朝晖吴晓蓓夏桂华严新平杨春燕余凯余有成张学工赵春江周志华祝烈煌庄越挺《中国机器学习白皮书》编写组组长:陈松灿高阳组员:黄圣君李武军薛晖俞扬余志文詹德川詹志辉张利军张敏灵 庄福振
目录第1章引言 [229]等反馈受限问题中,主要目的是支持模糊决策,在探索和利用之间寻找最优的平衡。在解决这些实际问题时,又会发现一些新的问题,产生新的研究方向,促进在线学习算法和理论的发展。完全信息下的在线学习研究前沿包括非凸函数在线学习、非线性函数在线学习等问题。赌博机在线学习的研究热点主要围绕如何将算法和理论拓展到弱反馈场景,比如基于比较的赌博机。
第5章结束语本白皮书从主流机器学习技术、新兴机器学习技术以及大数据机器学习三方面对机器学习的研究和应用现状做了有选择的简要介绍。机器学习经过30余年的发展,目前已成为计算机科学中研究内涵极其丰富、新技术、新应用层出不穷的重要研究分支。国际上关于机器学习的主要学术会议包括每年定期举行的国际机器学习会议(ICML)、国际神经信息处理系统会议(NIPS)、欧洲机器学习会议(ECML)以及亚洲机器学习会议(ACML)等,主要学术期刊包括《MachineLearning》、《JournalofMachineLearningResearch》、《IEEETransactionsonNeuralNetworksandLearningSystems》等。此外,人工智能领域的一些主要国际会议(如IJCAI、AAAI等)和国际期刊(如《ArtificialIntelligence》、《IEEETransactionsonPatternAnalysisandMachineIntelligence》等)也经常发表与机器学习相关的最新研究成果。国内机器学习的重要学术活动包括每两年举行一次的中国机器学习会议(ChinaConferenceonMachineLearning,CCML),该会议目前由中国人工智能学会和中国计算机学会联合主办,中国人工智能学会机器学习专业委员会和中国计算机学会人工智能与模式识别专业委员会协办,目前已历经15届。此外,每年举行的中国机器学习及其应用研讨会(ChineseWorkshoponMachineLearningandApplications,MLA),该会议遵循“学术至上、其余从简”的原则,每届会议邀请海内外从事机器学习及相关领域研究的多位专家与会进行学术交流,包括特邀报告、顶会交流、以及TopConferenceReview等部分。迄今已历经13届,2015年度参会人数超过1200人。目前,大数据浪潮正对人类社会生活、科学研究的方方面面产生深刻影响。早期机器学习研究通常假设数据具有相对简单的特性,如数据来源单一、概念语义明确、数据规模适中、结构静态稳定等。当数据具有以上简单特性时,基于现有的机器学习理论与方法可以有效实现数据的智能化处理。然而,在大数据时代背景下,数据往往体现出多源异构、语义复杂、规模巨大、动态多变等特殊性质,为传统机器学习技术带来了新的挑战。为应对这一挑战,国内外科技企业巨头如谷歌、微软、亚马逊、华为、百度等纷纷成立以机器学习技术为核心的研究院,以充分挖掘大数据中蕴含的巨大商业与应用价值。可以预见,在未来相当长的一段时期内,机器学习领域的研究将以更广泛、更紧密的方式与工业界深度耦合,推动信息技术及产业的快速发展。
参考文献周志华.机器学习与数据挖掘.中国计算机学会通讯,2007,3(12):35-44.T.Mitchell.MachineLearning,NewYork:McGraw-Hill,1997.A.N.Meltzoff,P.K.Kuhl,J.Movellan,T.J.Sejnowski.Foundationsforanewscienceoflearning.Science,2009,325(5938):284-288.X.
Wang,A.
Mueen,
H.
Ding,
G.
Trajcevski,
P.
Scheuermann,E.
Keogh.Experimentalcomparisonofrepresentationmethodsanddistancemeasuresfortimeseriesdata.DataMiningandKnowledgeDiscovery.2013,26(2):275-309,2013.E.Levina,P.Bicke.Theearthmover'sdistanceistheMallowsdistance:Someinsightsfromstatistics.InProceedingsofthe8thInternationalConferenceonComputerVision,Vancouver,Canada,2001,251–256.E.Xing,A.Ng,M.Jordan,S.Russell.Distancemetriclearning,withapplicationtoclusteringwithside-information.InAdvancesinNeuralInformationProcessingSystems15,Cambridge,MA:MITPress,2003,505-512.A.Bar-Hillel,T.Hertz,N.Shental,D.Weinshall.Learningdistancefunctionsusingequivalencerelations.InProceedingsofthe20thInternationalConferenceonMachineLearning,Washington,D.C.,2003,11-18.J.Davis,B.Kulis,P.Jain,S.Sra,I.Dhillon.Information-theoreticmetriclearning.InProceedingsofthe24thInternationalConferenceonMachineLearning,Corvallis,OR.,2007,209-216.S.Shalev-Shwartz,Y.Singer,A.Ng.Onlineandbatchlearningofpseudo-metrics.InProceedingsofthe21stInternationalConferenceonMachineLearning,Alberta,P.Jain,B.Kulis,I.Dhillon,K.Grauman.Onlinemetriclearningandfastsimilaritysearch.InAdvancesinNeuralInformationProcessingSystems21,Cambridge,MA:MITPress,2008,761-768.K.Weinberger,L.Saul.Fastsolversandefficientimplementationsfordistancemetriclearning.InProceedingsofthe25thInternationalConferenceonMachineLearning,Helsinki,Finland,2008,1160–1167.S.Paramswaran,K.Weinberger.Largemarginmulti-taskmetriclearning.InAdvancesinNeuralInformationProcessingSystems23,Cambridge,MA:MITPress,2010,1867-1875.K.Huang,R.Jin,Z.Xu,C.-L.Liu.Robustmetriclearningbysmoothoptimization.InProceedingsofthe26thConferenceonUncertaintyinArtificialIntelligence,CatalinaIsland,CA,2010,244-251.G.Checik,U.Shalit,V.Sharma,S.Bengio.Anonlinealgorithmforlargescaleimagesimilaritylearning.InAdvancesinNeuralInformationProcessingSystems22,Cambridge,MA:MITPress,2009,306-314.M.Cuturi,D.Avis.Groundmetriclearning.JournalofMachineLearningResearch,2014,15:533-564.D.-C.Zhan,Y.-F.Li,Z.-H.Zhou.Learninginstancespecificdistancesusingmetricpropagation.InProceedingsofthe26thInternationalConferenceonMachineLearning,Montreal,Canada,2009,1225–1232.J.Goldberger,S.Roweis,G.Hinton,R.Salakhutdinov.NeighbourhoodComponentsAnalysis.In:AdvancesinNeuralInformationProcessingSystems17,Cambridge,MA:MITPress,2004,513–520.A.Bellet,A.Habrard,M.Sebban.Metriclearning.In:SynthesisLecturesonArtificialIntelligenceandMachineLearning,SanFrancisco,CA:MorganandClaypoolPublishers,2015,1-151.Y.Shi,A.Bellet,F.Sha.Sparsecompositionalmetriclearning.In:Proceedingsofthe28thAAAIConferenceonArtificialIntelligence,QuébecCity,Q.Qian,R.Jin,S.Zhu,Y.Lin.Anintegratedframeworkforhighdimensionaldistancemetriclearninganditsapplicationtofine-grainedvisualcategorization.arXiv:1402.0453,2014.M.Schultz,T.Joachims.Learningadistancemetricfromrelativecomparisons.InAdvancesinNeuralInformationProcessingSystems16,Cambridge,MA:MITPress,2004,41-48.X.Gao,S.Hoi,Y.Zhang,J.Wan,J.Li.SOML:Sparseonlinemetriclearningwithapplicationtoimageretrieval.In:Proceedingsofthe28thAAAIConferenceonArtificialIntelligence,QuébecCity,Canada,2014,1206–1212.K.Liu,A.Bellet,F.Sha.Similaritylearningforhigh-dimensionalsparsedata.arXiv:1411.2374,2014.T.Mensink,J.Verbeek,F.Perronnin,G.Csurka.Metriclearningforlargescaleimageclassification:Generalizingtonewclassesatnear-zerocost.InProceedingsofthe12thEuropeanConferenceonComputerVision,Firenze,Italy,2012,488-501.N.Verma,D.Mahajan,S.Sellamanickam,V.Nair.Learninghierarchicalsimilaritymetrics.InProceedingsoftheIEEEConferenceonComputerVisionandPatternRecognition,Providence,RI,2012,2280-2287.N.Jiang,W.Liu,Y.Wu.Orderdeterminationandsparsity-regularizedmetriclearningadaptivevisualtracking.InProceedingsoftheIEEEConferenceonComputerVisionandPatternRecognition,Providence,RI,2012,1956-1964.G.Lebanon.Metriclearningfortextdocuments.IEEETransactionsonPatternAnalysisandMachineIntelligence,2006,28(4):497-508.D.Lim,B.McFee,G.Lanckriet.Robuststructuremetriclearning.InProceedingsofthe30thInternationalConferenceonMachineLearning.Atlanta,GA,2013,615-623.T.Kato,N.Nagano.Metriclearningforenzymeactive-sitesearch.Bioinformatics,2010,26(21):2698-2704.J.Wang,X.Gao,Q.Wang,Y.Li.ProDis-ContSHC:Learningproteindissimilaritymeasuresandhierarchicalcontextcoherentlyforprotein-proteincomparisoninproteindatabaseretrieval.BMCBioinformatics,2012,13(S-7):S2.汪洪桥,孙富春,蔡艳宁,陈宁.多核学习方法.自动化学报,2010,36(8):1037-1050.G.R.G.Lanckriet,T.D.Bie,N.Cristianini,M.I.Jordan,W.S.Noble.Astatisticalframeworkforgenomicdatafusion.Bioinformatics,2004,20:2626-2635.F.R.Bach,G.R.G.Lanckriet,andM.I.Jordan.Multiplekernellearning,conicduality,andtheSMOalgorithm.In:Proceedingsofthe21stInternationalConferenceonMachineLearning,Banff,Canada,2004,41-48.G.R.G.Lanckriet,N.Cristianini,P.Bartlett,L.E.Ghaoui,M.I.Jordan.Learningthekernelmatrixwithsemidefiniteprogramming.JournalofMachineLearningResearch,2004,5:27-72.S.Sonnenburg,G.Rätsch,C.Schäfer,B.Schölkopf.Largescalemultiplekernellearning.JournalofMachineLearningResearch,2006,7:1531-1565.A.Rakotomamonjy,F.Bach,S.Canu,Y.Grandvalet.Moreefficiencyinmultiplekernellearning.In:Proceedingsofthe24thInternationalConferenceonMachineLearning,Corvallis,ORA.Rakotomamonjy,F.Bach,S.Canu,Y.Grandvalet.SimpleMKL.JournalofMachineLearningResearch,2008,9:2491-2521.Z.Xu,R.Jin,I.King,M.R.Lyu.Anextendedlevelmethodforefficientmultiplekernellearning.In:AdvancesinNeuralInformationProcessingSystems22,Cambridge,MA:MITPress,2009,1825-1832.Z.Xu,R.Jin,H.Yang,I.King,M.R.Lyu.Simpleandefficientmultiplekernellearningbygrouplasso.In:Proceedingsof27thInternationalConferenceonMachineLearning,Haifa,Israel,2010,1175--1182.S.V.N.Vishwanathan,Z.Sun,N.Ampornpunt.MultiplekernellearningandtheSMOalgorithm.In:AdvancesinNeuralInformationProcessingSystems23,Cambridge,MA:MITPress,2010,2361--2369.R.Jin,T.Yang,M.Mahdavi.Sparsemultiplekernellearningwithgeometricconvergencerate.arXiv:1302.0315v1,2013.M.Kloft,U.Brefeld,S.Sonnenburg,P.Laskov.Efficientandaccuratelp-normmultiplekernellearning.In:AdvancesinNeuralInformationProcessingSystems22,Cambridge,MA:MITPress,2009,997-1005.M.Varma,B.R.Babu.Moregeneralityinefficientmultiplekernellearning.In:Proceedingsofthe26thInternationalConferenceonMachineLearning,Montreal,Canada,2009,1065-1072.A.Jain,S.V.N.Vishwanathan,M.Varma.SPG-GMKL:Generalizedmultiplekernellearningwithamillionkernels.In:Proceedingsofthe18thACMSIGKDDInternationalConferenceonKnowledgeDiscoveryandDataMining,Beijing,China,2012,750-758.C.Hinrichs,V.Singh,J.Peng,S.C.Johnson.Q-MKL:matrix-inducedregularizationinmulti-kernellearningwithapplicationstoneuroimaging.In:AdvancesinNeuralInformationProcessingSystems25,Cambridge,MA:MITPress,2012,1421-1429.C.Cortes,M.Mohri,A.Rostamizadeh.Learningnon-linearcomibinationsofkernels.In:AdvancesinNeuralInformationProcessingSystems22,Cambridge,MA:MITPress,2009,396-404.Q.Mao,I.W.Tsang,S.Gao,L.Wang.Generalizedmultiplekernellearningwithdata-dependentpriors.IEEETransactionsonNeuralNetworksandLearningSystems,2015,26(6):1134-1148.A.Nazarpour,P.Adibi.Two-stagemultiplekernellearningforsuperviseddimensionalityreduction.PatternRecognition,2015,48(5):1854-1862.C.Xu,D.Tao,C.Xu.Asurveyonmulti-viewlearning.arXiv:1304.5434v1,2013.A.Blum,T.Mitchell.Combininglabeledandunlabeleddatawithco-training.In:Proceedingsofthe11thAnnualConferenceonComputationalLearningTheory,Madison,WI,1998,92-100.K.Nigam,R.Ghani.Analyzingtheeffectivenessandapplicabilityofco-training.In:Proceedingsofthe9thInternationalConferenceonInformationandKnowledgeManagement,McLean,VA,2000,86-93.V.Sindhwani,D.S.Rosenberg.AnRKHSformulti-viewlearningandmanifoldco-regularization.In:Proceedingsofthe25thInternationalConferenceonMachineLearning,Montreal,Canada,2009,976-983.Z.-H.Zhou,M.Li.Semi-supervisedregressionwithco-training.In:Proceedingsofthe19thInternationalJointConferencesonArtificialIntelligence,Edinburgh,UK,2005,908-916.S.Bickel,T.Scheffer.Multi-viewclustering.In:Proceedingsofthe4thIEEEInternationalConferenceonDataMining,Brighton,UK,2004,19-26.S.Yu,K.Yu,V.Tresp,H.P.Kriegel.Multi-outputregularizedfeatureprojection.IEEETransactionsonKnowledgeandDataEngineering,2006,18(12):1600-1613.A.Sharma,A.Kumar,H.Daume,D.W.Jacobs.Generalizedmultiviewanalysis:Adiscriminativelatentspace.In:ProceedingsoftheIEEEConferenceonComputerVisionandPatternRecognition,Providence,RI,2012,2160-2167.Z.-H.Zhou,D.-C.Zhan,Q.Yang.Semi-supervisedlearningwithveryfewlabeledtrainingsamples.In:Proceedingsofthe22ndNationalConferenceonArtificialIntelligence,Vancouver,Canada,2007,675-680.J.He,R.Lawrence.Agraph-basedframeworkformulti-taskmulti-viewlearning.In:Proceedingsofthe28thInternationalConferenceonMachineLearning,Bellevue,Washington,2011,25-32.J.Zhang,J.Huan.Inductivemulti-tasklearningwithmultipleviewdata.In:Proceedingsofthe18thACMSIGKDDInternationalConferenceonKnowledgeDiscoveryandDataMining,Beijing,China,2012,543-551.X.Jin,F.Zhuang,S.Wang,Q.He,Z.Shi.Sharedstructurelearningformultipletaskswithmultipleviews.In:LectureNotesinArtificialIntelligence8189,Berlin:Springer,2013,353-368.M.Hodosh,P.Young,J.Hockenmaier.Framingimagedescriptionasarankingtask:Data,modelsandevaluationmetrics.JournalofArtificialIntelligenceResearch,2013,47(1):853-899.L.Ma,Z.Lu,L.Shang,H.Li.Multimodalconvolutionalneuralnetworksformatchingimageandsentences.arXiv:1504.06063v1,2015.M.Hall,E.Frank,G.Holmes,B.Pfahringer,P.Reutemann,I.H.Witten.TheWEKAdataminingsoftware:Anupdate.SIGKDDExplorations,2009,11(1):10-18.J.Alcala-Fdez,A.Fernandez,J.Luengo,J.Derrac,S.Garcaa,L.Sanchez,F.Herrera.KEELdata-miningsoftwaretool:datasetrepository,integrationofalgorithmsandexperimentalanalysisframework.JournalofMultiple-ValuedLogicandSoftComputing,2011,17(2-3):255-287.M.Kearns,L.G.Valiant.Crytographiclimitationonlearningbooleanformulaeandfiniteautomata.In:Proceedingsofthe21stAnnualACMSymposiumonTheoryofComputing,Seattle,Washington,1989,433-444.L.Breiman.Baggingpredictors.MachineLearning,1996,24(2):123-140.Y.Freund,R.E.Schapire.Adecision-theoreticgeneralizationofonlinelearningandanapplicationtoboosting.JournalofComputerandSystemSciences,1997,55(1):119-139.L.Breiman.Randomforests.MachineLearning,2011,45(1):5-32.T.K.Ho.Therandomsubspacemethodforconstructingdecisionforests.IEEETransactionsPatternAnalysisandMachineIntelligence,1998,20(8):832-844.J.J.Rodriguez,L.I.Kuncheva,C.J.Alonso.Rotationforest:Anewclassifierensemblemethod.IEEETransactionsonPatternAnalysisandMachineIntelligence,2006,28(10):1619-1630.L.I.Kuncheva,J.J.Rodriguez.Classifierensembleswitharandomlinearoracle.IEEETransactionsonKnowledgeandDataEngineering,2007,19(4):500-508.Z.-H.Zhou,J.Wu,W.Tang.Ensemblingneuralnetworks:Manycouldbebetterthanall.ArtificialIntelligence,2002,137(1-2):239-263.Z.Yu,L.Li,J.Liu,G.Han.Hybridadaptiveclassifierensemble.IEEETransactionsonCybernetics,2015,42(2):177-190.Z.-H.Zhou.EnsembleMethods:FoundationsandAlgorithms,BocaRaton,FL:Chapman&Hall/CRC,2012.Z.Yu,Z.Deng,H.-S.Wong,L.Tan.Identifyingproteinkinase-specificphosphorylationsitesbasedonthebagging-adaboostensembleapproach.IEEETransactionsonNanoBioScience,2010,9(2):132-143.X.Zhu,P.Zhang,X.Lin,Y.Shi.Activelearningfromstreamdatausingoptimalweightclassifierensemble.IEEETransactionsonSystems,Man,andCybernetics-PartB:Cybernetics,2010,40(6):1607-1621.Y.Xu,X.Cao,H.Qiao.Anefficienttreeclassifierensemble-basedapproachforpedestriandetection.IEEETransactionsonSystems,Man,andCybernetics-PartB:Cybernetics,2011,41(1):107-117.X.Zhu.Semi-supervisedlearningwithgraphs.PhDthesis,CarnegieMellonB.Settles.Activelearningliteraturesurvey.ComputerSciencesTechnicalReport1648,UniversityofWisconsin–Madison,2009.S.Tong,D.Koller.Supportvectormachineactivelearningwithapplicationstotextclassification.In:Proceedingsofthe17thInternationalConferenceonMachineLearning,Stanford,CA,2000,999–1006.N.Roy,A.McCallum.Towardoptimalactivelearningthroughsamplingestimationoferrorreduction.In:Proceedingsofthe18thInternationalConferenceonMachineLearning,Williamstown,MA,2001,441–448.Y.Freund,H.S.Seung,E.Shamir,N.Tishby.Selectivesamplingusingthequerybycommitteealgorithm.MachineLearning,1997.28(2-3):133–168.S.Dasgupta,D.Hsu.Hierarchicalsamplingforactivelearning.In:Proceedingsofthe25thInternationalConferenceonMachineLearning,Helsinki,Finland,2008,208–215.B.Settles,M.Craven.Ananalysisofactivelearningstrategiesforsequencelabelingtasks.In:ProceedingsoftheConferenceonEmpiricalMethodsinNaturalLanguageProcessing,Honolulu,HI,2008,1069–1078.S.-J.Huang,R.Jin,Z.-H.Zhou.Activelearningbyqueryinginformativeandrepresentativeexamples.IEEETransactionsonPatternAnalysisandMachineIntelligence,2014.36(10):1936-1949.R.Chattopadhyay,Z.Wang,W.Fan,I.Davidson,S.Panchanathan,J.Ye.Batchmodeactivesamplingbasedonmarginalprobabilitydistributionmatching.In:Proceedingsofthe18thACMSIGKDDInternationalConferenceonKnowledgeDiscoveryandDataMining,Beijing,China,2012,741-749.S.-J.Huang,S.Chen,Z.-H.Zhou.Multi-labelactivelearning:Querytypematters.In:Proceedingsofthe24thInternationalJointConferenceonArtificialIntelligence,BuenosAires,ArgentinaP.Donmez,J.Carbonell,J.Schneider.Efficientlylearningtheaccuracyoflabelingsourcesforselectivesampling.In:Proceedingsofthe15thACMSIGKDDInternationalConferenceonKnowledgeDiscoveryandDataMining,Paris,France,2009,259–268.D.Margineantu.Activecost-sensitivelearning.In:Proceedingsofthe19thInternationalJointConferenceonArtificialIntelligence,Edinburgh,UK,2005,1622–1623.R.S.Sutton,A.G.Barto.ReinforcementLearning:AnIntroduction.Cambridge,MA:MITPress,1998.P.Abbeel,A.Coates,M.Quigley,A.Y.Ng.Anapplicationofreinforcementlearningtoaerobatichelicopterflight.In:AdvancesinNeuralInformationProcessingSystems19,Cambridge,MA:MITPress,2007,1-8.Y.C.Wang,J.M.Usher.Applicationofreinforcementlearningforagent-basedproductionscheduling.EngineeringApplicationsofArtificialIntelligence,2005,18(1):73-82.J.J.Choi,D.Laibson,B.C.Madrian,A.Metrick.Reinforcementlearningandsavingsbehavior.TheJournalofFinance,2009,64(6):2515-2534.J.A.Boyan,M.L.Littman.Packetroutingindynamicallychangingnetworks:Areinforcementlearningapproach.In:AdvancesinNeuralInformationProcessingSystems6,Burlington,MA:MorganKaufmann,1994,671-671.J.Frank,L.C.Seeberger,R.C.O'Reilly.Bycarrotorbystick:CognitivereinforcementlearninginParkinsonism.Science,2004,306(5703):1940-1943.K.Samejima,Y.Ueda,K.Doya,M.Kimura.Representationofaction-specificrewardvaluesinthestriatum.Science,2005,310(5752):1337-1340.T.G.Dietterich.Machinelearningresearch:Fourcurrentdirections.AIMagazine,1997,18(4),97-136.C.H.Watkins.Learningfromdelayedrewards.Ph.D.Thesis,KingsCollege,UniversityofCambridge,1989.P.L.Bartlett,J.Baxter.Infinite-horizonpolicy-gradientestimation.JournalofArtificialIntelligenceResearch,2001,15:319-350.G.Rummery,M.Niranjan.On-lineQ-learningusingconnectionistsystems.TechnicalReport,UniversityofCambridge,1994.R.J.Williams.Simplestatisticalgradient-followingalgorithmsforconnectionistreinforcementlearning.MachineLearning,1992,8(3):229–256.G.Konidaris,S.Osentoski,P.Thomas.ValuefunctionapproximationinreinforcementlearningusingtheFourierbasis.In:Proceedingsofthe25thAAAIConferenceonArtificialIntelligence,SanFrancisco,CA,2011,380-385.M.Bellemare,J.Veness,M.Bowling.Sketch-basedlinearvaluefunctionapproximation.In:AdvancesinNeuralInformationProcessingSystems25,Cambridge,MA:MITPress,2012,2222-2230.X.Xu,D.Hu,X.Lu.Kernel-basedleastsquarespolicyiterationforreinforcementlearning.IEEETransactionsonNeuralNetworks,2007,18(4):973-992.V.Mnih,K.Kavukcuoglu,D.Silver,A.A.Rusu,J.Veness,M.G.Bellemare,A.Graves,M.Riedmiller,A.K.Fidjeland,G.Ostrovski,S.Petersen,C.Beattie,A.Sadik,I.Antonoglou,H.King,D.Kumaran,D.Wierstra,S.Legg,D.Hassabis.Human-levelcontrolthroughdeepreinforcementlearning.Nature,2015,518:529–533.S.Mannor,R.Y.Rubinstein,Y.Gat.Thecrossentropymethodforfastpolicysearch.In:Proceedingsofthe30thInternationalConferenceonMachineLearning,Atlanta,GA,2013,512-519.I.Szita,A.Lörincz.Learningtetrisusingthenoisycross-entropymethod.NeuralComputation,2006,18(12):2936-2941.S.Schaal.Isimitationlearningtheroutetohumanoidrobots.TrendsinCognitive Sciences.1999,3(6):233-242.C.Atkeson,S.Schaal.Robotlearningfromdemonstration.In:Proceedingsofthe14thInternationalConferenceonMachineLearning,SanFrancisco,CA,1997,12-20.P.Abbeel,A.Y.Ng.Apprenticeshiplearningviainversereinforcementlearning.In:Proceedingsofthe21stInternationalConferenceonMachineLearning,Banff,Canada,2004,1-8.B.Ziebart,A.Maas,J.Bagnell,A.Dey.Maximumentropyinversereinforcementlearning.In:Proceedingsofthe23thAAAIConferenceonArtificialIntelligence,Chicago,IL,2008,1433-1438.A.Y.Ng,S.J.Russell.Algorithmsforinversereinforcementlearning.In:Proceedingsofthe17thInternationalConferenceonMachineLearning,Stanford,CA,2000,663–670.P.Abbeel,D.Dolgo,A.Y.Ng,S.Thrun.Apprenticeshiplearningformotionplanningwithapplicationtoparkinglotnavigation.In:ProceedingsoftheIEEE/RSJInternationalConferenceonIntelligentRobotsandSystems,Nice,France,2008,1083–1090.M.E.Taylor,P.Stone.Transferlearningforreinforcementlearningdomains:Asurvey.JournalofMachineLearningResearch,2009,10:1633–1685.M.E.Taylor,G.Kuhlmann,P.Stone.Autonomoustransferforreinforcementlearning.In:Proceedingsofthe7thInternationalConferenceonAutonomousAgentsandMultiagentSystems,Estoril,Portugal,2008,283–290.B.DaSilva,G.Konidaris,A.Barto.Learningparameterizedskills.In:Proceedingsofthe29thInternationalConferenceonMachineLearning,Edinburgh,UK,2012,1679-1686.W.B.Knox,P.Stone.Framingreinforcementlearningfromhumanreward:Rewardpositivity,temporaldiscounting,episodicity,andperformance.ArtificialIntelligence,2015,225:24-50.S.J.Pan,Q.Yang.Asurveyontransferlearning.IEEETransactiononDataEngineering,2010.22(10):1345-1359.J.Jiang,C.X.Zhai.Atwo-stageapproachtodomainadaptationforstatisticalclassifiers.In:Proceedingsofthe16thACMConferenceonInformationandKnowledgeManagement,Lisbon,Portugal,2007,401-410.W.Y.Dai,G.R.Xue,Q.Yang,Y.Yu.Co-clusteringbasedclassificationforout-of-domaindocuments.In:Proceedingsofthe13thACMSIGKDDInternationalConferenceonKnowledgeDiscoveryandDataMining,SanJose,CA,2007,210-219.M.Fang,J.Yin,X.Q.Zhu.Transferlearningacrossnetworksforcollectiveclassification.In:Proceedingsofthe13thIEEEInternationalConferenceonDataMining,Dallas,TX,2013,161-170.S.J.Pan,J.T.Kwok,Q.Yang.Transferlearningviadimensionalityreduction.In:Proceedingsofthe23rdNationalConferenceonArtificialIntelligence,Chicago,IL,2008,677-682.J.Blitzer,R.McDonald,F.Pereira.Domainadaptationwithstructuralcorrespondencelearning.In:ProceedingsoftheInternationalConferenceonEmpiricalMethodsinNaturalLanguageProcessing,Sydney,Australia,2006,120-128.Y.Yeh,C.Huang,Y.Wang.Heterogeneousdomainadaptationandclassificationbyexploitingthecorrelationsubspace.IEEETransactionsonImageProcessing,2013,23(5):2009-2018.J.Jiang,C.X.Zhai.InstanceweightingfordomainadaptationinNLP.In:Proceedingsofthe45thAnnualMeetingoftheAssociationforComputationalLinguistics,Prague,CzechRepublic,2007,264-271.W.Y.Dai,Q.Yang,G.R.Xue,Y.Yu.Boostingfortransferlearning.In:Proceedingsofthe24thInternationalConferenceonMachineLearning,Corvallis,ORJ.Gao,W.Fan,Y.Z.Sun,J.Han.Heterogeneoussourceconsensuslearningviadecisionpropagationandnegotiation.In:Proceedingsofthe13thACMSIGKDDInternationalConferenceonKnowledgeDiscoveryandDataMining,Paris,France,2009,339-348.F.Z.Zhuang,P.Luo,H.Xiong,Y.Xiong,Q.He,Z.Shi.Cross-domainlearningfrommultiplesources:Aconsensusregularizationperspective.IEEETransactionsonKnowledgeandDataEngineering,2010,22(12):1664-1678.F.Z.Zhuang,
X.Cheng,
P.Luo,
S.J.Pan,
Q.He.Supervisedrepresentationlearning:Transferlearningwithdeepautoencoders.
In:Proceedingsofthe24thInternationalJointConferenceonArtificialIntelligence,BuenosAires,Argentina,2015,4119-4125.F.Z.Zhuang,
X.Cheng,
S.J.Pan,
W.Yu,
Q.He,
Z.Shi.Transferlearningwithmultiplesourcesviaconsensusregularizedautoencoders.
In:LectureNotesinComputerScience8726,Berlin:Springer,2014,417-431.Q.Q.Gu,J.Zhou.Learningthesharedsubspaceformulti-taskclusteringandtransductivetransferclassification.In:Proceedingsofthe9thIEEEInternationalConferenceonDataMining,Miami,FL,2009,159-168.M.Kan,J.Wu,S.Shan,X.Chen.Domainadaptationforfacerecognition:Targetizesourcedomainbridgedbycommonsubspace.InternationalJournalofComputerVision,2014,109(1):94-109.W.Pan,E.W.Xiang,Q.Yang.Transferlearningincollaborativefilteringwithuncertainratings.In:Proceedingsofthe26thAAAIConferenceonArtificialIntelligence,Toronto,Canada,2012,662-668.G.E.Hinton,R.R.Salakhutdinov.Reducingthedimensionalityofdatawithneuralnetwork.Science,2006,313(5786):504-507.G.Dahl,D.Yu,L.Deng,A.Acero.Context-dependentpre-traineddeepneuralnetworksforlargevocabularyspeechrecognition.IEEETransactionsonAudio,Speech,andLanguageProcessing,2012,20(1):30-42.A.Hannun,C.Case,J.Casper,B.Catanzaro,G.Diamos,E.Elsen,R.Prenger,S.Satheesh,S.Sengupta,A.CoatesandA.Y.Ng.DeepSpeech:Scalingupend-to-endspeechrecognition.arXiv:1412.5567,2014.D.C.Ciresan,U.Meier,L.M.Gambardella,J.Schmidhuber.Deepbigsimpleneuralnetsexcelonhandwrittendigitrecognition.arXiv:1003.0358,2010.A.Krizhevsky,I.Sutskever,G.E.Hinton.Imagenetclassificationwithdeepconvolutionalneuralnetworks.In:AdvancesinNeuralInformationProcessingSystems25,Cambridge,MA:MITPress,2012,1097-1105.C.Szegedy,W.Liu,Y.Jia,P.Sermanet,S.Reed,D.Anguelov,D.Erhan,V.Vanhocke,A.Rabinovich.Goingdeeperwithconvolutions.arXiv:1409.4842,2014.R.Collobert,J.Weston.Aunifiedarchitecturefornaturallanguageprocessing:Deepneuralnetworkswithmultitasklearning.In:Proceedingsofthe25thInternationalConferenceonMachineLearning,Helsinki,Finland,2008,160-167.A.Mnih,G.Hinton.Threenewgraphicalmodelsforstatisticallanguagemodeling.In:Proceedingsofthe24thInternationalConferenceonMachineLearning,Corvallis,OR,2007,641-648.A.Mnih,G.Hinton.Ascalablehierarchicaldistributedlanguagemodel.In:AdvancesinNeuralInformationProcessingSystems21,Cambridge,MA:MITPress,2009,1081-1088.M.K.Leung,H.Y.Xiong,L.J.Lee,B.J.Frey.Deeplearningofthetissue-regulatedsplicin
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 全国安全教育培训感言课件
- 《税法》第8章:特点目的税法
- 建筑学专业就业前景
- 全员培训课件手机播放
- 宝鸡职业发展规划指南
- 人工智能训练方法
- 安全督导新举措讲解
- 贷款业务话术书籍
- 动物医学就业前景分析
- 光电工厂安全培训内容课件
- 2026年安全员考试题库300道附完整答案【必刷】
- 销售行业合同范本
- 2026年民用无人机操控员执照(CAAC)考试复习重点题库标准卷
- 英语试卷+答案黑龙江省哈三中2025-2026学年上学期高二学年12月月考(12.11-12.12)
- 中北大学2025年招聘编制外参编管理人员备考题库(一)参考答案详解
- 中华联合财产保险股份有限公司2026年校园招聘备考题库及一套完整答案详解
- 诗经中的爱情课件
- 2025年烟花爆竹经营单位安全管理人员考试试题及答案
- 2025天津大学管理岗位集中招聘15人参考笔试试题及答案解析
- 2025年云南省人民检察院聘用制书记员招聘(22人)考试笔试参考题库及答案解析
- TCAMET02002-2019城市轨道交通预埋槽道及套筒技术规范
评论
0/150
提交评论