版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
copyrightPaulaMatuszek2017
1
CMSC671:Principlesof
ArtificialIntelligencePythonforAI
Dr.PaulaMatuszek
Paula.Matuszek@
Paula.Matuszek@
(610)647-9789
WhyPythonforAI?
2
ThetraditionalAIlanguagesareLispandProlog
WhenperformanceisaseriousissuecommonlyusedlanguagesincludeC++,sometimesJava,andmorerecentlyGPU-basedlanguages
SowhyPython?
Lightweightstartup,interpreter,IDLE.
Object-oriented
Usefulbuilt-indatastructuresandoperationsforsymbolicprocessing:dictionaries,lists,sets,strings
Strongnumericprocessingforstatisticalprocessing:matrixoperations,etc.
PythonforAI
3
Inadditiontothebuilt-incapabilities,Pythongetsmuchofitspowerfromlibrariesthatcanbeimportedtoaddcapabilities
Acoupleofbasiconesyouwillwant(andprobablyhave)
NumPy:mathfunctionsforforarraysandmatrices
matplotlib:plottinglibraryforPythonandNumPy.
PooleandMackworthhavealargesetofPythontoolswhichimplementmanybasicAIfunctions.Theyarestartingpoints,notpolishedcode.
/AIPython/
ThePDF
/AIPython/aipython.pdf
givesadiscussionofPythonandofthetools.
MoreSpecificAILibraries
4
Asyougetintomoreadvancedtopics,forprojectsorlaterinthecourse,youwillreachsomeareaswhereyoudon’twanttoimplementfromscratch.Thereareanumberof
morespecificlibrariesthatletyouconcentrateonabroaderproblem.
Someofthebest-knownare
NaturalLanguageToolkit(NLTK)
Scikit-learn
TensofFlow
NLTK
5
WhatisNLTK?
NaturalLanguageToolKit
SetofmodulesforPython
Largenumberoftoolsforprocessingnaturaltext
Alargestofrelevantdata
Widelyusedfornaturallanguagerecognition,speechprocessing,textmining,speechtranslation.
Startingpointis
/.
SomeNLTKModules
6
NLTKisatoolkitforprocessingtext
Textistreatedasalistofwords
Modulesinclude
Stemmers,Tokenizers,Parsers
PartofSpeechandNamedEntityTaggers
N-Grams,FrequencyDistributions,otherstatistical
Classifyingdocumentsintopredefinedgroups
Clusteringdocumentsintogroups
NLTKCorpora
7
NLTKalsohasalargesetofcorpora:existingtextbodiesthatcanreadilybeprocessed.
Brown:About500Englishdocuments,aboutamillionwords,compiledfromavarietyofsources.Firstgeneralcomputercorpusavailable.
Reuters:about800,000newsarticles
Gutenberg:18worksfrom12authors
Shakespeare:8ofhisplays
Currentlistat
/nltk_data/
Italsoincludesdictionaries,gazetteers,trainedmodels.
Example
8
Simpleclassifier,tryingtoclassifynamesintomaleandfemalebythelastletter.
/~matuszek/fall2013/
Sep11Classify.py
Scikit-learn
9
SciPyis“aPython-basedecosystemofopen-sourcesoftwareformathematics,science,andengineering.”(https://
/)]
NumPyandmatplotlibarefromhere.
Scikit-learnisasetofmachinelearningmodulesbuiltontopofSciPy.(
/stable/)
Scikit-learnmodules
10
Scikit-learnmodulesaregroupedintosixkinds:
Classification
Regression
Clustering
DimensionalityReduction
DataPreparation
ModelSelection
Classification
11
Decidingwhatgroiponclassanobjectbelongsto
Algorithms:
DecisionTrees
KNearestNeighbor
NaiveBayes
SupportVectorMachones
Examples:malevsfemale:-).Spamdetection,loanapplications,collegeadmissions,imageidentification
Regression
12
Predictingacontinuousvalue(ratherthanaclass)foranobject
Algorithms
Leastsquareslinearregression
LogisticRegression
Examples:whichwillthishousesellfor?WhatGPAcanweexpectthisstudenttoachieve?Whattemperaturewillitbetomorrow?
13
Clustering:groupingsimilarobjectswithoutanyaprioridefinitionofgroups
Algorithms:
K-Means
Agglomerativeclustering
Hierarchicalclustering
Examples:whatarethenewstopicsinthesearticles?HowmanydifferentthingsdoIhaveimagesof?Whatarethekindsofcallstothehelpdeskwearegetting?
DimensionalityReduction
14
Reducingthetotalnumberofvariablesforeachindividualwithoutlosingexplanatorypower
Algorithms
PrincipalComponentAnalysis
FactorAnalysis
SingularValueDecomposition
Examples.Reducingtextvectorlengths,eliminatingsomeoftheredundancyinnaturallanguage.Reducingimagevectors,eliminatingirrelevantvariationfromlighting.
TheRest
15
Preprocessing.Datacleanup
Algorithms:featureextractionandnormalization
Examples:Usingweightandheightinthesameclassification,turninga10-pointmovieratinginto“yes,no”.
ModelSelection.Checkingusefulnessofmodels,comparingdifferentapproachesandparameters.
Examples:Eachoftheabovemethodsproducesamathematicalmodelorformulawhichcanthenbeappliedtonewdata.Andeachhasmanyparameterswhichcanbetweaked.Thesetools
helpdecideamongtheoptionsanddeterminewhethertheyareactuallygood.
Allofthisiswelldocumentedatthescikit-learnsite.
Andothers
16
TensorFlowisGoogle’sdeeplearningframework;itiswrittenmostlyinC++,buttheAPIisPythonbased.It’srelativelynew,notaswid
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 幼儿园学期计划模板锦集七篇范文材料
- 班主任年级工作总结模板七篇
- 银川市第二十四中学2023届九年级下学期中考一模数学试卷(含解析)
- 哈佛家训五年级读书笔记摘抄10篇
- 农信社干货知识点
- 白雪公主观后感范文5篇
- 化妆品公司生产科主管岗位说明书
- 钢结构涂装质量控制
- 年货促销方案策划书(2篇)
- 单位安全大检查方案(2篇)
- 广告安装应急管理预案范本
- 广东开放改革开放史(本专23春)-第七单元形成性考核0
- 人教版高一思想政治必修3《第九课全面依法治国的基本要求》教案及教学反思
- 2023-2024学年四川省遂宁市小学语文五年级期末自测测试题详细参考答案解析
- 基于核心素养高中物理“深度学习”策略及其教学研究课题论证设计方案
- 冀人版科学(2017)六年级下册全册单元测试卷及答案
- 安徽恒星新材料科技有限公司年产6万吨新型高品质电子级及多功能环氧树脂项目环评报告
- 小猪唏哩呼噜
- 云南省高等学校教学改革项目
- 写给儿子的一封廉政家书
- 2020阿里云产品图标
评论
0/150
提交评论