版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
copyrightPaulaMatuszek2017
1
CMSC671:Principlesof
ArtificialIntelligencePythonforAI
Dr.PaulaMatuszek
Paula.Matuszek@
Paula.Matuszek@
(610)647-9789
WhyPythonforAI?
2
ThetraditionalAIlanguagesareLispandProlog
WhenperformanceisaseriousissuecommonlyusedlanguagesincludeC++,sometimesJava,andmorerecentlyGPU-basedlanguages
SowhyPython?
Lightweightstartup,interpreter,IDLE.
Object-oriented
Usefulbuilt-indatastructuresandoperationsforsymbolicprocessing:dictionaries,lists,sets,strings
Strongnumericprocessingforstatisticalprocessing:matrixoperations,etc.
PythonforAI
3
Inadditiontothebuilt-incapabilities,Pythongetsmuchofitspowerfromlibrariesthatcanbeimportedtoaddcapabilities
Acoupleofbasiconesyouwillwant(andprobablyhave)
NumPy:mathfunctionsforforarraysandmatrices
matplotlib:plottinglibraryforPythonandNumPy.
PooleandMackworthhavealargesetofPythontoolswhichimplementmanybasicAIfunctions.Theyarestartingpoints,notpolishedcode.
/AIPython/
ThePDF
/AIPython/aipython.pdf
givesadiscussionofPythonandofthetools.
MoreSpecificAILibraries
4
Asyougetintomoreadvancedtopics,forprojectsorlaterinthecourse,youwillreachsomeareaswhereyoudon’twanttoimplementfromscratch.Thereareanumberof
morespecificlibrariesthatletyouconcentrateonabroaderproblem.
Someofthebest-knownare
NaturalLanguageToolkit(NLTK)
Scikit-learn
TensofFlow
NLTK
5
WhatisNLTK?
NaturalLanguageToolKit
SetofmodulesforPython
Largenumberoftoolsforprocessingnaturaltext
Alargestofrelevantdata
Widelyusedfornaturallanguagerecognition,speechprocessing,textmining,speechtranslation.
Startingpointis
/.
SomeNLTKModules
6
NLTKisatoolkitforprocessingtext
Textistreatedasalistofwords
Modulesinclude
Stemmers,Tokenizers,Parsers
PartofSpeechandNamedEntityTaggers
N-Grams,FrequencyDistributions,otherstatistical
Classifyingdocumentsintopredefinedgroups
Clusteringdocumentsintogroups
NLTKCorpora
7
NLTKalsohasalargesetofcorpora:existingtextbodiesthatcanreadilybeprocessed.
Brown:About500Englishdocuments,aboutamillionwords,compiledfromavarietyofsources.Firstgeneralcomputercorpusavailable.
Reuters:about800,000newsarticles
Gutenberg:18worksfrom12authors
Shakespeare:8ofhisplays
Currentlistat
/nltk_data/
Italsoincludesdictionaries,gazetteers,trainedmodels.
Example
8
Simpleclassifier,tryingtoclassifynamesintomaleandfemalebythelastletter.
/~matuszek/fall2013/
Sep11Classify.py
Scikit-learn
9
SciPyis“aPython-basedecosystemofopen-sourcesoftwareformathematics,science,andengineering.”(https://
/)]
NumPyandmatplotlibarefromhere.
Scikit-learnisasetofmachinelearningmodulesbuiltontopofSciPy.(
/stable/)
Scikit-learnmodules
10
Scikit-learnmodulesaregroupedintosixkinds:
Classification
Regression
Clustering
DimensionalityReduction
DataPreparation
ModelSelection
Classification
11
Decidingwhatgroiponclassanobjectbelongsto
Algorithms:
DecisionTrees
KNearestNeighbor
NaiveBayes
SupportVectorMachones
Examples:malevsfemale:-).Spamdetection,loanapplications,collegeadmissions,imageidentification
Regression
12
Predictingacontinuousvalue(ratherthanaclass)foranobject
Algorithms
Leastsquareslinearregression
LogisticRegression
Examples:whichwillthishousesellfor?WhatGPAcanweexpectthisstudenttoachieve?Whattemperaturewillitbetomorrow?
13
Clustering:groupingsimilarobjectswithoutanyaprioridefinitionofgroups
Algorithms:
K-Means
Agglomerativeclustering
Hierarchicalclustering
Examples:whatarethenewstopicsinthesearticles?HowmanydifferentthingsdoIhaveimagesof?Whatarethekindsofcallstothehelpdeskwearegetting?
DimensionalityReduction
14
Reducingthetotalnumberofvariablesforeachindividualwithoutlosingexplanatorypower
Algorithms
PrincipalComponentAnalysis
FactorAnalysis
SingularValueDecomposition
Examples.Reducingtextvectorlengths,eliminatingsomeoftheredundancyinnaturallanguage.Reducingimagevectors,eliminatingirrelevantvariationfromlighting.
TheRest
15
Preprocessing.Datacleanup
Algorithms:featureextractionandnormalization
Examples:Usingweightandheightinthesameclassification,turninga10-pointmovieratinginto“yes,no”.
ModelSelection.Checkingusefulnessofmodels,comparingdifferentapproachesandparameters.
Examples:Eachoftheabovemethodsproducesamathematicalmodelorformulawhichcanthenbeappliedtonewdata.Andeachhasmanyparameterswhichcanbetweaked.Thesetools
helpdecideamongtheoptionsanddeterminewhethertheyareactuallygood.
Allofthisiswelldocumentedatthescikit-learnsite.
Andothers
16
TensorFlowisGoogle’sdeeplearningframework;itiswrittenmostlyinC++,buttheAPIisPythonbased.It’srelativelynew,notaswid
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2026届河南南阳市第一中学高二数学第一学期期末质量检测试题含解析
- 内业培训课件
- 焦化厂卫生管理制度(3篇)
- 甘肃网络公司管理制度(3篇)
- 盛典活动创意方案策划(3篇)
- 兽药行业培训课件
- 老年康复运动管理制度内容(3篇)
- 《GA 1512-2018公安单警装备 金属手铐》专题研究报告
- 《GA 762-2008警服 高级警官大衣》专题研究报告
- Unit 7 Happy Birthday!Section A 1a- 3c 课件+视频 2025-2026学年人教版七年级英语上册
- 2025-2026学年人教版(新教材)小学数学三年级下册(全册)教学设计(附目录P208)
- 2026年孝昌县供水有限公司公开招聘正式员工备考题库及参考答案详解一套
- 2025版中国慢性乙型肝炎防治指南
- 2026年及未来5年市场数据中国草酸行业发展前景预测及投资战略数据分析研究报告
- 临床技能培训中的教学理念更新
- 2025年太原理工大学马克思主义基本原理概论期末考试参考题库
- 2026届广东东莞七校高三上学期12月联考政治试题含答案
- 感染科结核病防治培训指南
- 思想政治教育研究课题申报书
- 金属水幕施工方案(3篇)
- 开发区再生水资源化利用建设项目可行性研究报告
评论
0/150
提交评论