版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
Unit16TextBDigitalSignalProcessorDigitalSignalProcessorAdigitalsignalprocessor(DSP)isaspecializedmicroprocessor(oraSIPblock),withitsarchitectureoptimizedfortheoperationalneedsofdigitalsignalprocessing.TextTextWordsNotesDiscussionHomeThegoalofDSPsisusuallytomeasure,filterand/orcompresscontinuousreal-worldanalogsignals.Mostgeneral-purposemicroprocessorscanalsoexecutedigitalsignalprocessingalgorithmssuccessfully,butdedicatedDSPsusuallyhavebetterpowerefficiencythustheyaremoresuitableinportabledevicessuchasmobilephonesbecauseofpowerconsumptionconstraints.DSPsoftenusespecialmemoryarchitecturesthatareabletofetchmultipledataand/orinstructionsatthesametime.TextTextWordsNotesDiscussionHome1.OverviewDigitalsignalprocessingalgorithmstypicallyrequirealargenumberofmathematicaloperationstobeperformedquicklyandrepeatedlyonaseriesofdatasamples.Signals(perhapsfromaudioorvideosensors)areconstantlyconvertedfromanalogtodigital,manipulateddigitally,andthenconvertedbacktoanalogform.ManyDSPapplicationshaveconstraintsonlatency;thatis,forthesystemtowork,theDSPoperationmustbecompletedwithinsomefixedtime,anddeferred(orbatch)processingisnotviable.TextTextWordsNotesDiscussionHomeMostgeneral-purposemicroprocessorsandoperatingsystemscanexecuteDSPalgorithmssuccessfully,butarenotsuitableforuseinportabledevicessuchasmobilephonesandPDAsbecauseofpowerefficiencyconstraints.Aspecializeddigitalsignalprocessor,however,willtendtoprovidealower-costsolution,withbetterperformance,lowerlatency,andnorequirementsforspecializedcoolingorlargebatteries.TextTextWordsNotesDiscussionHomeThearchitectureofadigitalsignalprocessorisoptimizedspecificallyfordigitalsignalprocessing.Mostalsosupportsomeofthefeaturesasanapplicationsprocessorormicrocontroller,sincesignalprocessingisrarelytheonlytaskofasystem.SomeusefulfeaturesforoptimizingDSPalgorithmsareoutlinedbelow.TextTextWordsNotesDiscussionHome2.Architecture(1)SoftwareBythestandardsofgeneral-purposeprocessors,DSPinstructionsetsareoftenhighlyirregular;whiletraditionalinstructionsetsaremadeupofmoregeneralinstructionsthatallowthemtoperformawidervarietyofoperations,instructionsetsoptimizedfordigitalsignalprocessingcontaininstructionsforcommonmathematicaloperationsthatoccurfrequentlyinDSPcalculations.BothtraditionalandDSP-optimizedinstructionsetsareabletocomputeanyarbitraryoperationbutanoperationthatmightrequiremultipleARMorx86instructionstocomputemightrequireonlyoneinstructioninaDSPoptimizedinstructionset.TextTextWordsNotesDiscussionHomeOneimplicationforsoftwarearchitectureisthathand-optimizedassembly-coderoutinesarecommonlypackagedintolibrariesforre-use,insteadofrelyingonadvancedcompilertechnologiestohandleessentialalgorithms.Evenwithmoderncompileroptimizationshand-optimizedassemblycodeismoreefficientandmanycommonalgorithmsinvolvedinDSPcalculationsarehand-writteninordertotakefulladvantageofthearchitecturaloptimizations.TextTextWordsNotesDiscussionHome1)Instructionsets·multiply–accumulates(MACs,includingfusedmultiply–add,FMA)operations·usedextensivelyinallkindsofmatrixoperations·convolutionforfiltering·dotproduct·polynomialevaluation·FundamentalDSPalgorithmsdependheavilyonmultiply–accumulateperformance·FIRfiltersTextTextWordsNotesDiscussionHome·FastFouriertransform(FFT)·Instructionstoincreaseparallelism:·SIMD[1]·VLIW[2]·superscalararchitecture·Specializedinstructionsformoduloaddressinginringbuffersandbit-reversedaddressingmodeforFFTcross-referencingTextTextWordsNotesDiscussionHome·Digitalsignalprocessorssometimesusetime-stationaryencodingtosimplifyhardwareandincreasecodingefficiency.·Multiplearithmeticunitsmayrequirememoryarchitecturestosupportseveralaccessesperinstructioncycle·Specialloopcontrols,suchasarchitecturalsupportforexecutingafewinstructionwordsinaverytightloopwithoutoverheadforinstructionfetchesorexittestingTextTextWordsNotesDiscussionHome2)Datainstructions·Saturationarithmetic,inwhichoperationsthatproduceoverflowswillaccumulateatthemaximum(orminimum)valuesthattheregistercanholdratherthanwrappingaround(maximum+1doesn'toverflowtominimumasinmanygeneral-purposeCPUs,insteaditstaysatmaximum).Sometimesvariousstickybitsoperationmodesareavailable.·Fixed-pointarithmeticisoftenusedtospeeduparithmeticprocessing·Single-cycleoperationstoincreasethebenefitsofpipeliningTextTextWordsNotesDiscussionHome3)Programflow·Floating-pointunitintegrateddirectlyintothedatapath·Pipelinedarchitecture·Highlyparallelmultiplier–accumulators(MACunits)·Hardware-controlledlooping,toreduceoreliminatetheoverheadrequiredforloopingoperationsTextTextWordsNotesDiscussionHome(2)Hardware1)MemoryarchitectureDSPsareusuallyoptimizedforstreamingdataandusespecialmemoryarchitecturesthatareabletofetchmultipledataand/orinstructionsatthesametime,suchastheHarvardarchitectureorModifiedvonNeumannarchitecture,whichuseseparateprogramanddatamemories(sometimesevenconcurrentaccessonmultipledatabuses).TextTextWordsNotesDiscussionHome2)AddressingandvirtualmemoryDSPsfrequentlyusemulti-taskingoperatingsystems,buthavenosupportforvirtualmemoryormemoryprotection.Operatingsystemsthatusevirtualmemoryrequiremoretimeforcontextswitchingamongprocesses,whichincreaseslatency.TextTextWordsNotesDiscussionHome·Hardwaremoduloaddressing·Allowscircularbufferstobeimplementedwithouthavingtotestforwrapping·Bit-reversedaddressing,aspecialaddressingmode·usefulforcalculatingFFTs·Exclusionofamemorymanagementunit·Memory-addresscalculationunitTextTextWordsNotesDiscussionHome3.HistoryPriortotheadventofstand-aloneDSPchipsdiscussedbelow,mostDSPapplicationswereimplementedusingbit-sliceprocessors.TheAMD2901bit-slicechipwithitsfamilyofcomponentswasaverypopularchoice.TherewerereferencedesignsfromAMD,butveryoftenthespecificsofaparticulardesignwereapplicationspecific.Thesebitslicearchitectureswouldsometimesincludeaperipheralmultiplierchip.ExamplesofthesemultiplierswereaseriesfromTRWincludingtheTDC1008andTDC1010,someofwhichincludedanaccumulator,providingtherequisitemultiply–accumulate(MAC)function.TextTextWordsNotesDiscussionHomeIn1976,RichardWigginsproposedtheSpeak&SpellconcepttoPaulBreedlove,LarryBrantingham,andGeneFrantzatTexasInstrument'sDallasresearchfacility.Twoyearslaterin1978theyproducedthefirstSpeak&Spell,withthetechnologicalcenterpiecebeingtheTMS5100,theindustry'sfirstdigitalsignalprocessor.Italsosetothermilestones,beingthefirstchiptouseLinearpredictivecodingtoperformspeechsynthesis.TextTextWordsNotesDiscussionHomeIn1978,Intelreleasedthe2920asan"analogsignalprocessor".Ithadanon-chipADC/DACwithaninternalsignalprocessor,butitdidn'thaveahardwaremultiplierandwasnotsuccessfulinthemarket.In1979,AMIreleasedtheS2811.Itwasdesignedasamicroprocessorperipheral,andithadtobeinitializedbythehost.TheS2811waslikewisenotsuccessfulinthemarket.TextTextWordsNotesDiscussionHomeIn1980thefirststand-alone,completeDSPs–theNECµPD7720andAT&TDSP1–werepresentedattheInternationalSolid-StateCircuitsConference'80.BothprocessorswereinspiredbytheresearchinPSTN[3]telecommunications.TextTextWordsNotesDiscussionHomeTheAltamiraDX-1wasanotherearlyDSP,utilizingquadintegerpipelineswithdelayedbranchesandbranchprediction.TextTextWordsNotesDiscussionHomeAnotherDSPproducedbyTexasInstruments(TI)[4],theTMS32010presentedin1983,provedtobeanevenbiggersuccess.ItwasbasedontheHarvardarchitecture,andsohadseparateinstructionanddatamemory.Italreadyhadaspecialinstructionset,withinstructionslikeload-and-accumulateormultiply-and-accumulate.Itcouldworkon16-bitnumbersandneeded390nsforamultiply–addoperation.TIisnowthemarketleaderingeneral-purposeDSPs.TextTextWordsNotesDiscussionHomeAboutfiveyearslater,thesecondgenerationofDSPsbegantospread.Theyhad3memoriesforstoringtwooperandssimultaneouslyandincludedhardwaretoacceleratetightloops,theyalsohadanaddressingunitcapableofloop-addressing.Someofthemoperatedon24-bitvariablesandatypicalmodelonlyrequiredabout21nsforaMAC.MembersofthisgenerationwereforexampletheAT&TDSP16AortheMotorola56000.TextTextWordsNotesDiscussionHomeThemainimprovementinthethirdgenerationwastheappearanceofapplication-specificunitsandinstructionsinthedatapath,orsometimesascoprocessors.Theseunitsalloweddirecthardwareaccelerationofveryspecificbutcomplexmathematicalproblems,liketheFourier-transformormatrixoperations.Somechips,liketheMotorolaMC68356,evenincludedmorethanoneprocessorcoretoworkinparallel.OtherDSPsfrom1995aretheTITMS320C541ortheTMS320C80.TextTextWordsNotesDiscussionHomeThefourthgenerationisbestcharacterizedbythechangesintheinstructionsetandtheinstructionencoding/decoding.SIMDextensionswereadded,VLIWandthesuperscalararchitectureappeared.Asalways,theclock-speedshaveincreased,a3nsMACnowbecamepossible.TextTextWordsNotesDiscussionHome4.ModernDSPsModernsignalprocessorsyieldgreaterperformance;thisisdueinparttobothtechnologicalandarchitecturaladvancementslikelowerdesignrules,fast-accesstwo-levelcache,(E)DMAcircuitryandawiderbussystem.NotallDSPsprovidethesamespeedandmanykindsofsignalprocessorsexist,eachoneofthembeingbettersuitedforaspecifictask,ranginginpricefromaboutUS$1.50toUS$300.TexasInstrumentsproducestheC6000seriesDSPs,whichhaveclockspeedsof1.2GHzandimplementseparateinstructionanddatacaches.(tobecontinued)TextTextWordsNotesDiscussionHome(Continued)Theyalsohavean8MiB2ndlevelcacheand64EDMAchannels.Thetopmodelsarecapableofasmanyas8000MIPS(instructionspersecond),useVLIW(verylonginstructionword),performeightoperationsperclock-cycleandarecompatiblewithabroadrangeofexternalperipheralsandvariousbuses(PCI/serial/etc).TMS320C6474chipseachhavethreesuchDSPs,andthenewestgenerationC6000chipssupportfloatingpointaswellasfixedpointprocessing.TextTextWordsNotesDiscussionHomeFreescale[5]producesamulti-coreDSPfamily,theMSC81xx.TheMSC81xxisbasedonStarCoreArchitectureprocessorsandthelatestMSC8144DSPcombinesfourprogrammableSC3400StarCoreDSPcores.EachSC3400StarCoreDSPcorehasaclockspeedof1GHz.TextTextWordsNotesDiscussionHomeXMOSproducesamulti-coremulti-threadedlineofprocessorwellsuitedtoDSPoperations.Theycomeinvariousspeedsrangingfrom400to1600MIPS.Theprocessorshaveamulti-threadedarchitecturethatallowsupto8real-timethreadspercore,meaningthata4coredevicewouldsupportupto32realtimethreads.Threadscommunicatebetweeneachotherwithbufferedchannelsthatarecapableofupto80Mbit/s.ThedevicesareeasilyprogrammableinCandaimatbridgingthegapbetweenconventionalmicro-controllersandFPGAs.TextTextWordsNotesDiscussionHomeCEVA,Inc.producesandlicensesthreedistinctfamiliesofDSPs.PerhapsthebestknownandmostwidelydeployedistheCEVA-TeakLiteDSPfamily,aclassicmemory-basedarchitecture,with16-bitor32-bitword-widthsandsingleordualMACs.TheCEVA-XDSPfamilyoffersacombinationofVLIWandSIMDarchitectures,withdifferentmembersofthefamilyofferingdualorquad16-bitMACs.TheCEVA-XCDSPfamilytargetsSoftware-definedRadio(SDR)modemdesignsandleveragesauniquecombinationofVLIWandVectorarchitectureswith3216-bitMACs.TextTextWordsNotesDiscussionHomeAnalogDevicesproducetheSHARC-basedDSPandrangeinperformancefrom66MHz/198MFLOPS(millionfloating-pointoperationspersecond)to400MHz/2400MFLOPS.SomemodelssupportmultiplemultipliersandALUs,SIMDinstructionsandaudioprocessing-specificcomponentsandperipherals.TheBlackfinfamilyofembeddeddigitalsignalprocessorscombinethefeaturesofaDSPwiththoseofageneraluseprocessor.Asaresult,theseprocessorscanrunsimpleoperatingsystemslikeµCLinux,velOSityandNucleusRTOSwhileoperatingonreal-timedata.TextTextWordsNotesDiscussionHomelatencyn.潜伏;潜在因素deferv.使推迟;使延期;延期;服从arbitraryadj.[数]任意的;武断的;专制的assemblyn.装配;集会,集合compilern.[计]编译程序;汇编者matrixn.[数]矩阵;模型WordsWordsTextWordsNotesDiscussionHomeconvolutionn.[数]卷积;回旋;盘旋;卷绕polynomialn.[数]多项式superscalarn.超标量体系结构saturationn.饱和;色饱和度;磁化饱和moduloprep.&adv.以…为模;按模计算peripheraladj.&n.外围的;次要的;外部设备WordsTextWordsNotesDiscussionHomemilestonen.里程碑,划时代的事件operandn.[计]操作数;[计]运算对象leveragen.&v.杠杆作用;利用;举债经营WordsTextWordsNotesDiscussionHome[1]SIMD(SingleInstructionMultipleData),单指令多数据流,能够复制多个操作数,并把它们打包在大型寄存器的一组指令集。TextWordsNotesDiscussionNotesHome[2]VLIW(VeryLongInstructionWord)是一种非常长的指令组合,它把许多条指令连在一起,增加了运算的速度。VLIW体系结构是美国Multiflow和Cydrome公司于20世纪80年代设计的体系结构,EPIC体系结构就是从VLIW中衍生出来的。TextWordsNotesDiscussionHome[3]PSTN(PublicSwitchedTelephoneNetwork):公共交换电话网络,一种常用旧式电话系统,即我们日常生活中常用的电话网。TextWordsNotesDiscussionHome[4]Texa
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2026年辽宁科技学院教师招聘考试参考题库及答案解析
- 水电安全培训内容记录
- 企业市场进入策略制定方案
- 2025-2030年健身社区挑战积分商城行业深度调研及发展战略咨询报告
- 2026年安徽警官职业学院教师招聘考试备考试题及答案解析
- 2025-2030年手持式便捷植保设备企业制定与实施新质生产力战略分析研究报告
- 2025-2030年社会组织教育培训服务行业数字营销策略分析研究报告
- 2025-2030年游戏化营销平台企业制定与实施新质生产力战略分析研究报告
- 安全培训视屏内容
- 企业利润最大化管理策略
- 2026浙江台州市椒江区科技事业中心招聘编制外工作人员1人备考题库附答案详解(a卷)
- 2026国家电投集团二级单位总会计师选聘笔试历年参考题库附带答案详解
- 任务7.1 智能饲喂控制系统的PLC控制
- 2026中国太平洋人寿保险股份有限公司校园招聘考试参考题库及答案解析
- 2026湖南益阳市中心医院人才引进67人考试参考试题及答案解析
- 2026春季新人教版历史八年级下册全册教案11新版
- 成都兴蓉环境股份有限公司笔试题
- 排水沟施工方案
- 校长在2026年学校读书节启动仪式上的讲话动员:鼓励阅读营造书香氛围提升人文素养
- 2025年河南农业职业学院单招职业技能考试题库附答案解析
- 2026年及未来5年中国共享住宿未来发展预测及投资方向研究报告
评论
0/150
提交评论