30-Structural aspects of the System 360 Model 85, Part II The cache.pdf_第1页
30-Structural aspects of the System 360 Model 85, Part II The cache.pdf_第2页
30-Structural aspects of the System 360 Model 85, Part II The cache.pdf_第3页
30-Structural aspects of the System 360 Model 85, Part II The cache.pdf_第4页
30-Structural aspects of the System 360 Model 85, Part II The cache.pdf_第5页
已阅读5页,还剩2页未读 继续免费阅读

付费下载

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

Thecache,ahigh-speedbufferestablishingastoragehierarchyintheModel85,isdiscussedindepthinthispart,sinceitrepresentsthebasicorganizationaldeparturefromotherSYSTEM/BOcomputers.Discussedareorganizationandoperationofthecache,includingthemechanismsusedtolocateandretrievedataneededbytheprocessor.Theinternalperformancestudiesthatledtouseofthecachearede-scribed,andsimulatedperformanceofthechosenconfigurationiscomparedwiththatofatheoreticalsystemhavinganentire80-nano-secondmainstorage.Finally,theeffectsofvaryingcacheparametersarediscussedandtabulated.StructuralaspectsoftheSystem/360Model8511ThecachebyJ.S.LiptayAmongtheobjectivesoftheModel85isthatofprovidingaSYSTEM/compatibleprocessorwithbothhighperformanceandhighthroughput.Oneoftheimportantingredientsofhighthrough-putisalargemainstoragecapacity(seetheaccompanyingarticleinPartI).However,itisnotfeasibletoprovidealargemainstor-agewithanaccesstimecommensuratewiththe80-nanosecondprocessorcycleoftheModel85.Alongeraccesstimecanbepar-tiallycompensatedforbyanincreaseinoverlap,greaterbuffering,deeperstorageinterleaving,moresophisticationinthehandlingofbranches,andotherimprovementsintheprocessor.Allofthesefactorsonlypartiallycompensatefortheslowerstorage,and,there-fore,wedecidedtouseastoragehierarchyinstead.Thestoragehierarchyconsistsofa1.04-microsecondmainstor-ageandasmall,faststorecalledacache,lwhichisintegratedintotheCPU.Thecacheisnotaddressablebyaprogram,butratherisusedtoholdthecontentsofthoseportionsofmainstoragethatarecurrentlybeingused.Mostprocessorfetchescanthenbehandledbyreferringtothecache,sothatmostofthetimetheprocessorhasashortaccesstime.Whentheprogramstartsoperat-ingondatainadifferentportionofmainstorage,thedatainthatportionmustbeloadedintothecacheandthedatafromsomeotherportionremoved.Thisactivitymusttakeplacewithoutprogramassistance,sincetheModel85mustbecompatiblewiththerestofThispaperdiscussesorganizationofthecacheandthestudiesthatledtoitsuseintheModel85andtoselectingofvaluesforitsparameters.theSYSTEM/360line.ZBMSYSTEMSJOURNALVOL.7NO.1196815Figure1AssignmentofcachesectorstomainstoragesectorsMAINSTORAGECacheorganizationThemainstorageunitsthatcanbeusedontheModel85aretheIBM2365-5andthe2385.Theyhavea1.04-microsecondcycletimeandmakeavailablecapacitiesfrom512Kbytesto4096Kbytes(K=1024).Thecacheisa16K-byteintegratedstorage,whichiscapableofoperatingeveryprocessorcycle.Optionally,itcanbeexpandedto24Kbytesor32Kbytes.Boththecacheandmainstoragearelogicallydividedintosec-tors,eachconsistingof1Kcontiguousbytesstartingon1K-byteboundaries.Duringoperation,acorrespondenceissetupbetweencachesectorsandmainstoragesectorsinwhicheachcachesectorisassignedtoasingledifferentmainstoragesector.However,be-causeofthelimitednumberofcachesectors,mostmainstoragesectorsdonothaveanycachesectorsassignedtothem(seeFigure1).Eachofthecachesectorshasa14-bitsectoraddressregister,whichholdstheaddressofthemainstoragesectortowhichitisassigned.Theassignmentofcachesectorsisdynamicallyadjustedduringassigningoperation,sothattheyareassignedtothemainstoragesectorsthatcachearecurrentlybeingusedbytheprogram.Iftheprogramcausesasectorsfetchfromamainstoragesectorthatdoesnothaveacachesectorassignedtoit,oneofthecachesectorsisthenreassignedtothatmainstoragesector.Tomakeagoodselectionofacachesectortoreassign,enoughinformationismaintainedtoorderthecachesec-torsintoanactivitylist.Thesectoratthetopofthelististheonethatwasmostrecentlyreferredto,thesecondoneisthenextmostrecentlyreferredto,andsoforth.Whenacachesectorisreferredto,itismovedtothetopofthelist,andtheinterveningonesaremoveddownoneposition.Thisisnotmeanttoimplyanactualmovementofsectorswithinthecache,butratherreferstoalogical16J.8.LIPTAYorderingofthesectors.Whenitisnecessarytoreassignasector,theoneselectedistheoneatthebottomoftheactivitylist.Thiscachesectoristheonethathasgonethelongestwithoutbeingreferredto.Whenacachesectorisassignedtoadifferentmainstoragesec-tor,thecontentsofallofthe1Kbyteslocatedinthatmainstoragesectorarenotloadedintothecacheatonce.Rather,eachsectorisdividedinto16blocksof64bytes,andtheblocksareloadedonademandbasis.Whenacachesectorisreassigned,theonlyblockthatisloadedistheonethatwasreferredto.Iftheyarerequired,theremainingblocksareloadedlater,oneatatime.Eachblockinthecachehasabitassociatedwithittorecordwhetherithasbeenloaded.This“validitybit”isturnedonwhentheblockisloadedandoffwhenthesectorisreassigned.Storeoperationsalwayscausemainstoragetobeupdated.Ifthemainstoragesectorbeingchangedhasacachesectorassignedtoit,thecacheisalsoupdated;otherwise,noactivityrelatedtothecachetakesplace.Therefore,storeoperationscannotcauseacachesectortobereassigned,ablocktobeloaded,ortheactivitylisttoberevised.Sinceallofthedatainthecacheisalsoinmainstorage,itisnotnecessaryonacachesectorreassignmenttomoveanydatafromthecachetomainstorage.Allthatisrequiredistochangethesectoraddressregister,resetthevaliditybits,andinitiateIoadingofablock.Theprocessoriscapableofbufferingoneinstructionrequestingthestoringofinformationinmainstorage,sothatitcanproceedwithsubsequentinstructionsevenifexecutionofthestoreinstructioncannotbeinitiatedimmediately.Twoprocessorcyclesarerequiredtofetchdatathatisinthecache.Thefirstcycleisusedtoexaminethesectoraddressregistersandthevaliditybitstodetermineifthedataisinthecache.Thesecondcycleisthenusedtoreadthedataoutofthecache.How-ever,requestscannormallybeoverlapped,sothatonerequestcanbeprocessedeverycycle.Ifthedataisnotpresentinthecache,additionalcyclesarerequiredwhiletheblockisloadedintothecachefrommainstorage.ThestoragewordsizeonwhichtheModel85operatesinternallyis16bytes.Thisisthewidthofthedatapathstoandfromthestorageunits,andistheamounttheprocessorcanstoreorfetchwithasinglerequest.Becauseasingle2365-5storageunitoperatesonan8-byte-wideinterface,twounitsarepairedtogetherandoperatedsimultaneously.Exceptforthe512Kconfiguration,mainstorageisinterleavedfourways.Sinceablockis64bytes,fourfetchestomainstoragearerequiredtoloadoneblockintothecache.Withfour-wayinterleaving,thismeansonerequesttoeachbasicstoragemodule.Toimproveperformance,thefirstbasicstoragemodulereferredtoduringeachblockloadistheonecon-tainingthe16byteswantedbytheprocessor.Inadditiontobeingloadedintothecache,thedataissentdirectlytotheprocessor,sothatexecutioncanproceedassoonaspossible(seeFigure2).OntheModel85,channelsstoreandfetchdatabywayoftheMODEL85CACHEFigure2TimingforablockloadMAINSTORAGENttUtUBSMZACCESSTIMEIBSMPIBSMOACCESSTIMEwBSM-BASICSTORAGEMODULEprocessor.Channelfetchesareprocessedby-gettingtherequireddatafrommainstoragewithoutreferringtothecache.Channelstoresarehandledthesamewayasprocessorstores.Inthisway,ifachannelchangesdatathatisinthecache,thecacheisupdatedbutthechannelsdonothaveanypartofthecachedevotedtothem.PerformancestudiesAmongthequestionsthathadtobeansweredtodeterminewhetherthecacheapproachshouldbetakenwere:(1)howeffectiveisit,and(2)doesitseffectivenessvarysubstantiallyfromoneprogramtoanother?TheprincipaltoolsusedtoanswerthesequestionsarethetracingandtimingtechniquesreferredtoinPartI.Thetracingtechniqueproducesaninstruction-by-instructiontraceofapro-gramoperatingundertheSYSTEM/OOperatingSystem.Theoutputisasequenceof(tracetapes,”whichcontaineveryinstructionexecuted,whetherintheproblemprogramortheoperatingsystem,andthenecessaryinformationtodeterminehowlongittakestobeexecuted.Thesetracetapescontainabout250,000instructionseachandareusedasinputtoatimingprogram,whichdetermines,cycle-by-cycle,howtheModel85wouldexecutethatsequenceofinstruc-tions.Thesetechniquesareintendedtodetermineinternalper-formanceanddonotprovideanyinformationconcerningthrough-put.Anintensiveinvestigationprecededselectionoftheprogramsusedinthisstudy.Inordertomeasuretheeffectivenessofthecache,wepostulatedcacheasystemidenticaltotheModel85exceptthatthestoragehierarchyeffectivenessisreplacedbyasingle-levelstorageoperatingatcachespeed.TheperformanceofsuchasystemisthatwhichwouldbeachievedbytheModel85ifitalwaysfoundthedataitwantedinthecacheandifitneverencounteredinterferenceinmainstorageduetostores.Therefore,itrepresentsanupperlimitontheperformanceoftheModel85;howclosetheModel85approachesthisidealcanserveasameasureofhoweffectivethecacheis.NineteentracetapesI18J.s.LIPTAYFigure3Model85performancerelativetosingle-levelstorageoperatingatcachespeedMEAN=81%75.7980-8490.94PERCENTAGEOFIDEALPERFORMANCEFlgure4ProbabilityoffindingfetcheddataincachePROBABILITYweretimedforboththeModel85andthepostulatedsystem,andtheperformanceoftheModel85wasexpressedasapercentageoftheperformanceoftheidealsystem.Figure3showsthedistribu-tionofperformancedataobtained,Theaveragewas81percentoftheperformanceoftheidealsystem,witharangebetween66and94percent.Animportantstatisticrelatedtocacheoperationistheprob-abilityoffindingthedatawantedforafetchinthecache.Figure4showsthedistributionofthisprobabilityforthesame19tracetapesusedforFigure3.Theaverageprobabilitywas0.968.Itisworthnotingthat,iftheaddressesgeneratedbyaprogramwererandom,theprobabilityoffindingthedatawantedinthecachewouldbemuchlessthan0.01.Therefore,itcanbesaidthatwhatmakesthecacheworkisthefactthatrealprogramsarenotrandomintheiraddressingpatterns.MODEL85CACHETable2Averageperformancerelativetoanidealsystemwithcachesizeandnum-berofbytesperblockvaried-Numberofsectors=16NumberofNumberofbytesperblockcachebytes64128266,8K0.74416K0.8250.8100.78132K0.8910.8850.870Table3Comparativeperformanceusingdifferentcachesectorreplacementalgorithmsalgorithmperformance1partition*1.0002partitions0.9904partitions0.9878partitions0.97916partitions0.933usagebits0.931*ReplacementalgorithmchosenfortheModel85Table3summarizestheresultsobtained.Thechoiceoftheactivitylistwas

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论