




全文预览已结束
下载本文档
版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
Review 11. Pipeline 特点Pipelining doesnt help latency of single task, it helps throughput of entire workload Pipeline rate limited by slowest pipeline stageMultiple tasks operating simultaneouslyPotential speedup = Number pipe stagesUnbalanced lengths of pipe stages reduces speedupTime to “fill” pipeline and time to “drain” it reduces speedup2. RISC MIPS 5 steps of MIPS datapath IF ID EXE MA WB3. Three Hazards structural不能同时运作 data之前结果 control 跳转4. One memory port two different cache entries holding data for the same physical address!for update: must update all cache entries with same physical addressor memory becomes inconsistent3. TLBs:A way to speed up translation is to use a special cache of recently used page table entries - this has many names, but the most frequently used is Translation Lookaside Buffer or TLB Virtual Address Physical Address Dirty Ref Valid Access4. P408 计算加速比 命中性能5. SPEC: System Performance Evaluation Cooperative6. Moores Law: the number of transistors in a dense integrated circuit doubles approximately 18 months. 摩尔定律指出集成电路上可容纳的晶体管数目,约每隔18个月便会增加一倍,性能也将提升一倍。7. Performance Summary needs good benchmarks and good ways to summarize performance.8. AMAT = Average Memory Access Time 例:Suppose a processor executes at Clock Rate = 200 MHz (5 ns per cycle), Ideal (no misses) CPI = 1.1 50% arith/logic, 30% ld/st, 20% control Suppose that 10% of memory operations get 50 cycle miss penalty Suppose that 1% of instructions get same miss penalty CPI = ideal CPI + average stalls per instruction1.1(cycles/ins) + 0.30 (DataMops/ins) x 0.10 (miss/DataMop) x 50 (cycle/miss) + 1 (InstMop/ins) x 0.01 (miss/InstMop) x 50 (cycle/miss) = (1.1 + 1.5 + .5) cycle/ins = 3.1 58% of the time the proc is stalled waiting for memory!AMAT=(1/1.3)x1+0.01x50+(0.3/1.3)x1+0.1x50=2.549. 冯诺依曼和哈佛结构性能呢对比: 16KB I&D: Inst miss rate=0.64%, Data miss rate=6.47% 32KB unified: Aggregate miss rate=1.99%Assume 33% data ops 75% accesses from instructions (1.0/1.33)hit time=1, miss time=50Note that data hit has 1 stall for unified cache (only one port)AMATHarvard=75%x(1+0.64%x50)+25%x(1+6.47%x50) = 2.05AMATUnified=75%x(1+1.99%x50)+25%x(1+1+1.99%x50)= 2.2410.write through(a valid bit) write back(dirty bit and valid bit)Write Allocate vs Non-Allocate 写入缺失时做法,先读到缓存中在写,和直接写磁盘11. Improving Cache Performance P426 Reduce the miss rate 3Cs n-way 1-way(size x) 2-way(size x/2)Reduce Misses via Larger Block Size (因空间局部性会降低强制缺失,可能增大冲突缺失,若容量小可能增大容量缺失)提高了缺失代价 Reduce Misses via Higher Associativity 2:1 Cache Rule 会延长命中时间 AMAT Reducing Misses via a“Victim Cache” Add buffer to place data discarded from cache Reducing Misses via “Pseudo-Associativity”Reducing Misses by Hardware Prefetching of Instructions & Datals Reducing Misses by Software Prefetching Data Prefetching comes in two flavors: Binding prefetch: Requests load directly into register. Must be correct address and register! Non-Binding prefetch: Load into cache. Can be incorrect. Frees HW/SW to guess!Reducing Misses by Compiler Optimizations(merging arrays loop interchange loop fusion blocking) Reduce the miss penaltyRead Priority over Write on Miss(读取缺失优先级高于写入缺失)让读取缺失一直等待到写入缓冲区为空为止Reduce Miss Penalty: Early Restart and Critical Word FirstDont wait for full block to be loaded before restarting CPUNon-blocking Caches to reduce stalls on misses:Add a second-l
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 社区共享菜园管理办法
- 《安全生产法》考试题库多选题(含答案)
- 基本公共卫生练习题库及参考答案
- 全国税法知识竞赛题库含答案
- 城镇供热管网改造及换热站提升改造项目可行性研究报告
- 消防国考试题及答案
- 水产养殖虚拟现实技术培训创新创业项目商业计划书
- 物联网在智慧水务的应用拓展创新创业项目商业计划书
- 智能车队管理与调度系统创新创业项目商业计划书
- 观沧海中考试题及答案
- 医院培训课件:《肺源性心脏病》
- 2025年承包学校食堂餐饮废弃物处理合同
- 部编版道德与法治小学四年级上册期末复习专练试题及答案(全套)
- 2025-2026人教版(2024)一年级上册数学教学计划
- 二零二五年度炉渣资源化利用项目合作协议书
- 2025-2026学年鲁科版(五四学制)(2024)初中生物六年级上册教学计划及进度表
- 2025年事业单位招聘考试综合类专业知识试卷(环境工程知识)2025年试题集
- 2025年湖南省教师招聘考试(公共基础知识)历年参考题库含答案详解(5卷)
- 施工进度计划管理制度
- 以工代赈务工协议书
- 2025年三级仓储管理员(图书管理)职业技能鉴定《理论知识》考试真题(后附答案及解析)
评论
0/150
提交评论