下载本文档
版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、Beijing Jiaotong University Final ExaminationCourse:Computer Architecture Lecturer: Ai Lihua,Wang Bing(NOTE:4 parts in 2 hours: total 100 points)Part No.Part1Part2Part3Part4Part5TotalScoreExaminerPlease show your work CLEARLY for all problems. I hope you enjoy the test!Part1Mark only one answer for
2、each question 10 points1. A superscalar processor has ( )(a) multiple functional units (b) a high clock speed (c) a large amount of RAM (d) many I/O ports2. On-chip cache has ( )(a) lower access time than RAM(b) larger capacity than off chip cache (c) its own data bus (d) become obsolete3. ( ) data
3、hazards are not possible in the DLX in-order instruction issue and in-order execution multicycle pipeline?(a) WAR(b) WAW (c)RAW (d) RAR4.Pipelining improves CPU performance due to ( )(a) reduced memory access time (b) increased clock speed (c) the introduction of parallellism (d) additional function
4、al units5. Cache memory enhances ( )(a) memory capacity (b) memory access time (c) secondary storage capacity (d) secondary storage access time6. RISC machines typically ( )(a) have high capacity on-chip cache memories (b) have fewer registers than CISC machines (c) are less reliable than CISC machi
5、nes (d) execute 1 instruction per clock cycle.7. Which of the following is NOT a computer performance metric: ( )(a) MIPS(b) FLOPS(c) SPECbenchmark, (d) RISC8.Given a 5 stage pipeline with stages taking 1, 2, 3, 1, 1 units of time, the clock period of the pipeline is: ( )(a) 8 (b) 1/8 (c) 1/3 (d) 39
6、. The average memory access time for a machine with a cache hit rate of 90% where the cache access time is 10ns and the memory access time is 100ns is ( )(a) 55ns (b) 45ns(c) 90ns(d) 19ns10. Delayed branching is used ( )(a) to introduce delays in program execution (b) in pipelining(c) in cache memor
7、y (d) decoding instructionsPart2Fundamentals of Computer Design 10 points1. 10 points In many practical applications that demand a real-time response, the computational workload W is often fixed. As the number of processors increases in a parallel computer, the fixed workload is distributed to more
8、processors for parallel execution. Assume 20 percent of W must be executed sequentially, and 80 percent can be executed by 4 nodes simultaneously. What is a fixed-load speedup?Part 3Instruction Set Architecture 20 points2. 8 points Suppose the variable x of type int and at address 0x100 has a hexade
9、cimal value 0x. The ordering of the bytes within the address range 0x100 through 0x103 depends on the type of machine. What will be arranged in memory 0x1000x103 according to Little Endian and Big Endian? 3. 12 points A model machine has 7 instructions, which frequencies are 43%, 21%, 12%, 8%, 6%, 6
10、%, and 4% respectively.3.1 Encoding operator with the minimum average code length.3.2According to 5.1, give the value of the minimum average code length.Part 4Pipelining 35 points4. 6 points Why would a designer sometimes allow structural hazards?5. 21 points Using the following code fragment:LOOP:L
11、W R1 , 0(R2);load R1 from address 0+R2ADDI R1, R1, #1; R1= R1+1 SW 0(R2), R1;store R1 at address 0+ R2ADDI R2, R2, #4; R2= R2+4SUB R4, R3, R2; R4= R3-R2BNEZ R4, LOOP;branch to LOOP if R4!=0Assume the initial value of R3 is R2+200. They run on a pipelined machine like DLX.5.1.According to the followi
12、ng format, show the timing of this instruction sequence with normal forwarding and bypassing hardware. Also assuming a register read and a write in the same clock cycle “forwards” through the register file. instructionClock123456789101112131415LW R1,0(R2)IFIDEXEMEMWBADDI R1, R1, #1SW 0(R2), R1ADDI R
13、2, R2, #4SUBR4, R3, R2BNEZ R4, LOOP 5.2.Assume that the branch is handled by predicting it as not taken. If all memory references take 1 cycle, how many cycles does this loop take to execute?5.3.In order to reduce the total cycles of 8.2, what measures could be taken? Give an explanation.6. 8 points
14、 For two-level branch prediction strategy with (2,2) predictor, how many bits does the branch prediction buffer need for 2K branch instructions?Part 5Memory Hierarchy 25points7. 7 points Cache design: Give short answers to the following questions.7.1Cache miss rates decrease with larger cache block
15、sizes due to what kind of locality?7.2How many sets in fully associative cache with 64 cache blocks?7.3 How many sets in a direct-mapped cache with 128 cache blocks?7.4How to keep the main memory and the cache consistent?8.18 points Assume a four blocks direct-mapped cache, two words per block, LRU replacement. The ca
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2025年江苏省靖江市高二生物下册期末考试考试卷含完整答案(典优)
- 2025年辽宁省北票市高二生物下册期末考试检测卷【夺冠系列】附答案
- 2026年贵州省都匀市高二生物下册期末考试试卷附参考答案【黄金题型】
- 2025年青海省玉树市高二生物下册期末考试模拟卷附参考答案【黄金题型】
- 2025年云南省芒市高二生物下册期末考试试卷附参考答案(达标题)
- 2026年河南省济源市高二生物下册期末考试检测卷重点附答案
- 2026年浙江省慈溪市高二生物下册期末考试模拟卷【各地真题】附答案
- 2026年湖南省临湘市高二生物下册期末考试试卷及答案(各地真题)
- 2025年黑龙江省抚远市高二生物下册期末考试测试卷及完整答案【典优】
- 2026年湖南省武冈市高二生物下册期末考试检测卷【B卷】附答案
- 江苏省无锡市2025-2026学年五年级下学期6月数学期末调研试题(试卷+答案)
- 湖南省2026年全省政工专业知识考试(政治+中国近现代史)试题解析及核心考点
- 广东省中山市2024-2025学年高一地理上学期期末考试试试题含解析
- 天津市南开区2023-2024学年四年级下学期6月期末数学试题
- 2024年四川省凉山彝族自治州中考道德与法治真题(原卷版+解析版)
- 质量信得过班组汇报材料课件
- 学院监察工作试点的实施方案
- 中学数学教学设计研究的开题报告
- APD自动化腹膜透析机的使用
- 220kv输电线路工程建设监理实施细则
- 体育概论全部课件
评论
0/150
提交评论