北交模拟试卷_第1页
北交模拟试卷_第2页
北交模拟试卷_第3页
全文预览已结束

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

1、Beijing Jiaotong University Final ExaminationCourse:Computer Architecture Lecturer: Ai Lihua,Wang Bing(NOTE:4 parts in 2 hours: total 100 points)Part No.Part1Part2Part3Part4Part5TotalScoreExaminerPlease show your work CLEARLY for all problems. I hope you enjoy the test!Part1Mark only one answer for

2、each question 10 points1. A superscalar processor has ( )(a) multiple functional units (b) a high clock speed (c) a large amount of RAM (d) many I/O ports2. On-chip cache has ( )(a) lower access time than RAM(b) larger capacity than off chip cache (c) its own data bus (d) become obsolete3. ( ) data

3、hazards are not possible in the DLX in-order instruction issue and in-order execution multicycle pipeline?(a) WAR(b) WAW (c)RAW (d) RAR4.Pipelining improves CPU performance due to ( )(a) reduced memory access time (b) increased clock speed (c) the introduction of parallellism (d) additional function

4、al units5. Cache memory enhances ( )(a) memory capacity (b) memory access time (c) secondary storage capacity (d) secondary storage access time6. RISC machines typically ( )(a) have high capacity on-chip cache memories (b) have fewer registers than CISC machines (c) are less reliable than CISC machi

5、nes (d) execute 1 instruction per clock cycle.7. Which of the following is NOT a computer performance metric: ( )(a) MIPS(b) FLOPS(c) SPECbenchmark, (d) RISC8.Given a 5 stage pipeline with stages taking 1, 2, 3, 1, 1 units of time, the clock period of the pipeline is: ( )(a) 8 (b) 1/8 (c) 1/3 (d) 39

6、. The average memory access time for a machine with a cache hit rate of 90% where the cache access time is 10ns and the memory access time is 100ns is ( )(a) 55ns (b) 45ns(c) 90ns(d) 19ns10. Delayed branching is used ( )(a) to introduce delays in program execution (b) in pipelining(c) in cache memor

7、y (d) decoding instructionsPart2Fundamentals of Computer Design 10 points1. 10 points In many practical applications that demand a real-time response, the computational workload W is often fixed. As the number of processors increases in a parallel computer, the fixed workload is distributed to more

8、processors for parallel execution. Assume 20 percent of W must be executed sequentially, and 80 percent can be executed by 4 nodes simultaneously. What is a fixed-load speedup?Part 3Instruction Set Architecture 20 points2. 8 points Suppose the variable x of type int and at address 0x100 has a hexade

9、cimal value 0x. The ordering of the bytes within the address range 0x100 through 0x103 depends on the type of machine. What will be arranged in memory 0x1000x103 according to Little Endian and Big Endian? 3. 12 points A model machine has 7 instructions, which frequencies are 43%, 21%, 12%, 8%, 6%, 6

10、%, and 4% respectively.3.1 Encoding operator with the minimum average code length.3.2According to 5.1, give the value of the minimum average code length.Part 4Pipelining 35 points4. 6 points Why would a designer sometimes allow structural hazards?5. 21 points Using the following code fragment:LOOP:L

11、W R1 , 0(R2);load R1 from address 0+R2ADDI R1, R1, #1; R1= R1+1 SW 0(R2), R1;store R1 at address 0+ R2ADDI R2, R2, #4; R2= R2+4SUB R4, R3, R2; R4= R3-R2BNEZ R4, LOOP;branch to LOOP if R4!=0Assume the initial value of R3 is R2+200. They run on a pipelined machine like DLX.5.1.According to the followi

12、ng format, show the timing of this instruction sequence with normal forwarding and bypassing hardware. Also assuming a register read and a write in the same clock cycle “forwards” through the register file. instructionClock123456789101112131415LW R1,0(R2)IFIDEXEMEMWBADDI R1, R1, #1SW 0(R2), R1ADDI R

13、2, R2, #4SUBR4, R3, R2BNEZ R4, LOOP 5.2.Assume that the branch is handled by predicting it as not taken. If all memory references take 1 cycle, how many cycles does this loop take to execute?5.3.In order to reduce the total cycles of 8.2, what measures could be taken? Give an explanation.6. 8 points

14、 For two-level branch prediction strategy with (2,2) predictor, how many bits does the branch prediction buffer need for 2K branch instructions?Part 5Memory Hierarchy 25points7. 7 points Cache design: Give short answers to the following questions.7.1Cache miss rates decrease with larger cache block

15、sizes due to what kind of locality?7.2How many sets in fully associative cache with 64 cache blocks?7.3 How many sets in a direct-mapped cache with 128 cache blocks?7.4How to keep the main memory and the cache consistent?8.18 points Assume a four blocks direct-mapped cache, two words per block, LRU replacement. The ca

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论