高级体系结构期末考试课后习题汇总_第1页
高级体系结构期末考试课后习题汇总_第2页
高级体系结构期末考试课后习题汇总_第3页
高级体系结构期末考试课后习题汇总_第4页
高级体系结构期末考试课后习题汇总_第5页
已阅读5页,还剩10页未读 继续免费阅读

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

1、Homework1,1 Explain the Concepts 系统结构(CA: Computer Architectute)、高级系统结构(Advanced CA)、Amdahl law、SCALAR PROCESSING、LOOK-AHEAD、PVP、SMP、MPP、DSM、COW、GCE、CISC、RISC、VMM、SUPERCOMPUTER、SVM、MAINFRAME、COMPUTER SYSTEM ON CHIP、PARALLEL ARCHITECTURE INTO SINGLE CHIP、MOORE定律、UMA、NUMA、COMA、CC-NUMA、NORMA、SHELL ARCH

2、ITECTURE, PRAM, BSP, Log P 2 Explain the Flynn classify and its semantics. 3 Where are the difficulties of Parallel Processing ? What is Parallel(并行)、concurrent(并发) and Simultaneous(同时)separately ? 4 Assume that a system component be speeded to 10 times, its past processing time accounts for 40% of

3、the system, how much will be the system performance improved? 5 Please draw the memory Hierarchy chart.,homework2,1 Explaining Conceptions RISC, CISC, VLIW, SUPER-SCALAR, SUPER-PIPELINE, SUPER-SCALAR-SUPER-PIPELINE, IPC, SINGLE ISSUE,MULTIPLE ISSUE, OOO, Multi-Threading 2 In ideal case, please give

4、the performances for SUPER-SCALAR, SUPER-PIPELINE, SUPER-SCALAR-SUPER-PIPELINE, and give N=8 examples and their average IPC. 3 Please describe the CPU technology in recent years, and give an example to illustrate their technical parameters. Please give computing trace of ai+7=bj+1+ck+8+dm+10 for T90

5、00 How to solve the multicore memory wall problem? 6 Simple computer design test: Supposed there are a simple CPU with A15-A0,D7-D0,read and write, an register and Tri-state buffers for I/O device, a 8k*8 ROM with A12-A0,D7-D0,CE and OE, and a 8k*8 RAM with A12-A0,D7-D0,CE,RD and WR. Please you desi

6、gn the circuit of a simple computer.,Homework,1) There is a program in a 40MHz processor,its number of integer arithmetic instructions is 45000 and the clock is 1.The number of instructions with data operations is 32000 and the clock is 2. The number of float instructions is 15000 and its clock 2. T

7、he number of instructions such as JMP etc is 8000 and its clock is 2.Please give its CPI、MIPS and the CPU execution time. 2) If the communication cost in a SP2 machine is t(m)=46+(0.035)m, please give its (渐进带宽 )r and its (半峰值信息长度) m1/2. 3) If a N*N matrix A is multiplied by a N*N matrix B, its time

8、 is T1=CN3 s, but the execution time in a machine with n nodes is Tn=(CN3/n+bN2/square-root(N) s,here,C,N and b is a constant. Please you give its Speedup under fix-workload, fix-time and fix-memory usage.,4) (等效率)see :kumar V, Rao V N. Parallel depth firsh search, part ii: Analysis. IntI J of Paral

9、lel Programming,1987,16(6):501-519 5) (等速度)see:Sun X H,etc. scalability of parallel algorithm-machine combina-tions. IEEE transaction on parallel and distributed system,1994,5(6):519-613 6) (等延迟)see: Zhang X D,etc. latency metric: an experimental method for measuring and evaluating parallel program

10、and architecture scalability.J of parallel and distributed Computing,1994,22:392-410.,2021/3/29,5,HOMEWORK,1解释Concepts State network(静态网络)、Daymic network(动态网络)、crossbar(交叉开关)、Multi-buses(多总线) 2 Draw N=25 MESH 3写出用混洗交换网络模拟单级立方体网络的互连函数表达式please give the single cube function by using SHUFFLE-EXCHANGE N

11、ETWORK 4 How much have the different bus Arbitrate Alogrithms ? What is its merit and demerit? 总线仲裁有几种,各有何优缺点?,HOMEWORK,1 THERE ARE P1,P2,P3,P4 AND M1,M2,M3,M4,PLEASE USE 2*2 CROSSBAR SWITCH BOX TO DESIGN A CONNECTION NETWORK OF P1M2,P2M4,P3M1,P4M3. 2 WHAT IS DS-LINK? 3 WHAT IS DIFFERENCE BETWEEN ME

12、SSAGE-PASSING AND SHARED MEMORY? 4 WHAT ARE WORMHOLE COMMUNICATION AND ITS PERFORMANCE? 5 Explaining that Myrinet,HiPPI,FDDI,ATM,SCI,and 100BaseT。,6采用虫洞寻径的超立方体多机系统中,如果相邻节点间有一对方向相反的单向通道,试证明在该系统上实现立方体编码下的寻径不会死锁。(In a supercube network with wormhole communication,if there is a pair opposite one-way lin

13、k between any two neighbour nodes,please try to proof that the network can not be deadlocked when looking for path),自由任选(课外大作业),任选一个具有代表性的BENCHMARK小程序,要求: 1) 请分析其源程序,写出其中的数据执行流程(包括:数据类型、具体值和时序关系); 2) 在1)的基础上,对其目标代码再进行数据流分析,写出其中的数据执行流程(包括:数据类型、具体值和时序关系);观察并写出与1)中间结果的差别表。 3)在1)和2)基础上,对其数据执行的流程进行数据预取优化

14、,并在具有CACHE的计算机上进行实际求解。要求给出具体数据预取的优化方法和改进的测试时间效果表。 (可加5-10分),用PVM或MPI并行软件工具编写一个并行程序,要求至少有2或3个并行任务进行协同求解某问题。 What are Architecture?What areKey Tech and Theory?What areMeeting Problems? 1、Earth-Simulator 2、Blue Gene 3、Beowulf With PoPC Cluster 4、Grid Computing 5、Pervasive Computing 6、P2P Computing 7、Sp

15、ecial PoPC Cluster Such as Web Cache Cluster,作业homework1,1 H0(n)=nmH0(-n)/(-)+n(1-m)H0; Hc(n)=nmH0(-n)/(-)+n(1-m)H0 please delete the and by using ,then draw the function figure when m=0.5, =0.2(画出其函数关系图),Homework2,MESI Protocol,Can you fill the states?,关于可用性中的检查点问题 CHECKPOINT(a,b,c),可在内核、库、应用程序三级发生

16、;,a,b,d,c,x,y,z,P,Q,R,Process,一致性快照Checkpoint Consistency Snapshot(a-Consistency一致;b-No Consistency不一致),如果进程之间不存在一个进程的检查点已接收了消息,而另一进程的检查点还未发送这个消息。称一致性快照。,a,b,x,y,z,P,Q,R,C ?,If there is not the process which receives a message at checkpoint and another process has not sent the message at checkpoint,

17、 then we say that the related checkpoints are the Consistency Snapshot,homework,1. What are the differences about MPP and SMP? Answer: MPP A large parallel processing system with a shared-nothing architecture Consist of several hundred nodes with a high-speed interconnection network/switch Each node consists of a main memory & one or more processors Runs a separate

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论