版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、 基于 nmon 的 PowerHA 宕机故障分析 NMONmem选项NMONmemnew选项NMONmemuse选项NMONnet选项NMONpage选项NMONtopas选项Interpreting the %Processor by PID chart概述本分析文档是基于nmon命令抓取的日志来进行的,从0点开始,每两分钟抓取一次,共抓取24小时。本文档采用了3月31日和4月1日的日志内容,主机为jddbs02NMONmem选项选项官方注释The main graph on this sheet shows the amount of Real Free memory in Mbytes
2、 by time of day. This would be the same as dividing the fre values reported by vmstat over the same interval by 256. The small graph shows the amount of real memory. This is useful in determining if dynamic reconfiguration has been used during the collection period.For AIX, other columns on the shee
3、t are as follows:Real Free the percentage of real pages on the free list.Virtual Free the percentage of unallocated virtual slots on the paging spaces.Real Free (MB) the amount of memory on the free list in Mbytes.Virtual Free (MB) the amount of unallocated space on the paging spaces.Real Total (MB)
4、 the total amount of memory available to AIX.Virtual Total (MB) the total amount of space allocated for paging spaces.Note: you can calculate the amount of memory used during an interval simply by subtracting theReal Free (MB)valuefrom theReal Total (MB)value.This will, however, include file pages.T
5、he graph on the MEMUSE sheet gives a more accurate assessment of memory used by programs (computational pages).分项解析空闲内存率自16点左右,开始大幅度下滑自16:02开始,实内存空闲率逐步下滑,到50%左右,自此后再没有增加小结自16点开始,内存空闲率在持续降低,也就是说,内存使用率在增长。NMONmemnew选项选项官方解释The graph shows the allocation of memory split into the three major categories:
6、 pages used by user processes, file system cache, and pages used by the system (kernel).Process% the percentage of real memory allocated to user processesFSCache% the percentage of real memory allocated to file system cacheSystem% the percentage of real memory used by system segmentsFree% the percen
7、tage of unallocated real memoryUser% the percentage of real memory used by non-system segments分项解析整体上内存的使用率趋于缓和趋势,system项(系统占用)没有增长,process项有平缓的增长趋于。自16点左右开始,用户使用内存比在持续增加,其他选项未发生变化。小结增加的内存使用中,主要是用户类型进程在持续增长。NMONmemuse选项选项官方解释Except for %comp, the values on this sheet are the same as would be reporte
8、d by the vmtune command.%numperm the percentage of real memory allocated to file pages.%minperm value specified on the vmtune command or system default of 20%. This will normally be constant for the run unless the vmtune or rmss commands are used during collection.%maxperm value specified on the vmt
9、une command or system default of 80%. This will normally be constant for the run unless the vmtune or rmss commands are used during collection.minfree the minimum number of pages AIX is to keep on the free list. Specified on the vmtune command or system default of maxfree - 8.maxfree the maximum num
10、ber of pages AIX is steal in order to replenish the free list. Specified on the vmtune command or system default.%comp The percentage of real memory allocated to computational pages. NMON_Analyser calculates this value. Computational pages are those backed by page space and include working storage a
11、nd program text segments. They exclude data, executable and shared library files.The Analyser generates two graphs.The first shows the split between computational and file pages by time of day.The second plots the values of%numperm, %minperm, %maxpermand %comp.If%numpermfalls below%minpermthen compu
12、tational pages will be stolen.If%numpermrises above%maxpermthen computational pages cannot be stolen.Low values for both%minpermand%maxpermindicate that the system has been tuned for a database server.You may also want to check the setting of STRICT_MAXPERM on the BBBP sheet (if present).分项解析从图中可看出,
13、计算内存为发生明显变化,永久内存在16点后开始持续增长,并且增长的内存一直未释放。如图所示,%numberper和%numclient类型内存,在16点后开始增长,到16点半左右趋于稳定,且后续增长的内存一直未释放。小结自16点开始持续增长的内存应是用于了持久性内存,且为网络文件系统类型的(根据numclient得出)NMON-net选项选项官方解释This sheet shows the data rates, in Kbytes/sec, for each network adapter in the system (including SP switch if present).This
14、 is the same as produced by the netpmon O dd command.NMON_Analyser adds one column for each adapter showing the total data rate (read + write) and two columns showing Total Read and Total Write.Note that the Total Write is calculated as a negative number for graphing.The Analyser generates three gra
15、phs.The first graph shows total network traffic broken down as Total-Read and Total-Write.The writes are shown below the X-Axis.Note the area chart can be easily converted to a line chart if required.Simply right click on the white space within the chart area, then select Chart TypeLineOK.分项解析如图所示,在
16、16点左右的时间段en4网卡产生了大量的读流量小结可从侧面印证numclient类型内存的大量使用。NMONpage选项选项官方解释This sheet has the paging statistics as recorded by NMON.faults the number of page faults per second. This is not a count of page faults that generate I/O, because some page faults can be resolved without I/O.pgin the total rate/sec o
17、f in operations to both paging space and file systems during the interval.pgout the total rate/sec of out operations to both paging space and file systems during the interval.pgsin the rate/sec of in operations from paging space during the interval. This is the same as the pi value reported by vmsta
18、t. If pgsin is consistently higher than pgsout this may indicate thrashing. pgsout the rate/sec of out operations to paging space during the interval. This is the same as the po value reported by vmstat.reclaims from NMON 10 onwards this field is the same as thefrvalue reported byvmstatand represent
19、s the number of pages/sec freed by the replacement routine.scans the number of pages/sec examined by the page replacement routine. This is the same as the sr value reported by vmstat. Page replacement is initiated when the number of free pages falls below minfree and stops when the number of free pa
20、ges exceeds maxfree.cycles the number of times/sec the page replacement routine had to scan the entire Page Frame Table in order to replenish the free list. This is the same as the cy value reported by vmstat but note that vmstat reports this number as an integer whereas nmon reports it as a real nu
21、mber.fsin calculated by the Analyser as pgin-pgsin for graphingfsout calculated by the Analyser as pgout-pgsout for graphingsr/fr calculated by the Analyser as scans/reclaims for graphingNMON_Analyser produces two graphs.The first shows paging operations to/from paging space.The ideal here would be
22、no more than 5 operations/sec per page space (see the BBBC sheet for details).The second graph shows the scan:free rate.Memory may be over-committed when this figure is 4 although you also need to examine the MEM and PAGE sheets as well.分项解析在16点时间段,有基于文件系统的密集换出操作在16点左右pgout和fsout的密集性换出操作,在整体的时间段faul
23、ts值很高。小结16点左右的page io可以是由于在这个时段有大量的numclient内存动作,可以解释。而faults动作暂时保留NMON-topas选项选项官方解释This sheet is only generated if you specify the -t flag on the NMON command line. The output is similar to that produced using the ps v command.Note that, because of the limitation of having only 65,000 lines on a s
24、ingle sheet, some data may be omitted for very large files and this may mean that entire PIDs or even commands may be missing from the analysis.Note that data are only present for processes that consumed a significant amount of CPU during an interval.The TOP sheet does not represent a complete view
25、of the system.NMON_Analyser does the following:reorders the columns for easier processing. Sorts the data on the sheet into COMMAND name order - using TIME as a minor sort key. Creates a table at the end of the sheet summarising the data by command name and used for graphing.You can see the detail s
26、ection by scrolling to the top of the sheet. The summary table is largely obscured by the graphs and so you will need to move (or delete) them for easier viewing.PID in the detail section this is the process ID of a specific invocation of a command. In the summary table this is the command name.%CPU
27、 in the detail section this is the utilisation of a single processor (rather than of the system) by that PID during the interval. In the summary table this is the average amount of CPU used by all invocations of the command during the collection period.%Usr in the detail section this is the average
28、amount of User-mode CPU used by that PID during the interval.%Sys in the detail section this is the average amount of Kernel-mode CPU used by that PID during the interval.Threads the number of (software) threads being used by this command.Size the average amount of paging space (in Kbytes) allocated
29、 for the data section (private segment + shared library data pages) for one invocation of this command. This is the same as the SIZE figure on the ps v command. Note that if Size is greater than ResData it means some working segment pages are currently paged out.ResText the average amount of real me
30、mory (in Kbytes) used for the code segments of one invocation of this command. Note that multiple concurrent invocations will normally share these pages.ResData the average amount of real memory (in Kbytes) used for the data segments of one invocation of this command. A method of calculating real me
31、mory usage for a command is ResText + (ResData * N).CharIO this is the count of bytes/sec being passed via the read and write system calls. The bulk of this is reading and writing to disks but also includes data to/from terminals, sockets and pipes. Use this to work out which processes are doing the
32、 I/O.%RAM this is an indication of what percentage of real memory this command is using. This is (ResText + ResData) / Real Mem; it is the same as the %MEM value on theps v command. Due to rounding/truncation, and the large amounts of memory in modern systems, this is usually 0. Paging sum of all pa
33、ge faults for this process. Use this to identify which process is causing paging but note that the figure includes asynchronous I/O and can be misleading.Command name of the commandWLMClass name of the Workload Partition or Workload Manager superclass to which this command has been allocated (64-bit
34、 kernel only). IntervalCPU generated by the Analyser. In the detail section this shows the total amount of CPU used by all invocations of a command in the time interval. It is calculated as the sum of CPU used by all PIDS running the same command divided by the number of active processors (physical
35、cores) available during the interval. In the summary section this is broken down as Average, Weighted Average and Maximum and is used to generate the graph.WSet generated by the Analyser. In the detail section this shows the total amount of memory used by all invocations of a command recorded in the
36、 time interval. It is calculated as ResText + (ResData * N) (where “N” is the number of copies of this command running concurrently during the interval). In the summary section this is broken down as Minimum, Average and Maximum and is used to generate the graph.User generated by the Analyser if a U
37、ARG sheet is present. This contains the name of the user running the process. Arg generated by the Analyser if a UARG sheet is present. This contains the complete argument string entered for the command.The Analyser generates four graphs using data in the generated table:A graph showing Average, Wei
38、ghted Average and Maximum CPU Utilisation by commandA graph showing Minimum, Average and Maximum Memory Utilisation by commandA graph showing Average, Weighted Average and Maximum CHARIO by commandA graph showing the CPU utilisation for each PID for each interval as a scatter chart. Note that this chart is only produced if there are fewer than 32,000 lines on the TOP sheet. See below for notes on interpreting this chart.Interpreting the %Processor by PID chartThe purpose of the chart
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- (编制说明)《养殖水体水质参数遥感监测技术规程》(征求意见稿)
- 集字练习:学然后知不足说课稿2025学年小学书法人美版六年级上册-人美版
- (秋季版)七年级道德与法治下册 第三单元 主动了解社会 第13课 善用网络与媒 第2课时 我的网络我做主教学设计 陕教版
- 2026年亚马逊英国上线测试题及答案
- 2026年京东运营助理测试题及答案
- 2026年中医诊断测试题及答案
- 2026年体系文件宣贯测试题及答案
- 2026年民法总则培训测试题及答案
- 2026年数学广角集合测试题及答案
- 2026年空气悬架测试题及答案
- 危险物品管理:安全储存和处理氧化剂和还原剂的方法和要求
- 2022年保育师理论知识考试题库(含答案)
- JCT908-2013 人造石的标准
- 【基于PLC的交通信号灯控制系统设计7000字(论文)】
- 施工图出图计划
- 园林植物病虫害防治高职全套完整教学课件
- 医用内窥镜冷光源产品技术要求深圳迈瑞
- 热控专业试题-热工试题
- GB/T 10857-2005S型和C型钢制滚子链条、附件和链轮
- 高大支模架工程监理实施细则
- 科技论文写作与学术规范
评论
0/150
提交评论