您好,欢迎访问三七文档
该问题目前的分析:1、9312-A主板(1/13)忽然出现硬件故障,导致该单板不停复位。Jan19201214:29:07Quidway%%01CSSM/4/STACKBACKUP(l)[33]:ThisclusterCSScompeteresultisbackup.Jan19201214:29:15Quidway%%01ALML/4/CLOCKFAULT(l)[50]:TheCLK_33M_CHKsensor15ofMPUboard[1/13]detectclocksignalfaultJan19201214:29:15Quidway%%01ALML/4/CLOCKFAULT(l)[51]:TheCLK_125M_CHKsensor16ofMPUboard[1/13]detectclocksignalfaultJan19201214:29:15Quidway%%01ALML/4/CLOCKFAULT_RESUME(l)[55]:TheCLK_125M_CHKsensor16ofMPUboard[1/13]detectclocksignalfaultresumeJan19201214:29:15Quidway%%01ALML/4/CLOCKFAULT(l)[56]:TheCLK_125M_CHKsensor16ofMPUboard[1/13]detectclocksignalfaultJan19201214:29:15Quidway%%01ALML/3/CPU_RESET(l)[57]:ThecanbusnodeofMPUboard[1/13]detectsthatCPUwasreset.2、由于该单板的复位导致9312-A备板(1/14)也出现异常复位,应该是由于1/13单板复位导致,怀疑是1/13板一直复位,自动回退到了老的版本,此时出现主备板版本不一致引发。V1R6后续版本已经解决该问题。Jan19201214:29:41Quidway%%01ALML/4/ENTRESET(l):MPUframe[1]board[14]isreset,Thereasonis:VRPresetselfboardbecauseoffindexception.3、此时1框的两块主控都复位了,导致堆叠分裂。分裂之后,1/14单板启动,启动完成之后又会堆叠合并。合并的过程会出现2号框的整框复位,这个是堆叠机制要求的。Jan19201214:39:05Quidway%%01ALML/4/ENTRESET(l):LPUframe[2]board[5]isreset,Thereasonis:ResetforCSSmanagement.Jan19201214:39:05Quidway%%01ALML/4/ENTRESET(l):LPUframe[2]board[8]isreset,Thereasonis:ResetforCSSmanagement.Jan19201214:39:05Quidway%%01ALML/4/ENTRESET(l):MPUframe[2]board[14]isreset,Thereasonis:ResetforCSSmanagement.Jan19201214:39:06Quidway%%01ALML/4/ENTRESET(l):MPUframe[2]board[13]isreset,Thereasonis:ResetforCSSmanagement.4、1/13故障之后引发了1/14单板的复位,同时1/14的复位引发了2框的复位。5、升级到V1R6之后,应该可以解决上诉问题。但是日志里分别在01:23:21才使能了两框的堆叠,但是01:49:17、02:21:42、02:12:29和02:32:05都有电源的告警,怀疑是人为手动整框下电,在02:29:13的时候去使能了堆叠,之后就一直没有再使能堆叠,一直处于单框工作状态。详细分析如下:B单框直到201:23才开始有堆叠Jan19201221:13:18Quidway%%01SHELL/5/CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=startupsystem-softwarecfcard:/s9300v100r006c00spc800.ccslave-board)Jan19201221:13:22Quidway%%01SHELL/6/DISPLAY_CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=displaystartup)Jan19201221:13:35Quidway%%01SHELL/6/DISPLAY_CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=displaycurrent-configuration)Jan19201221:13:40Quidway%%01SHELL/5/CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=system-view)A框20号01:08:53才开始使能堆叠an19201222:01:01QuidwayBASETRAP/4/CPUUSAGERESUME:OID1.3.6.1.4.1.2011.5.25.129.2.4.2CPUutilizationresumedfromexceedingthepre-alarmthreshold.(Index=70516745,BaseUsagePhyIndex=0,UsageType=1,UsageIndex=0,Severity=6,ProbableCause=154,EventType=4,PhysicalName=MPUBoard13,RelativeResource=,UsageValue=73,UsageUnit=1,UsageThreshold=80)Jan19201222:01:07Quidway%%01SHELL/5/CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=getS9300V100R006C00SPC800.CC)Jan19201222:02:31Quidway%%01SHELL/5/CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=quit)Jan19201222:02:32Quidway%%01SHELL/5/CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=dir)A日志Jan20201201:04:46QuidwayBASETRAP/4/CPUUSAGERESUME:OID1.3.6.1.4.1.2011.5.25.129.2.4.2CPUutilizationrecoveredtothenormalrange.(Index=68419593,BaseUsagePhyIndex=0,UsageType=1,UsageIndex=0,Severity=6,ProbableCause=154,EventType=4,PhysicalName=LPUBoard5,RelativeResource=,UsageValue=27,UsageUnit=1,UsageThreshold=80)Jan20201201:08:19Quidway%%01SHELL/6/DISPLAY_CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=displaydevice)Jan20201201:08:47Quidway%%01SHELL/5/CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=system-view)Jan20201201:08:53Quidway%%01SHELL/5/CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=cssenable)B日志Jan20201201:23:21SwitchB%%01SHELL/5/CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=save)Jan20201201:23:22SwitchB%%01HWCM/5/TRAPLOG(l):OID1.3.6.1.4.1.2011.6.10.2.1configurechanged.(EventIndex=9,CommandSource=1,ConfigSource=2,ConfigDestination=4)Jan20201201:23:26SwitchB%%01SHELL/5/CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=system-view)Jan20201201:23:28SwitchB%%01SHELL/5/CMDRECORD(l):Recordcommandinformation.(Task=co0,Ip=**,User=**,Command=cssenable)Jan20201201:23:29SwitchB%%01VFS/5/DEV_UNREG(l):Deviceslave#flash:unregistrationfinished.Jan20201201:23:29SwitchB%%01VFS/5/DEV_UNREG(l):Deviceslave#cfcard:unregistrationfinished.B日志Jan20201201:26:08SwitchA%%01CSSM/4/STACKBACKUP(l)[326]:ThisclusterCSScompeteresultisbackup.选为备框Jan20201201:56:59SwitchB%%01CSSM/4/STACKMASTER(l):ThisclusterCSScompeteresultismaster.A日志Jan20201201:25:15SwitchA%%01CSSM/4/STACKMASTER(l):ThisclusterCSScompeteresultismaster.选为主框Selfslot:25,CSSstatus:masterMatser:[1,25],backup:[2,27]1:49分25掉电了。主备切换。B为主框Jan20201201:49:17SwitchA%%01ALML/4/IOFAULT(l):TheACMODEPROTECsensor3of[FRAME1/PWR1]detectsafault.Jan20201201:49:17SwitchA%%01ALML/4/IOFAULT(l):TheACMODEPROTECsensor3of[FRAME1/PWR2]detectsafault.Jan20201201:49:18SwitchA%%01ALML/4/IOFAULT(l):TheACMODEPROTECsensor3of[FRAME1/PWR3]detectsafault.%2012-Jan-2001:56:29.790.2SwitchA01SOURCE/6/TASKREGSUC(D)[64]:Succeedt
本文标题:错误日志分析
链接地址:https://www.777doc.com/doc-1974411 .html