注册 登录  
 加关注
   显示下一条  |  关闭
温馨提示!由于新浪微博认证机制调整,您的新浪微博帐号绑定已过期,请重新绑定!立即重新绑定新浪微博》  |  关闭

欢迎光临我的博客

 
 
 

日志

 
 

2月3日440问题  

2009-02-05 13:34:52|  分类: unix |  标签: |举报 |字号 订阅

  下载LOFTER 我的照片书  |

     2月3日以来440频繁启动,查看/var/adm/messages日志发现主要有几方面的错误,一是内存超限,一个是一个部件意外错误,一个是硬盘错误。以前就有硬盘放面的错误2月三号出现了五次重启,上午3次,下午2次,平均在两个小时左右重启一次,晚上关闭了服务器,关闭时间比较长,在服务器关闭之后,我发现从下往上数的第二快硬盘在别的硬盘灯都灭的时候,它还一直在亮,时间很长,4号早上来的时候发现第二块硬盘灯也熄灭了,本来想着拔掉第一块硬盘让启动一下,看看是否做了ride 5,如果做了ride 5的话,系统能启动起来,可以排除第一块硬盘的问题,考虑到安全问题没有做这一步。只是连接上了串口想看一下信息,也没有搜集到串口的信息,原因是那根串口线可能有问题,以前就用串口收集过一次信息,没收到,换了一根线就好了。4号服务器重启了五六次,在晚上关机的时候没有正常关机,出现5号早上ping 10.0.1.66能通,但是不能telnet 的问题,开机又看了一下日志信息,主要是内存错误信息,以前的错误信息如下:

1  部件意外出错

Feb  3 09:01:07 v440 EVENT-TIME: Tue Feb  3 09:01:07 GMT 2009
Feb  3 09:01:07 v440 PLATFORM: SUNW,Sun-Fire-V440, CSN: -, HOSTNAME: v440
Feb  3 09:01:07 v440 SOURCE: fmd-self-diagnosis, REV: 1.0
Feb  3 09:01:07 v440 EVENT-ID: db69c5e6-dfce-cca4-8a55-d825b39f3b9d
Feb  3 09:01:07 v440 DESC: The Solaris Fault Manager received an event from a component to which no automated diagnosis software is currently subscribed. Refer to http://sun.com/msg/FMD-8000-0W for more information.
Feb  3 09:01:07 v440 AUTO-RESPONSE: Error reports from the component will be logged for examination by Sun.
Feb  3 09:01:07 v440 IMPACT: Automated diagnosis and response for these events will not occur.
Feb  3 09:01:07 v440 REC-ACTION: Run pkgchk -n SUNWfmd to ensure that fault management software is installed properly.  Contact Sun for support.
Feb  3 09:01:07 v440 fmd: [ID 441519 daemon.error] SUNW-MSG-ID: FMD-8000-0W, TYPE: Defect, VER: 1, SEVERITY: Minor

2 硬盘错误

Feb  3 14:33:22 v440 scsi: [ID 107833 kern.warning] WARNING: /pci@1f,700000/scsi@2/sd@0,0 (sd15):
Feb  3 14:33:22 v440  Error for Command: read(10)                Error Level: Retryable
Feb  3 14:33:22 v440 scsi: [ID 107833 kern.notice]  Requested Block: 6394976                   Error Block: 6394979
Feb  3 14:33:22 v440 scsi: [ID 107833 kern.notice]  Vendor: SEAGATE                            Serial Number: 0446B9J8CF 
Feb  3 14:33:22 v440 scsi: [ID 107833 kern.notice]  Sense Key: Media Error
Feb  3 14:33:22 v440 scsi: [ID 107833 kern.notice]  ASC: 0x11 (unrecovered read error), ASCQ: 0x0, FRU: 0xf
Feb  3 14:33:23 v440 scsi: [ID 107833 kern.warning] WARNING: /pci@1f,700000/scsi@2/sd@0,0 (sd15):
Feb  3 14:33:23 v440  Error for Command: read(10)                Error Level: Retryable

3  内存错误

Feb  3 16:00:15 v440 EVENT-TIME: Tue Feb  3 16:00:15 GMT 2009
Feb  3 16:00:15 v440 PLATFORM: SUNW,Sun-Fire-V440, CSN: -, HOSTNAME: v440
Feb  3 16:00:15 v440 SOURCE: cpumem-diagnosis, REV: 1.3
Feb  3 16:00:15 v440 EVENT-ID: b7686528-30ea-429b-ecda-e98064ab7318
Feb  3 16:00:15 v440 DESC: The number of errors associated with this memory module has exceeded acceptable levels.  Refer to http://sun.com/msg/SUN4U-8000-2S for more information.
Feb  3 16:00:15 v440 AUTO-RESPONSE: Pages of memory associated with this memory module are being removed from service as errors are reported.
Feb  3 16:00:15 v440 IMPACT: Total system memory capacity will be reduced as pages are retired.
Feb  3 16:00:15 v440 REC-ACTION: Schedule a repair procedure to replace the affected memory module.  Use fmdump -v -u <EVENT_ID> to identify the module.

然后查了一下错误的详细信息

$ prtdiag -v
System Configuration: Sun Microsystems  sun4u Sun Fire V440
System clock frequency: 183 MHZ
Memory size: 16GB

==================================== CPUs ====================================
               E$          CPU                    CPU
CPU  Freq      Size        Implementation         Mask    Status      Location
---  --------  ----------  ---------------------  -----   ------      --------
0    1281 MHz  1MB         SUNW,UltraSPARC-IIIi    3.4    on-line      -
1    1281 MHz  1MB         SUNW,UltraSPARC-IIIi    3.4    on-line      -
2    1281 MHz  1MB         SUNW,UltraSPARC-IIIi    2.4    on-line      -
3    1281 MHz  1MB         SUNW,UltraSPARC-IIIi    2.4    on-line      -

================================= IO Devices =================================
Bus     Freq  Slot +      Name +
Type    MHz   Status      Path                          Model
------  ----  ----------  ----------------------------  --------------------
pci     66    MB          pci108e,abba (network)        SUNW,pci-ce
              okay        /pci@1c,600000/network@2

pci     66    PCI2        fibre-channel-pci10df,f900
              okay        /pci@1d,700000/fibre-channel@2

pci     33    MB          isa/su (serial)
              okay        /pci@1e,600000/isa@7/serial

pci     33    MB          isa/su (serial)
              okay        /pci@1e,600000/isa@7/serial

pci     33    MB          isa/rmc-comm-rmc_comm (seria+
              okay        /pci@1e,600000/isa@7/rmc-comm@0,3e8

pci     33    PCI0        SUNW,XVR-100 (display)        SUNW,375-3181
              okay        /pci@1e,600000/SUNW,XVR-100@2

pci     33    MB          pci10b9,5229 (ide)
              okay        /pci@1e,600000/ide@d

pci     66    MB          pci108e,abba (network)        SUNW,pci-ce
              okay        /pci@1f,700000/network@1

pci     66    MB          scsi-pci1000,30 (scsi-2)      LSI,1030
              okay        /pci@1f,700000/scsi@2

pci     66    MB          scsi-pci1000,30 (scsi-2)      LSI,1030
              okay        /pci@1f,700000/scsi@2,1


============================ Memory Configuration ============================
Segment Table:
-----------------------------------------------------------------------
Base Address       Size       Interleave Factor  Contains
-----------------------------------------------------------------------
0x0                4GB               16          BankIDs 0,1,2,3,4,5,6,7,8,9,10,
11,12,13,14,15
0x1000000000       4GB               16          BankIDs 16,17,18,19,20,21,22,23
,24,25,26,27,28,29,30,31
0x2000000000       4GB               16          BankIDs 32,33,34,35,36,37,38,39
,40,41,42,43,44,45,46,47
0x3000000000       4GB               16          BankIDs 48,49,50,51,52,53,54,55
,56,57,58,59,60,61,62,63

Bank Table:
-----------------------------------------------------------
           Physical Location
ID       ControllerID  GroupID   Size       Interleave Way
-----------------------------------------------------------
0        0             0         256MB           0,1,2,3,4,5,6,7,8,9,10,11,12,13
,14,15
1        0             0         256MB
2        0             1         256MB
3        0             1         256MB
4        0             0         256MB
5        0             0         256MB
6        0             1         256MB
7        0             1         256MB
8        0             1         256MB
9        0             1         256MB
10       0             0         256MB
11       0             0         256MB
12       0             1         256MB
13       0             1         256MB
14       0             0         256MB
15       0             0         256MB
16       1             0         256MB           0,1,2,3,4,5,6,7,8,9,10,11,12,13
,14,15
17       1             0         256MB
18       1             1         256MB
19       1             1         256MB
20       1             0         256MB
21       1             0         256MB
22       1             1         256MB
23       1             1         256MB
24       1             1         256MB
25       1             1         256MB
26       1             0         256MB
27       1             0         256MB
28       1             1         256MB
29       1             1         256MB
30       1             0         256MB
31       1             0         256MB
32       2             0         256MB           0,1,2,3,4,5,6,7,8,9,10,11,12,13
,14,15
33       2             0         256MB
34       2             1         256MB
35       2             1         256MB
36       2             0         256MB
37       2             0         256MB
38       2             1         256MB
39       2             1         256MB
40       2             1         256MB
41       2             1         256MB
42       2             0         256MB
43       2             0         256MB
44       2             1         256MB
45       2             1         256MB
46       2             0         256MB
47       2             0         256MB
48       3             0         256MB           0,1,2,3,4,5,6,7,8,9,10,11,12,13
,14,15
49       3             0         256MB
50       3             1         256MB
51       3             1         256MB
52       3             0         256MB
53       3             0         256MB
54       3             1         256MB
55       3             1         256MB
56       3             1         256MB
57       3             1         256MB
58       3             0         256MB
59       3             0         256MB
60       3             1         256MB
61       3             1         256MB
62       3             0         256MB
63       3             0         256MB

Memory Module Groups:
--------------------------------------------------
ControllerID   GroupID  Labels         Status
--------------------------------------------------
0              0        C0/P0/B0/D0
0              0        C0/P0/B0/D1
0              1        C0/P0/B1/D0
0              1        C0/P0/B1/D1
1              0        C1/P0/B0/D0
1              0        C1/P0/B0/D1
1              1        C1/P0/B1/D0
1              1        C1/P0/B1/D1
2              0        C2/P0/B0/D0
2              0        C2/P0/B0/D1
2              1        C2/P0/B1/D0
2              1        C2/P0/B1/D1
3              0        C3/P0/B0/D0
3              0        C3/P0/B0/D1
3              1        C3/P0/B1/D0
3              1        C3/P0/B1/D1

=============================== usb Devices ===============================

Name          Port#
------------  -----
keyboard        1

=============================== usb Devices ===============================

Name          Port#
------------  -----
mouse           1

============================ Environmental Status ============================
Fan Status:
-------------------------------------------
Location             Sensor          Status
-------------------------------------------
FT0/F0               TACH            okay
FT1/F0               TACH            okay
FT1/F1               TACH            okay
PS0                  FF_PDCT_FAN     okay
PS1                  FF_PDCT_FAN     okay

Temperature sensors:
-----------------------------------------
Location       Sensor              Status
-----------------------------------------
C0/P0          T_CORE              okay
C1/P0          T_CORE              okay
C2/P0          T_CORE              okay
C3/P0          T_CORE              okay
C0             T_AMB               okay
C1             T_AMB               okay
C2             T_AMB               okay
C3             T_AMB               okay
SCSIBP         T_AMB               okay
MB             T_AMB               okay
------------------------------------
Current sensors:
----------------------------------------
Location             Sensor       Status
----------------------------------------
MB                   FF_SCSIA     okay
MB                   FF_SCSIB     okay
MB                   FF_POK       okay
C0/P0                FF_POK       okay
C1/P0                FF_POK       okay
C2/P0                FF_POK       okay
C3/P0                FF_POK       okay
------------------------------------
Voltage sensors:
-----------------------------------
Location       Sensor        Status
-----------------------------------
MB             V_+1V5        okay
MB             V_VCCTM       okay
MB             V_NET0_1V2D   okay
MB             V_NET1_1V2D   okay
MB             V_NET0_1V2A   okay
MB             V_NET1_1V2A   okay
MB             V_+3V3        okay
MB             V_+3V3STBY    okay
MB/BAT         V_BAT         okay
MB             V_SCSI_CORE   okay
MB             V_+5V         okay
MB             V_+12V        okay
MB             V_-12V        okay
PS0            P_PWR         okay
PS0            FF_POK        okay
PS1            P_PWR         okay
PS1            FF_POK        okay
-----------------------------------------
Keyswitch:
-----------------------------------------
Location       Keyswitch   State
-----------------------------------------
SYS            SYSCTRL     NORMAL
--------------------------------------------------
Led State:
--------------------------------------------------------------
Location               Led                   State       Color
--------------------------------------------------------------
SYS                    ACT                   on          green
SYS                    SERVICE               off         amber
SYS                    LOCATE                off         white
PS0                    POK                   on          green
PS0                    STBY                  on          green
PS0                    SERVICE               off         amber
PS0                    OK2RM                 off         blue
PS1                    POK                   on          green
PS1                    STBY                  on          green
PS1                    SERVICE               off         amber
PS1                    OK2RM                 off         blue
HDD0                   SERVICE               off         amber
HDD0                   OK2RM                 off         blue
HDD1                   SERVICE               off         amber
HDD1                   OK2RM                 off         blue
HDD2                   SERVICE               off         amber
HDD2                   OK2RM                 off         blue
HDD3                   SERVICE               off         amber
HDD3                   OK2RM                 off         blue

=========================== FRU Operational Status ===========================
---------------------------------
Fru Operational Status:
---------------------------------
Location                Status
---------------------------------
SC                      okay
HDD0                    present
HDD1                    present
HDD2                    present
HDD3                    present
PS0                     okay
PS1                     okay

================================ HW Revisions ================================
ASIC Revisions:
-------------------------------------------------------------------
Path                   Device           Status             Revision
-------------------------------------------------------------------
/pci@1c,600000         pci108e,a801     okay               4
/pci@1d,700000         pci108e,a801     okay               4
/pci@1e,600000         pci108e,a801     okay               4
/pci@1f,700000         pci108e,a801     okay               4

System PROM revisions:
----------------------
OBP 4.13.2 2004/03/29 10:11 Sun Fire V440,Netra 440
OBDIAG 4.13.2 2004/03/29 10:12
$
# crontab -l
#ident  "@(#)root       1.21    04/03/23 SMI"
#
# The root crontab should be used to perform accounting data collection.
#
#
10 3 * * * /usr/sbin/logadm
25 1 * * * /usr/lib/patch/swupAuto > /dev/null 2>&1
30 3 * * * [ -x /usr/lib/gss/gsscred_clean ] && /usr/lib/gss/gsscred_clean
# 10 3 * * * /usr/lib/krb5/kprop_script ___slave_kdcs___
#30 5 * * * /cold_back.sh
#50 12 * * 0,1,3,5 /cperp2bak.sh

$ pkgchk -n SUNWfmd
NOTE: Couldn't lock the package database.

$ fmdump -v -u db69c5e6-dfce-cca4-8a55-d825b39f3b9d
TIME                 UUID                                 SUNW-MSG-ID
Feb 03 09:01:07.0010 db69c5e6-dfce-cca4-8a55-d825b39f3b9d FMD-8000-0W
  100%  defect.sunos.fmd.nosub
         FRU: -
        rsrc: -

$ fmdump -v -u b7686528-30ea-429b-ecda-e98064ab7318
TIME                 UUID                                 SUNW-MSG-ID
Jan 24 06:22:35.8016 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 24 06:58:51.1336 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 24 10:55:58.8971 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 24 12:59:31.0947 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 24 16:46:30.9119 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 24 23:14:01.0402 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 25 05:53:31.9847 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 25 23:43:09.0255 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 26 03:45:07.0486 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 26 06:26:41.2695 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 26 07:15:21.6349 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 26 23:03:50.3002 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 27 06:00:33.5067 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 27 06:26:27.8257 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 27 17:32:00.4125 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 28 01:15:24.1093 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 28 10:56:51.9351 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 29 04:13:40.1503 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 29 08:55:29.1142 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 29 12:34:58.1105 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 29 18:33:14.5206 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 30 16:09:17.8136 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 31 02:21:42.7862 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 31 14:01:27.4265 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 01 10:04:42.4945 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 01 12:58:50.8575 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 01 17:56:04.8340 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 02 07:48:24.6332 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 02 08:59:58.4107 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 03 14:36:38.2864 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 03 16:00:15.3604 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 03 17:25:45.7261 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 08:05:29.3068 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 10:00:15.5272 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 11:11:45.7342 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 14:21:57.6630 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 16:11:34.9218 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 17:36:12.5517 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 05 08:16:41.4779 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

$
$ fmdump -v -u e4794211-4548-46eb-ea09-97cf87fffd95
TIME                 UUID                                 SUNW-MSG-ID
Feb 03 14:47:04.5263 e4794211-4548-46eb-ea09-97cf87fffd95 FMD-8000-2K
  100%  defect.sunos.fmd.module
         FRU: -
        rsrc: fmd:///module/cpumem-diagnosis

$fmdump -v -u b7686528-30ea-429b-ecda-e98064ab7318
Jan 29 18:33:14.5206 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 30 16:09:17.8136 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 31 02:21:42.7862 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 31 14:01:27.4265 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 01 10:04:42.4945 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 01 12:58:50.8575 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 01 17:56:04.8340 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 02 07:48:24.6332 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 02 08:59:58.4107 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 03 14:36:38.2864 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 03 16:00:15.3604 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 03 17:25:45.7261 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 08:05:29.3068 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 10:00:15.5272 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 11:11:45.7342 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 14:21:57.6630 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 16:11:34.9218 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 04 17:36:12.5517 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Feb 05 08:16:41.4779 b7686528-30ea-429b-ecda-e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

$ fmdump -v -u e4794211-4548-46eb-ea09-97cf87fffd95
TIME                 UUID                                 SUNW-MSG-ID
Feb 03 14:47:04.5263 e4794211-4548-46eb-ea09-97cf87fffd95 FMD-8000-2K
  100%  defect.sunos.fmd.module
         FRU: -
        rsrc: fmd:///module/cpumem-diagnosis

$

之后又参考了网站信息

想先通过软件方式能不能解决问题,执行的操作如下;

# fmadm config
MODULE                   VERSION STATUS  DESCRIPTION
cpumem-diagnosis         1.3     failed  UltraSPARC-III

CPU/Memory Diagnosis
cpumem-retire            1.0     active  CPU/Memory Retire Agent
eft                      1.12    active  eft diagnosis engine
fmd-self-diagnosis       1.0     active  Fault Manager Self-

Diagnosis
io-retire                1.0     active  I/O Retire Agent
syslog-msgs              1.0     active  Syslog Messaging Agent
#
# fmdump
TIME                 UUID                                 SUNW-MSG-ID
Jan 24 06:22:35.8016 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
Jan 24 06:58:51.1336 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
Jan 24 10:55:58.8971 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
Jan 24 12:59:31.0947 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
Jan 24 16:46:30.9119 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
Jan 24 23:14:01.0402 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
Jan 25 05:53:31.9847 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
Jan 25 23:43:09.0255 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
Jan 26 03:45:07.0486 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
Jan 26 06:26:41.2695 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
Jan 26 07:15:21.6349 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
^C#
# fmdump -v -u b7686528-30ea-429b-ecda-e98064ab7318
TIME                 UUID                                 SUNW-MSG-ID
Jan 24 06:22:35.8016 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

Jan 24 06:58:51.1336 b7686528-30ea-429b-ecda-

e98064ab7318 SUN4U-8000-2S
   95%  fault.memory.dimm
         FRU: mem:///component=C1/P0/B1/D0:B1/D0
        rsrc: mem:///component=C1/P0/B1/D0:B1/D0

^C
# #
# fmadm faulty
   STATE RESOURCE / UUID
-------- ---------------------------------------------------------------
degraded fmd:///module/cpumem-diagnosis
         70070f1b-ae48-ebd5-c357-a0a0a6b21122
-------- ---------------------------------------------------------------
degraded mem:///component=C1/P0/B1/D0:B1/D0
         b7686528-30ea-429b-ecda-e98064ab7318
-------- ---------------------------------------------------------------
# fmadm repair mem:///component=C1/P0/B1/D0:B1/D0
fmadm: recorded repair to

mem:///component=C1/P0/B1/D0:B1/D0
## fmadm faulty
   STATE RESOURCE / UUID
-------- -----------------------------------------------------------------
degraded fmd:///module/cpumem-diagnosis
         70070f1b-ae48-ebd5-c357-a0a0a6b21122
-------- -----------------------------------------------------------------
# fmdump -v -u 70070f1b-ae48-ebd5-c357-a0a0a6b21122
TIME                 UUID                                 SUNW-MSG-ID
Jan 29 08:06:03.6999 70070f1b-ae48-ebd5-c357-

a0a0a6b21122 FMD-8000-2K
  100%  defect.sunos.fmd.module
         FRU: -
        rsrc: fmd:///module/cpumem-diagnosis
# pkgchk -n SUNWfmd
#
# fmadm config
MODULE                   VERSION STATUS  DESCRIPTION
cpumem-diagnosis         1.3     active  UltraSPARC-III

CPU/Memory Diagnosis
cpumem-retire            1.0     active  CPU/Memory Retire Agent
eft                      1.12    active  eft diagnosis engine
fmd-self-diagnosis       1.0     active  Fault Manager Self-

Diagnosis
io-retire                1.0     active  I/O Retire Agent
syslog-msgs              1.0     active  Syslog Messaging Agent

之后又从新启动了服务器,服务器没有关机重启,又变成了早上的状态,能ping通,telnet不通,启动不到10分钟,服务器又重启了,启动起来的时候发现内存的那个日志信息没有了,但是不知道什么原因又重启了,这次要看能坚持多长时间了


 

 

  评论这张
 
阅读(992)| 评论(0)
推荐 转载

历史上的今天

在LOFTER的更多文章

评论

<#--最新日志,群博日志--> <#--推荐日志--> <#--引用记录--> <#--博主推荐--> <#--随机阅读--> <#--首页推荐--> <#--历史上的今天--> <#--被推荐日志--> <#--上一篇,下一篇--> <#-- 热度 --> <#-- 网易新闻广告 --> <#--右边模块结构--> <#--评论模块结构--> <#--引用模块结构--> <#--博主发起的投票-->
 
 
 
 
 
 
 
 
 
 
 
 
 
 

页脚

网易公司版权所有 ©1997-2017