[SunHELP] Problems with an U2/2x300MHz

Michael Karl sunhelp at sunhelp.org
Tue Dec 18 19:10:43 CST 2001


Hi all,

since summer I have a problem with an U2 with 2x 300MHz CPUs and Sol-2.6.
This three year old machine crashed several times without any messages.
Only powerdown and poweron has helped.

Two month ago I decided to upgrade to Solaris 8 with ufs-logging.
(Since 10 days it has the latest recommended patch-cluster)

Today this machine crashed two times.

Please have a look to the following /var/adm/messages

****************************************************************************
Dec 18 14:55:14 SUN-Server savecore: [ID 570001 auth.error] reboot after
panic: [AFT1] errID 0x0000a876.04420107 UE Error(s)
Dec 18 14:55:14 SUN-Server     See previous message(s) for details
****************************************************************************

No previous messages ... :-(

Some minutes later ... :-(((
 
****************************************************************************
Dec 18 15:16:26 SUN-Server SUNW,UltraSPARC-II: [ID 155710 kern.warning]
WARNING: [AFT1] Uncorrectable Memory Error on CPU1 Data access at TL>0,
errID 0x00000141.cc396c07
Dec 18 15:16:26 SUN-Server     AFSR 0x00000000.80200000<PRIV,UE> AFAR
0x00000000.3f217eb8
Dec 18 15:16:26 SUN-Server     AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00
Fault_PC 0x100bdbc4
Dec 18 15:16:26 SUN-Server     UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0203<UE>
UDBL.ESYND 0x03
Dec 18 15:16:26 SUN-Server     UDBL Syndrome 0x3 Memory Module U0702 U0602
Dec 18 15:16:26 SUN-Server SUNW,UltraSPARC-II: [ID 579265 kern.warning]
WARNING: [AFT1] errID 0x00000141.cc396c07 Syndrome 0x3 indicates that this
may not be a memory module problem
Dec 18 15:16:26 SUN-Server SUNW,UltraSPARC-II: [ID 811682 kern.info] [AFT2]
errID 0x00000141.cc396c07 PA=0x00000000.3f217eb8
Dec 18 15:16:26 SUN-Server     E$tag 0x00000000.0a4007e4 E$State: Shared
E$parity 0x05 
Dec 18 15:16:26 SUN-Server SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x00): 0x00000300.01b7c678
Dec 18 15:16:26 SUN-Server SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x08): 0x00000300.00410480
Dec 18 15:16:26 SUN-Server SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x10): 0x00000300.003d4c00
Dec 18 15:16:26 SUN-Server SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x18): 0x00000300.00412280
Dec 18 15:16:26 SUN-Server SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x20): 0x00000000.026cc000
Dec 18 15:16:26 SUN-Server SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x28): 0x00000000.00000000
Dec 18 15:16:26 SUN-Server SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x30): 0x00000300.01b7c678
Dec 18 15:16:26 SUN-Server SUNW,UltraSPARC-II: [ID 989652 kern.info] [AFT2]
E$Data (0x38): 0x00000300.001d5110 *Bad* PSYND=0x00ff
Dec 18 15:16:26 SUN-Server unix: [ID 836849 kern.notice]
Dec 18 15:16:26 SUN-Server panic[cpu1]/thread=300019ce800:
Dec 18 15:16:26 SUN-Server unix: [ID 825528 kern.notice] [AFT1] errID
0x00000141.cc396c07 UE Error(s)
Dec 18 15:16:26 SUN-Server     See previous message(s) for details
Dec 18 15:16:26 SUN-Server unix: [ID 100000 kern.notice]
Dec 18 15:16:26 SUN-Server genunix: [ID 723222 kern.notice] 000002a100464ec0
SUNW,UltraSPARC-II:cpu_aflt_log+4e0 (2a100464f7e, 1, 10146a98, 2a100465108,
2a100464fcb, 10146ac0)
Dec 18 15:16:26 SUN-Server genunix: [ID 179002 kern.notice]   %l0-3:
0000000000000000 000002a1004651d0 0000000000000003 0000000000000010
Dec 18 15:16:26 SUN-Server   %l4-7: 0000000000041473 00000000ffbeb2f4
0000000000000000 00000000000da00c
Dec 18 15:16:26 SUN-Server genunix: [ID 723222 kern.notice] 000002a100465110
SUNW,UltraSPARC-II:cpu_async_error+868 (104597b0, 2a1004651d0, 80200000, 0,
4650180080200000, 2a100465390)
Dec 18 15:16:26 SUN-Server genunix: [ID 179002 kern.notice]   %l0-3:
000000001040dae4 0000000000000032 0000000000000203 0000000000000000
Dec 18 15:16:26 SUN-Server   %l4-7: 000000003f217e80 0000000000200000
0000000000200000 0000000000000001
Dec 18 15:16:26 SUN-Server genunix: [ID 723222 kern.notice] 000002a1004652e0
unix:prom_rtt+0 (10441ff0, 0, 2000, 0, 216d8, 77ff4)
Dec 18 15:16:26 SUN-Server genunix: [ID 179002 kern.notice]   %l0-3:
0000000000000006 0000000000001400 0000000000001605 000000001013e7d4
Dec 18 15:16:26 SUN-Server   %l4-7: 0000000000000000 0000000000000000
0000000000000000 000002a100465390
Dec 18 15:16:26 SUN-Server genunix: [ID 723222 kern.notice] 000002a100465430
genunix:segmap_hashout+48 (34e, 1013abd8, 2000, 10441ff0, 30000406a00,
30000410488)
Dec 18 15:16:26 SUN-Server genunix: [ID 179002 kern.notice]   %l0-3:
0000030000410480 000002a100465ba0 00000300019c2a90 0000000010032e04
Dec 18 15:16:26 SUN-Server   %l4-7: 00000000104336e0 0000030000a10e40
00000300019ce800 000002a100465ba0
Dec 18 15:16:26 SUN-Server genunix: [ID 723222 kern.notice] 000002a1004654e0
genunix:grab_smp+2c (30000406a00, 3000221a478, 5c000, ffbeb124, 208f80, 0)
Dec 18 15:16:26 SUN-Server genunix: [ID 179002 kern.notice]   %l0-3:
0000000010442000 00000000001f8f10 00000000001743c8 0000000000002000
Dec 18 15:16:26 SUN-Server   %l4-7: 00000000ffbeb124 0000000000000000
0000000000000000 00000000000da00c
Dec 18 15:16:26 SUN-Server genunix: [ID 723222 kern.notice] 000002a100465590
genunix:get_free_smp+1ac (10441e60, 1042fb28, 20, 10442190, 1042f8c8, a)
Dec 18 15:16:26 SUN-Server genunix: [ID 179002 kern.notice]   %l0-3:
000002a100465708 0000030000069998 0000000010441d80 0000030000406a00
Dec 18 15:16:26 SUN-Server   %l4-7: 0000030000069990 0000030000069988
0000030000406a00 00000000000da00c
Dec 18 15:16:26 SUN-Server genunix: [ID 723222 kern.notice] 000002a100465640
genunix:segmap_getmapflt+1c8 (10442180, 30001b7c678, 28e8000, 2e0, 10441d80,
10442060)
Dec 18 15:16:26 SUN-Server genunix: [ID 179002 kern.notice]   %l0-3:
00000000028e8000 00000000028f8000 0000030001b7c678 0000000060004b6c
Dec 18 15:16:26 SUN-Server   %l4-7: 000000000000005c 0000000000000000
00000000028e8000 0000000000000000
Dec 18 15:16:26 SUN-Server genunix: [ID 723222 kern.notice] 000002a100465710
ufs:rdip+2ec (28e8000, 0, 30001b7c730, 2a100465988, 0, 30001b7c728)
Dec 18 15:16:26 SUN-Server genunix: [ID 179002 kern.notice]   %l0-3:
0000000000000000 0000030000ee8000 0000000000000001 0000000000000001
Dec 18 15:16:26 SUN-Server   %l4-7: 0000030001b7c5e8 000002a100465978
0000000000002000 000000001041a7e0
Dec 18 15:16:26 SUN-Server genunix: [ID 723222 kern.notice] 000002a100465800
ufs:ufs_read+ec (3000052df28, 2a100465978, 30000ebdc48, 30001b7c5e8, 0, 0)
Dec 18 15:16:26 SUN-Server genunix: [ID 179002 kern.notice]   %l0-3:
0000030001b7c728 0000000000174234 0000000000000008 0000000000050000
Dec 18 15:16:26 SUN-Server   %l4-7: 0000000000041473 00000000ffbeb2f4
0000000000000000 00000000000da00c
Dec 18 15:16:26 SUN-Server genunix: [ID 723222 kern.notice] 000002a1004658c0
genunix:read+25c (28e8000, 0, 3, 30001938978, 7, 2000)
Dec 18 15:16:26 SUN-Server genunix: [ID 179002 kern.notice]   %l0-3:
000000001015f9cc 0000000000002000 0000030001b7c678 00000000001e6a38
Dec 18 15:16:26 SUN-Server   %l4-7: 0000000000000001 00000000001e48d8
0000000000050000 00000000000da00c
Dec 18 15:16:26 SUN-Server genunix: [ID 723222 kern.notice] 000002a100465a40
genunix:read32+30 (7, 1743c8, 2000, 0, 216d8, 77ff4)
Dec 18 15:16:26 SUN-Server genunix: [ID 179002 kern.notice]   %l0-3:
00000300019c8058 0000000000000007 0000000000000000 0000000000000000
Dec 18 15:16:26 SUN-Server   %l4-7: 0000000000000000 0000000000000000
0000000000000000 00000000000da00c
Dec 18 15:16:26 SUN-Server unix: [ID 100000 kern.notice]
Dec 18 15:16:26 SUN-Server genunix: [ID 672855 kern.notice] syncing file
systems...
Dec 18 15:16:26 SUN-Server genunix: [ID 433738 kern.notice]  [3]
Dec 18 15:16:26 SUN-Server genunix: [ID 733762 kern.notice]  19
.
.
.
Dec 18 15:16:26 SUN-Server genunix: [ID 433738 kern.notice]  [3]
Dec 18 15:16:26 SUN-Server genunix: [ID 733762 kern.notice]  12
Dec 18 15:16:26 SUN-Server genunix: [ID 616637 kern.notice]  cannot sync --
giving up
Dec 18 15:16:26 SUN-Server genunix: [ID 353387 kern.notice] dumping to
/dev/dsk/c1t0d0s1, offset 209715200
Dec 18 15:16:26 SUN-Server genunix: [ID 409368 kern.notice] 100% done: 10588
pages dumped, compression ratio 2.98,
Dec 18 15:16:26 SUN-Server genunix: [ID 851671 kern.notice] dump succeeded
****************************************************************************

I don't find any hints on sunsolve.sun.com or access1.sun.com.

Could it be a CPU-modul-problem ? ... or memory ? ... or damaged U2 ?

Thanks in advance,

Michael



More information about the SunHELP mailing list