--- Begin Message ---
- To: Debian Bug Tracking System <submit@bugs.debian.org>
- Subject: base: I receive CPU type error traps from the kernel from time to time and with alarmingly greater frequency when I try to start X11 using startx
- From: Paul Llanyod <donallys1@gmail.com>
- Date: Tue, 04 Feb 2014 12:53:44 +0200
- Message-id: <20140204105344.4811.81221.reportbug@debian.sandolien.net>
Package: base
Severity: normal
-- System Information:
Debian Release: 7.3
APT prefers stable-updates
APT policy: (500, 'stable-updates'), (500, 'stable')
Architecture: sparc (sparc64)
Kernel: Linux 3.2.0-4-sparc64
Locale: LANG=en_ZA.UTF-8, LC_CTYPE=en_ZA.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
I receive the following error or trap from time to time on my server (a Sunblade 2000) which is running in "headless mode". When I try to start the
GUI using startx the error occurs so frequently that I cannot use the console screen. My hardware seems to be functioning perfectly - I have not
succeeded in starting the GUI (which I would like to do).
The error repeats for a while once it occurs and is shown below. If I type startx from the command line the error does not stop as readily and I have
to reboot the machine to put it in a stable state.
Jan 30 12:11:42 debian kernel: [ 952.957402] ERROR(0): Cheetah error trap taken afsr[000000020000005b] afar[000000006cfd2e90] TL1(0)
Jan 30 12:11:42 debian kernel: [ 952.957956] ERROR(0): TPC[5d1e90] TNPC[5d1e94] O7[4c1c8c] TSTATE[11001601]
Jan 30 12:11:42 debian kernel: [ 952.958380] ERROR(0): TPC<U3copy_to_user+0x190/0x500>
Jan 30 12:11:42 debian kernel: [ 952.958724] ERROR(0): M_SYND(0), E_SYND(5b)
Jan 30 12:11:42 debian kernel: [ 952.959003] ERROR(0): Highest priority error (0000000200000000) "HW corrected system bus data ECC error for read"
Jan 30 12:11:42 debian kernel: [ 952.959617] ERROR(0): AFAR E-syndrome [J0304, pin 19]
Jan 30 12:11:42 debian kernel: [ 952.959946] ERROR(0): D-cache idx[ee90] tag[000000000006cc1f] utag[0000000000004f07] stag[000000000006cc1e]
Jan 30 12:11:42 debian kernel: [ 952.960531] ERROR(0): D-cache data0[b646cadc1a96df61] data1[0800000204020042] data2[fc0e771fb5c70a95] data3[00000000ff82ede4]
Jan 30 12:11:42 debian kernel: [ 952.961204] ERROR(0): I-cache idx[0] tag[0000000000000000] utag[0000000000000000] stag[0000000000000000] u[0000000000000000] l[0000000000000000]
Jan 30 12:11:42 debian kernel: [ 952.962387] ERROR(0): I-cache INSN0[0000000000000000] INSN1[0000000000000000] INSN2[0000000000000000] INSN3[0000000000000000]
Jan 30 12:11:42 debian kernel: [ 952.963058] ERROR(0): I-cache INSN4[0000000000000000] INSN5[0000000000000000] INSN6[0000000000000000] INSN7[0000000000000000]
Jan 30 12:11:42 debian kernel: [ 952.963727] ERROR(0): E-cache idx[6cfd2e80] tag[00000000000000e7]
Jan 30 12:11:42 debian kernel: [ 952.964109] ERROR(0): E-cache data0[bca1876cc1e4352a] data1[67fe78d2554431b2] data2[091f48b54a9e7b15] data3[a596f28215531bba]
Jan 30 12:11:42 debian kernel: [ 953.010971] ERROR(0): Cheetah error trap taken afsr[000000020000005b] afar[000000006cfeee90] TL1(0)
Jan 30 12:11:42 debian kernel: [ 953.011524] ERROR(0): TPC[5d1e90] TNPC[5d1e94] O7[4c1c8c] TSTATE[11001601]
Jan 30 12:11:42 debian kernel: [ 953.011948] ERROR(0): TPC<U3copy_to_user+0x190/0x500>
Jan 30 12:11:42 debian kernel: [ 953.012292] ERROR(0): M_SYND(0), E_SYND(5b)
Jan 30 12:11:42 debian kernel: [ 953.012572] ERROR(0): Highest priority error (0000000200000000) "HW corrected system bus data ECC error for read"
Jan 30 12:11:42 debian kernel: [ 953.013185] ERROR(0): AFAR E-syndrome [J0304, pin 19]
Jan 30 12:11:42 debian kernel: [ 953.013514] ERROR(0): D-cache idx[ee90] tag[000000000006cc1f] utag[0000000000004f07] stag[000000000006cc1e]
Jan 30 12:11:42 debian kernel: [ 953.014098] ERROR(0): D-cache data0[b646cadc1a96df61] data1[0800000204020042] data2[fc0e771fb5c70a95] data3[00000000ff82ede4]
Jan 30 12:11:42 debian kernel: [ 953.014772] ERROR(0): I-cache idx[0] tag[0000000000000000] utag[0000000000000000] stag[0000000000000000] u[0000000000000000] l[0000000000000000]
Jan 30 12:11:42 debian kernel: [ 953.015953] ERROR(0): I-cache INSN0[0000000000000000] INSN1[0000000000000000] INSN2[0000000000000000] INSN3[0000000000000000]
Jan 30 12:11:42 debian kernel: [ 953.016625] ERROR(0): I-cache INSN4[0000000000000000] INSN5[0000000000000000] INSN6[0000000000000000] INSN7[0000000000000000]
Jan 30 12:11:42 debian kernel: [ 953.017294] ERROR(0): E-cache idx[6cfeee80] tag[0000000000000090]
Jan 30 12:11:42 debian kernel: [ 953.017678] ERROR(0): E-cache data0[820461d8c4066008] data1[8728a002c2004003] data2[83286018c227bf58] data3[8338601880a06065]
Jan 30 12:11:54 debian kernel: [ 964.663357] /pci@8,700000: Correctable Error, primary error type[DMA Read]
Jan 30 12:11:54 debian kernel: [ 964.664223] /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08]
Jan 30 12:11:54 debian kernel: [ 964.664648] /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
Jan 30 12:11:54 debian kernel: [ 964.665124] /pci@8,700000: CE AFAR [000000006ccf2e80]
Jan 30 12:11:54 debian kernel: [ 964.665449] /pci@8,700000: CE Secondary errors [(none)]
Jan 30 12:11:54 debian kernel: [ 964.665825] /pci@8,700000: Correctable Error, primary error type[DMA Read]
Jan 30 12:11:54 debian kernel: [ 964.666251] /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08]
Jan 30 12:11:54 debian kernel: [ 964.666672] /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
Jan 30 12:11:54 debian kernel: [ 964.667137] /pci@8,700000: CE AFAR [000000006cd0ee80]
Jan 30 12:11:54 debian kernel: [ 964.667457] /pci@8,700000: CE Secondary errors [(DMA)]
Jan 30 12:11:54 debian kernel: [ 964.667870] /pci@8,700000: Correctable Error, primary error type[DMA Read]
Jan 30 12:11:54 debian kernel: [ 964.668306] /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08]
Jan 30 12:11:54 debian kernel: [ 964.668729] /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
Jan 30 12:11:54 debian kernel: [ 964.669204] /pci@8,700000: CE AFAR [000000006ce0ee80]
Jan 30 12:11:54 debian kernel: [ 964.669530] /pci@8,700000: CE Secondary errors [(none)]
Jan 30 12:11:54 debian kernel: [ 964.669909] /pci@8,700000: Correctable Error, primary error type[DMA Read]
Jan 30 12:11:54 debian kernel: [ 964.670342] /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08]
Jan 30 12:11:54 debian kernel: [ 964.670764] /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
Jan 30 12:11:54 debian kernel: [ 964.671237] /pci@8,700000: CE AFAR [000000006ce82e80]
Jan 30 12:11:54 debian kernel: [ 964.671558] /pci@8,700000: CE Secondary errors [(DMA)]
Jan 30 12:11:54 debian kernel: [ 964.671920] /pci@8,700000: Correctable Error, primary error type[DMA Read]
Jan 30 12:11:54 debian kernel: [ 964.672344] /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08]
Jan 30 12:11:54 debian kernel: [ 964.672758] /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
Jan 30 12:11:54 debian kernel: [ 964.673223] /pci@8,700000: CE AFAR [000000006cfa2e80]
Jan 30 12:11:54 debian kernel: [ 964.673542] /pci@8,700000: CE Secondary errors [(DMA)]
I would really appreciate it if you could let me know if there is a work around for this problem?
The peripherals identified in lspci are
uname -a: Linux debian 3.2.0-4-sparc64 #1 Debian 3.2.51-1 sparc64 GNU/Linux
lspci -knn: 0000:00:01.0 VGA compatible controller [0300]: Advanced Micro Devices [AMD] nee ATI Rage XL [1002:4752] (rev 27)
lspci -knn: Kernel driver in use: atyfb
lspci -knn: 0000:00:05.0 Bridge [0680]: Oracle/SUN RIO EBUS [108e:1100] (rev 01)
lspci -knn: 0000:00:05.1 Ethernet controller [0200]: Oracle/SUN RIO 10/100 Ethernet [eri] [108e:1101] (rev 01)
lspci -knn: Kernel driver in use: gem
lspci -knn: 0000:00:05.2 FireWire (IEEE 1394) [0c00]: Oracle/SUN RIO 1394 [108e:1102] (rev 01)
lspci -knn: 0000:00:05.3 USB controller [0c03]: Oracle/SUN RIO USB [108e:1103] (rev 01)
lspci -knn: Kernel driver in use: ohci_hcd
lspci -knn: 0000:00:06.0 SCSI storage controller [0100]: LSI Logic / Symbios Logic 53c875 [1000:000f] (rev 37)
lspci -knn: Kernel driver in use: sym53c8xx
lspci -knn: 0000:00:06.1 SCSI storage controller [0100]: LSI Logic / Symbios Logic 53c875 [1000:000f] (rev 37)
lspci -knn: Kernel driver in use: sym53c8xx
lspci -knn: 0001:00:04.0 SCSI storage controller [0100]: QLogic Corp. QLA2200 64-bit Fibre Channel Adapter [1077:2200] (rev 05)
lspci -knn: Kernel driver in use: qla2xxx
PCI@8,700000 is the address of the vga card
/proc/iomem: 7fe00000000-7feffffffff : /pci@8,700000
/proc/iomem: 7fe000a0000-7fe000bffff : Video RAM area
/proc/iomem: 7fe000c0000-7fe000c7fff : Video ROM
/proc/iomem: 7fe000f0000-7fe000fffff : System ROM
/proc/iomem: 7fe00100000-7fe0011ffff : sungem
/proc/iomem: 7fe00124000-7fe00125fff : sym53c8xx
/proc/iomem: 7fe00126000-7fe00127fff : sym53c8xx
/proc/iomem: 7fe00128000-7fe00129fff : sym53c8xx
/proc/iomem: 7fe0012a000-7fe0012bfff : sym53c8xx
/proc/iomem: 7fe01000000-7fe01ffffff : ohci_hcd
/proc/iomem: 7fe02000000-7fe02ffffff : atyfb
/proc/iomem: 7fe7e400000-7fe7e40003f : sab
/proc/iomem: 7fe7e400040-7fe7e40007f : sab
--- End Message ---