reassign 737616 src:linux retitle 737616 sparc: CPU type error traps from the kernel from time to time and with alarmingly greater frequency when I try to start X11 using startx thanks On Dienstag, 4. Februar 2014, Paul Llanyod wrote: > Package: base > Severity: normal > > > > -- System Information: > Debian Release: 7.3 > APT prefers stable-updates > APT policy: (500, 'stable-updates'), (500, 'stable') > Architecture: sparc (sparc64) > > Kernel: Linux 3.2.0-4-sparc64 > Locale: LANG=en_ZA.UTF-8, LC_CTYPE=en_ZA.UTF-8 (charmap=UTF-8) > Shell: /bin/sh linked to /bin/dash > > I receive the following error or trap from time to time on my server (a > Sunblade 2000) which is running in "headless mode". When I try to start > the GUI using startx the error occurs so frequently that I cannot use the > console screen. My hardware seems to be functioning perfectly - I have not > succeeded in starting the GUI (which I would like to do). > > The error repeats for a while once it occurs and is shown below. If I type > startx from the command line the error does not stop as readily and I have > to reboot the machine to put it in a stable state. > > Jan 30 12:11:42 debian kernel: [ 952.957402] ERROR(0): Cheetah error trap > taken afsr[000000020000005b] afar[000000006cfd2e90] TL1(0) Jan 30 12:11:42 > debian kernel: [ 952.957956] ERROR(0): TPC[5d1e90] TNPC[5d1e94] > O7[4c1c8c] TSTATE[11001601] Jan 30 12:11:42 debian kernel: [ 952.958380] > ERROR(0): TPC<U3copy_to_user+0x190/0x500> Jan 30 12:11:42 debian kernel: [ > 952.958724] ERROR(0): M_SYND(0), E_SYND(5b) Jan 30 12:11:42 debian > kernel: [ 952.959003] ERROR(0): Highest priority error (0000000200000000) > "HW corrected system bus data ECC error for read" Jan 30 12:11:42 debian > kernel: [ 952.959617] ERROR(0): AFAR E-syndrome [J0304, pin 19] Jan 30 > 12:11:42 debian kernel: [ 952.959946] ERROR(0): D-cache idx[ee90] > tag[000000000006cc1f] utag[0000000000004f07] stag[000000000006cc1e] Jan 30 > 12:11:42 debian kernel: [ 952.960531] ERROR(0): D-cache > data0[b646cadc1a96df61] data1[0800000204020042] data2[fc0e771fb5c70a95] > data3[00000000ff82ede4] Jan 30 12:11:42 debian kernel: [ 952.961204] > ERROR(0): I-cache idx[0] tag[0000000000000000] utag[0000000000000000] > stag[0000000000000000] u[0000000000000000] l[0000000000000000] Jan 30 > 12:11:42 debian kernel: [ 952.962387] ERROR(0): I-cache > INSN0[0000000000000000] INSN1[0000000000000000] INSN2[0000000000000000] > INSN3[0000000000000000] Jan 30 12:11:42 debian kernel: [ 952.963058] > ERROR(0): I-cache INSN4[0000000000000000] INSN5[0000000000000000] > INSN6[0000000000000000] INSN7[0000000000000000] Jan 30 12:11:42 debian > kernel: [ 952.963727] ERROR(0): E-cache idx[6cfd2e80] > tag[00000000000000e7] Jan 30 12:11:42 debian kernel: [ 952.964109] > ERROR(0): E-cache data0[bca1876cc1e4352a] data1[67fe78d2554431b2] > data2[091f48b54a9e7b15] data3[a596f28215531bba] Jan 30 12:11:42 debian > kernel: [ 953.010971] ERROR(0): Cheetah error trap taken > afsr[000000020000005b] afar[000000006cfeee90] TL1(0) Jan 30 12:11:42 > debian kernel: [ 953.011524] ERROR(0): TPC[5d1e90] TNPC[5d1e94] > O7[4c1c8c] TSTATE[11001601] Jan 30 12:11:42 debian kernel: [ 953.011948] > ERROR(0): TPC<U3copy_to_user+0x190/0x500> Jan 30 12:11:42 debian kernel: [ > 953.012292] ERROR(0): M_SYND(0), E_SYND(5b) Jan 30 12:11:42 debian > kernel: [ 953.012572] ERROR(0): Highest priority error (0000000200000000) > "HW corrected system bus data ECC error for read" Jan 30 12:11:42 debian > kernel: [ 953.013185] ERROR(0): AFAR E-syndrome [J0304, pin 19] Jan 30 > 12:11:42 debian kernel: [ 953.013514] ERROR(0): D-cache idx[ee90] > tag[000000000006cc1f] utag[0000000000004f07] stag[000000000006cc1e] Jan 30 > 12:11:42 debian kernel: [ 953.014098] ERROR(0): D-cache > data0[b646cadc1a96df61] data1[0800000204020042] data2[fc0e771fb5c70a95] > data3[00000000ff82ede4] Jan 30 12:11:42 debian kernel: [ 953.014772] > ERROR(0): I-cache idx[0] tag[0000000000000000] utag[0000000000000000] > stag[0000000000000000] u[0000000000000000] l[0000000000000000] Jan 30 > 12:11:42 debian kernel: [ 953.015953] ERROR(0): I-cache > INSN0[0000000000000000] INSN1[0000000000000000] INSN2[0000000000000000] > INSN3[0000000000000000] Jan 30 12:11:42 debian kernel: [ 953.016625] > ERROR(0): I-cache INSN4[0000000000000000] INSN5[0000000000000000] > INSN6[0000000000000000] INSN7[0000000000000000] Jan 30 12:11:42 debian > kernel: [ 953.017294] ERROR(0): E-cache idx[6cfeee80] > tag[0000000000000090] Jan 30 12:11:42 debian kernel: [ 953.017678] > ERROR(0): E-cache data0[820461d8c4066008] data1[8728a002c2004003] > data2[83286018c227bf58] data3[8338601880a06065] Jan 30 12:11:54 debian > kernel: [ 964.663357] /pci@8,700000: Correctable Error, primary error > type[DMA Read] Jan 30 12:11:54 debian kernel: [ 964.664223] > /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08] Jan 30 > 12:11:54 debian kernel: [ 964.664648] /pci@8,700000: partial[0] > owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b] Jan 30 12:11:54 debian > kernel: [ 964.665124] /pci@8,700000: CE AFAR [000000006ccf2e80] Jan 30 > 12:11:54 debian kernel: [ 964.665449] /pci@8,700000: CE Secondary errors > [(none)] Jan 30 12:11:54 debian kernel: [ 964.665825] /pci@8,700000: > Correctable Error, primary error type[DMA Read] Jan 30 12:11:54 debian > kernel: [ 964.666251] /pci@8,700000: bytemask[0000] qword_offset[1] > SAFARI_AID[08] Jan 30 12:11:54 debian kernel: [ 964.666672] > /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b] > Jan 30 12:11:54 debian kernel: [ 964.667137] /pci@8,700000: CE AFAR > [000000006cd0ee80] Jan 30 12:11:54 debian kernel: [ 964.667457] > /pci@8,700000: CE Secondary errors [(DMA)] Jan 30 12:11:54 debian kernel: > [ 964.667870] /pci@8,700000: Correctable Error, primary error type[DMA > Read] Jan 30 12:11:54 debian kernel: [ 964.668306] /pci@8,700000: > bytemask[0000] qword_offset[1] SAFARI_AID[08] Jan 30 12:11:54 debian > kernel: [ 964.668729] /pci@8,700000: partial[0] owned_in[0] mtag[0] > mtag_synd[0] ecc_sync[5b] Jan 30 12:11:54 debian kernel: [ 964.669204] > /pci@8,700000: CE AFAR [000000006ce0ee80] Jan 30 12:11:54 debian kernel: [ > 964.669530] /pci@8,700000: CE Secondary errors [(none)] Jan 30 12:11:54 > debian kernel: [ 964.669909] /pci@8,700000: Correctable Error, primary > error type[DMA Read] Jan 30 12:11:54 debian kernel: [ 964.670342] > /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08] Jan 30 > 12:11:54 debian kernel: [ 964.670764] /pci@8,700000: partial[0] > owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b] Jan 30 12:11:54 debian > kernel: [ 964.671237] /pci@8,700000: CE AFAR [000000006ce82e80] Jan 30 > 12:11:54 debian kernel: [ 964.671558] /pci@8,700000: CE Secondary errors > [(DMA)] Jan 30 12:11:54 debian kernel: [ 964.671920] /pci@8,700000: > Correctable Error, primary error type[DMA Read] Jan 30 12:11:54 debian > kernel: [ 964.672344] /pci@8,700000: bytemask[0000] qword_offset[1] > SAFARI_AID[08] Jan 30 12:11:54 debian kernel: [ 964.672758] > /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b] > Jan 30 12:11:54 debian kernel: [ 964.673223] /pci@8,700000: CE AFAR > [000000006cfa2e80] Jan 30 12:11:54 debian kernel: [ 964.673542] > /pci@8,700000: CE Secondary errors [(DMA)] > > I would really appreciate it if you could let me know if there is a work > around for this problem? > > The peripherals identified in lspci are > > uname -a: Linux debian 3.2.0-4-sparc64 #1 Debian 3.2.51-1 sparc64 GNU/Linux > lspci -knn: 0000:00:01.0 VGA compatible controller [0300]: Advanced Micro > Devices [AMD] nee ATI Rage XL [1002:4752] (rev 27) lspci -knn: Kernel > driver in use: atyfb > lspci -knn: 0000:00:05.0 Bridge [0680]: Oracle/SUN RIO EBUS [108e:1100] > (rev 01) lspci -knn: 0000:00:05.1 Ethernet controller [0200]: Oracle/SUN > RIO 10/100 Ethernet [eri] [108e:1101] (rev 01) lspci -knn: Kernel driver > in use: gem > lspci -knn: 0000:00:05.2 FireWire (IEEE 1394) [0c00]: Oracle/SUN RIO 1394 > [108e:1102] (rev 01) lspci -knn: 0000:00:05.3 USB controller [0c03]: > Oracle/SUN RIO USB [108e:1103] (rev 01) lspci -knn: Kernel driver in use: > ohci_hcd > lspci -knn: 0000:00:06.0 SCSI storage controller [0100]: LSI Logic / > Symbios Logic 53c875 [1000:000f] (rev 37) lspci -knn: Kernel driver in > use: sym53c8xx > lspci -knn: 0000:00:06.1 SCSI storage controller [0100]: LSI Logic / > Symbios Logic 53c875 [1000:000f] (rev 37) lspci -knn: Kernel driver in > use: sym53c8xx > lspci -knn: 0001:00:04.0 SCSI storage controller [0100]: QLogic Corp. > QLA2200 64-bit Fibre Channel Adapter [1077:2200] (rev 05) lspci -knn: > Kernel driver in use: qla2xxx > > PCI@8,700000 is the address of the vga card > > /proc/iomem: 7fe00000000-7feffffffff : /pci@8,700000 > /proc/iomem: 7fe000a0000-7fe000bffff : Video RAM area > /proc/iomem: 7fe000c0000-7fe000c7fff : Video ROM > /proc/iomem: 7fe000f0000-7fe000fffff : System ROM > /proc/iomem: 7fe00100000-7fe0011ffff : sungem > /proc/iomem: 7fe00124000-7fe00125fff : sym53c8xx > /proc/iomem: 7fe00126000-7fe00127fff : sym53c8xx > /proc/iomem: 7fe00128000-7fe00129fff : sym53c8xx > /proc/iomem: 7fe0012a000-7fe0012bfff : sym53c8xx > /proc/iomem: 7fe01000000-7fe01ffffff : ohci_hcd > /proc/iomem: 7fe02000000-7fe02ffffff : atyfb > /proc/iomem: 7fe7e400000-7fe7e40003f : sab > /proc/iomem: 7fe7e400040-7fe7e40007f : sab
Attachment:
signature.asc
Description: This is a digitally signed message part.