[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

space Re: base: I receive CPU type error traps from the kernel from time to time and with alarmingly greater frequency when I try to start X11 using startx



reassign 737616 src:linux
retitle 737616 sparc: CPU type error traps from the kernel from time to time and with alarmingly 
greater frequency when I try to start X11 using startx
thanks

On Dienstag, 4. Februar 2014, Paul Llanyod wrote:
> Package: base
> Severity: normal
> 
> 
> 
> -- System Information:
> Debian Release: 7.3
>   APT prefers stable-updates
>   APT policy: (500, 'stable-updates'), (500, 'stable')
> Architecture: sparc (sparc64)
> 
> Kernel: Linux 3.2.0-4-sparc64
> Locale: LANG=en_ZA.UTF-8, LC_CTYPE=en_ZA.UTF-8 (charmap=UTF-8)
> Shell: /bin/sh linked to /bin/dash
> 
> I receive the following error or trap from time to time on my server (a
> Sunblade 2000) which is running in "headless mode". When I try to start
> the GUI using startx the error occurs so frequently that I cannot use the
> console screen. My hardware seems to be functioning perfectly - I have not
> succeeded in starting the GUI (which I would like to do).
> 
> The error repeats for a while once it occurs and is shown below. If I type
> startx from the command line the error does not stop as readily and I have
> to reboot the machine to put it in a stable state.
> 
> Jan 30 12:11:42 debian kernel: [  952.957402] ERROR(0): Cheetah error trap
> taken afsr[000000020000005b] afar[000000006cfd2e90] TL1(0) Jan 30 12:11:42
> debian kernel: [  952.957956] ERROR(0): TPC[5d1e90] TNPC[5d1e94]
> O7[4c1c8c] TSTATE[11001601] Jan 30 12:11:42 debian kernel: [  952.958380]
> ERROR(0): TPC<U3copy_to_user+0x190/0x500> Jan 30 12:11:42 debian kernel: [
>  952.958724] ERROR(0): M_SYND(0),  E_SYND(5b) Jan 30 12:11:42 debian
> kernel: [  952.959003] ERROR(0): Highest priority error (0000000200000000)
> "HW corrected system bus data ECC error for read" Jan 30 12:11:42 debian
> kernel: [  952.959617] ERROR(0): AFAR E-syndrome [J0304, pin  19] Jan 30
> 12:11:42 debian kernel: [  952.959946] ERROR(0): D-cache idx[ee90]
> tag[000000000006cc1f] utag[0000000000004f07] stag[000000000006cc1e] Jan 30
> 12:11:42 debian kernel: [  952.960531] ERROR(0): D-cache
> data0[b646cadc1a96df61] data1[0800000204020042] data2[fc0e771fb5c70a95]
> data3[00000000ff82ede4] Jan 30 12:11:42 debian kernel: [  952.961204]
> ERROR(0): I-cache idx[0] tag[0000000000000000] utag[0000000000000000]
> stag[0000000000000000] u[0000000000000000] l[0000000000000000] Jan 30
> 12:11:42 debian kernel: [  952.962387] ERROR(0): I-cache
> INSN0[0000000000000000] INSN1[0000000000000000] INSN2[0000000000000000]
> INSN3[0000000000000000] Jan 30 12:11:42 debian kernel: [  952.963058]
> ERROR(0): I-cache INSN4[0000000000000000] INSN5[0000000000000000]
> INSN6[0000000000000000] INSN7[0000000000000000] Jan 30 12:11:42 debian
> kernel: [  952.963727] ERROR(0): E-cache idx[6cfd2e80]
> tag[00000000000000e7] Jan 30 12:11:42 debian kernel: [  952.964109]
> ERROR(0): E-cache data0[bca1876cc1e4352a] data1[67fe78d2554431b2]
> data2[091f48b54a9e7b15] data3[a596f28215531bba] Jan 30 12:11:42 debian
> kernel: [  953.010971] ERROR(0): Cheetah error trap taken
> afsr[000000020000005b] afar[000000006cfeee90] TL1(0) Jan 30 12:11:42
> debian kernel: [  953.011524] ERROR(0): TPC[5d1e90] TNPC[5d1e94]
> O7[4c1c8c] TSTATE[11001601] Jan 30 12:11:42 debian kernel: [  953.011948]
> ERROR(0): TPC<U3copy_to_user+0x190/0x500> Jan 30 12:11:42 debian kernel: [
>  953.012292] ERROR(0): M_SYND(0),  E_SYND(5b) Jan 30 12:11:42 debian
> kernel: [  953.012572] ERROR(0): Highest priority error (0000000200000000)
> "HW corrected system bus data ECC error for read" Jan 30 12:11:42 debian
> kernel: [  953.013185] ERROR(0): AFAR E-syndrome [J0304, pin  19] Jan 30
> 12:11:42 debian kernel: [  953.013514] ERROR(0): D-cache idx[ee90]
> tag[000000000006cc1f] utag[0000000000004f07] stag[000000000006cc1e] Jan 30
> 12:11:42 debian kernel: [  953.014098] ERROR(0): D-cache
> data0[b646cadc1a96df61] data1[0800000204020042] data2[fc0e771fb5c70a95]
> data3[00000000ff82ede4] Jan 30 12:11:42 debian kernel: [  953.014772]
> ERROR(0): I-cache idx[0] tag[0000000000000000] utag[0000000000000000]
> stag[0000000000000000] u[0000000000000000] l[0000000000000000] Jan 30
> 12:11:42 debian kernel: [  953.015953] ERROR(0): I-cache
> INSN0[0000000000000000] INSN1[0000000000000000] INSN2[0000000000000000]
> INSN3[0000000000000000] Jan 30 12:11:42 debian kernel: [  953.016625]
> ERROR(0): I-cache INSN4[0000000000000000] INSN5[0000000000000000]
> INSN6[0000000000000000] INSN7[0000000000000000] Jan 30 12:11:42 debian
> kernel: [  953.017294] ERROR(0): E-cache idx[6cfeee80]
> tag[0000000000000090] Jan 30 12:11:42 debian kernel: [  953.017678]
> ERROR(0): E-cache data0[820461d8c4066008] data1[8728a002c2004003]
> data2[83286018c227bf58] data3[8338601880a06065] Jan 30 12:11:54 debian
> kernel: [  964.663357] /pci@8,700000: Correctable Error, primary error
> type[DMA Read] Jan 30 12:11:54 debian kernel: [  964.664223]
> /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08] Jan 30
> 12:11:54 debian kernel: [  964.664648] /pci@8,700000: partial[0]
> owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b] Jan 30 12:11:54 debian
> kernel: [  964.665124] /pci@8,700000: CE AFAR [000000006ccf2e80] Jan 30
> 12:11:54 debian kernel: [  964.665449] /pci@8,700000: CE Secondary errors
> [(none)] Jan 30 12:11:54 debian kernel: [  964.665825] /pci@8,700000:
> Correctable Error, primary error type[DMA Read] Jan 30 12:11:54 debian
> kernel: [  964.666251] /pci@8,700000: bytemask[0000] qword_offset[1]
> SAFARI_AID[08] Jan 30 12:11:54 debian kernel: [  964.666672]
> /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
> Jan 30 12:11:54 debian kernel: [  964.667137] /pci@8,700000: CE AFAR
> [000000006cd0ee80] Jan 30 12:11:54 debian kernel: [  964.667457]
> /pci@8,700000: CE Secondary errors [(DMA)] Jan 30 12:11:54 debian kernel:
> [  964.667870] /pci@8,700000: Correctable Error, primary error type[DMA
> Read] Jan 30 12:11:54 debian kernel: [  964.668306] /pci@8,700000:
> bytemask[0000] qword_offset[1] SAFARI_AID[08] Jan 30 12:11:54 debian
> kernel: [  964.668729] /pci@8,700000: partial[0] owned_in[0] mtag[0]
> mtag_synd[0] ecc_sync[5b] Jan 30 12:11:54 debian kernel: [  964.669204]
> /pci@8,700000: CE AFAR [000000006ce0ee80] Jan 30 12:11:54 debian kernel: [
>  964.669530] /pci@8,700000: CE Secondary errors [(none)] Jan 30 12:11:54
> debian kernel: [  964.669909] /pci@8,700000: Correctable Error, primary
> error type[DMA Read] Jan 30 12:11:54 debian kernel: [  964.670342]
> /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08] Jan 30
> 12:11:54 debian kernel: [  964.670764] /pci@8,700000: partial[0]
> owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b] Jan 30 12:11:54 debian
> kernel: [  964.671237] /pci@8,700000: CE AFAR [000000006ce82e80] Jan 30
> 12:11:54 debian kernel: [  964.671558] /pci@8,700000: CE Secondary errors
> [(DMA)] Jan 30 12:11:54 debian kernel: [  964.671920] /pci@8,700000:
> Correctable Error, primary error type[DMA Read] Jan 30 12:11:54 debian
> kernel: [  964.672344] /pci@8,700000: bytemask[0000] qword_offset[1]
> SAFARI_AID[08] Jan 30 12:11:54 debian kernel: [  964.672758]
> /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
> Jan 30 12:11:54 debian kernel: [  964.673223] /pci@8,700000: CE AFAR
> [000000006cfa2e80] Jan 30 12:11:54 debian kernel: [  964.673542]
> /pci@8,700000: CE Secondary errors [(DMA)]
> 
> I would really appreciate it if you could let me know if there is a work
> around for this problem?
> 
> The peripherals identified in lspci are
> 
> uname -a: Linux debian 3.2.0-4-sparc64 #1 Debian 3.2.51-1 sparc64 GNU/Linux
> lspci -knn: 0000:00:01.0 VGA compatible controller [0300]: Advanced Micro
> Devices [AMD] nee ATI Rage XL [1002:4752] (rev 27) lspci -knn: 	Kernel
> driver in use: atyfb
> lspci -knn: 0000:00:05.0 Bridge [0680]: Oracle/SUN RIO EBUS [108e:1100]
> (rev 01) lspci -knn: 0000:00:05.1 Ethernet controller [0200]: Oracle/SUN
> RIO 10/100 Ethernet [eri] [108e:1101] (rev 01) lspci -knn: 	Kernel driver
> in use: gem
> lspci -knn: 0000:00:05.2 FireWire (IEEE 1394) [0c00]: Oracle/SUN RIO 1394
> [108e:1102] (rev 01) lspci -knn: 0000:00:05.3 USB controller [0c03]:
> Oracle/SUN RIO USB [108e:1103] (rev 01) lspci -knn: 	Kernel driver in use:
> ohci_hcd
> lspci -knn: 0000:00:06.0 SCSI storage controller [0100]: LSI Logic /
> Symbios Logic 53c875 [1000:000f] (rev 37) lspci -knn: 	Kernel driver in
> use: sym53c8xx
> lspci -knn: 0000:00:06.1 SCSI storage controller [0100]: LSI Logic /
> Symbios Logic 53c875 [1000:000f] (rev 37) lspci -knn: 	Kernel driver in
> use: sym53c8xx
> lspci -knn: 0001:00:04.0 SCSI storage controller [0100]: QLogic Corp.
> QLA2200 64-bit Fibre Channel Adapter [1077:2200] (rev 05) lspci -knn:
> 	Kernel driver in use: qla2xxx
> 
> PCI@8,700000 is the address of the vga card
> 
> /proc/iomem: 7fe00000000-7feffffffff : /pci@8,700000
> /proc/iomem:   7fe000a0000-7fe000bffff : Video RAM area
> /proc/iomem:   7fe000c0000-7fe000c7fff : Video ROM
> /proc/iomem:   7fe000f0000-7fe000fffff : System ROM
> /proc/iomem:   7fe00100000-7fe0011ffff : sungem
> /proc/iomem:   7fe00124000-7fe00125fff : sym53c8xx
> /proc/iomem:   7fe00126000-7fe00127fff : sym53c8xx
> /proc/iomem:   7fe00128000-7fe00129fff : sym53c8xx
> /proc/iomem:   7fe0012a000-7fe0012bfff : sym53c8xx
> /proc/iomem:   7fe01000000-7fe01ffffff : ohci_hcd
> /proc/iomem:   7fe02000000-7fe02ffffff : atyfb
> /proc/iomem:   7fe7e400000-7fe7e40003f : sab
> /proc/iomem:   7fe7e400040-7fe7e40007f : sab



Attachment: signature.asc
Description: This is a digitally signed message part.


Reply to: