[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#737616: marked as done (sparc: CPU type error traps from the kernel from time to time and with greater frequency when starting X11 using startx)



Your message dated Sat, 24 Apr 2021 12:21:14 -0700 (PDT)
with message-id <60846faa.1c69fb81.ab66f.53a8@mx.google.com>
and subject line Closing this bug (BTS maintenance for src:linux bugs)
has caused the Debian Bug report #737616,
regarding sparc: CPU type error traps from the kernel from time to time and with greater frequency when starting X11 using startx
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@bugs.debian.org
immediately.)


-- 
737616: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=737616
Debian Bug Tracking System
Contact owner@bugs.debian.org with problems
--- Begin Message ---
Package: base
Severity: normal



-- System Information:
Debian Release: 7.3
  APT prefers stable-updates
  APT policy: (500, 'stable-updates'), (500, 'stable')
Architecture: sparc (sparc64)

Kernel: Linux 3.2.0-4-sparc64
Locale: LANG=en_ZA.UTF-8, LC_CTYPE=en_ZA.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

I receive the following error or trap from time to time on my server (a Sunblade 2000) which is running in "headless mode". When I try to start the 
GUI using startx the error occurs so frequently that I cannot use the console screen. My hardware seems to be functioning perfectly - I have not 
succeeded in starting the GUI (which I would like to do).

The error repeats for a while once it occurs and is shown below. If I type startx from the command line the error does not stop as readily and I have
 to reboot the machine to put it in a stable state.

Jan 30 12:11:42 debian kernel: [  952.957402] ERROR(0): Cheetah error trap taken afsr[000000020000005b] afar[000000006cfd2e90] TL1(0)
Jan 30 12:11:42 debian kernel: [  952.957956] ERROR(0): TPC[5d1e90] TNPC[5d1e94] O7[4c1c8c] TSTATE[11001601]
Jan 30 12:11:42 debian kernel: [  952.958380] ERROR(0): TPC<U3copy_to_user+0x190/0x500>
Jan 30 12:11:42 debian kernel: [  952.958724] ERROR(0): M_SYND(0),  E_SYND(5b)
Jan 30 12:11:42 debian kernel: [  952.959003] ERROR(0): Highest priority error (0000000200000000) "HW corrected system bus data ECC error for read"
Jan 30 12:11:42 debian kernel: [  952.959617] ERROR(0): AFAR E-syndrome [J0304, pin  19]
Jan 30 12:11:42 debian kernel: [  952.959946] ERROR(0): D-cache idx[ee90] tag[000000000006cc1f] utag[0000000000004f07] stag[000000000006cc1e]
Jan 30 12:11:42 debian kernel: [  952.960531] ERROR(0): D-cache data0[b646cadc1a96df61] data1[0800000204020042] data2[fc0e771fb5c70a95] data3[00000000ff82ede4]
Jan 30 12:11:42 debian kernel: [  952.961204] ERROR(0): I-cache idx[0] tag[0000000000000000] utag[0000000000000000] stag[0000000000000000] u[0000000000000000] l[0000000000000000]
Jan 30 12:11:42 debian kernel: [  952.962387] ERROR(0): I-cache INSN0[0000000000000000] INSN1[0000000000000000] INSN2[0000000000000000] INSN3[0000000000000000]
Jan 30 12:11:42 debian kernel: [  952.963058] ERROR(0): I-cache INSN4[0000000000000000] INSN5[0000000000000000] INSN6[0000000000000000] INSN7[0000000000000000]
Jan 30 12:11:42 debian kernel: [  952.963727] ERROR(0): E-cache idx[6cfd2e80] tag[00000000000000e7]
Jan 30 12:11:42 debian kernel: [  952.964109] ERROR(0): E-cache data0[bca1876cc1e4352a] data1[67fe78d2554431b2] data2[091f48b54a9e7b15] data3[a596f28215531bba]
Jan 30 12:11:42 debian kernel: [  953.010971] ERROR(0): Cheetah error trap taken afsr[000000020000005b] afar[000000006cfeee90] TL1(0)
Jan 30 12:11:42 debian kernel: [  953.011524] ERROR(0): TPC[5d1e90] TNPC[5d1e94] O7[4c1c8c] TSTATE[11001601]
Jan 30 12:11:42 debian kernel: [  953.011948] ERROR(0): TPC<U3copy_to_user+0x190/0x500>
Jan 30 12:11:42 debian kernel: [  953.012292] ERROR(0): M_SYND(0),  E_SYND(5b)
Jan 30 12:11:42 debian kernel: [  953.012572] ERROR(0): Highest priority error (0000000200000000) "HW corrected system bus data ECC error for read"
Jan 30 12:11:42 debian kernel: [  953.013185] ERROR(0): AFAR E-syndrome [J0304, pin  19]
Jan 30 12:11:42 debian kernel: [  953.013514] ERROR(0): D-cache idx[ee90] tag[000000000006cc1f] utag[0000000000004f07] stag[000000000006cc1e]
Jan 30 12:11:42 debian kernel: [  953.014098] ERROR(0): D-cache data0[b646cadc1a96df61] data1[0800000204020042] data2[fc0e771fb5c70a95] data3[00000000ff82ede4]
Jan 30 12:11:42 debian kernel: [  953.014772] ERROR(0): I-cache idx[0] tag[0000000000000000] utag[0000000000000000] stag[0000000000000000] u[0000000000000000] l[0000000000000000]
Jan 30 12:11:42 debian kernel: [  953.015953] ERROR(0): I-cache INSN0[0000000000000000] INSN1[0000000000000000] INSN2[0000000000000000] INSN3[0000000000000000]
Jan 30 12:11:42 debian kernel: [  953.016625] ERROR(0): I-cache INSN4[0000000000000000] INSN5[0000000000000000] INSN6[0000000000000000] INSN7[0000000000000000]
Jan 30 12:11:42 debian kernel: [  953.017294] ERROR(0): E-cache idx[6cfeee80] tag[0000000000000090]
Jan 30 12:11:42 debian kernel: [  953.017678] ERROR(0): E-cache data0[820461d8c4066008] data1[8728a002c2004003] data2[83286018c227bf58] data3[8338601880a06065]
Jan 30 12:11:54 debian kernel: [  964.663357] /pci@8,700000: Correctable Error, primary error type[DMA Read]
Jan 30 12:11:54 debian kernel: [  964.664223] /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08]
Jan 30 12:11:54 debian kernel: [  964.664648] /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
Jan 30 12:11:54 debian kernel: [  964.665124] /pci@8,700000: CE AFAR [000000006ccf2e80]
Jan 30 12:11:54 debian kernel: [  964.665449] /pci@8,700000: CE Secondary errors [(none)]
Jan 30 12:11:54 debian kernel: [  964.665825] /pci@8,700000: Correctable Error, primary error type[DMA Read]
Jan 30 12:11:54 debian kernel: [  964.666251] /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08]
Jan 30 12:11:54 debian kernel: [  964.666672] /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
Jan 30 12:11:54 debian kernel: [  964.667137] /pci@8,700000: CE AFAR [000000006cd0ee80]
Jan 30 12:11:54 debian kernel: [  964.667457] /pci@8,700000: CE Secondary errors [(DMA)]
Jan 30 12:11:54 debian kernel: [  964.667870] /pci@8,700000: Correctable Error, primary error type[DMA Read]
Jan 30 12:11:54 debian kernel: [  964.668306] /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08]
Jan 30 12:11:54 debian kernel: [  964.668729] /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
Jan 30 12:11:54 debian kernel: [  964.669204] /pci@8,700000: CE AFAR [000000006ce0ee80]
Jan 30 12:11:54 debian kernel: [  964.669530] /pci@8,700000: CE Secondary errors [(none)]
Jan 30 12:11:54 debian kernel: [  964.669909] /pci@8,700000: Correctable Error, primary error type[DMA Read]
Jan 30 12:11:54 debian kernel: [  964.670342] /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08]
Jan 30 12:11:54 debian kernel: [  964.670764] /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
Jan 30 12:11:54 debian kernel: [  964.671237] /pci@8,700000: CE AFAR [000000006ce82e80]
Jan 30 12:11:54 debian kernel: [  964.671558] /pci@8,700000: CE Secondary errors [(DMA)]
Jan 30 12:11:54 debian kernel: [  964.671920] /pci@8,700000: Correctable Error, primary error type[DMA Read]
Jan 30 12:11:54 debian kernel: [  964.672344] /pci@8,700000: bytemask[0000] qword_offset[1] SAFARI_AID[08]
Jan 30 12:11:54 debian kernel: [  964.672758] /pci@8,700000: partial[0] owned_in[0] mtag[0] mtag_synd[0] ecc_sync[5b]
Jan 30 12:11:54 debian kernel: [  964.673223] /pci@8,700000: CE AFAR [000000006cfa2e80]
Jan 30 12:11:54 debian kernel: [  964.673542] /pci@8,700000: CE Secondary errors [(DMA)]

I would really appreciate it if you could let me know if there is a work around for this problem?

The peripherals identified in lspci are

uname -a: Linux debian 3.2.0-4-sparc64 #1 Debian 3.2.51-1 sparc64 GNU/Linux
lspci -knn: 0000:00:01.0 VGA compatible controller [0300]: Advanced Micro Devices [AMD] nee ATI Rage XL [1002:4752] (rev 27)
lspci -knn: 	Kernel driver in use: atyfb
lspci -knn: 0000:00:05.0 Bridge [0680]: Oracle/SUN RIO EBUS [108e:1100] (rev 01)
lspci -knn: 0000:00:05.1 Ethernet controller [0200]: Oracle/SUN RIO 10/100 Ethernet [eri] [108e:1101] (rev 01)
lspci -knn: 	Kernel driver in use: gem
lspci -knn: 0000:00:05.2 FireWire (IEEE 1394) [0c00]: Oracle/SUN RIO 1394 [108e:1102] (rev 01)
lspci -knn: 0000:00:05.3 USB controller [0c03]: Oracle/SUN RIO USB [108e:1103] (rev 01)
lspci -knn: 	Kernel driver in use: ohci_hcd
lspci -knn: 0000:00:06.0 SCSI storage controller [0100]: LSI Logic / Symbios Logic 53c875 [1000:000f] (rev 37)
lspci -knn: 	Kernel driver in use: sym53c8xx
lspci -knn: 0000:00:06.1 SCSI storage controller [0100]: LSI Logic / Symbios Logic 53c875 [1000:000f] (rev 37)
lspci -knn: 	Kernel driver in use: sym53c8xx
lspci -knn: 0001:00:04.0 SCSI storage controller [0100]: QLogic Corp. QLA2200 64-bit Fibre Channel Adapter [1077:2200] (rev 05)
lspci -knn: 	Kernel driver in use: qla2xxx

PCI@8,700000 is the address of the vga card

/proc/iomem: 7fe00000000-7feffffffff : /pci@8,700000
/proc/iomem:   7fe000a0000-7fe000bffff : Video RAM area
/proc/iomem:   7fe000c0000-7fe000c7fff : Video ROM
/proc/iomem:   7fe000f0000-7fe000fffff : System ROM
/proc/iomem:   7fe00100000-7fe0011ffff : sungem
/proc/iomem:   7fe00124000-7fe00125fff : sym53c8xx
/proc/iomem:   7fe00126000-7fe00127fff : sym53c8xx
/proc/iomem:   7fe00128000-7fe00129fff : sym53c8xx
/proc/iomem:   7fe0012a000-7fe0012bfff : sym53c8xx
/proc/iomem:   7fe01000000-7fe01ffffff : ohci_hcd
/proc/iomem:   7fe02000000-7fe02ffffff : atyfb
/proc/iomem:   7fe7e400000-7fe7e40003f : sab
/proc/iomem:   7fe7e400040-7fe7e40007f : sab

--- End Message ---
--- Begin Message ---
Hi

This bug was filed for a very old kernel or the bug is old itself
without resolution.

If you can reproduce it with

- the current version in unstable/testing
- the latest kernel from backports

please reopen the bug, see https://www.debian.org/Bugs/server-control
for details.

Regards,
Salvatore

--- End Message ---

Reply to: