[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#854855: linux-image-4.9.0-1-amd64: GPU hang (after resuming from hibernation) in Xorg since 4.9.0 [with patch]



Package: src:linux
Version: 4.9.6-3
Severity: normal

Hello,

Since kernel 4.9.0 replaced 4.8.0 in stretch, my notebook was unable
to resume from hibernation. Hardware is a Lenovo Thinkpad X1 Carbon
Gen4. The issue first came up with 4.9.2, but is still present in the 4.9.6 version of linux-image-4.9.0-1-amd64.

While the system appears to succesfully return from hibernation, soon as the system switch to X it starts freezing and locks more and more.

At the same time, the following messages appear in the kernel ring buffer:

[   62.815665] [drm] GPU HANG: ecode 9:1:0xc28d04b4, in Xorg [1830],
reason: Hang on blitter ring, action: reset
[   62.815675] [drm] GPU hangs can indicate a bug anywhere in the entire
gfx stack, including userspace.
[   62.815680] [drm] Please file a _new_ bug report on
bugs.freedesktop.org against DRI -> DRM/Intel
[   62.815683] [drm] drm/i915 developers can then reassign to the right
component if it's not a kernel issue.
[   62.815687] [drm] The gpu crash dump is required to analyze gpu
hangs, so please always attach it.
[   62.815691] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[   63.575242] drm/i915: Resetting chip after gpu hang
[   63.575437] [drm] RC6 on
[   63.591843] [drm] GuC firmware load skipped


As being adviced by the message, I have opened an upstream bug at [1],
got then redirected to a known bug at [2].

And indeed, rebuilding a custom ⒋.9.6 kernel and applying the patch
from [3] to intel_lrc.c resolves my problem.

Then my system resumes successfully from hibernation without these
freezes.

Cheers,
Andreas

[1] https://bugs.freedesktop.org/show_bug.cgi?id=99545
[2] https://bugs.freedesktop.org/show_bug.cgi?id=96526
[3] https://patchwork.freedesktop.org/patch/111587/


-- Package-specific info:
** Version:
Linux version 4.9.0-1-amd64 (debian-kernel@lists.debian.org) (gcc version 6.3.0 20170124 (Debian 6.3.0-5) ) #1 SMP Debian 4.9.6-3 (2017-01-28)

** Command line:
BOOT_IMAGE=/vmlinuz-4.9.0-1-amd64 root=/dev/mapper/carbon-root ro resume=UUID=b1a930e8-b6f2-481e-8478-cddd551b795f intel_iommu=igfx_off

** Not tainted

** Kernel log:
[ 62.815665] [drm] GPU HANG: ecode 9:1:0xc28d04b4, in Xorg [1830], reason: Hang on blitter ring, action: reset [ 62.815675] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace. [ 62.815680] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel [ 62.815683] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue. [ 62.815687] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[   62.815691] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[   63.575242] drm/i915: Resetting chip after gpu hang
[   63.575437] [drm] RC6 on
[   63.591843] [drm] GuC firmware load skipped

** Model information
sys_vendor: LENOVO
product_name: 20FB003RGE
product_version: ThinkPad X1 Carbon 4th
chassis_vendor: LENOVO
chassis_version: None
bios_vendor: LENOVO
bios_version: N1FET43W (1.17 )
board_vendor: LENOVO
board_name: 20FB003RGE
board_version: SDK0J40705 WIN

** Loaded modules:
fuse
rfcomm
ip6table_mangle
nf_log_ipv6
nf_conntrack_ipv6
nf_defrag_ipv6
xt_connmark
iptable_mangle
xt_helper
nf_log_ipv4
nf_log_common
xt_LOG
xt_limit
nf_conntrack_ipv4
nf_defrag_ipv4
xt_tcpudp
xt_addrtype
xt_conntrack
nf_conntrack_sip
nf_conntrack_ftp
nf_conntrack_irc
nf_conntrack_pptp
nf_conntrack_proto_gre
nf_conntrack
ctr
ccm
ebtable_filter
ebtables
ip6table_filter
ip6_tables
iptable_filter
snd_hrtimer
snd_seq
snd_seq_device
cpufreq_userspace
cpufreq_powersave
cmac
cpufreq_conservative
bnep
binfmt_misc
algif_skcipher
af_alg
dm_crypt
hid_sensor_accel_3d
hid_sensor_trigger
hid_sensor_iio_common
industrialio_triggered_buffer
kfifo_buf
arc4
industrialio
acer_wmi
sparse_keymap
ext4
jbd2
fscrypto
ecb
mbcache
intel_rapl
x86_pkg_temp_thermal
intel_powerclamp
kvm_intel
kvm
irqbypass
crct10dif_pclmul
crc32_pclmul
iwlmvm
ghash_clmulni_intel
snd_hda_codec_hdmi
intel_cstate
intel_uncore
mac80211
snd_hda_codec_conexant
snd_hda_codec_generic
snd_soc_skl
intel_rapl_perf
snd_soc_skl_ipc
snd_soc_sst_ipc
snd_soc_sst_dsp
efi_pstore
snd_hda_ext_core
snd_soc_sst_match
snd_soc_core
snd_compress
snd_hda_intel
serio_raw
pcspkr
snd_hda_codec
efivars
iwlwifi
snd_hda_core
iTCO_wdt
snd_hwdep
iTCO_vendor_support
snd_pcm
rtsx_pci_ms
cfg80211
snd_timer
memstick
sg
joydev
uvcvideo
shpchp
videobuf2_vmalloc
videobuf2_memops
videobuf2_v4l2
cdc_mbim
videobuf2_core
cdc_wdm
qcserial
cdc_ncm
videodev
usb_wwan
btusb
usbnet
btrtl
mii
btbcm
usbserial
btintel
media
mei_me
mei
bluetooth
intel_pch_thermal
crc16
hid_sensor_hub
thinkpad_acpi
nvram
snd
soundcore
rfkill
battery
ac
wmi
evdev
tpm_tis
tpm_tis_core
tpm
auth_rpcgss
sunrpc
idma64
virt_dma
coretemp
efivarfs
ip_tables
x_tables
autofs4
xfs
libcrc32c
btrfs
crc32c_generic
xor
raid6_pq
nvme
nvme_core
hid_generic
usbhid
dm_mod
sd_mod
intel_ishtp_hid
hid
rtsx_pci_sdmmc
mmc_core
crc32c_intel
rtsx_pci
i915
mfd_core
aesni_intel
e1000e
aes_x86_64
glue_helper
lrw
gf128mul
ablk_helper
cryptd
psmouse
ahci
libahci
i2c_i801
i2c_smbus
libata
xhci_pci
xhci_hcd
ptp
i2c_algo_bit
pps_core
drm_kms_helper
usbcore
scsi_mod
drm
usb_common
intel_ish_ipc
intel_ishtp
thermal
video
button
fjes

** PCI devices:
00:00.0 Host bridge [0600]: Intel Corporation Skylake Host Bridge/DRAM Registers [8086:1904] (rev 08)
	Subsystem: Lenovo Skylake Host Bridge/DRAM Registers [17aa:2238]
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ >SERR- <PERR- INTx-
	Latency: 0
	Capabilities: <access denied>
	Kernel driver in use: skl_uncore

00:02.0 VGA compatible controller [0300]: Intel Corporation HD Graphics 520 [8086:1916] (rev 07) (prog-if 00 [VGA controller])
	Subsystem: Lenovo HD Graphics 520 [17aa:2238]
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 126
	Region 0: Memory at e0000000 (64-bit, non-prefetchable) [size=16M]
	Region 2: Memory at c0000000 (64-bit, prefetchable) [size=512M]
	Region 4: I/O ports at e000 [size=64]
	[virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
	Capabilities: <access denied>
	Kernel driver in use: i915
	Kernel modules: i915

00:08.0 System peripheral [0880]: Intel Corporation Skylake Gaussian Mixture Model [8086:1911]
	Subsystem: Lenovo Skylake Gaussian Mixture Model [17aa:2238]
Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Interrupt: pin A routed to IRQ 255
Region 0: Memory at e124a000 (64-bit, non-prefetchable) [disabled] [size=4K]
	Capabilities: <access denied>

00:13.0 Non-VGA unclassified device [0000]: Intel Corporation Device [8086:9d35] (rev 21)
	Subsystem: Lenovo Device [17aa:2238]
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 20
	Region 0: Memory at e124b000 (64-bit, non-prefetchable) [size=4K]
	Capabilities: <access denied>
	Kernel driver in use: intel_ish_ipc
	Kernel modules: intel_ish_ipc

00:14.0 USB controller [0c03]: Intel Corporation Sunrise Point-LP USB 3.0 xHCI Controller [8086:9d2f] (rev 21) (prog-if 30 [XHCI])
	Subsystem: Lenovo Sunrise Point-LP USB 3.0 xHCI Controller [17aa:2238]
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 123
	Region 0: Memory at e1220000 (64-bit, non-prefetchable) [size=64K]
	Capabilities: <access denied>
	Kernel driver in use: xhci_hcd
	Kernel modules: xhci_pci

00:14.2 Signal processing controller [1180]: Intel Corporation Sunrise Point-LP Thermal subsystem [8086:9d31] (rev 21)
	Subsystem: Lenovo Sunrise Point-LP Thermal subsystem [17aa:2238]
Control: I/O- Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Interrupt: pin C routed to IRQ 18
	Region 0: Memory at e124c000 (64-bit, non-prefetchable) [size=4K]
	Capabilities: <access denied>
	Kernel driver in use: intel_pch_thermal
	Kernel modules: intel_pch_thermal

00:16.0 Communication controller [0780]: Intel Corporation Sunrise Point-LP CSME HECI #1 [8086:9d3a] (rev 21)
	Subsystem: Lenovo Sunrise Point-LP CSME HECI [17aa:2238]
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 125
	Region 0: Memory at e124d000 (64-bit, non-prefetchable) [size=4K]
	Capabilities: <access denied>
	Kernel driver in use: mei_me
	Kernel modules: mei_me

00:17.0 SATA controller [0106]: Intel Corporation Sunrise Point-LP SATA Controller [AHCI mode] [8086:9d03] (rev 21) (prog-if 01 [AHCI 1.0])
	Subsystem: Lenovo Sunrise Point-LP SATA Controller [AHCI mode] [17aa:2238]
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 124
	Region 0: Memory at e1248000 (32-bit, non-prefetchable) [size=8K]
	Region 1: Memory at e1250000 (32-bit, non-prefetchable) [size=256]
	Region 2: I/O ports at e080 [size=8]
	Region 3: I/O ports at e088 [size=4]
	Region 4: I/O ports at e060 [size=32]
	Region 5: Memory at e124e000 (32-bit, non-prefetchable) [size=2K]
	Capabilities: <access denied>
	Kernel driver in use: ahci
	Kernel modules: ahci

00:1c.0 PCI bridge [0604]: Intel Corporation Device [8086:9d10] (rev f1) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 16
	Bus: primary=00, secondary=02, subordinate=02, sec-latency=0
	Memory behind bridge: e1100000-e11fffff
Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ <SERR- <PERR-
	BridgeCtl: Parity- SERR- NoISA- VGA- MAbort- >Reset- FastB2B-
		PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
	Capabilities: <access denied>
	Kernel driver in use: pcieport
	Kernel modules: shpchp

00:1c.2 PCI bridge [0604]: Intel Corporation Device [8086:9d12] (rev f1) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin C routed to IRQ 18
	Bus: primary=00, secondary=04, subordinate=04, sec-latency=0
	Memory behind bridge: e1000000-e10fffff
Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ <SERR- <PERR-
	BridgeCtl: Parity- SERR- NoISA- VGA- MAbort- >Reset- FastB2B-
		PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
	Capabilities: <access denied>
	Kernel driver in use: pcieport
	Kernel modules: shpchp

00:1f.0 ISA bridge [0601]: Intel Corporation Sunrise Point-LP LPC Controller [8086:9d48] (rev 21)
	Subsystem: Lenovo Sunrise Point-LP LPC Controller [17aa:2238]
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0

00:1f.2 Memory controller [0580]: Intel Corporation Sunrise Point-LP PMC [8086:9d21] (rev 21)
	Subsystem: Lenovo Sunrise Point-LP PMC [17aa:2238]
Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Region 0: Memory at e1244000 (32-bit, non-prefetchable) [disabled] [size=16K]

00:1f.3 Audio device [0403]: Intel Corporation Sunrise Point-LP HD Audio [8086:9d70] (rev 21)
	Subsystem: Lenovo Sunrise Point-LP HD Audio [17aa:2238]
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 64
	Interrupt: pin A routed to IRQ 127
	Region 0: Memory at e1240000 (64-bit, non-prefetchable) [size=16K]
	Region 4: Memory at e1230000 (64-bit, non-prefetchable) [size=64K]
	Capabilities: <access denied>
	Kernel driver in use: snd_hda_intel
	Kernel modules: snd_hda_intel, snd_soc_skl

00:1f.4 SMBus [0c05]: Intel Corporation Sunrise Point-LP SMBus [8086:9d23] (rev 21)
	Subsystem: Lenovo Sunrise Point-LP SMBus [17aa:2238]
Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Interrupt: pin A routed to IRQ 16
	Region 0: Memory at e124f000 (64-bit, non-prefetchable) [size=256]
	Region 4: I/O ports at efa0 [size=32]
	Kernel driver in use: i801_smbus
	Kernel modules: i2c_i801

00:1f.6 Ethernet controller [0200]: Intel Corporation Ethernet Connection I219-V [8086:1570] (rev 21)
	Subsystem: Lenovo Ethernet Connection I219-V [17aa:2233]
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 129
	Region 0: Memory at e1200000 (32-bit, non-prefetchable) [size=128K]
	Capabilities: <access denied>
	Kernel driver in use: e1000e
	Kernel modules: e1000e

02:00.0 Unassigned class [ff00]: Realtek Semiconductor Co., Ltd. RTS525A PCI Express Card Reader [10ec:525a] (rev 01)
	Subsystem: Lenovo RTS525A PCI Express Card Reader [17aa:2238]
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 122
	Region 1: Memory at e1100000 (32-bit, non-prefetchable) [size=4K]
	Capabilities: <access denied>
	Kernel driver in use: rtsx_pci
	Kernel modules: rtsx_pci


-- System Information:
Debian Release: 9.0
  APT prefers testing
  APT policy: (500, 'testing')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 4.9.0-1-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)

Versions of packages linux-image-4.9.0-1-amd64 depends on:
ii  initramfs-tools [linux-initramfs-tool]  0.127
ii  kmod                                    23-2
ii  linux-base                              4.5

Versions of packages linux-image-4.9.0-1-amd64 recommends:
ii  firmware-linux-free  3.4
ii  irqbalance           1.1.0-2.2

Versions of packages linux-image-4.9.0-1-amd64 suggests:
pn  debian-kernel-handbook  <none>
ii  grub-efi-amd64          2.02~beta3-4
pn  linux-doc-4.9           <none>

Versions of packages linux-image-4.9.0-1-amd64 is related to:
ii  firmware-iwlwifi          20161130-2
ii  firmware-misc-nonfree     20161130-2

-- no debconf information


Reply to: