Bug#600656: linux-image-2.6.32-5-amd64: Crash after nullpointer dereference during gparted reading a disk

To: Ben Hutchings <ben@decadent.org.uk>
Cc: 600656@bugs.debian.org
Subject: Bug#600656: linux-image-2.6.32-5-amd64: Crash after nullpointer dereference during gparted reading a disk
From: Andreas Feldner <pelzi@flying-snail.de>
Date: Wed, 10 Nov 2010 00:41:36 +0200
Message-id: <[🔎] 201011092341.39105.pelzi@flying-snail.de>
Reply-to: Andreas Feldner <pelzi@flying-snail.de>, 600656@bugs.debian.org
In-reply-to: <1287452085.20865.161.camel@localhost>
References: <20101018214306.3643.64059.reportbug@localhost.localdomain> <1287452085.20865.161.camel@localhost>

Hi Ben,

somehow netconsole doesn't do anything for me. But it turned out the the 
system behaviour is not exactly reproducible and anyway I can see the messages 
on screen if X is not started (because nvidia module banned ;-) ).

So, here I have the following error message, hope that helps!

Best regards,
Andreas.

Oct 20 01:11:46 athlon1 kernel: [10695.728746] BUG: unable to handle kernel 
NULL pointer dereference at 0000000000000020
Oct 20 01:11:46 athlon1 kernel: [10695.729011] IP: [<ffffffff8110f31f>] 
block_invalidatepage+0x32/0xa8
Oct 20 01:11:46 athlon1 kernel: [10695.729011] PGD bcfe2067 PUD bcef1067 PMD 0 
Oct 20 01:11:46 athlon1 kernel: [10695.729011] Oops: 0000 [#1] SMP 
Oct 20 01:11:46 athlon1 kernel: [10695.729011] last sysfs file: 
/sys/devices/platform/it87.656/temp2_input
Oct 20 01:11:46 athlon1 kernel: [10695.729011] CPU 1 
Oct 20 01:11:46 athlon1 kernel: [10695.729011] Modules linked in: tcp_diag 
inet_diag tun netconsole configfs cpufreq_conservative cpufreq_stats ppdev lp 
capifs binfmt_misc uinput fuse ext4 jbd2 crc16 it87 hwmon_vid eeprom 
cpufreq_userspace cpufreq_powersave powernow_k8 psmouse ide_generic 
ide_gd_mod ide_cd_mod ide_core dm_crypt dm_mod em28xx_alsa tuner_xc2028 tuner 
tvp5150 em28xx v4l2_common videodev v4l1_compat 
v4l2_compat_ioctl32 snd_intel8x0 ir_common videobuf_vmalloc snd_ac97_codec 
videobuf_core tveeprom ac97_bus snd_pcm_oss snd_mixer_oss joydev 
snd_pcm snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq snd_timer 
snd_seq_device snd i2c_nforce2 edac_core parport_pc soundcore evdev 
k8temp edac_mce_amd snd_page_alloc i2c_core parport button pcspkr processor 
serio_raw ext3 jbd mbcache hid_cherry sg usbhid hid sr_mod cdrom 
sd_mod crc_t10dif ata_generic thermal firewire_ohci pata_amd ohci_hcd floppy 
thermal_sys firewire_core crc_itu_t sata_nv forcedeth libata ehci_hcd 
scsi_mod usbcore nls_base [last unloaded: net
Oct 20 01:11:46 athlon1 kernel: onsole]
Oct 20 01:11:46 athlon1 kernel: [10695.729011] Pid: 3821, comm: gpartedbin Not 
tainted 2.6.32-5-amd64 #1  
Oct 20 01:11:46 athlon1 kernel: [10695.729011] RIP: 0010:[<ffffffff8110f31f>]  
[<ffffffff8110f31f>] block_invalidatepage+0x32/0xa8
Oct 20 01:11:46 athlon1 kernel: [10695.729011] RSP: 0018:ffff8800a9a15d78  
EFLAGS: 00010207
Oct 20 01:11:46 athlon1 kernel: [10695.729011] RAX: 0000000000fabcf0 RBX: 
0000000000000000 RCX: ffff8800bf2d0878
Oct 20 01:11:46 athlon1 kernel: [10695.729011] RDX: 0000000000000002 RSI: 
ffff880016b53c30 RDI: ffff8800016b3a48
Oct 20 01:11:46 athlon1 kernel: [10695.729011] RBP: ffffea0000a38550 R08: 
0000000000000000 R09: 0000000000000000
Oct 20 01:11:46 athlon1 kernel: [10695.729011] R10: ffff8800bf2d0878 R11: 
ffffffff8110f2ed R12: ffff880016b53cb0
Oct 20 01:11:46 athlon1 kernel: [10695.729011] R13: 0000000000fabcf0 R14: 
0000000000000000 R15: 0000000000000000
Oct 20 01:11:46 athlon1 kernel: [10695.729011] FS:  00007fd036d1d710(0000) 
GS:ffff880001900000(0000) knlGS:0000000000000000
Oct 20 01:11:46 athlon1 kernel: [10695.729011] CS:  0010 DS: 0000 ES: 0000 
CR0: 000000008005003b
Oct 20 01:11:46 athlon1 kernel: [10695.729011] CR2: 0000000000000020 CR3: 
00000000bc693000 CR4: 00000000000006e0
Oct 20 01:11:46 athlon1 kernel: [10695.729011] DR0: 0000000000000000 DR1: 
0000000000000000 DR2: 0000000000000000
Oct 20 01:11:46 athlon1 kernel: [10695.729011] DR3: 0000000000000000 DR6: 
00000000ffff0ff0 DR7: 0000000000000400
Oct 20 01:11:46 athlon1 kernel: [10695.729011] Process gpartedbin (pid: 3821, 
threadinfo ffff8800a9a14000, task ffff8800bce60000)
Oct 20 01:11:46 athlon1 kernel: [10695.729011] Stack:
Oct 20 01:11:46 athlon1 kernel: [10695.729011]  ffff8800bf2d0878 
ffffea0000a38550 ffff8800bf2d0878 0000000001381801
Oct 20 01:11:46 athlon1 kernel: [10695.729011] <0> 000000000000000d 
ffff8800a9a15e80 0000000000000000 ffffffff810bc8b6
Oct 20 01:11:46 athlon1 kernel: [10695.729011] <0> 0000000000000000 
ffffea0000a38550 0000000001381b2f ffffffff810bc99f
Oct 20 01:11:46 athlon1 kernel: [10695.729011] Call Trace:
Oct 20 01:11:46 athlon1 kernel: [10695.729011]  [<ffffffff810bc8b6>] ? 
truncate_inode_page+0x45/0x84
Oct 20 01:11:46 athlon1 kernel: [10695.729011]  [<ffffffff810bc99f>] ? 
truncate_inode_pages_range+0xaa/0x2b0
Oct 20 01:11:46 athlon1 kernel: [10695.729011]  [<ffffffff81111c53>] ? 
__blkdev_put+0x75/0x14c
Oct 20 01:11:46 athlon1 kernel: [10695.729011]  [<ffffffff810ef421>] ? 
__fput+0x100/0x1af
Oct 20 01:11:46 athlon1 kernel: [10695.729011]  [<ffffffff810ec85e>] ? 
filp_close+0x5b/0x62
Oct 20 01:11:46 athlon1 kernel: [10695.729011]  [<ffffffff810ec8f9>] ? 
sys_close+0x94/0xcd
Oct 20 01:11:46 athlon1 kernel: [10695.729011]  [<ffffffff81010b42>] ? 
system_call_fastpath+0x16/0x1b
Oct 20 01:11:46 athlon1 kernel: [10695.729011] Code: 41 55 41 54 55 48 89 fd 
53 48 83 ec 08 48 8b 07 a8 01 75 04 0f 0b eb fe f6 c4 08 74 77 4c 8b 67 10 31 
c0 4c 89 e3 41 89 c5 89 c0 <44> 03 6b 20 49 39 c7 4c 8b 73 08 77 36 48 89 df 
e8 22 e8 ff ff 
Oct 20 01:11:46 athlon1 kernel: [10695.729011] RIP  [<ffffffff8110f31f>] 
block_invalidatepage+0x32/0xa8
Oct 20 01:11:46 athlon1 kernel: [10695.729011]  RSP <ffff8800a9a15d78>
Oct 20 01:11:46 athlon1 kernel: [10695.729011] CR2: 0000000000000020
Oct 20 01:11:46 athlon1 kernel: [10696.430344] ---[ end trace bfde34ae8c0233a0 
]---


Am Dienstag 19. Oktober 2010, 03:34:45 schrieben Sie:
> On Mon, 2010-10-18 at 23:43 +0200, Andreas Feldner wrote:
> > Package: linux-2.6
> > Version: 2.6.32-23
> > Severity: important
> > 
> > My system crashes reproducibly during the gparted operation "move
> > /dev/sda1 to the right" in the read-only test stage. It is very hard to
> > track down as the system freezes (ping is working, no other networking
> > stuff like sshd, screen is black, keyboard doesn't react, not even the
> > shift indicator LEDs). After reboot, no indication of the problem can be
> > found in the log files anymore.
> > 
> > Running tail -f /var/log/kern.log on a different machine, and running
> > gparted with remote X11 did the trick to get to some information. The
> > following lines were the last words to hear from the machine:
> > 
> > Oct 18 21:44:10 xxxxxxx kernel: [88819.864398] BUG: unable to handle
> > kernel NULL pointer dereference at (null) Oct 18 21:44:10 xxxxxxx
> > kernel: [88819.864413] IP: [<ffffffff8110c456>] drop_buffers+0x23/0x9d
> > Oct 18 21:44:10 xxxxxxx kernel: [88819.864429] PGD 7d9c4067 PUD 1f542067
> > PMD 0 Oct 18 21:44:10 xxxxxxx kernel: [88819.864439] Oops: 0000 [#1] SMP
> > 
> > gparted was attempting to read a 511.94 GiB sized partition on a SATA
> > disk with a block size of 16.00 MiB. The crash orccured at 365.60 GiB
> > (in case that matters). The file system on the partition in question is
> > reported OK by fsck. A test with dd if=/dev/sda1 of=/dev/null bs=4096
> > worked out OK (didn't try with 16M block size). SMART status if the hard
> > disk is passed.
> > 
> > I suspect some rare freezes of the machine to come from the same origin,
> > though I didn't find another way to actually reproduce the problem than
> > running the described gparted operation.
> > 
> > Unfortunately, I have no idea if this bug report contains any useful
> > information, nor how to provide any additional.
> 
> [...]
> 
> It really isn't enough information.  Please use netconsole or a serial
> console to capture the full 'oops' message.  Also please test whether
> this is reproducible if you don't use the nvidia proprietary driver.
> 
> Ben.


-- 
Dr. Andreas Feldner
Neufeldstraße 7b
81243 München

Reply to:

Follow-Ups:
- Bug#600656: linux-image-2.6.32-5-amd64: Crash after nullpointer dereference during gparted reading a disk
  - From: Ben Hutchings <ben@decadent.org.uk>

Prev by Date: Bug#602966: linux-source-2.6.36: Oops while unmouning an USB key with FAT filesystem
Next by Date: Processed: notfound 602945 in 2.6.32-9, found 602945 in 2.6.32-27
Previous by thread: Bug#602966: linux-source-2.6.36: Oops while unmouning an USB key with FAT filesystem
Next by thread: Bug#600656: linux-image-2.6.32-5-amd64: Crash after nullpointer dereference during gparted reading a disk
Index(es):
- Date
- Thread