[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)



On Mon, Sep 27, 2010 at 09:29:28AM -0700, Vagrant Cascadian wrote:
> On Tue, Sep 21, 2010 at 01:37:42PM -0700, Vagrant Cascadian wrote:
> > On Mon, Sep 20, 2010 at 02:04:23AM +0100, Ben Hutchings wrote:
> > > Please add 'memory_corruption_check=1' to the kernel parameters and
> > > report whether that fixes the problem and whether it results in any new
> > > log messages.
> > 
> > added this now, will reboot and see if that fixes it... presuming it just
> > doesn't hide itself for another several weeks.
> 
> ok, didn't have to wait for weeks, apparently.

still getting crashes every week or two, but not sure if they're related to the
initial problem...

so i've been running with memory_corruption_check=1:

cat /proc/cmdline
BOOT_IMAGE=/vmlinuz-2.6.32-5-686 root=/dev/mapper/mneme-rwt ro quiet memory_corruption_check=1


Oct  5 19:14:30 mneme kernel: [ 8525.133061] BUG: unable to handle kernel NULL pointer dereference at (null)
Oct  5 19:14:30 mneme kernel: [ 8525.133072] IP: [<c113a802>] strlen+0x8/0x11
Oct  5 19:14:30 mneme kernel: [ 8525.133083] *pde = 00000000
Oct  5 19:14:30 mneme kernel: [ 8525.133088] Oops: 0000 [#1] SMP
Oct  5 19:14:30 mneme kernel: [ 8525.133094] last sysfs file: /sys/devices/pci0000:00/0000:00:02.1/resource
Oct  5 19:14:30 mneme kernel: [ 8525.133099] Modules linked in: usbhid hid tun ip6table_filter ip6_tables iptable_filter ip_tables x_tables sco bridge stp bnep
rfcomm l2cap crc16 bluetooth acpi_cpufreq parport_pc ppdev lp cpufreq_stats cpufreq_powersave parport cpufreq_conservative cpufreq_userspace kvm_intel kvm uinpu
t fuse dm_snapshot firewire_sbp2 loop snd_hda_codec_idt snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_midi snd_rawmidi snd_seq
_midi_event i915 snd_seq arc4 drm_kms_helper ecb snd_timer b43 snd_seq_device drm snd yenta_socket rsrc_nonstatic i2c_i801 i2c_algo_bit soundcore mac80211 rng_c
ore snd_page_alloc dell_laptop joydev i2c_core cfg80211 rfkill video battery processor button output dcdbas psmouse ac evdev serio_raw ext3 jbd mbcache sha256_g
eneric aes_i586 aes_generic cbc dm_crypt dm_mod sd_mod crc_t10dif ata_generic ata_piix sdhci_pci sdhci thermal ssb firewire_ohci pcmcia libata mmc_core tg3 uhci
_hcd firewire_core crc_itu_t pcmcia_core scsi_mod led_class ehci_h
Oct  5 19:14:30 mneme kernel: cd thermal_sys libphy usbcore nls_base [last unloaded: scsi_wait_scan]
Oct  5 19:14:30 mneme kernel: [ 8525.133242]
Oct  5 19:14:30 mneme kernel: [ 8525.133248] Pid: 10491, comm: Xorg Not tainted (2.6.32-5-686 #1) Latitude D420
Oct  5 19:14:30 mneme kernel: [ 8525.133254] EIP: 0060:[<c113a802>] EFLAGS: 00213246 CPU: 0
Oct  5 19:14:30 mneme kernel: [ 8525.133259] EIP is at strlen+0x8/0x11
Oct  5 19:14:30 mneme kernel: [ 8525.133263] EAX: 00000000 EBX: 00000fd4 ECX: ffffffff EDX: 00000005
Oct  5 19:14:30 mneme kernel: [ 8525.133268] ESI: 00000000 EDI: 00000000 EBP: f6f533f0 ESP: d5871f4c
Oct  5 19:14:30 mneme kernel: [ 8525.133272]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
Oct  5 19:14:30 mneme kernel: [ 8525.133278] Process Xorg (pid: 10491, ti=d5870000 task=f335d540 task.ti=d5870000)
Oct  5 19:14:30 mneme kernel: [ 8525.133282] Stack:
Oct  5 19:14:30 mneme kernel: [ 8525.133285]  f0790e00 c10f2aee c10bd768 d5871f90 00000000 c128167c f0790e00 e0537d88
Oct  5 19:14:30 mneme kernel: [ 8525.133295] <0> e0537e00 c10bd9fb d5871f90 c10bd768 fffffff7 f0790e00 00000000 00008000
Oct  5 19:14:30 mneme kernel: [ 8525.133306] <0> c10bda8b 0a0882a8 0a088288 00007d28 ffffffea 0000000c 0a087fd0 b76afff4
Oct  5 19:14:30 mneme kernel: [ 8525.133318] Call Trace:
Oct  5 19:14:30 mneme kernel: [ 8525.133328]  [<c10f2aee>] ? sysfs_readdir+0xe0/0x13a
Oct  5 19:14:30 mneme kernel: [ 8525.133336]  [<c10bd768>] ? filldir64+0x0/0xc5
Oct  5 19:14:30 mneme kernel: [ 8525.133342]  [<c10bd9fb>] ? vfs_readdir+0x62/0x8c
Oct  5 19:14:30 mneme kernel: [ 8525.133349]  [<c10bd768>] ? filldir64+0x0/0xc5
Oct  5 19:14:30 mneme kernel: [ 8525.133355]  [<c10bda8b>] ? sys_getdents64+0x66/0xa5
Oct  5 19:14:30 mneme kernel: [ 8525.133363]  [<c10030fb>] ? sysenter_do_call+0x12/0x28
Oct  5 19:14:30 mneme kernel: [ 8525.133367] Code: eb 04 19 c0 0c 01 5e 5f c3 56 89 c6 89 d0 88 c4 ac 38 e0 74 09 84 c0 75 f7 be 01 00 00 00 89 f0 48 5e c3 57 8
3 c9 ff 89 c7 31 c0 <f2> ae f7 d1 49 89 c8 5f c3 57 31 ff 85 c9 74 0e 89 c7 89 d0 f2
Oct  5 19:14:30 mneme kernel: [ 8525.133427] EIP: [<c113a802>] strlen+0x8/0x11 SS:ESP 0068:d5871f4c
Oct  5 19:14:30 mneme kernel: [ 8525.133434] CR2: 0000000000000000
Oct  5 19:14:30 mneme kernel: [ 8525.133439] ---[ end trace e4f8adeee260d138 ]---

and again today:

Oct  6 08:20:00 mneme kernel: [ 5643.886857] BUG: unable to handle kernel NULL pointer dereference at 00000010
Oct  6 08:20:00 mneme kernel: [ 5643.887055] IP: [<c108cf4b>] file_ra_state_init+0x3/0x18
Oct  6 08:20:00 mneme kernel: [ 5643.890573] *pde = 00000000 
Oct  6 08:20:00 mneme kernel: [ 5643.890573] Oops: 0000 [#1] SMP 
Oct  6 08:20:00state_init+0x3/0x18
Oct  6 08:20:00 mneme kernel: [ 5643.890573] EAX: ef4e1fc8 EBX: 00000000 ECX: f3710800 EDX: 00000000
Oct  6 08:20:00 mneme kernel: [ 5643.890573] ESI: ef4e1f80 EDI: 00000000 EBP: f6f73b18 ESP: f3725ea0
Oct  6 08:20:00 mneme kernel: [ 5643.890573]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
Oct  6 08:20:00 mneme kernel: [ 5643.890573] Process screen (pid: 2119, ti=f3724000 task=f32b5100 task.ti=f3724000)
Oct  6 08:20:00 mneme kernel: [ 5643.890573] Stack:
Oct  6 08:20:00 mneme kernel: [ 5643.890573]  c10b1367 f6f28300 f68bd7f8 f3725f00 ef4e1f80 f3725f00 f3725f00 00000003
Oct  6 08:20:00 mneme kernel: [ 5643.890573] <0> c10b14c8 ef4e1f80 c10b4e5c f3606d80 00000000 f3725f00 c10bb470 00000000
Oct  6 08:20:00 mneme kernel: [ 5643.890573] <0> 00000002 f3501000 ffffff9c 0000fe01 00000000 00000000 000175ac 000081ed
Oct  6 08:20:00 mneme kernel: [ 5643.890573] Call Trace:
Oct  6 08:20:00 mneme kernel: [ 5643.890573]  [<c10b1367>] ? __dentry_open+0x176/0x246
Oct  6 08:20:00 mneme kernel: [ 5643.890573]  [<c10b14c8>] ? nameidata_to_filp+0x29/0x3c
Oct  6 08:20:00 mneme kernel: [ 5643.890573]  [<c10b4e5c>] ? chrdev_open+0x0/0x116
Oct  6 08:20:00 mneme kernel: [ 5643.890573]  [<c10bb470>] ? do_filp_open+0x43f/0x802
Oct  6 08:20:00 mneme kernel: [ 5643.890573]  [<c10c3181>] ? alloc_fd+0x52/0xb7
Oct  6 08:20:00 mneme kernel: [ 5643.890573]  [<c10b10ff>] ? do_sys_open+0x4c/0xdf
Oct  6 08:20:00 mneme kernel: [ 5643.890573]  [<c10b11d6>] ? sys_open+0x1e/0x23
Oct  6 08:20:00 mneme kernel: [ 5643.890573]  [<c10030fb>] ? sysenter_do_call+0x12/0x28
Oct  6 08:20:00 mneme kernel: [ 5643.890573] Code: c3 53 89 d3 ff 74 24 0c ff 74 24 0c e8 fa 92 fa ff 5a 59 85 c0 75 0e 85 db 74 0a c7 05 34 0c 4b c1 00 00 00 00 5b c3 90 8b 52 40 <8b> 52 10 c7 40 14 ff ff ff ff c7 40 18 ff ff ff ff 89 50 0c c3 
Oct  6 08:20:00 mneme kernel: [ 5643.890573] EIP: [<c108cf4b>] file_ra_state_init+0x3/0x18 SS:ESP 0068:f3725ea0
Oct  6 08:20:00 mneme kernel: [ 5643.890573] CR2: 0000000000000010
Oct  6 08:20:00 mneme kernel: [ 5644.125222] ---[ end trace e48111b1d9cacfd3 ]--
Oct  6 08:21:26 mneme anacron[6198]: Job `cron.daily' started
Oct  6 08:21:26 mneme anacron[6816]: Updated timestamp for job `cron.daily' to 2010-10-06
Oct  6 08:21:58 mneme shutdown[6821]: shutting down for system reboot
Oct  6 08:21:59 mneme init: Switching to runlevel: 6
Oct  6 08:22:00 mneme kernel: [ 5764.683835] BUG: unable to handle kernel NULL pointer dereference at 00000010
Oct  6 08:22:00 mneme kernel: [ 5764.684997] IP: [<c108cf4b>] file_ra_state_init+0x3/0x18
Oct  6 08:22:00 mneme kernel: [ 5764.684997] *pde = 00000000 
Oct  6 08:22:00 mneme kernel: [ 5764.702263] Oops: 0000 [#2] SMP 
Oct  6 08:22:00 mneme kernel: [ 5764.702263] last sysfs file: /sys/devices/system/cpu/cpu1/cache/index2/shared_cpu_map
Oct  6 08:22:00 mneme kernel: [ 5764.702263] Modules linked in: tun ip6table_filter ip6_tables iptable_filter ip_tables x_tables sco bridge stp bnep parport_pc ppdev lp parport l2cap crc16 bluetooth acpi_cpufreq cpufreq_stats cpufreq_powersave cpufreq_conservative cpufreq_userspace kvm_intel kvm uinput fuse dm_snapshot firewire_sbp2 loop snd_hda_codec_idt snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_midi arc4 snd_rawmidi ecb snd_seq_midi_event snd_seq snd_timer b43 joydev snd_seq_device i915 yenta_socket drm_kms_helper snd rsrc_nonstatic soundcore mac80211 drm psmouse i2c_algo_bit i2c_i801 snd_page_alloc rng_core i2c_core dell_laptop serio_raw cfg80211 video battery processor button ac evdev output dcdbas rfkill ext3 jbd mbcache sha256_generic aes_i586 aes_generic cbc usbhid hid dm_crypt dm_mod sd_mod crc_t10dif ata_generic ssb thermal sdhci_pci sdhci ata_piix uhci_hcd firewire_ohci pcmcia mmc_core libata tg3 thermal_sys firewire_core crc_itu_t pcmcia_core led_class ehci_hcd s
Oct  6 08:22:00 mneme kernel: csi_mod libphy usbcore nls_base [last unloaded: scsi_wait_scan]
Oct  6 08:22:00 mneme kernel: [ 5764.702263] 
Oct  6 08:22:00 mneme kernel: [ 5764.702263] Pid: 6836, comm: startpar Tainted: G      D    (2.6.32-5-686 #1) Latitude D420                   
Oct  6 08:22:00 mneme kernel: [ 5764.702263] EIP: 0060:[<c108cf4b>] EFLAGS: 00010202 CPU: 1
Oct  6 08:22:00 mneme kernel: [ 5764.702263] EIP is at file_ra_state_init+0x3/0x18
Oct  6 08:22:00 mneme kernel: [ 5764.702263] EAX: f3703d48 EBX: 00000000 ECX: f4fd3800 EDX: 00000000
Oct  6 08:22:00 mneme kernel: [ 5764.702263] ESI: f3703d00 EDI: 00000000 EBP: f6f73b18 ESP: f3613ea0
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
Oct  6 08:22:00 mneme kernel: [ 5764.702263] Process startpar (pid: 6836, ti=f3612000 task=ef508440 task.ti=f3612000)
Oct  6 08:22:00 mneme kernel: [ 5764.702263] Stack:
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  c10b1367 f6f28300 f68bd7f8 f3613f00 f3703d00 f3613f00 f3613f00 00000103
Oct  6 08:22:00 mneme kernel: [ 5764.702263] <0> c10b14c8 f3703d00 c10b4e5c f3497100 00000000 f3613f00 c10bb470 c13b3d00
Oct  6 08:22:00 mneme kernel: [ 5764.702263] <0> 00000102 f358c000 ffffff9c 00000079 b76d6000 00000000 b765d000 00000000
Oct  6 08:22:00 mneme kernel: [ 5764.702263] Call Trace:
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  [<c10b1367>] ? __dentry_open+0x176/0x246
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  [<c10b14c8>] ? nameidata_to_filp+0x29/0x3c
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  [<c10b4e5c>] ? chrdev_open+0x0/0x116
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  [<c10bb470>] ? do_filp_open+0x43f/0x802
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  [<c109b2af>] ? handle_mm_fault+0x48f/0x959
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  [<c10a4b09>] ? free_pages_and_swap_cache+0x3b/0x48
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  [<c10c3181>] ? alloc_fd+0x52/0xb7
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  [<c10b10ff>] ? do_sys_open+0x4c/0xdf
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  [<c10b11d6>] ? sys_open+0x1e/0x23
Oct  6 08:22:00 mneme kernel: [ 5764.702263]  [<c10030fb>] ? sysenter_do_call+0x12/0x28
Oct  6 08:22:00 mneme kernel: [ 5764.702263] Code: c3 53 89 d3 ff 74 24 0c ff 74 24 0c e8 fa 92 fa ff 5a 59 85 c0 75 0e 85 db 74 0a c7 05 34 0c 4b c1 00 00 00 00 5b c3 90 8b 52 40 <8b> 52 10 c7 40 14 ff ff ff ff c7 40 18 ff ff ff ff 89 50 0c c3 
Oct  6 08:22:00 mneme kernel: [ 5764.702263] EIP: [<c108cf4b>] file_ra_state_init+0x3/0x18 SS:ESP 0068:f3613ea0
Oct  6 08:22:00 mneme kernel: [ 5764.702263] CR2: 0000000000000010
Oct  6 08:22:00 mneme kernel: [ 5764.940095] ---[ end trace e48111b1d9cacfd4 ]---


anything else that could possibly be of use in troubleshooting this?

live well,
  vagrant



Reply to: