[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#528028: BUG: Bad page state in process md?_resync pfn:<hex>



Package: linux-image-2.6.29-1-686
Version: 2.6.29-3
Justification: breaks the whole system
Severity: critical

I wonder if this is the same thing Neil Brown refers to, here:

	http://www.spinics.net/lists/raid/msg23053.html

The syslog shows first 97 of these:

,----
| May 10 02:10:05 kernel: [107468.160593] BUG: Bad page state in process md9_resync  pfn:36243
| May 10 02:10:05 kernel: [107468.160602] page:c16c4860 flags:40000000 count:-1 mapcount:0 mapping:(null) index:24
| May 10 02:10:05 kernel: [107468.160610] Pid: 31239, comm: md9_resync Tainted: P    B      2.6.29-1-686 #1
| May 10 02:10:05 kernel: [107468.160614] Call Trace:
| May 10 02:10:05 kernel: [107468.160626]  [<c016a17c>] bad_page+0xc6/0xda
| May 10 02:10:05 kernel: [107468.160633]  [<c016afbc>] get_page_from_freelist+0x31b/0x3f3
| May 10 02:10:05 kernel: [107468.160641]  [<c016b2e0>] __alloc_pages_internal+0x98/0x398
| May 10 02:10:05 kernel: [107468.160676]  [<f8ca0b1d>] r1buf_pool_alloc+0x8f/0x164 [raid1]
| May 10 02:10:05 kernel: [107468.160684]  [<c016885f>] mempool_alloc+0x27/0xcb
| May 10 02:10:05 kernel: [107468.160691]  [<c01a49cc>] bio_add_page+0x28/0x2e
| May 10 02:10:05 kernel: [107468.160707]  [<f8ca0e54>] sync_request+0x1fd/0x5ee [raid1]
| May 10 02:10:05 kernel: [107468.160739]  [<f8c83d5b>] md_do_sync+0x6d4/0xb60 [md_mod]
| May 10 02:10:05 kernel: [107468.160753]  [<c0136d9e>] autoremove_wake_function+0x0/0x2d
| May 10 02:10:05 kernel: [107468.160775]  [<f8c845e3>] md_thread+0xdd/0xf4 [md_mod]
| May 10 02:10:05 kernel: [107468.160796]  [<f8c84506>] md_thread+0x0/0xf4 [md_mod]
| May 10 02:10:05 kernel: [107468.160803]  [<c0136ad7>] kthread+0x38/0x5d
| May 10 02:10:05 kernel: [107468.160808]  [<c0136a9f>] kthread+0x0/0x5d
| May 10 02:10:05 kernel: [107468.160815]  [<c0103d13>] kernel_thread_helper+0x7/0x10
`----

and then continues and alternates with 81 of these:

,----
| May 10 02:10:14 kernel: [107477.308920] BUG: Bad page state in process find  pfn:04322
| May 10 02:10:14 kernel: [107477.308928] page:c1086440 flags:40000000 count:-1 mapcount:0 mapping:(null) index:1
| May 10 02:10:14 kernel: [107477.308933] Pid: 32146, comm: find Tainted: P    B      2.6.29-1-686 #1
| May 10 02:10:14 kernel: [107477.308937] Call Trace:
| May 10 02:10:14 kernel: [107477.308946]  [<c016a17c>] bad_page+0xc6/0xda
| May 10 02:10:14 kernel: [107477.308952]  [<c016afbc>] get_page_from_freelist+0x31b/0x3f3
| May 10 02:10:14 kernel: [107477.308957]  [<c016b2e0>] __alloc_pages_internal+0x98/0x398
| May 10 02:10:14 kernel: [107477.308962]  [<c0167365>] find_or_create_page+0x3e/0x76
| May 10 02:10:14 kernel: [107477.308968]  [<c01a182f>] __getblk+0x137/0x2df
| May 10 02:10:14 kernel: [107477.308973]  [<c01a263f>] ll_rw_block+0xb5/0xc6
| May 10 02:10:14 kernel: [107477.309013]  [<f8d5725e>] search_by_key+0x69/0x560 [reiserfs]
| May 10 02:10:14 kernel: [107477.309036]  [<f8d57649>] search_by_key+0x454/0x560 [reiserfs]
| May 10 02:10:14 kernel: [107477.309042]  [<c01a16ee>] __find_get_block+0x168/0x172
| May 10 02:10:14 kernel: [107477.309047]  [<c016b2e0>] __alloc_pages_internal+0x98/0x398
| May 10 02:10:14 kernel: [107477.309052]  [<c01a171f>] __getblk+0x27/0x2df
| May 10 02:10:14 kernel: [107477.309073]  [<f8d4a151>] reiserfs_read_locked_inode+0x64/0x50e [reiserfs]
| May 10 02:10:14 kernel: [107477.309093]  [<f8d46d1d>] search_by_entry_key+0x14/0x184 [reiserfs]
| May 10 02:10:14 kernel: [107477.309114]  [<f8d48b00>] make_cpu_key+0x1c/0x38 [reiserfs]
| May 10 02:10:14 kernel: [107477.309134]  [<f8d48b1c>] reiserfs_init_locked_inode+0x0/0xe [reiserfs]
| May 10 02:10:14 kernel: [107477.309154]  [<f8d4a642>] reiserfs_iget+0x47/0x78 [reiserfs]
| May 10 02:10:14 kernel: [107477.309173]  [<f8d47574>] reiserfs_lookup+0xa7/0xf6 [reiserfs]
| May 10 02:10:14 kernel: [107477.309188]  [<c018ed0d>] do_lookup+0xa6/0x116
| May 10 02:10:14 kernel: [107477.309193]  [<c018f678>] __link_path_walk+0x523/0x630
| May 10 02:10:14 kernel: [107477.309198]  [<c0199ab9>] mntput_no_expire+0x1a/0xfe
| May 10 02:10:14 kernel: [107477.309203]  [<c0194d82>] dput+0x1a/0x108
| May 10 02:10:14 kernel: [107477.309207]  [<c018fb87>] path_walk+0x4f/0xa3
| May 10 02:10:14 kernel: [107477.309212]  [<c01908cb>] do_path_lookup+0x132/0x178
| May 10 02:10:14 kernel: [107477.309216]  [<c0191159>] getname+0x5e/0xb0
| May 10 02:10:14 kernel: [107477.309221]  [<c0191930>] user_path_at+0x37/0x5f
| May 10 02:10:14 kernel: [107477.309227]  [<c018b6a7>] vfs_lstat_fd+0x12/0x38
| May 10 02:10:14 kernel: [107477.309231]  [<c018b7b1>] sys_fstatat64+0x28/0x48
| May 10 02:10:14 kernel: [107477.309237]  [<c010343b>] sysenter_do_call+0x12/0x2f
`----

Continues with 151 of these:

,----
| May 10 02:20:35 kernel: [108097.888004] BUG: soft lockup - CPU#0 stuck for 61s! [pdflush:28304]
| May 10 02:20:35 kernel: [108097.888004] Modules linked in: iptable_nat nf_nat iptable_mangle ebtable_filter ebtables hid_gaff hid_pl hid_cypress hid_zpff hid_gyration hid_sony hid_ntrig hid_samsung hid_microsoft hid_tmff hid_monterey hid_ezkey hid_apple hid_a4tech hid_logitech ff_memless hid_cherry hid_sunplus hid_petalynx hid_belkin hid_chicony usbhid hid nvidia(P) xt_multiport binfmt_misc ppdev parport_pc lp parport ipt_ULOG inet_diag acpi_cpufreq cpufreq_powersave cpufreq_stats cpufreq_userspace cpufreq_conservative nf_conntrack_ftp nf_conntrack_irc ipt_LOG xt_limit xt_tcpudp ipt_REJECT xt_state nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 iptable_filter ip_tables x_tables ipv6 fuse dm_crypt eeprom lm85 hwmon_vid i2c_dev snd_emu10k1 snd_ac97_codec ac97_bus snd_pcm_oss snd_mixer_oss snd_pcm snd_page_alloc snd_util_mem snd_hwdep snd_seq_dummy snd_seq_oss snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq snd_timer snd_seq_device snd serio_raw i2c_i801 soundcore pcspkr p!
 smouse evdev i2c_core rng_core bu
| May 10 02:20:35 kernel: tton reiserfs dm_mirror dm_region_hash dm_log dm_snapshot dm_mod raid1 md_mod sd_mod crc_t10dif ide_cd_mod cdrom ata_generic ide_pci_generic ata_piix uhci_hcd libata piix ehci_hcd scsi_mod firewire_ohci firewire_core crc_itu_t ide_core usbcore e1000e intel_agp agpgart thermal processor fan thermal_sys
| May 10 02:20:35 kernel: [108097.888004]
| May 10 02:20:35 kernel: [108097.888004] Pid: 28304, comm: pdflush Tainted: P    B      (2.6.29-1-686 #1)
| May 10 02:20:35 kernel: [108097.888004] EIP: 0060:[<c0166724>] EFLAGS: 00000282 CPU: 0
| May 10 02:20:35 kernel: [108097.888004] EIP is at find_get_pages_tag+0x5f/0xdb
| May 10 02:20:35 kernel: [108097.888004] EAX: ffffffff EBX: 00000000 ECX: c127a040 EDX: c127a044
| May 10 02:20:35 kernel: [108097.888004] ESI: 00000000 EDI: 00000000 EBP: d731be8c ESP: d731be0c
| May 10 02:20:35 kernel: [108097.888004]  DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
| May 10 02:20:35 kernel: [108097.888004] CR0: 8005003b CR2: b7f27000 CR3: 22152000 CR4: 000006d0
| May 10 02:20:35 kernel: [108097.888004] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
| May 10 02:20:35 kernel: [108097.888004] DR6: ffff0ff0 DR7: 00000400
| May 10 02:20:35 kernel: [108097.888004] Call Trace:
| May 10 02:20:35 kernel: [108097.888004]  [<c016d167>] ? pagevec_lookup_tag+0x1b/0x22
| May 10 02:20:35 kernel: [108097.888004]  [<c016bc5b>] ? write_cache_pages+0x117/0x29a
| May 10 02:20:35 kernel: [108097.888004]  [<c016b6ac>] ? __writepage+0x0/0x22
| May 10 02:20:35 kernel: [108097.888004]  [<c016bdde>] ? generic_writepages+0x0/0x21
| May 10 02:20:35 kernel: [108097.888004]  [<c016bdf8>] ? generic_writepages+0x1a/0x21
| May 10 02:20:35 kernel: [108097.888004]  [<c016be1f>] ? do_writepages+0x20/0x30
| May 10 02:20:35 kernel: [108097.888004]  [<c019d8f6>] ? __writeback_single_inode+0x18e/0x368
| May 10 02:20:35 kernel: [108097.888004]  [<c019deb6>] ? generic_sync_sb_inodes+0x25b/0x3b8
| May 10 02:20:35 kernel: [108097.888004]  [<c019e194>] ? writeback_inodes+0x69/0xad
| May 10 02:20:35 kernel: [108097.888004]  [<c016bf1c>] ? wb_kupdate+0x7a/0xdf
| May 10 02:20:35 kernel: [108097.888004]  [<c016cb54>] ? pdflush+0x137/0x1db
| May 10 02:20:35 kernel: [108097.888004]  [<c016bea2>] ? wb_kupdate+0x0/0xdf
| May 10 02:20:35 kernel: [108097.888004]  [<c016ca1d>] ? pdflush+0x0/0x1db
| May 10 02:20:35 kernel: [108097.888004]  [<c0136ad7>] ? kthread+0x38/0x5d
| May 10 02:20:35 kernel: [108097.888004]  [<c0136a9f>] ? kthread+0x0/0x5d
| May 10 02:20:35 kernel: [108097.888004]  [<c0103d13>] ? kernel_thread_helper+0x7/0x10
`----

last one at May 10 05:04:19.  A few seconds later: collapse.


-- Package-specific info:

-- System Information:
Debian Release: squeeze/sid
  APT prefers unstable
  APT policy: (500, 'unstable'), (500, 'testing')
Architecture: i386 (i686)

Kernel: Linux 2.6.26-2-686 (SMP w/2 CPU cores)
Locale: LANG=C, LC_CTYPE=C (charmap=ANSI_X3.4-1968)
Shell: /bin/sh linked to /bin/bash

Versions of packages linux-image-2.6.29-1-686 depends on:
ii  debconf [debconf-2.0]         1.5.26     Debian configuration management sy
ii  initramfs-tools [linux-initra 0.93.2     tools for generating an initramfs
ii  module-init-tools             3.7-pre9-1 tools for managing Linux kernel mo

Versions of packages linux-image-2.6.29-1-686 recommends:
ii  libc6-i686                    2.9-12     GNU C Library: Shared libraries [i

Versions of packages linux-image-2.6.29-1-686 suggests:
ii  grub                          0.97-53    GRand Unified Bootloader (Legacy v
pn  linux-doc-2.6.29              <none>     (no description available)

-- debconf information:
  linux-image-2.6.29-1-686/preinst/abort-overwrite-2.6.29-1-686:
  linux-image-2.6.29-1-686/preinst/elilo-initrd-2.6.29-1-686: true
  linux-image-2.6.29-1-686/preinst/bootloader-initrd-2.6.29-1-686: true
  linux-image-2.6.29-1-686/postinst/depmod-error-initrd-2.6.29-1-686: false
  linux-image-2.6.29-1-686/postinst/old-system-map-link-2.6.29-1-686: true
  linux-image-2.6.29-1-686/postinst/kimage-is-a-directory: shared/kernel-image/really-run-bootloader: true
  linux-image-2.6.29-1-686/preinst/overwriting-modules-2.6.29-1-686: true
  linux-image-2.6.29-1-686/preinst/initrd-2.6.29-1-686:
  linux-image-2.6.29-1-686/postinst/bootloader-test-error-2.6.29-1-686:
  linux-image-2.6.29-1-686/prerm/removing-running-kernel-2.6.29-1-686: true
  linux-image-2.6.29-1-686/preinst/lilo-has-ramdisk:
  linux-image-2.6.29-1-686/postinst/old-dir-initrd-link-2.6.29-1-686: true
  linux-image-2.6.29-1-686/preinst/lilo-initrd-2.6.29-1-686: true
  linux-image-2.6.29-1-686/postinst/depmod-error-2.6.29-1-686: false
  linux-image-2.6.29-1-686/postinst/bootloader-error-2.6.29-1-686:
  linux-image-2.6.29-1-686/preinst/failed-to-move-modules-2.6.29-1-686:
  linux-image-2.6.29-1-686/postinst/create-kimage-link-2.6.29-1-686: true
  linux-image-2.6.29-1-686/prerm/would-invalidate-boot-loader-2.6.29-1-686: true
  linux-image-2.6.29-1-686/preinst/abort-install-2.6.29-1-686:
  linux-image-2.6.29-1-686/postinst/old-initrd-link-2.6.29-1-686: true


Cheers,

-- 
Cristian


Reply to: