[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#656899: bug in md



Hello.

Yesterday frozen one of us 6.0.4 server with soft raid.

Packages:

mdadm 3.1.4-1+8efb9d1+squeeze1
linux-image-2.6.32-5-686 2.6.32-41

Log:

Mar 4 00:57:01 xxfw kernel: [810540.404471] md: data-check of RAID array md0 Mar 4 00:57:01 xxfw kernel: [810540.404477] md: minimum _guaranteed_ speed: 1000 KB/sec/disk. Mar 4 00:57:01 xxfw kernel: [810540.404481] md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for data-check. Mar 4 00:57:01 xxfw kernel: [810540.404488] md: using 128k window, over a total of 1951744 blocks.
Mar  4 00:57:58 xxfw kernel: [810597.523705] md: md0: data-check done.
Mar 4 01:00:02 xxfw kernel: [810720.628033] kthreadd D cf2a42a0 0 2 0 0x00000000 Mar 4 01:00:02 xxfw kernel: [810720.628042] ce828440 00000046 ce82b300 cf2a42a0 00000246 c141d100 c141d100 c14186ac Mar 4 01:00:02 xxfw kernel: [810720.628055] ce8285fc c1808100 00000000 a8ea713c 0002e12e cf2a42ac cf2a42a8 00000000 Mar 4 01:00:02 xxfw kernel: [810720.628066] 00000001 ce8285fc 0c12d5b8 00000292 00000000 00000003 ce831aec ce9ce9d8
Mar  4 01:00:02 xxfw kernel: [810720.628078] Call Trace:
Mar 4 01:00:02 xxfw kernel: [810720.628119] [<d0340a26>] ? md_write_start+0x12d/0x140 [md_mod] Mar 4 01:00:02 xxfw kernel: [810720.628133] [<c1044342>] ? autoremove_wake_function+0x0/0x2d Mar 4 01:00:02 xxfw kernel: [810720.628144] [<d0364d9d>] ? make_request+0x27/0x6dd [raid1] Mar 4 01:00:02 xxfw kernel: [810720.628154] [<c106f881>] ? __rcu_process_callbacks+0x164/0x227 Mar 4 01:00:02 xxfw kernel: [810720.628161] [<c106f881>] ? __rcu_process_callbacks+0x164/0x227 Mar 4 01:00:02 xxfw kernel: [810720.628169] [<c10aeb9c>] ? kmem_cache_free+0x78/0xaf Mar 4 01:00:02 xxfw kernel: [810720.628175] [<c106f881>] ? __rcu_process_callbacks+0x164/0x227 Mar 4 01:00:02 xxfw kernel: [810720.628182] [<c106f789>] ? __rcu_process_callbacks+0x6c/0x227 Mar 4 01:00:02 xxfw kernel: [810720.628194] [<d03422f0>] ? md_make_request+0xa4/0xd8 [md_mod] Mar 4 01:00:02 xxfw kernel: [810720.628201] [<c106f977>] ? rcu_process_callbacks+0x33/0x39 Mar 4 01:00:02 xxfw kernel: [810720.628211] [<c1035b6d>] ? __do_softirq+0x115/0x156 Mar 4 01:00:02 xxfw kernel: [810720.628219] [<c1128d19>] ? generic_make_request+0x266/0x2b4 Mar 4 01:00:02 xxfw kernel: [810720.628228] [<c108982a>] ? mempool_alloc+0x3b/0xdd Mar 4 01:00:02 xxfw kernel: [810720.628235] [<c1128e23>] ? submit_bio+0xbc/0xd6 Mar 4 01:00:02 xxfw kernel: [810720.628244] [<c108cc21>] ? test_set_page_writeback+0xc7/0xd0 Mar 4 01:00:02 xxfw kernel: [810720.628250] [<c10a5680>] ? swap_writepage+0x82/0x89 Mar 4 01:00:02 xxfw kernel: [810720.628258] [<c10900d1>] ? shrink_page_list+0x32d/0x585 Mar 4 01:00:02 xxfw kernel: [810720.628264] [<c108f2aa>] ? isolate_pages_global+0x159/0x1bc Mar 4 01:00:02 xxfw kernel: [810720.628270] [<c10906b7>] ? shrink_list+0x38e/0x61b Mar 4 01:00:02 xxfw kernel: [810720.628291] [<d00bb6b5>] ? scsi_next_command+0x25/0x2f [scsi_mod] Mar 4 01:00:02 xxfw kernel: [810720.628301] [<c112cdaf>] ? blk_done_softirq+0x53/0x5f Mar 4 01:00:02 xxfw kernel: [810720.628307] [<c1090b69>] ? shrink_zone+0x225/0x2c8 Mar 4 01:00:02 xxfw kernel: [810720.628315] [<c109170b>] ? try_to_free_pages+0x1f6/0x31a Mar 4 01:00:02 xxfw kernel: [810720.628321] [<c108f151>] ? isolate_pages_global+0x0/0x1bc Mar 4 01:00:02 xxfw kernel: [810720.628327] [<c108c81c>] ? __alloc_pages_nodemask+0x302/0x4d9 Mar 4 01:00:02 xxfw kernel: [810720.628334] [<c108c9ff>] ? __get_free_pages+0xc/0x17 Mar 4 01:00:02 xxfw kernel: [810720.628342] [<c102f31c>] ? copy_process+0xb7/0xf28 Mar 4 01:00:02 xxfw kernel: [810720.628352] [<c1025134>] ? update_curr+0x106/0x1b3 Mar 4 01:00:02 xxfw kernel: [810720.628358] [<c10302c7>] ? do_fork+0x13a/0x2bc Mar 4 01:00:02 xxfw kernel: [810720.628364] [<c102b8d6>] ? finish_task_switch+0x34/0x95 Mar 4 01:00:02 xxfw kernel: [810720.628372] [<c1001e39>] ? kernel_thread+0x85/0x8d Mar 4 01:00:02 xxfw kernel: [810720.628379] [<c10440af>] ? kthread+0x0/0x66 Mar 4 01:00:02 xxfw kernel: [810720.628384] [<c10440af>] ? kthread+0x0/0x66 Mar 4 01:00:02 xxfw kernel: [810720.628390] [<c1003d40>] ? kernel_thread_helper+0x0/0x10 Mar 4 01:00:02 xxfw kernel: [810720.628396] [<c1044080>] ? kthreadd+0x8c/0xbb Mar 4 01:00:02 xxfw kernel: [810720.628401] [<c1043ff4>] ? kthreadd+0x0/0xbb Mar 4 01:00:02 xxfw kernel: [810720.628407] [<c1003d47>] ? kernel_thread_helper+0x7/0x10 Mar 4 01:00:02 xxfw kernel: [810720.628422] bdi-default D c141d100 0 13 2 0x00000000 Mar 4 01:00:02 xxfw kernel: [810720.628430] ce82b300 00000046 c141d100 c141d100 c1808100 c141d100 c141d100 00000000 Mar 4 01:00:02 xxfw kernel: [810720.628441] ce82b4bc c1808100 00000000 34259b7b 0002e12f 0002e12f 342595fa c1808138 Mar 4 01:00:02 xxfw kernel: [810720.628453] 00000e5b ce82b4bc 021e93a2 0000010c 00000002 00000000 c1808138 ce82846c

There are many of these inside boot log:

Mar 5 09:06:42 xxfw kernel: [ 4.478620] md: raid1 personality registered for level 1 Mar 5 09:06:42 xxfw kernel: [ 4.512732] mdadm: sending ioctl 800c0910 to a partition! Mar 5 09:06:42 xxfw kernel: [ 4.512808] mdadm: sending ioctl 800c0910 to a partition!

cat /proc/mdstat
Personalities : [raid1]
md4 : active raid1 sda7[0] sdb7[1]
      108422528 blocks [2/2] [UU]

md3 : active raid1 sda6[0] sdb6[1]
      4883648 blocks [2/2] [UU]

md2 : active raid1 sda5[0] sdb5[1]
      979840 blocks [2/2] [UU]

md1 : active (auto-read-only) raid1 sda2[0] sdb2[1]
      979840 blocks [2/2] [UU]

md0 : active raid1 sda1[0] sdb1[1]
      1951744 blocks [2/2] [UU]

cat /proc/cpuinfo
processor	: 0
vendor_id	: GenuineIntel
cpu family	: 15
model		: 2
model name	: Intel(R) Pentium(R) 4 CPU 2.00GHz
stepping	: 4
cpu MHz		: 1993.980
cache size	: 512 KB
fdiv_bug	: no
hlt_bug		: no
f00f_bug	: no
coma_bug	: no
fpu		: yes
fpu_exception	: yes
cpuid level	: 2
wp		: yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm up pebs bts
bogomips	: 3987.96
clflush size	: 64
cache_alignment	: 128
address sizes	: 36 bits physical, 32 bits virtual

lspci
00:00.0 Host bridge: Intel Corporation 82845G/GL[Brookdale-G]/GE/PE DRAM Controller/Host-Hub Interface (rev 01) 00:02.0 VGA compatible controller: Intel Corporation 82845G/GL[Brookdale-G]/GE Chipset Integrated Graphics Device (rev 01) 00:1d.0 USB Controller: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) USB UHCI Controller #1 (rev 01) 00:1d.1 USB Controller: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) USB UHCI Controller #2 (rev 01) 00:1d.7 USB Controller: Intel Corporation 82801DB/DBM (ICH4/ICH4-M) USB2 EHCI Controller (rev 01)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev 81)
00:1f.0 ISA bridge: Intel Corporation 82801DB/DBL (ICH4/ICH4-L) LPC Interface Bridge (rev 01) 00:1f.1 IDE interface: Intel Corporation 82801DB (ICH4) IDE Controller (rev 01) 00:1f.3 SMBus: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) SMBus Controller (rev 01) 00:1f.5 Multimedia audio controller: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) AC'97 Audio Controller (rev 01) 05:08.0 Ethernet controller: Intel Corporation 82801DB PRO/100 VM (LOM) Ethernet Controller (rev 81) 05:09.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8139/8139C/8139C+ (rev 10)

/etc/mdadm/mdadm.conf
DEVICE partitions
CREATE owner=root group=disk mode=0660 auto=yes
HOMEHOST <system>
MAILADDR root
ARRAY /dev/md0 level=raid1 num-devices=2 UUID=00884240:e0988a25:15e30bc4:8db560df ARRAY /dev/md1 level=raid1 num-devices=2 UUID=0246224e:9c60f949:de7ef7d6:860b05ae ARRAY /dev/md2 level=raid1 num-devices=2 UUID=8fd5519b:03b4e7d3:bf6ff4dc:2cac279e ARRAY /dev/md3 level=raid1 num-devices=2 UUID=64abe774:520fc0c1:1642e5bb:29471201 ARRAY /dev/md4 level=raid1 num-devices=2 UUID=fddaf625:75249f33:7b4ae07f:ad9e9a69

/etc/default/mdadm
INITRDSTART='/dev/md0'
AUTOSTART=true
AUTOCHECK=true
START_DAEMON=true
DAEMON_OPTIONS="--syslog"
VERBOSE=false

/proc/partitions
major minor  #blocks  name

   8        0  117220824 sda
   8        1    1951866 sda1
   8        2     979965 sda2
   8        3          1 sda3
   8        5     979933 sda5
   8        6    4883728 sda6
   8        7  108422653 sda7
   8       16  117220824 sdb
   8       17    1951866 sdb1
   8       18     979965 sdb2
   8       19          1 sdb3
   8       21     979933 sdb5
   8       22    4883728 sdb6
   8       23  108422653 sdb7
   9        0    1951744 md0
   9        1     979840 md1
   9        2     979840 md2
   9        3    4883648 md3
   9        4  108422528 md4


Regards,

J.



Reply to: