[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#647185: linux-2.6: kernel null pointer dereference while adding SAN path



Hi Ben!


>>> removing paths to our SAN and adding them back results in
>> [...]
>>
>> Does the attached patch help?  Instructions for building a patched
>> kernel can be found at:
>>
>> http://kernel-handbook.alioth.debian.org/ch-common-tasks.html#s-common-official
> 
> Sorry, you'll need this patch as well.

thanks for the patches! After applying them we run into the following oops instead.
Please note that
- this only seems to happen when there is a partition on the LUN, using LVM on the
  DM device directly doesn't seem to trigger that bug
- you'll have to fix the vserver patch when the two dm patches are applied
  (not the issue here, but wanted to mention it before you fall about it)


[ 2000.379681] device-mapper: multipath: Failing path 8:32.
[ 2000.380022] device-mapper: multipath: Failing path 8:48.
[ 2000.381502] device-mapper: table: 254:2: multipath: error getting device
[ 2000.381533] device-mapper: ioctl: error adding target to table
[ 2000.382355] device-mapper: table: 254:2: multipath: error getting device
[ 2000.382385] device-mapper: ioctl: error adding target to table
[ 2000.411143] general protection fault: 0000 [#1] SMP 
[ 2000.411175] last sysfs file: /sys/devices/pci0000:00/0000:00:07.0/0000:0e:00.0/host0/rport-0:0-3/target0:0:3/0:0:3:0/block/sdd/uevent
[ 2000.411229] CPU 4 
[ 2000.411251] Modules linked in: 8021q garp stp ext4 jbd2 crc16 dm_round_robin dm_multipath scsi_dh bonding ipmi_devintf ipmi_si ipmi_msghandler ohci_hcd snd_pcm snd_timer radeon snd ttm soundcore drm_kms_helper drm i2c_algo_bit i2c_core hpilo snd_page_alloc hpwdt joydev pcspkr evdev button power_meter processor container psmouse serio_raw ext3 jbd mbcache dm_mod sd_mod crc_t10dif sg sr_mod cdrom ata_generic usbhid hid qla2xxx hpsa scsi_transport_fc uhci_hcd thermal scsi_tgt ata_piix ehci_hcd libata bnx2 qlcnic usbcore cciss nls_base scsi_mod thermal_sys [last unloaded: scsi_wait_scan]
[ 2000.411579] Pid: 8402, comm: multipath Not tainted 2.6.32-5-amd64 #1 ProLiant DL380 G7
[ 2000.411623] RIP: 0010:[<ffffffff8117629b>]  [<ffffffff8117629b>] elv_drain_elevator+0x13/0x5a
[ 2000.411674] RSP: 0018:ffff880e1b2cfd18  EFLAGS: 00010002
[ 2000.411700] RAX: ffff880719b0cd80 RBX: ffff880719a291a0 RCX: 0000000000000000
[ 2000.411729] RDX: 0000000000000002 RSI: 0000000000000001 RDI: ffff880719a291a0
[ 2000.411758] RBP: ffff880719a291a0 R08: ffff88071a65be70 R09: ffff88071a701840
[ 2000.411787] R10: 0000000100067c84 R11: ffff880713a8a780 R12: ffff880719a291a0
[ 2000.411816] R13: 0000000000000002 R14: ffff880719707160 R15: ffff880719707044
[ 2000.411845] FS:  00007f3b1d07a7a0(0000) GS:ffff88001a440000(0000) knlGS:0000000000000000
[ 2000.411889] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 2000.411916] CR2: 00000000025ea210 CR3: 0000000e1a53f000 CR4: 00000000000006e0
[ 2000.411945] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 2000.411974] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 2000.418573] Process multipath (pid: 8402, threadinfo ffff880e1b2ce000, task ffff880e1b93e9f0)
[ 2000.418618] Stack:
[ 2000.418637]  ffff880248cc0018 ffffffff81176c40 ffff880248cc0018 ffff880248cc0018
[ 2000.418674] <0> ffff880719a291a0 0000000000000096 ffff880719707000 ffffffff8117dec9
[ 2000.418726] <0> ffff880248cc0018 ffffc9000ca8b040 ffff880719a2b4e0 ffffffffa019492b
[ 2000.418795] Call Trace:
[ 2000.418818]  [<ffffffff81176c40>] ? elv_insert+0x91/0x260
[ 2000.418847]  [<ffffffff8117dec9>] ? blk_insert_cloned_request+0x4f/0x67
[ 2000.418879]  [<ffffffffa019492b>] ? dm_dispatch_request+0x33/0x59 [dm_mod]
[ 2000.418912]  [<ffffffffa0195ef7>] ? dm_request_fn+0x121/0x1a2 [dm_mod]
[ 2000.418941]  [<ffffffff8117eef6>] ? __blk_run_queue+0x35/0x66
[ 2000.418970]  [<ffffffffa0194a43>] ? dm_resume+0xb5/0x123 [dm_mod]
[ 2000.419001]  [<ffffffffa0199071>] ? dev_suspend+0x0/0x196 [dm_mod]
[ 2000.419032]  [<ffffffffa01991d0>] ? dev_suspend+0x15f/0x196 [dm_mod]
[ 2000.419063]  [<ffffffffa0199c24>] ? ctl_ioctl+0x1c6/0x20e [dm_mod]
[ 2000.419092]  [<ffffffffa0199c7a>] ? dm_ctl_ioctl+0xe/0x12 [dm_mod]
[ 2000.419124]  [<ffffffff810fab66>] ? vfs_ioctl+0x21/0x6c
[ 2000.419150]  [<ffffffff810fb0b4>] ? do_vfs_ioctl+0x48d/0x4cb
[ 2000.419178]  [<ffffffff810d066d>] ? remove_vma+0x6b/0x72
[ 2000.419205]  [<ffffffff810d1782>] ? do_munmap+0x307/0x329
[ 2000.419231]  [<ffffffff810fb143>] ? sys_ioctl+0x51/0x70
[ 2000.419258]  [<ffffffff81010b42>] ? system_call_fastpath+0x16/0x1b
[ 2000.419285] Code: 41 0f 18 09 75 bb 48 8b 02 48 89 70 08 48 89 06 48 89 56 08 48 89 32 c3 53 48 89 fb 48 8b 43 18 be 01 00 00 00 48 89 df 48 8b 00 <ff> 50 20 85 c0 75 ea 8b 8b b0 03 00 00 85 c9 74 34 8b 15 ca ea 
[ 2000.419478] RIP  [<ffffffff8117629b>] elv_drain_elevator+0x13/0x5a
[ 2000.419507]  RSP <ffff880e1b2cfd18>
[ 2000.419759] ---[ end trace cff8452e221a0978 ]---
[ 2130.432222] qla2xxx 0000:0e:00.0: LIP reset occurred (f700).
[ 2130.647148] qla2xxx 0000:0e:00.1: LOOP DOWN detected (2 3 0).
[ 2130.808220] qla2xxx 0000:0e:00.0: LIP occurred (f700).
[ 2130.808342] qla2xxx 0000:0e:00.0: LIP reset occurred (f7f7).
[ 2130.857976] qla2xxx 0000:0e:00.0: LOOP UP detected (8 Gbps).


Cheers,

Bernd

-- 
Mit freundlichen Grüßen,

-------------------------------------------------
Bernd Zeimetz
Systems Engineer

conova communications GmbH

web   |  www.conova.com
mail  |  b.zeimetz@conova.com

ZENTRALE SALZBURG
Karolingerstraße 36A
A - 5020 Salzburg

tel   |  +43/(0)662 2200-313
fax   |  +43/(0)662 2200-209
------------------------------------------------



Reply to: