[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#587915: marked as done (ocfs2: inode errors when exporting via nfs)



Your message dated Sun, 2 Dec 2012 13:29:23 -0800
with message-id <20121202212922.GA16024@elie.Belkin>
and subject line Re: ocfs2: inode errors when exporting via nfs
has caused the Debian Bug report #587915,
regarding ocfs2: inode errors when exporting via nfs
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@bugs.debian.org
immediately.)


-- 
587915: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=587915
Debian Bug Tracking System
Contact owner@bugs.debian.org with problems
--- Begin Message ---
Package: linux-modules-2.6.26-2-xen-amd64
Version: 2.6.26-24
Severity: critical
Justification: breaks the whole system


We're using a two node cluster with drbd and ocfs2 as filesystem, exported 
via NFS to some webnodes.  The nfs clients use the nordirplus mount option.
We had the reboot one of the cluster nodes due to the following kernel BUG.  
Looks like debian bug #506950 to me.  Upstream has a fix:

http://www.mail-archive.com/ocfs2-users@oss.oracle.com/msg03855.html

Here's the BUG:

[140488.568363] (5423,0):ocfs2_inode_lock_update:1977 ERROR: bug expression: inode->i_generation != le32_to_cpu(fe->i_generation)
[140488.568422] (5423,0):ocfs2_inode_lock_update:1977 ERROR: Invalid dinode 130037 disk generation: 3505630863 inode->i_generation: 3505492205
[140488.568500] ------------[ cut here ]------------
[140488.568527] kernel BUG at fs/ocfs2/dlmglue.c:1977!
[140488.568553] invalid opcode: 0000 [1] SMP 
[140488.568595] CPU 0 
[140488.570872] Modules linked in: ipmi_watchdog ipmi_devintf ocfs2 sha1_generic drbd cn xt_physdev bridge netloop ipt_REJECT ipt_LOG xt_tcpudp nf_conntrack_ipv4 nf_conntrack iptable_filter ip_tables x_tables ipv6 ocfs2_dlmfs ocfs2_stack_o2cb ocfs2_dlm ocfs2_nodemanager ocfs2_stackglue
 configfs nfsd auth_rpcgss exportfs nfs lockd nfs_acl sunrpc ext2 loop ipmi_si ipmi_msghandler hpilo psmouse serio_raw pcspkr container button evdev ext3 jbd mbcache dm_mirror dm_log dm_snapshot dm_mod sg sr_mod cdrom ide_pci_generic ide_core ata_generic usbhid hid ff_memless ata_piix libata dock ehci_hcd bnx2 fir
mware_class uhci_hcd e1000e cciss scsi_mod thermal processor fan thermal_sys [last unloaded: scsi_wait_scan]
[140488.571316] Pid: 5423, comm: nfsd Not tainted 2.6.26-2-xen-amd64 #1
[140488.571345] RIP: e030:[<ffffffffa04258d7>]  [<ffffffffa04258d7>] :ocfs2:ocfs2_inode_lock_full+0x8f5/0xd88
[140488.571409] RSP: e02b:ffff880b43c51cf0  EFLAGS: 00010286
[140488.571436] RAX: 0000000000000092 RBX: ffff88083ade5000 RCX: 0000565600013ef9
[140488.571481] RDX: ffffffffff5f7000 RSI: 0000000000000001 RDI: ffffffff805aaab0
[140488.571526] RBP: ffff880b37c854a8 R08: 000000000000008f R09: ffff880b43c51800
[140488.571571] R10: 0000000000000000 R11: 00000143f7bb211e R12: 0000000100080000
[140488.571616] R13: ffff880b37c850c0 R14: 0000000000000000 R15: ffff880b37c851d8
[140488.571663] FS:  00007f664454a6e0(0000) GS:ffffffff8053a000(0000) knlGS:0000000000000000
[140488.571710] CS:  e033 DS: 0000 ES: 0000
[140488.571734] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[140488.571780] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[140488.571825] Process nfsd (pid: 5423, threadinfo ffff880b43c50000, task ffff880fbe57f940)
[140488.571872] Stack:  ffff880bd0f3ae8f ffff880bd0f190ed 000000003b0bf568 0000000000000000
[140488.571951]  0000000100000000 ffff880b3b0b7490 ffff8807f09fd3a0 ffff880b37c854a8
[140488.572006]  ffff880b37c85410 ffff880b43c51db0 ffff880fbe83b800 ffff880b43b38000
[140488.572044] Call Trace:
[140488.572094]  [<ffffffffa043458a>] ? :ocfs2:ocfs2_inode_revalidate+0x171/0x24b
[140488.572148]  [<ffffffffa04300e6>] ? :ocfs2:ocfs2_getattr+0x78/0x167
[140488.572186]  [<ffffffffa028b697>] ? :nfsd:encode_post_op_attr+0x3e/0x84
[140488.572220]  [<ffffffff8029ac98>] ? dput+0x21/0x13e
[140488.572253]  [<ffffffffa0284239>] ? :nfsd:nfsd_lookup+0xa1/0xae
[140488.572291]  [<ffffffffa028ca79>] ? :nfsd:nfs3svc_encode_diropres+0x32/0x62
[140488.572327]  [<ffffffffa027f2cf>] ? :nfsd:nfsd_dispatch+0x168/0x1b6
[140488.572370]  [<ffffffffa01f08e9>] ? :sunrpc:svc_process+0x405/0x6e4
[140488.572402]  [<ffffffff804365d6>] ? __down_read+0x15/0x110
[140488.572436]  [<ffffffffa027f67c>] ? :nfsd:nfsd+0x0/0x2f0
[140488.572469]  [<ffffffffa027f837>] ? :nfsd:nfsd+0x1bb/0x2f0
[140488.572499]  [<ffffffff802282ec>] ? schedule_tail+0x27/0x5c
[140488.572529]  [<ffffffff8020be28>] ? child_rip+0xa/0x12
[140488.572561]  [<ffffffffa027f67c>] ? :nfsd:nfsd+0x0/0x2f0
[140488.572594]  [<ffffffffa027f67c>] ? :nfsd:nfsd+0x0/0x2f0
[140488.572628]  [<ffffffffa027f67c>] ? :nfsd:nfsd+0x0/0x2f0
[140488.572656]  [<ffffffff8020be1e>] ? child_rip+0x0/0x12
[140488.572684] 
[140488.572704] 
[140488.572724] Code: 14 25 24 00 00 00 48 c7 c1 e0 06 45 a0 89 d2 48 c7 c7 df 5b 45 a0 89 44 24 08 8b 43 08 89 04 24 4d 8b 4d 00 31 c0 e8 3f 7d e0 df <0f> 0b eb fe 48 83 7b 48 00 75 0a f6 43 2c 01 0f 85 b5 00 00 00 
[140488.572979] RIP  [<ffffffffa04258d7>] :ocfs2:ocfs2_inode_lock_full+0x8f5/0xd88
[140488.573050]  RSP <ffff880b43c51cf0>
[140488.573428] ---[ end trace 5eface76cbedf781 ]---




-- System Information:
Debian Release: 5.0.5
  APT prefers stable
  APT policy: (500, 'stable')
Architecture: amd64 (x86_64)

Kernel: Linux 2.6.26-2-xen-amd64 (SMP w/16 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages linux-modules-2.6.26-2-xen-amd64 depends on:
ii  module-init-tools             3.4-1      tools for managing Linux kernel mo

linux-modules-2.6.26-2-xen-amd64 recommends no packages.

linux-modules-2.6.26-2-xen-amd64 suggests no packages.

-- no debconf information



--- End Message ---
--- Begin Message ---
Version: 2.6.30-1
unarchive 506950
reopen 506950
reassign 506950 src:linux-2.6 2.6.26-10
merge 587915 506950
quit

Hi,

In 2010, Joerg Schumacher wrote:

> We're using a two node cluster with drbd and ocfs2 as filesystem, exported 
> via NFS to some webnodes.  The nfs clients use the nordirplus mount option.
> We had the reboot one of the cluster nodes due to the following kernel BUG.  
> Looks like debian bug #506950 to me.

Sorry for the very slow response.  I agree that they look the same.

>                                       Upstream has a fix:
>
> http://www.mail-archive.com/ocfs2-users@oss.oracle.com/msg03855.html

Thanks for digging that up.  For reference, that's

  6ca497a83e59 ocfs2: fix rare stale inode errors when exporting via nfs

which was part of 2.6.30-rc1.  Closing optimistically.  If you learn
more, please don't hesitate to let us know.

Ciao,
Jonathan

--- End Message ---

Reply to: