[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#617666: nfs-kernel-server: Periodic nfsd failure - single nfsd process with high CPU and no mounts working



Package: nfs-kernel-server
Version: 1:1.2.2-4
Severity: grave
Justification: renders package unusable


Hi there,

appologies if this has already been reported but I couldn't see anything quite matching what I'm seeing.

I have a 26TB debian squeeze fileserver providing NFS mounts to a large number of users.  The system has been working flawlessly for a number of months but twice in the last week NFS seems to have crashed.  The first thing I noticed is that users reported being unable to access shares.  Logging into the system I see a single nfsd process taking 100% CPU with a very long run time.  Restarting nfs-kernel-server has no effect.  The process is unkillable (even with -9) and the system has required a reboot to get it usable again.  jnettop is not showing significant network traffic and lsof on /export/ (where all my NFS exports are located) shows no nfs access to any files.  

Please let me know if you need any further information.  I am going to reboot the server now, so I may not be able to reproduce the problem straight away (but as its happened twice, I am quite sure it will happen again at some point...).

Thanks in advance for your help.

Dan Tomlinson

My /etc/exports file is below:


# /etc/exports: the access control list for filesystems which may be exported
#		to NFS clients.  See exports(5).
#
# Example for NFSv2 and NFSv3:
# /srv/homes       hostname1(no_subtree_check,rw,sync,no_subtree_check) hostname2(ro,sync,no_subtree_check)
#
# Example for NFSv4:
#

# misc shares
/export/software		192.168.32.0/24(no_subtree_check,rw,sync,root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,root_squash,insecure)
/export/system_tools		192.168.32.0/24(no_subtree_check,rw,sync,no_root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,no_root_squash,insecure)
/export/home		192.168.32.0/24(no_subtree_check,rw,sync,root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,no_root_squash,insecure)

# flychip shares
/export/flychip/archives	192.168.32.0/24(no_subtree_check,rw,sync,root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,root_squash,insecure)
/export/flychip/misc		192.168.32.0/24(no_subtree_check,rw,sync,root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,root_squash,insecure)
/export/flychip/production	192.168.32.0/24(no_subtree_check,rw,sync,root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,root_squash,insecure)
/export/flychip/share		192.168.32.0/24(no_subtree_check,rw,sync,root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,root_squash,insecure)
/export/flychip/temp		192.168.32.0/24(no_subtree_check,rw,sync,root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,root_squash,insecure)

# mickelm shares
/export/micklem/releases		192.168.32.0/24(no_subtree_check,rw,sync,no_root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,no_root_squash,insecure)
/export/micklem/data		192.168.32.0/24(no_subtree_check,rw,sync,no_root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,no_root_squash,insecure)

# logic shares
/export/logic/data		192.168.32.0/24(no_subtree_check,rw,sync,no_root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,no_root_squash,insecure)
/export/logic/webdav		192.168.32.0/24(no_subtree_check,rw,sync,no_root_squash,insecure) 192.168.128.0/24(no_subtree_check,rw,sync,no_root_squash,insecure)



-- System Information:
Debian Release: 6.0
  APT prefers stable
  APT policy: (500, 'stable')
Architecture: amd64 (x86_64)

Kernel: Linux 2.6.32-5-amd64 (SMP w/16 CPU cores)
Locale: LANG=en_GB.UTF-8, LC_CTYPE=en_GB.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages nfs-kernel-server depends on:
ii  libblkid1               2.17.2-9         block device id library
ii  libc6                   2.11.2-10        Embedded GNU C Library: Shared lib
ii  libcomerr2              1.41.12-2        common error description library
ii  libgssapi-krb5-2        1.8.3+dfsg-4     MIT Kerberos runtime libraries - k
ii  libgssglue1             0.1-4            mechanism-switch gssapi library
ii  libk5crypto3            1.8.3+dfsg-4     MIT Kerberos runtime libraries - C
ii  libkrb5-3               1.8.3+dfsg-4     MIT Kerberos runtime libraries
ii  libnfsidmap2            0.23-2           An nfs idmapping library
ii  librpcsecgss3           0.19-2           allows secure rpc communication us
ii  libwrap0                7.6.q-19         Wietse Venema's TCP wrappers libra
ii  lsb-base                3.2-23.2squeeze1 Linux Standard Base 3.2 init scrip
ii  nfs-common              1:1.2.2-4        NFS support files common to client
ii  ucf                     3.0025+nmu1      Update Configuration File: preserv

nfs-kernel-server recommends no packages.

nfs-kernel-server suggests no packages.

-- no debconf information



Reply to: