2.6.10: NFS/RPC related Kernel Oops... (with ksymoops output)
Hi, it seems that I had to use ksymoops to make this report useful, so
here's the output of ksymoops (after reboot)
Mar 6 13:38:10 P2 kernel: Unable to handle kernel paging request at virtual
address ff2fd414
Mar 6 13:38:10 P2 kernel: c01b30ae
Mar 6 13:38:10 P2 kernel: *pde = 00000000
Mar 6 13:38:10 P2 kernel: Oops: 0002 [#1]
Mar 6 13:38:10 P2 kernel: CPU: 0
Mar 6 13:38:10 P2 kernel: EIP: 0060:[nfs3_xdr_writeres+142/224] Not
tainted VLI
Mar 6 13:38:10 P2 kernel: EFLAGS: 00010246 (2.6.10p2)
Mar 6 13:38:10 P2 kernel: eax: 00000000 ebx: d74b90e0 ecx: c456f500
edx:ff2fd414
Mar 6 13:38:10 P2 kernel: esi: c456f4a4 edi: c62fd384 ebp: 00000000
esp:d79caea4
Mar 6 13:38:10 P2 kernel: ds: 007b es: 007b ss: 0068
Mar 6 13:38:10 P2 kernel: Stack: c456f4ac c62fd384 d74b90e0 c62fd2e4
c62fd470 c456f4a0 c02a92ea d7bde0ac
Mar 6 13:38:10 P2 kernel: c456f4a0 c62fd470 00000000 c02a52e0
c62fd2e4 d7bde0f8 d7bde138 d7bde0ac
Mar 6 13:38:10 P2 kernel: c02a34c2 c62fd2e4 c01b3020 d7bde0ac
c456f4a0 c62fd470 c01b3020 c62fd2e4
Mar 6 13:38:10 P2 kernel: Call Trace:
Warning (Oops_read): Code line not seen, dumping what data is available
>>ebx; d74b90e0 <pg0+170f50e0/3fc3a400>
>>ecx; c456f500 <pg0+41ab500/3fc3a400>
>>edx; ff2fd414 <pg0+3ef39414/3fc3a400>
>>esi; c456f4a4 <pg0+41ab4a4/3fc3a400>
>>edi; c62fd384 <pg0+5f39384/3fc3a400>
>>esp; d79caea4 <pg0+17606ea4/3fc3a400>
Mar 6 13:38:10 P2 kernel: Code: 47 18 8d 46 1c 89 c1 8b 00 83 c1 04 85 c0
75 52 85 ed 75 42 8b 54 24 24 8b 01 0f c8 89 42 08 8b 44 24 24 8b 50 04 8b
41 04 0f c8 <89> 02 8b 44 24 24 8b 50 04 8b 41 08 89 42 04 8b 44 24 24 8b
50
Code; ffffffd5 <__kernel_rt_sigreturn+1b95/????>
00000000 <_EIP>:
Code; ffffffd5 <__kernel_rt_sigreturn+1b95/????>
0: 47 inc %edi
Code; ffffffd6 <__kernel_rt_sigreturn+1b96/????>
1: 18 8d 46 1c 89 c1 sbb %cl,0xc1891c46(%ebp)
Code; ffffffdc <__kernel_rt_sigreturn+1b9c/????>
7: 8b 00 mov (%eax),%eax
Code; ffffffde <__kernel_rt_sigreturn+1b9e/????>
9: 83 c1 04 add $0x4,%ecx
Code; ffffffe1 <__kernel_rt_sigreturn+1ba1/????>
c: 85 c0 test %eax,%eax
Code; ffffffe3 <__kernel_rt_sigreturn+1ba3/????>
e: 75 52 jne 62 <_EIP+0x62>
Code; ffffffe5 <__kernel_rt_sigreturn+1ba5/????>
10: 85 ed test %ebp,%ebp
Code; ffffffe7 <__kernel_rt_sigreturn+1ba7/????>
12: 75 42 jne 56 <_EIP+0x56>
Code; ffffffe9 <__kernel_rt_sigreturn+1ba9/????>
14: 8b 54 24 24 mov 0x24(%esp),%edx
Code; ffffffed <__kernel_rt_sigreturn+1bad/????>
18: 8b 01 mov (%ecx),%eax
Code; ffffffef <__kernel_rt_sigreturn+1baf/????>
1a: 0f c8 bswap %eax
Code; fffffff1 <__kernel_rt_sigreturn+1bb1/????>
1c: 89 42 08 mov %eax,0x8(%edx)
Code; fffffff4 <__kernel_rt_sigreturn+1bb4/????>
1f: 8b 44 24 24 mov 0x24(%esp),%eax
Code; fffffff8 <__kernel_rt_sigreturn+1bb8/????>
23: 8b 50 04 mov 0x4(%eax),%edx
Code; fffffffb <__kernel_rt_sigreturn+1bbb/????>
26: 8b 41 04 mov 0x4(%ecx),%eax
Code; fffffffe <__kernel_rt_sigreturn+1bbe/????>
29: 0f c8 bswap %eax
Code; 00000000 Before first symbol
2b: 89 02 mov %eax,(%edx)
Code; 00000002 Before first symbol
2d: 8b 44 24 24 mov 0x24(%esp),%eax
Code; 00000006 Before first symbol
31: 8b 50 04 mov 0x4(%eax),%edx
Code; 00000009 Before first symbol
34: 8b 41 08 mov 0x8(%ecx),%eax
Code; 0000000c Before first symbol
37: 89 42 04 mov %eax,0x4(%edx)
Code; 0000000f Before first symbol
3a: 8b 44 24 24 mov 0x24(%esp),%eax
Code; 00000013 Before first symbol
3e: 8b .byte 0x8b
Code; 00000014 Before first symbol
3f: 50 push %eax
> Hi,
> a few times per month, I get the following Kernel Oops on my workstation
> (an NFS client):
>
> Mar 6 13:38:10 P2 kernel: Unable to handle kernel paging request at
> virtual address ff2fd414
> Mar 6 13:38:10 P2 kernel: printing eip:
> Mar 6 13:38:10 P2 kernel: c01b30ae
> Mar 6 13:38:10 P2 kernel: *pde = 00000000
> Mar 6 13:38:10 P2 kernel: Oops: 0002 [#1]
> Mar 6 13:38:10 P2 kernel: PREEMPT
> Mar 6 13:38:10 P2 kernel: Modules linked in:
> Mar 6 13:38:10 P2 kernel: CPU: 0
> Mar 6 13:38:10 P2 kernel: EIP: 0060:[nfs3_xdr_writeres+142/224] Not
> tainted VLI
> Mar 6 13:38:10 P2 kernel: EFLAGS: 00010246 (2.6.10p2)
> Mar 6 13:38:10 P2 kernel: EIP is at nfs3_xdr_writeres+0x8e/0xe0
> Mar 6 13:38:10 P2 kernel: eax: 00000000 ebx: d74b90e0 ecx: c456f500
> edx:ff2fd414
> Mar 6 13:38:10 P2 kernel: esi: c456f4a4 edi: c62fd384 ebp: 00000000
> esp:d79caea4
> Mar 6 13:38:10 P2 kernel: ds: 007b es: 007b ss: 0068
> Mar 6 13:38:10 P2 kernel: Process rpciod (pid: 206, threadinfo=d79ca000
> task=d7b970e0)
> Mar 6 13:38:10 P2 kernel: Stack: c456f4ac c62fd384 d74b90e0 c62fd2e4
> c62fd470 c456f4a0 c02a92ea d7bde0ac
> Mar 6 13:38:10 P2 kernel: c456f4a0 c62fd470 00000000 c02a52e0
> c62fd2e4 d7bde0f8 d7bde138 d7bde0ac
> Mar 6 13:38:10 P2 kernel: c02a34c2 c62fd2e4 c01b3020 d7bde0ac
> c456f4a0 c62fd470 c01b3020 c62fd2e4
> Mar 6 13:38:10 P2 kernel: Call Trace:
> Mar 6 13:38:10 P2 kernel: [rpcauth_unwrap_resp+122/176]
> rpcauth_unwrap_resp+0x7a/0xb0
> Mar 6 13:38:10 P2 kernel: [xprt_timer+0/192] xprt_timer+0x0/0xc0
> Mar 6 13:38:10 P2 kernel: [call_decode+274/560] call_decode+0x112/0x230
> Mar 6 13:38:10 P2 kernel: [nfs3_xdr_writeres+0/224]
> nfs3_xdr_writeres+0x0/0xe0
> Mar 6 13:38:10 P2 kernel: [nfs3_xdr_writeres+0/224]
> nfs3_xdr_writeres+0x0/0xe0
> Mar 6 13:38:10 P2 kernel: [__rpc_execute+738/1008]
> __rpc_execute+0x2e2/0x3f0
> Mar 6 13:38:10 P2 kernel: [recalc_task_prio+143/400]
> recalc_task_prio+0x8f/0x190
> Mar 6 13:38:10 P2 kernel: [schedule+799/1344] schedule+0x31f/0x540
> Mar 6 13:38:10 P2 kernel: [__rpc_schedule+77/208]
> __rpc_schedule+0x4d/0xd0
> Mar 6 13:38:10 P2 kernel: [rpciod+213/752] rpciod+0xd5/0x2f0
> Mar 6 13:38:10 P2 kernel: [autoremove_wake_function+0/96]
> autoremove_wake_function+0x0/0x60
> Mar 6 13:38:10 P2 kernel: [ret_from_fork+6/20] ret_from_fork+0x6/0x14
> Mar 6 13:38:10 P2 kernel: [autoremove_wake_function+0/96]
> autoremove_wake_function+0x0/0x60
> Mar 6 13:38:10 P2 kernel: [rpciod+0/752] rpciod+0x0/0x2f0
> Mar 6 13:38:10 P2 kernel: [kernel_thread_helper+5/20]
> kernel_thread_helper+0x5/0x14
> Mar 6 13:38:10 P2 kernel: Code: 47 18 8d 46 1c 89 c1 8b 00 83 c1 04 85 c0
> 75 52 85 ed 75 42 8b 54 24 24 8b 01 0f c8 89 42 08 8b 44 24 24 8b 50 04 8b
> 41 04 0f c8 <89> 02 8b 44 24 24 8b 50 04 8b 41 08 89 42 04 8b 44 24 24 8b
> 50
>
>
> Here's my kernel, 2.6.10 config:
>
> cat /boot/config-2.6.10p2 | grep -v "^#"
>
> CONFIG_X86=y
> CONFIG_MMU=y
> CONFIG_UID16=y
> CONFIG_GENERIC_ISA_DMA=y
> CONFIG_GENERIC_IOMAP=y
>
> CONFIG_EXPERIMENTAL=y
> CONFIG_CLEAN_COMPILE=y
> CONFIG_BROKEN_ON_SMP=y
> CONFIG_LOCK_KERNEL=y
>
> CONFIG_LOCALVERSION=""
> CONFIG_SWAP=y
> CONFIG_SYSVIPC=y
> CONFIG_SYSCTL=y
> CONFIG_LOG_BUF_SHIFT=14
> CONFIG_KOBJECT_UEVENT=y
> CONFIG_KALLSYMS=y
> CONFIG_FUTEX=y
> CONFIG_EPOLL=y
> CONFIG_SHMEM=y
> CONFIG_CC_ALIGN_FUNCTIONS=0
> CONFIG_CC_ALIGN_LABELS=0
> CONFIG_CC_ALIGN_LOOPS=0
> CONFIG_CC_ALIGN_JUMPS=0
>
> CONFIG_MODULES=y
> CONFIG_MODULE_UNLOAD=y
> CONFIG_MODULE_FORCE_UNLOAD=y
> CONFIG_OBSOLETE_MODPARM=y
> CONFIG_KMOD=y
>
> CONFIG_X86_PC=y
> CONFIG_MK7=y
> CONFIG_X86_CMPXCHG=y
> CONFIG_X86_XADD=y
> CONFIG_X86_L1_CACHE_SHIFT=6
> CONFIG_RWSEM_XCHGADD_ALGORITHM=y
> CONFIG_X86_WP_WORKS_OK=y
> CONFIG_X86_INVLPG=y
> CONFIG_X86_BSWAP=y
> CONFIG_X86_POPAD_OK=y
> CONFIG_X86_GOOD_APIC=y
> CONFIG_X86_INTEL_USERCOPY=y
> CONFIG_X86_USE_PPRO_CHECKSUM=y
> CONFIG_X86_USE_3DNOW=y
> CONFIG_PREEMPT=y
> CONFIG_X86_UP_APIC=y
> CONFIG_X86_UP_IOAPIC=y
> CONFIG_X86_LOCAL_APIC=y
> CONFIG_X86_IO_APIC=y
> CONFIG_X86_TSC=y
> CONFIG_X86_MCE=y
> CONFIG_X86_MCE_NONFATAL=y
>
> CONFIG_NOHIGHMEM=y
> CONFIG_MTRR=y
> CONFIG_HAVE_DEC_LOCK=y
>
> CONFIG_PM=y
>
> CONFIG_ACPI_BLACKLIST_YEAR=0
>
> CONFIG_APM=y
> CONFIG_APM_DO_ENABLE=y
> CONFIG_APM_CPU_IDLE=y
> CONFIG_APM_RTC_IS_GMT=y
>
> CONFIG_PCI=y
> CONFIG_PCI_GOANY=y
> CONFIG_PCI_BIOS=y
> CONFIG_PCI_DIRECT=y
> CONFIG_PCI_NAMES=y
>
> CONFIG_BINFMT_ELF=y
>
> CONFIG_STANDALONE=y
>
> CONFIG_BLK_DEV_RAM_COUNT=16
> CONFIG_INITRAMFS_SOURCE=""
> CONFIG_CDROM_PKTCDVD=m
> CONFIG_CDROM_PKTCDVD_BUFFERS=8
>
> CONFIG_IOSCHED_NOOP=y
> CONFIG_IOSCHED_AS=y
> CONFIG_IOSCHED_DEADLINE=y
> CONFIG_IOSCHED_CFQ=y
>
> CONFIG_IDE=y
> CONFIG_BLK_DEV_IDE=y
>
> CONFIG_BLK_DEV_IDEDISK=y
> CONFIG_IDEDISK_MULTI_MODE=y
> CONFIG_BLK_DEV_IDECD=y
>
> CONFIG_BLK_DEV_IDEPCI=y
> CONFIG_BLK_DEV_IDEDMA_PCI=y
> CONFIG_IDEDMA_PCI_AUTO=y
> CONFIG_BLK_DEV_VIA82CXXX=y
> CONFIG_BLK_DEV_IDEDMA=y
> CONFIG_IDEDMA_AUTO=y
>
> CONFIG_NET=y
>
> CONFIG_PACKET=y
> CONFIG_UNIX=y
> CONFIG_INET=y
>
> CONFIG_NETDEVICES=y
>
> CONFIG_NET_ETHERNET=y
> CONFIG_MII=y
>
> CONFIG_NET_PCI=y
> CONFIG_8139TOO=y
>
> CONFIG_INPUT=y
>
> CONFIG_INPUT_MOUSEDEV=y
> CONFIG_INPUT_MOUSEDEV_SCREEN_X=1280
> CONFIG_INPUT_MOUSEDEV_SCREEN_Y=960
>
> CONFIG_SOUND_GAMEPORT=y
> CONFIG_SERIO=y
> CONFIG_SERIO_I8042=y
> CONFIG_SERIO_SERPORT=y
>
> CONFIG_INPUT_KEYBOARD=y
> CONFIG_KEYBOARD_ATKBD=y
> CONFIG_INPUT_MOUSE=y
> CONFIG_MOUSE_PS2=y
>
> CONFIG_VT=y
> CONFIG_VT_CONSOLE=y
> CONFIG_HW_CONSOLE=y
>
> CONFIG_SERIAL_8250=y
> CONFIG_SERIAL_8250_NR_UARTS=4
>
> CONFIG_SERIAL_CORE=y
> CONFIG_UNIX98_PTYS=y
>
> CONFIG_VGA_CONSOLE=y
> CONFIG_DUMMY_CONSOLE=y
>
> CONFIG_SOUND=y
>
> CONFIG_SND=y
> CONFIG_SND_TIMER=y
> CONFIG_SND_PCM=y
> CONFIG_SND_RAWMIDI=y
>
> CONFIG_SND_MPU401_UART=y
>
> CONFIG_SND_AC97_CODEC=y
> CONFIG_SND_ICE1712=y
>
> CONFIG_USB_ARCH_HAS_HCD=y
> CONFIG_USB_ARCH_HAS_OHCI=y
>
> CONFIG_EXT3_FS=y
> CONFIG_EXT3_FS_XATTR=y
> CONFIG_JBD=y
> CONFIG_FS_MBCACHE=y
> CONFIG_DNOTIFY=y
>
> CONFIG_ISO9660_FS=y
> CONFIG_JOLIET=y
>
> CONFIG_PROC_FS=y
> CONFIG_PROC_KCORE=y
> CONFIG_SYSFS=y
> CONFIG_RAMFS=y
>
> CONFIG_NFS_FS=y
> CONFIG_NFS_V3=y
> CONFIG_LOCKD=y
> CONFIG_LOCKD_V4=y
> CONFIG_SUNRPC=y
>
> CONFIG_MSDOS_PARTITION=y
>
> CONFIG_NLS=y
> CONFIG_NLS_DEFAULT="iso8859-15"
> CONFIG_NLS_CODEPAGE_437=y
> CONFIG_NLS_CODEPAGE_850=y
> CONFIG_NLS_ISO8859_1=y
> CONFIG_NLS_ISO8859_15=y
>
> CONFIG_EARLY_PRINTK=y
> CONFIG_4KSTACKS=y
> CONFIG_X86_FIND_SMP_CONFIG=y
> CONFIG_X86_MPPARSE=y
>
> CONFIG_CRC32=y
> CONFIG_GENERIC_HARDIRQS=y
> CONFIG_GENERIC_IRQ_PROBE=y
> CONFIG_X86_BIOS_REBOOT=y
> CONFIG_PC=y
>
> I'm using the Debian 2.6.10 kernel sources.
>
> I've searched Google with keywords 'nfs3_xdr_writeres kernel oops', but I
> didn't find any relevant information.
>
> Is this a known bug, is it possible to find out what caused this kernel
> oops? It seems it occurs completely random.
>
> Thanks for any pointers,
> Stanley
Reply to: