[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#267006: Bug#267014: I think it's a hardware bug



Matthew Wilcox wrote:

On Fri, Aug 20, 2004 at 05:15:36PM -0700, Mike Fedyk wrote:
Matthew Wilcox wrote:
32-bit PCI cards should work without problems in 64-bit slots, though.

That's what I thought, but I have an Adaptec 2940UW card that causes instability reliably in 64bit pci slots.

Maybe the card can't handle 66Mhz?  Does 64bit pci always mean 66Mhz?

Nope, 64-bit and 66-MHz are independent features.  Unless the card lies
(by claiming to do 66MHz when it can't), putting a 33MHz card on a 66MHz
bus will simply cause the entire bus to run at 33MHz.

This time the 32bit AHA-2940UW was removed, and I had a 64bit gigabit ethernet card in there that uses the ns83820 module.

These oopses happened overnight on "fileserver" that is a nfs, mail, samba & ldap server.

Later that day the screen kept scrolling oopses during a nfs transfer. After that the 64bit ethernet card had been removed also and the machine has been working good for a day and a half. I don't have lspci output for the card since it's not in any machines right now.

Unable to handle kernel NULL pointer dereference at virtual address 00000000
printing eip:
c020e209
*pde = 00000000
Oops: 0000 [#1]
PREEMPT
Modules linked in: loop smbfs appletalk nfsd exportfs af_packet autofs ipv6 nfs lockd sunrpc raid1 dm_mod capability commoncap eepro100 mii rtc ext3 jbd mbcache reiserfs sd_mod aic7xxx scsi_mod raid5 md xor unix font vesafb cfbcopyarea cfbimgblt cfbfil
lrect
CPU:    0
EIP:    0060:[<c020e209>]    Not tainted
EFLAGS: 00010246   (2.6.7-1-686)
EIP is at sock_recvmsg+0x29/0xc0
eax: 00000000   ebx: 00000040   ecx: f6fccce0   edx: f6fd5190
esi: f6fccce0   edi: f6fd1f10   ebp: 00000000   esp: f6fd1da0
ds: 007b   es: 007b   ss: 0068
Process named (pid: 7223, threadinfo=f6fd0000 task=f6fd5190)
Stack: f6fd1db4 f6fccce0 f6fd1f10 00001000 00000040 f89c368d 00000000 00000000 00000001 ffffffff 00000000 f6f974c8 00000000 00000000 ffffffff 00000000 f6fd5190 00000246 c0353520 f69702e0 00000000 f6970040 1f2f8547 00000040
Call Trace:
[<f89c368d>] svc_sock_enqueue+0x13d/0x2b0 [sunrpc]
[<c0248526>] tcp_send_delayed_ack+0xd6/0x100
[<c0117d18>] recalc_task_prio+0xa8/0x1a0
[<c01a5772>] copy_from_user+0x42/0x70
[<c0213a6c>] verify_iovec+0x3c/0xa0
[<c020fd76>] sys_recvmsg+0x146/0x200
[<c0279a45>] schedule_timeout+0xb5/0xc0
[<c012ebb7>] unqueue_me+0x57/0xb0
[<c012ed57>] futex_wait+0x147/0x1a0
[<c0146e89>] find_extend_vma+0x29/0x90
[<c012e5b2>] get_futex_key+0x42/0x190
[<c0123c44>] update_process_times+0x44/0x50
[<c0123ab6>] update_wall_time+0x16/0x40
[<c01a5772>] copy_from_user+0x42/0x70
[<c021008b>] sys_socketcall+0x25b/0x260
[<c010605b>] syscall_call+0x7/0xb

Code: 8b 00 00 d0 00 00 00 b8 01 00 00 00 89 44 24 20 8b 9c 24 d4
<1>Unable to handle kernel NULL pointer dereference at virtual address 00000000
printing eip:
c020e209
*pde = 00000000
Oops: 0000 [#2]
PREEMPT
Modules linked in: loop smbfs appletalk nfsd exportfs af_packet autofs ipv6 nfs lockd sunrpc raid1 dm_mod capability commoncap eepro100 mii rtc ext3 jbd mbcache reiserfs sd_mod aic7xxx scsi_mod raid5 md xor unix font vesafb cfbcopyarea cfbimgblt cfbfil
lrect
CPU:    0
EIP:    0060:[<c020e209>]    Not tainted
EFLAGS: 00010246   (2.6.7-1-686)
EIP is at sock_recvmsg+0x29/0xc0
eax: 00000000   ebx: 00000000   ecx: f7fac720   edx: f7198bd0
esi: f7fac720   edi: f7983f34   ebp: 00000000   esp: f7983dd0
ds: 007b   es: 007b   ss: 0068
Process syslogd (pid: 802, threadinfo=f7982000 task=f7198bd0)
Stack: 00000000 41fac720 00000000 00000000 00000000 00000065 3ccbf2ae 00000000 00000001 ffffffff 00000000 f793c188 00000000 00000010 00000fff f7cc0a30 f7198bd0 f7cc0a30 f7983ea8 00000000 00000010 f7983e2c f7983e4c eaf287d4
Call Trace:
[<c01390f5>] buffered_rmqueue+0xf5/0x1d0
[<c027937b>] schedule+0x2bb/0x4d0
[<c020df3c>] sockfd_lookup+0x1c/0x80
[<c020f7d2>] sys_recvfrom+0xb2/0x120
[<c0279a45>] schedule_timeout+0xb5/0xc0
[<c01666e4>] poll_freewait+0x44/0x50
[<c0166a6f>] do_select+0x1af/0x2d0
[<c020f873>] sys_recv+0x33/0x40
[<c020ff94>] sys_socketcall+0x164/0x260
[<c010605b>] syscall_call+0x7/0xb

Code: 8b 00 00 d0 00 00 00 b8 01 00 00 00 89 44 24 20 8b 9c 24 d4
<5>statd: server localhost not responding, timed out
nsm_mon_unmon: rpc failed, status=-5
lockd: cannot unmonitor 10.0.0.2




Reply to: