[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#251215: marked as done (kernel-image-2.6.6-1-k7: pppd locks up, cannot be killed, during ppp shutdown)



Your message dated Sun, 29 Aug 2004 12:16:53 +0200
with message-id <20040829101653.GA28909@lst.de>
and subject line really closing
has caused the attached Bug report to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what I am
talking about this indicates a serious mail system misconfiguration
somewhere.  Please contact me immediately.)

Debian bug tracking system administrator
(administrator, Debian Bugs database)

--------------------------------------
Received: (at submit) by bugs.debian.org; 27 May 2004 14:05:26 +0000
>From debian.bugs@kepier.clara.net Thu May 27 07:05:26 2004
Return-path: <debian.bugs@kepier.clara.net>
Received: from relay3.mail.uk.clara.net [80.168.70.143] 
	by spohr.debian.org with esmtp (Exim 3.35 1 (Debian))
	id 1BTLVd-0002c8-00; Thu, 27 May 2004 07:05:25 -0700
Received: from du-069-0296.access.clara.net ([217.158.145.42] helo=plinth.doesnt.exist)
	by johnnybravo.uk.clara.net with esmtp (Exim 4.22)
	id 1BTLVa-000Ezb-FC; Thu, 27 May 2004 15:05:23 +0100
Content-Type: text/plain; charset="us-ascii"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
From: Neil Pilgrim <debian.bugs@kepier.clara.net>
To: Debian Bug Tracking System <submit@bugs.debian.org>
Subject: kernel-image-2.6.6-1-k7: pppd locks up, cannot be killed, during ppp shutdown
Bcc: Neil Pilgrim <debian.bugs@kepier.clara.net>
X-Mailer: reportbug 2.60
Date: Thu, 27 May 2004 15:14:31 +0100
Message-Id: <E1BTLVa-000Ezb-FC@johnnybravo.uk.clara.net>
Delivered-To: submit@bugs.debian.org
X-Spam-Checker-Version: SpamAssassin 2.60-bugs.debian.org_2004_03_25 
	(1.212-2003-09-23-exp) on spohr.debian.org
X-Spam-Status: No, hits=-7.0 required=4.0 tests=BAYES_01,HAS_PACKAGE 
	autolearn=no version=2.60-bugs.debian.org_2004_03_25
X-Spam-Level: 

Package: kernel-image-2.6.6-1-k7
Version: 2.6.6-1
Severity: grave
Justification: renders package unusable

The new kernel appeared to be running fine for some time, until I found
this :(

I am not sure precisely why this occurs (still debugging now), but
essentially what happens is that a 'pon' works fine, and (initial) 'net
connection works ok. The problem comes when the connection is terminated
(poff, or someone picks up my phone?). The first time after booting,
this seems to work fine, but possibly on the second time only (logs seem
to indicate this) I get a kernel BUG:

May 27 14:23:06 localhost pppd[26647]: Modem hangup
May 27 14:23:07 localhost pppd[26647]: Connection terminated.
May 27 14:23:07 localhost pppd[26647]: Connect time 65.7 minutes.
May 27 14:23:07 localhost pppd[26647]: Sent 465934 bytes, received 2918752 bytes.
May 27 14:23:07 localhost kernel: ------------[ cut here ]------------
May 27 14:23:07 localhost kernel: kernel BUG at net/core/dev.c:3038!
May 27 14:23:07 localhost kernel: invalid operand: 0000 [#1]
May 27 14:23:07 localhost kernel: PREEMPT
May 27 14:23:07 localhost kernel: CPU:    0
May 27 14:23:07 localhost kernel: EIP:    0060:[free_netdev+43/80]    Not tainted
May 27 14:23:07 localhost kernel: EFLAGS: 00010297   (2.6.6-1-k7)
May 27 14:23:07 localhost kernel: EIP is at free_netdev+0x2b/0x50
May 27 14:23:07 localhost kernel: eax: df128800   ebx: cf85e000   ecx: c02eada0   edx: 00000003
May 27 14:23:07 localhost kernel: esi: df128800   edi: dbff6a00   ebp: e0b542c0   esp: cf85ff48
May 27 14:23:07 localhost kernel: ds: 007b   es: 007b   ss: 0068
May 27 14:23:07 localhost kernel: Process pppd (pid: 26647, threadinfo=cf85e000 task=df023830)
May 27 14:23:07 localhost kernel: Stack: e0b5094f df128800 00000286 ce1d7600 dbff6a00 e0b4d020 dffe4840 d8d6acdc
May 27 14:23:07 localhost kernel:        e0b4d07f dbff6a00 d15f1bc0 c0152d24 d8d6acdc d15f1bc0 d8ae0648 d15f1bc0
May 27 14:23:07 localhost kernel:        00000000 de9a9ac0 cf85e000 c01513e9 d15f1bc0 de9a9ac0 de9a9ac0 d15f1bc0
May 27 14:23:07 localhost kernel: Call Trace:
May 27 14:23:07 localhost kernel:  [__crc___netdev_watchdog_up+1116592/2857393] ppp_shutdown_interface+0x7f/0xf0 [ppp_generic]
May 27 14:23:07 localhost kernel:  [__crc___netdev_watchdog_up+1101953/2857393] ppp_release+0x0/0x70 [ppp_generic]
May 27 14:23:07 localhost kernel:  [__crc___netdev_watchdog_up+1102048/2857393] ppp_release+0x5f/0x70 [ppp_generic]
May 27 14:23:07 localhost kernel:  [__fput+276/304] __fput+0x114/0x130
May 27 14:23:07 localhost kernel:  [filp_close+89/144] filp_close+0x59/0x90
May 27 14:23:07 localhost kernel:  [sys_close+97/160] sys_close+0x61/0xa0
May 27 14:23:07 localhost kernel:  [syscall_call+7/11] syscall_call+0x7/0xb
May 27 14:23:07 localhost kernel:
May 27 14:23:07 localhost kernel: Code: 0f 0b de 0b e0 75 29 c0 eb de 2b 80 04 02 00 00 89 44 24 04

I've investigated this a bit, and it seems that someone had a similar
problem in 2.6.5 (Auzanneau Gregory), but he has told me that this was
fixed by a patch from Andrew Morton. The bug appears to be that the net
device is not unregistered properly, from what I can understand of the
code. The free_netdev is called directly after the unregister_netdev in
the ppp code, but the unregistration is put in a list which is handled
elsewhere - could the unregistration in the list not be being handled
before the free_netdev? This is just from a quick read, mind you, I'm
not a kernel hacker ;)

Typically the messages I get during a working pon are:

May 27 13:17:27 localhost chat[26648]: CONNECT
May 27 13:17:27 localhost chat[26648]:  -- got it
May 27 13:17:27 localhost chat[26648]: send (\d)
May 27 13:17:28 localhost pppd[26647]: Serial connection established.
May 27 13:17:28 localhost pppd[26647]: Using interface ppp0
May 27 13:17:28 localhost pppd[26647]: Connect: ppp0 <--> /dev/ttyS1
May 27 13:17:28 localhost hal.hotplug[26661]: waiting for net info at
/class/net/ppp0
May 27 13:17:28 localhost hal.hotplug[26661]: Dont know how to wait for
net at /class/net/ppp0; sleeping 1000 ms
May 27 13:17:40 localhost pppd[26647]: CHAP authentication succeeded

and then the IP and DNS addresses are logged, and the ppp connection is
up. After the BUG, I only get the first pppd line, ie 'Serial connection
established'. Any subsequent attempts at using ppp (via pon/poff)
appears to dial correctly (ie. chat), but the pppd doesn't come up
properly, and cannot be killed - so I typically reboot.

If you need any more information, please let me know.

-- 
Neil

-- System Information:
Debian Release: testing/unstable
  APT prefers unstable
  APT policy: (500, 'unstable')
Architecture: i386 (i686)
Kernel: Linux 2.6.6-1-k7
Locale: LANG=en_GB.UTF-8, LC_CTYPE=en_GB.UTF-8

Versions of packages kernel-image-2.6.6-1-k7 depends on:
ii  coreutils [fileutils]        5.0.91-2    The GNU core utilities
ii  fileutils                    5.0.91-2    The GNU file management utilities 
ii  initrd-tools                 0.1.69      tools to create initrd image for p
ii  module-init-tools            3.0-pre10-4 tools for managing Linux kernel mo

-- no debconf information

---------------------------------------
Received: (at 251215-done) by bugs.debian.org; 29 Aug 2004 10:16:55 +0000
>From hch@lst.de Sun Aug 29 03:16:55 2004
Return-path: <hch@lst.de>
Received: from verein.lst.de (mail.lst.de) [213.95.11.210] 
	by spohr.debian.org with esmtp (Exim 3.35 1 (Debian))
	id 1C1Mk2-0003SZ-00; Sun, 29 Aug 2004 03:16:54 -0700
Received: from verein.lst.de (localhost [127.0.0.1])
	by mail.lst.de (8.12.3/8.12.3/Debian-6.6) with ESMTP id i7TAGr95028920
	(version=TLSv1/SSLv3 cipher=EDH-RSA-DES-CBC3-SHA bits=168 verify=NO)
	for <251215-done@bugs.debian.org>; Sun, 29 Aug 2004 12:16:53 +0200
Received: (from hch@localhost)
	by verein.lst.de (8.12.3/8.12.3/Debian-6.6) id i7TAGrh0028918
	for 251215-done@bugs.debian.org; Sun, 29 Aug 2004 12:16:53 +0200
Date: Sun, 29 Aug 2004 12:16:53 +0200
From: Christoph Hellwig <hch@lst.de>
To: 251215-done@bugs.debian.org
Subject: really closing
Message-ID: <20040829101653.GA28909@lst.de>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.3.28i
X-Spam-Score: -4.901 () BAYES_00
X-Scanned-By: MIMEDefang 2.39
Delivered-To: 251215-done@bugs.debian.org
X-Spam-Checker-Version: SpamAssassin 2.60-bugs.debian.org_2004_03_25 
	(1.212-2003-09-23-exp) on spohr.debian.org
X-Spam-Status: No, hits=-3.0 required=4.0 tests=BAYES_00 autolearn=no 
	version=2.60-bugs.debian.org_2004_03_25
X-Spam-Level: 
X-CrossAssassin-Score: 200094



Reply to: