Your message dated Tue, 2 Feb 2010 18:02:56 +0100 with message-id <20100202170256.GT9624@baikonur.stro.at> and subject line Re: Bug#557262: Processed: Re: Bug#557262: 2.6.31+2.6.31.4: XFS - All I/O locks up to D-state after 24-48 hours (sysrq-t+w available) - root cause found = asterisk has caused the Debian Bug report #557262, regarding 2.6.31+2.6.31.4: XFS - All I/O locks up to D-state after 24-48 hours (sysrq-t+w available) - root cause found = asterisk to be marked as done. This means that you claim that the problem has been dealt with. If this is not the case it is now your responsibility to reopen the Bug report if necessary, and/or fix the problem forthwith. (NB: If you are a system administrator and have no idea what this message is talking about, this may indicate a serious mail system misconfiguration somewhere. Please contact owner@bugs.debian.org immediately.) -- 557262: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=557262 Debian Bug Tracking System Contact owner@bugs.debian.org with problems
--- Begin Message ---
- To: Dave Chinner <david@fromorbit.com>
- Cc: linux-kernel@vger.kernel.org, linux-raid@vger.kernel.org, xfs@oss.sgi.com, Alan Piszcz <ap@solarrain.com>, asterisk-users@lists.digium.com, submit@bugs.debian.org
- Subject: Re: 2.6.31+2.6.31.4: XFS - All I/O locks up to D-state after 24-48 hours (sysrq-t+w available) - root cause found = asterisk
- From: Justin Piszcz <jpiszcz@lucidpixels.com>
- Date: Fri, 20 Nov 2009 15:39:26 -0500 (EST)
- Message-id: <alpine.DEB.2.00.0911201530500.10757@p34.internal.lan>
- In-reply-to: <alpine.DEB.2.00.0910210618210.10288@p34.internal.lan>
- References: <alpine.DEB.2.00.0910171825270.16781@p34.internal.lan> <alpine.DEB.2.00.0910181607040.27363@p34.internal.lan> <20091019030456.GS9464@discord.disaster> <alpine.DEB.2.00.0910190431180.23395@p34.internal.lan> <20091020003358.GW9464@discord.disaster> <alpine.DEB.2.00.0910200431290.21878@p34.internal.lan> <alpine.DEB.2.00.0910210618210.10288@p34.internal.lan>
Package: asterisk Version: 1.6.2.0~dfsg~rc1-1 See below for issue: On Wed, 21 Oct 2009, Justin Piszcz wrote:On Tue, 20 Oct 2009, Justin Piszcz wrote:On Tue, 20 Oct 2009, Dave Chinner wrote:On Mon, Oct 19, 2009 at 06:18:58AM -0400, Justin Piszcz wrote:On Mon, 19 Oct 2009, Dave Chinner wrote:On Sun, Oct 18, 2009 at 04:17:42PM -0400, Justin Piszcz wrote:It has happened again, all sysrq-X output was saved this time...... All pointing to log IO not completing.....So far I do not have a reproducible test case,Ok. What sort of load is being placed on the machine?Hello, generally the load is low, it mainly serves out some samba shares.Yes, the XFS threads show up like this on each time the kernel crashed. So far with 2.6.30.9 after ~48hrs+ it has not crashed. So it appears to be some issue between 2.6.30.9 and 2.6.31.x when this began happening. Any recommendationsIt appears that both the xfslogd and the xfsdatad on CPU 0 are in the running state but don't appear to be consuming any significant CPU time. If they remain like this then I think that means they are stuck waiting on the run queue. Do these XFS threads always appear like this when the hang occurs? If so, is there something else that is hogging CPU 0 preventing these threads from getting the CPU?on how to catch this bug w/certain options enabled/etc?Cheers, Dave. -- Dave Chinner david@fromorbit.comUptime with 2.6.30.9: 06:18:41 up 2 days, 14:10, 14 users, load average: 0.41, 0.21, 0.07 No issues yet, so it first started happening in 2.6.(31).(x).Any further recommendations on how to debug this issue? BTW: Do you view thisas an XFS bug or MD/VFS layer issue based on the logs/output thus far? Justin.Found root cause-- root cause is asterisk PBX software. I use an SPA3102. When someone called me, they accidentally dropped the connection, I called them back in a short period. It is during this time (and the last time) this happened that the box froze under multiple(!) kernels, always when someone was calling. I have removed asterisk but this is the version I was running: ~$ dpkg -l | grep -i asterisk rc asterisk 1:1.6.2.0~dfsg~rc1-1 Open S I don't know what asterisk is doing but top did run before the crash and asterisk was using 100% CPU and as I noted before all other processes were in D-state. When this bug occurs, it freezes I/O to all devices and the only way to recover is to reboot the system. Just FYI if anyone else out there has their system crash when running asterisk.Just out of curiosity, has anyone else running asterisk had such an issue? I was not running any special VoIP PCI cards/etc.Justin.
--- End Message ---
--- Begin Message ---
- To: 557262-done@bugs.debian.org
- Subject: Re: Bug#557262: Processed: Re: Bug#557262: 2.6.31+2.6.31.4: XFS - All I/O locks up to D-state after 24-48 hours (sysrq-t+w available) - root cause found = asterisk
- From: maximilian attems <max@stro.at>
- Date: Tue, 2 Feb 2010 18:02:56 +0100
- Message-id: <20100202170256.GT9624@baikonur.stro.at>
- In-reply-to: <alpine.DEB.2.00.1002021157460.15537@p34.internal.lan>
- References: <4B2EF4AD.8020502@debian.org> <handler.s.C.126136841729362.transcript@bugs.debian.org> <20100202165015.GK28750@stro.at> <alpine.DEB.2.00.1002021157460.15537@p34.internal.lan>
On Tue, Feb 02, 2010 at 11:58:30AM -0500, Justin Piszcz wrote: > > > Hi, > > You can resolve this case as I can no longer run asterisk on that machine > due to this problem, it had crashed the host more than 10 times with the > latest kernels at the time of the reporting. > > Justin. ok assuming hardware fun and closing as reporter can no longer reproduce. thanks for the report.
--- End Message ---