Re: etch on aranym, was Re: [buildd] Etch?

To: Petr Stehlik <pstehlik@sophics.cz>
Cc: debian-68k@lists.debian.org, aranym@lists.bobek.cz
Subject: Re: etch on aranym, was Re: [buildd] Etch?
From: Finn Thain <fthain@telegraphics.com.au>
Date: Fri, 18 Aug 2006 09:56:16 +1000 (EST)
Message-id: <[🔎] Pine.LNX.4.64.0608180936310.20998@loopy.telegraphics.com.au>
In-reply-to: <[🔎] 44E48FB7.3080207@sophics.cz>
References: <[🔎] 20060803152418.GA18415@country.grep.be> <[🔎] Pine.LNX.4.64.0608031821320.7817@scrub.home> <[🔎] 20060803233015.GA5411@country.grep.be> <[🔎] 20060804065755.GE6533@2004.bluespice.org> <[🔎] 20060804084343.GE5481@country.grep.be> <[🔎] Pine.LNX.4.64.0608041203150.7817@scrub.home> <[🔎] 20060804120146.GA5656@country.grep.be> <[🔎] Pine.LNX.4.64.0608041516010.6761@scrub.home> <[🔎] 20060804182343.GC3770@country.grep.be> <[🔎] Pine.LNX.4.64.0608082239280.7817@scrub.home> <[🔎] 20060811012015.GA4301@rock.grep.be> <[🔎] Pine.LNX.4.64.0608111243360.20905@loopy.telegraphics.com.au> <[🔎] 44E2DECF.3050807@sophics.cz> <[🔎] Pine.LNX.4.64.0608162257290.32306@loopy.telegraphics.com.au> <[🔎] 44E32B45.5070901@sophics.cz> <[🔎] Pine.LNX.4.64.0608171044310.13161@loopy.telegraphics.com.au> <[🔎] 44E48FB7.3080207@sophics.cz>

On Thu, 17 Aug 2006, Petr Stehlik wrote:

> Finn Thain wrote:
> > > > difficult to reproduce the bug?
> > > It's kinda random.
> > 
> > In that case, it might be necessary to make the scheduler behave in a 
> > more derministic way (maybe realtime priority?). Single-user mode 
> > would help.
> 
> I could try upgrading the sarge to etch in single-user mode to see if it 
> changes something.

Yes, but that won't really help to isolate a workload that fails every 
time. The upgrade will operate differently a second time. I guess you 
could backup the hard disk image first.

Single user-mode was just a way to try to eliminate non-deterministic 
scheduler behaviour in the interests of repeatability, by making sure that 
there were no other runnable processes in the system.

> > I'd create a script, say /root/crash.sh, make it executable, and boot 
> > the kernel with "init=/root/crash.sh". In crash.sh I'd run some 
> > single-threaded stress tests.
> > 
> > http://samba.org/ftp/tridge/dbench/README 
> > http://weather.ou.edu/~apw/projects/stress/ 
> > http://www.bitmover.com/lmbench/
> 
> FYI, I have just finished the following test:
> # stress -c 4 -i 16 -m 3 --vm-bytes 32M -d 4 --hdd-bytes 128M
> 
> It's been running for almost 5 hours. No problem detected. On another 
> console I ran while(true) do uptime; sleep 300; done and saw a 
> consistent load of 28-29.

That is a long run queue. If you did find a problem that way, it could be 
very hard to reproduce because of the interactions of all the tasks.

> So the machine was busy stressing CPU, memory and disk but it didn't 
> detect anything wrong.

Well, maybe we need to concentrate on I/O. I'd try continuous tripwire 
checks, or a similar intrusion detection system.

>
> > If you can't reproduce the problem that way, I'd try introducing more 
> > context switching into the workload.
> 
> like stress -c 1k instead of -c 4?

To get a single threaded test, I'd be trying -c 0 -i 0 -m 0 but maybe 1 
fork is the minimum (?)

> > > s!/usr/bin/perl
> > 
> > Are you sure the problem was not confined to the buffer cache?
> 
> I am not sure at all.

If we are going to test disk I/O, we must find a way to disable the buffer 
cache completely. Does anyone know how to do this?

-f

> > Re-reading the same file after an unmount/remount would determine that.
> 
> will try the next time.
> 
> Petr
> 
> 
>

Reply to:

References:
- [buildd] Etch?
  - From: Wouter Verhelst <wouter@debian.org>
- Re: [buildd] Etch?
  - From: Roman Zippel <zippel@linux-m68k.org>
- Re: [buildd] Etch?
  - From: Wouter Verhelst <wouter@grep.be>
- Re: [buildd] Etch?
  - From: Ingo Juergensmann <ij@2006.bluespice.org>
- Re: [buildd] Etch?
  - From: Wouter Verhelst <wouter@debian.org>
- Re: [buildd] Etch?
  - From: Roman Zippel <zippel@linux-m68k.org>
- Re: [buildd] Etch?
  - From: Wouter Verhelst <wouter@grep.be>
- Re: [buildd] Etch?
  - From: Roman Zippel <zippel@linux-m68k.org>
- Re: [buildd] Etch?
  - From: Wouter Verhelst <wouter@grep.be>
- Re: [buildd] Etch?
  - From: Roman Zippel <zippel@linux-m68k.org>
- Re: [buildd] Etch?
  - From: Wouter Verhelst <wouter@grep.be>
- Re: [buildd] Etch?
  - From: Finn Thain <fthain@telegraphics.com.au>
- Re: [buildd] Etch?
  - From: Petr Stehlik <pstehlik@sophics.cz>
- etch on aranym, was Re: [buildd] Etch?
  - From: Finn Thain <fthain@telegraphics.com.au>
- Re: etch on aranym, was Re: [buildd] Etch?
  - From: Petr Stehlik <pstehlik@sophics.cz>
- Re: etch on aranym, was Re: [buildd] Etch?
  - From: Finn Thain <fthain@telegraphics.com.au>
- Re: etch on aranym, was Re: [buildd] Etch?
  - From: Petr Stehlik <pstehlik@sophics.cz>

Prev by Date: Re: debian-installer/sarge
Next by Date: Re: buildd macs, was Re: [buildd] Etch?
Previous by thread: Re: etch on aranym, was Re: [buildd] Etch?
Next by thread: Re: [buildd] Etch?
Index(es):
- Date
- Thread