[kernel 2.6.8-1-686-smp  2.6.8-10] HANG : syslog ?
using pre-packaged (without any modifications/recompilation):
 kernel-image-2.6.8-1-686-smp      (2.6.8-10)
(initrd boot)
on:
DELL POWEREDGE 2650  with two  Intel Xeon HT
debian sources :
deb http://ftp.fr.debian.org/debian/ testing main contrib non-free
deb http://non-us.debian.org/debian-non-US testing/non-US main contrib non-free
deb http://security.debian.org/ testing/updates main contrib non-free
modules explicitly listed (/etc/modules) with 2.6 kernel:
ide-cd
ide-generic
sd_mod
tg3
psmouse
mousedev
BEHAVIOR:
system boots fine, but leads to daily hang !!!!!!!!! .... seems to append 
at syslog restart ?   :-(
we log another "internal service" event each minute (very fine to see HERE 
if event log works or not) ... at 6:28 the event log stops (no more event 
logged) ... and system remains in a hang state (Ctrl-Alt-Suppr don't work 
anymore (as other ALt-F1, ...) ... Hardware POWER OFF is needed to reboot !
STORY:
1) we used a 2.4.24 tailorized kernel since 1 year without Pb (and ... 
without recent updates)
2) "dselect update + install" at end december; (no kernel change: same 
2.4.24 tailorized kernel be used)
3) system hang at Jan 2, 2005 ... seems to be at 6:28 (no more syslog 
events logged) ==> reboot: OK for this week ...
4) but: we decide to install (Jan 8, 2005) a new 2.6 kernel
end of syslog.4 (normal):
Jan  9 06:24:01 zeus /USR/SBIN/CRON[6814]: (root) CMD 
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
Jan  9 06:25:01 zeus /USR/SBIN/CRON[6900]: (root) CMD (test -x 
/usr/sbin/anacron || run-parts --report /etc/cron.daily)
Jan  9 06:25:01 zeus /USR/SBIN/CRON[6903]: (root) CMD 
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
Jan  9 06:26:01 zeus /USR/SBIN/CRON[6980]: (root) CMD 
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
Jan  9 06:27:01 zeus /USR/SBIN/CRON[7041]: (root) CMD 
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
Jan  9 06:28:02 zeus /USR/SBIN/CRON[7108]: (root) CMD 
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
begin of syslog.3 (normal + hang: no additional logs after):
Jan  9 06:28:18 zeus syslogd 1.4.1#16: restart.
Jan  9 06:28:21 zeus dhcpd: DHCPREQUEST for 19.48.208.193 from 
00:06:5b:34:0f:52 via eth0
Jan  9 06:28:21 zeus dhcpd: DHCPACK on 19.48.208.193 to 00:06:5b:34:0f:52 
via eth0
==> no more "each minute" cron'ed EVAL_TOP_CLEAN event now after the 
DHCPPACK last line !
==> server doesn't react to Ctrl-Alt-Suppr  + no console access .... seems 
hang'ed ... server harware-power-off-reboot at Jan 10 09:20
end of syslog.3 (normal):
Jan 11 06:27:01 zeus /USR/SBIN/CRON[20572]: (root) CMD 
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
Jan 11 06:28:01 zeus /USR/SBIN/CRON[20610]: (root) CMD 
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
begin of syslog.2 (2 first lines - abnormal):
Jan 11 06:28:19 zeus syslogd 1.4.1#16: restart.     <==== last event logged 
by syslog ... no more events after !!!
Jan 11 08:50:40 zeus syslogd 1.4.1#16: restart.     <==== this line because 
server was  rebooted (with PREVIOUS KERNEL to avoid future daily hangs)  !!!!
==> no more "each minute" cron'ed EVAL_TOP_CLEAN event log
==> server doesn't react to Ctrl-Alt-Suppr + no console access nor 
swintching (alt F1, F2 , ...) .... seems hang'ed ... server rebooted Jan 11 
08:50:40 (see before) with previous kernel version to ensure continuous 
service (we hope !)
Any suggestion ? help ? could thes hnag be related to ACPI ?
Christian SENET
auth.log shows similar event log stop:
Jan  2 06:25:01 zeus CRON[6714]: (pam_unix) session opened for user root by 
(uid=0)
Jan  2 06:25:01 zeus CRON[6715]: (pam_unix) session opened for user root by 
(uid=0)
Jan  2 06:25:01 zeus CRON[6716]: (pam_unix) session opened for user root by 
(uid=0)
Jan  2 06:25:01 zeus CRON[6715]: (pam_unix) session closed for user root
Jan  2 06:25:01 zeus CRON[6716]: (pam_unix) session closed for user root
Jan  2 06:26:01 zeus CRON[6804]: (pam_unix) session opened for user root by 
(uid=0)
Jan  2 06:26:02 zeus CRON[6805]: (pam_unix) session opened for user root by 
(uid=0)
Jan  2 06:26:02 zeus CRON[6804]: (pam_unix) session closed for user root
Jan  2 06:26:02 zeus CRON[6805]: (pam_unix) session closed for user root
Jan  2 06:26:38 zeus CRON[6714]: (pam_unix) session closed for user root
... no more minute event logs after this last line ....
==========================================================
Laboratoire de Physique des Matériaux - UMR CNRS 7556
Faculté des Sciences
Boulevard des aiguillettes
B.P. 239
54506 VANDOEUVRE-LES-NANCY
Tél     : +33(0)3.83.68.48.21
Fax     : +33(0)3.83.68.48.01
E-mail  : Christian.Senet@lpm.u-nancy.fr
Web Site: http://www.lpm.u-nancy.fr/
==========================================================
== L'informatique doit servir l'homme et non l'asservir ==
==                            Christian SENET           ==
==========================================================
Reply to: