[kernel 2.6.8-1-686-smp 2.6.8-10] HANG : syslog ?
using pre-packaged (without any modifications/recompilation):
kernel-image-2.6.8-1-686-smp (2.6.8-10)
(initrd boot)
on:
DELL POWEREDGE 2650 with two Intel Xeon HT
debian sources :
deb http://ftp.fr.debian.org/debian/ testing main contrib non-free
deb http://non-us.debian.org/debian-non-US testing/non-US main contrib non-free
deb http://security.debian.org/ testing/updates main contrib non-free
modules explicitly listed (/etc/modules) with 2.6 kernel:
ide-cd
ide-generic
sd_mod
tg3
psmouse
mousedev
BEHAVIOR:
system boots fine, but leads to daily hang !!!!!!!!! .... seems to append
at syslog restart ? :-(
we log another "internal service" event each minute (very fine to see HERE
if event log works or not) ... at 6:28 the event log stops (no more event
logged) ... and system remains in a hang state (Ctrl-Alt-Suppr don't work
anymore (as other ALt-F1, ...) ... Hardware POWER OFF is needed to reboot !
STORY:
1) we used a 2.4.24 tailorized kernel since 1 year without Pb (and ...
without recent updates)
2) "dselect update + install" at end december; (no kernel change: same
2.4.24 tailorized kernel be used)
3) system hang at Jan 2, 2005 ... seems to be at 6:28 (no more syslog
events logged) ==> reboot: OK for this week ...
4) but: we decide to install (Jan 8, 2005) a new 2.6 kernel
end of syslog.4 (normal):
Jan 9 06:24:01 zeus /USR/SBIN/CRON[6814]: (root) CMD
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
Jan 9 06:25:01 zeus /USR/SBIN/CRON[6900]: (root) CMD (test -x
/usr/sbin/anacron || run-parts --report /etc/cron.daily)
Jan 9 06:25:01 zeus /USR/SBIN/CRON[6903]: (root) CMD
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
Jan 9 06:26:01 zeus /USR/SBIN/CRON[6980]: (root) CMD
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
Jan 9 06:27:01 zeus /USR/SBIN/CRON[7041]: (root) CMD
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
Jan 9 06:28:02 zeus /USR/SBIN/CRON[7108]: (root) CMD
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
begin of syslog.3 (normal + hang: no additional logs after):
Jan 9 06:28:18 zeus syslogd 1.4.1#16: restart.
Jan 9 06:28:21 zeus dhcpd: DHCPREQUEST for 19.48.208.193 from
00:06:5b:34:0f:52 via eth0
Jan 9 06:28:21 zeus dhcpd: DHCPACK on 19.48.208.193 to 00:06:5b:34:0f:52
via eth0
==> no more "each minute" cron'ed EVAL_TOP_CLEAN event now after the
DHCPPACK last line !
==> server doesn't react to Ctrl-Alt-Suppr + no console access .... seems
hang'ed ... server harware-power-off-reboot at Jan 10 09:20
end of syslog.3 (normal):
Jan 11 06:27:01 zeus /USR/SBIN/CRON[20572]: (root) CMD
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
Jan 11 06:28:01 zeus /USR/SBIN/CRON[20610]: (root) CMD
(/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)
begin of syslog.2 (2 first lines - abnormal):
Jan 11 06:28:19 zeus syslogd 1.4.1#16: restart. <==== last event logged
by syslog ... no more events after !!!
Jan 11 08:50:40 zeus syslogd 1.4.1#16: restart. <==== this line because
server was rebooted (with PREVIOUS KERNEL to avoid future daily hangs) !!!!
==> no more "each minute" cron'ed EVAL_TOP_CLEAN event log
==> server doesn't react to Ctrl-Alt-Suppr + no console access nor
swintching (alt F1, F2 , ...) .... seems hang'ed ... server rebooted Jan 11
08:50:40 (see before) with previous kernel version to ensure continuous
service (we hope !)
Any suggestion ? help ? could thes hnag be related to ACPI ?
Christian SENET
auth.log shows similar event log stop:
Jan 2 06:25:01 zeus CRON[6714]: (pam_unix) session opened for user root by
(uid=0)
Jan 2 06:25:01 zeus CRON[6715]: (pam_unix) session opened for user root by
(uid=0)
Jan 2 06:25:01 zeus CRON[6716]: (pam_unix) session opened for user root by
(uid=0)
Jan 2 06:25:01 zeus CRON[6715]: (pam_unix) session closed for user root
Jan 2 06:25:01 zeus CRON[6716]: (pam_unix) session closed for user root
Jan 2 06:26:01 zeus CRON[6804]: (pam_unix) session opened for user root by
(uid=0)
Jan 2 06:26:02 zeus CRON[6805]: (pam_unix) session opened for user root by
(uid=0)
Jan 2 06:26:02 zeus CRON[6804]: (pam_unix) session closed for user root
Jan 2 06:26:02 zeus CRON[6805]: (pam_unix) session closed for user root
Jan 2 06:26:38 zeus CRON[6714]: (pam_unix) session closed for user root
... no more minute event logs after this last line ....
==========================================================
Laboratoire de Physique des Matériaux - UMR CNRS 7556
Faculté des Sciences
Boulevard des aiguillettes
B.P. 239
54506 VANDOEUVRE-LES-NANCY
Tél : +33(0)3.83.68.48.21
Fax : +33(0)3.83.68.48.01
E-mail : Christian.Senet@lpm.u-nancy.fr
Web Site: http://www.lpm.u-nancy.fr/
==========================================================
== L'informatique doit servir l'homme et non l'asservir ==
== Christian SENET ==
==========================================================
Reply to: