[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

[kernel 2.6.8-1-686-smp 2.6.8-10] HANG : syslog ?



using pre-packaged (without any modifications/recompilation):
 kernel-image-2.6.8-1-686-smp      (2.6.8-10)
(initrd boot)

on:
DELL POWEREDGE 2650  with two  Intel Xeon HT

debian sources :
deb http://ftp.fr.debian.org/debian/ testing main contrib non-free
deb http://non-us.debian.org/debian-non-US testing/non-US main contrib non-free
deb http://security.debian.org/ testing/updates main contrib non-free

modules explicitly listed (/etc/modules) with 2.6 kernel:
ide-cd
ide-generic
sd_mod
tg3
psmouse
mousedev

BEHAVIOR:
system boots fine, but leads to daily hang !!!!!!!!! .... seems to append at syslog restart ? :-(

we log another "internal service" event each minute (very fine to see HERE if event log works or not) ... at 6:28 the event log stops (no more event logged) ... and system remains in a hang state (Ctrl-Alt-Suppr don't work anymore (as other ALt-F1, ...) ... Hardware POWER OFF is needed to reboot !

STORY:
1) we used a 2.4.24 tailorized kernel since 1 year without Pb (and ... without recent updates) 2) "dselect update + install" at end december; (no kernel change: same 2.4.24 tailorized kernel be used) 3) system hang at Jan 2, 2005 ... seems to be at 6:28 (no more syslog events logged) ==> reboot: OK for this week ...
4) but: we decide to install (Jan 8, 2005) a new 2.6 kernel

end of syslog.4 (normal):
Jan 9 06:24:01 zeus /USR/SBIN/CRON[6814]: (root) CMD (/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1) Jan 9 06:25:01 zeus /USR/SBIN/CRON[6900]: (root) CMD (test -x /usr/sbin/anacron || run-parts --report /etc/cron.daily) Jan 9 06:25:01 zeus /USR/SBIN/CRON[6903]: (root) CMD (/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1) Jan 9 06:26:01 zeus /USR/SBIN/CRON[6980]: (root) CMD (/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1) Jan 9 06:27:01 zeus /USR/SBIN/CRON[7041]: (root) CMD (/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1) Jan 9 06:28:02 zeus /USR/SBIN/CRON[7108]: (root) CMD (/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)

begin of syslog.3 (normal + hang: no additional logs after):
Jan  9 06:28:18 zeus syslogd 1.4.1#16: restart.
Jan 9 06:28:21 zeus dhcpd: DHCPREQUEST for 19.48.208.193 from 00:06:5b:34:0f:52 via eth0 Jan 9 06:28:21 zeus dhcpd: DHCPACK on 19.48.208.193 to 00:06:5b:34:0f:52 via eth0 ==> no more "each minute" cron'ed EVAL_TOP_CLEAN event now after the DHCPPACK last line !

==> server doesn't react to Ctrl-Alt-Suppr + no console access .... seems hang'ed ... server harware-power-off-reboot at Jan 10 09:20

end of syslog.3 (normal):
Jan 11 06:27:01 zeus /USR/SBIN/CRON[20572]: (root) CMD (/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1) Jan 11 06:28:01 zeus /USR/SBIN/CRON[20610]: (root) CMD (/users/local/EVAL_TOP_CLEAN > /dev/null 2>&1)

begin of syslog.2 (2 first lines - abnormal):
Jan 11 06:28:19 zeus syslogd 1.4.1#16: restart. <==== last event logged by syslog ... no more events after !!! Jan 11 08:50:40 zeus syslogd 1.4.1#16: restart. <==== this line because server was rebooted (with PREVIOUS KERNEL to avoid future daily hangs) !!!!

==> no more "each minute" cron'ed EVAL_TOP_CLEAN event log

==> server doesn't react to Ctrl-Alt-Suppr + no console access nor swintching (alt F1, F2 , ...) .... seems hang'ed ... server rebooted Jan 11 08:50:40 (see before) with previous kernel version to ensure continuous service (we hope !)


Any suggestion ? help ? could thes hnag be related to ACPI ?

Christian SENET

auth.log shows similar event log stop:
Jan 2 06:25:01 zeus CRON[6714]: (pam_unix) session opened for user root by (uid=0) Jan 2 06:25:01 zeus CRON[6715]: (pam_unix) session opened for user root by (uid=0) Jan 2 06:25:01 zeus CRON[6716]: (pam_unix) session opened for user root by (uid=0)
Jan  2 06:25:01 zeus CRON[6715]: (pam_unix) session closed for user root
Jan  2 06:25:01 zeus CRON[6716]: (pam_unix) session closed for user root
Jan 2 06:26:01 zeus CRON[6804]: (pam_unix) session opened for user root by (uid=0) Jan 2 06:26:02 zeus CRON[6805]: (pam_unix) session opened for user root by (uid=0)
Jan  2 06:26:02 zeus CRON[6804]: (pam_unix) session closed for user root
Jan  2 06:26:02 zeus CRON[6805]: (pam_unix) session closed for user root
Jan  2 06:26:38 zeus CRON[6714]: (pam_unix) session closed for user root
... no more minute event logs after this last line ....


==========================================================
Laboratoire de Physique des Matériaux - UMR CNRS 7556
Faculté des Sciences
Boulevard des aiguillettes
B.P. 239
54506 VANDOEUVRE-LES-NANCY

Tél     : +33(0)3.83.68.48.21
Fax     : +33(0)3.83.68.48.01
E-mail  : Christian.Senet@lpm.u-nancy.fr
Web Site: http://www.lpm.u-nancy.fr/
==========================================================
== L'informatique doit servir l'homme et non l'asservir ==
==                            Christian SENET           ==
==========================================================



Reply to: