[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Random hard freezes Wheezy



Hello.

On one server I intermittently encounter hard freezes. Server does not react, ping, or ctrl+alt+del. Just caps lock and num lock flashes. Only a power off from button helps to start server, after which it runs until again this happens.

This is the log I've saw on monitor:

__netif_receive_skb+0x3fb/0x42d
select_task_rq_fair+0x389/0x667
paravirt_read_tsc+0x5/0x8
native_sched_clock+0x27/0x2f
test_tsk_need_resched+0xa/0x13
resched_task+0x39/0x65
check_preempt_curr+0x36/0x5f
ttwu_do_wakeup+0x50/0xc4
page_fault+0x2/0x30
delayed_work_tiemr_fn+0xc/0x1e
run_timer_softirq+0x19a/0x261
__queue_work+0x24c/0x24c
timekeeping_get_ns+0xd/0x2a
__do_softirq+0xb9/0x177
call_softirq+0x1c/0x30
do_softirq+0x3c/0x7b
irq_exit+0x3c/0x99
smp_apic_timer_interrupt+0x74/0x82
apic_timer_interrupt+0x6e/0x80
<EOI> rcu_idle_cpu+0x80/0x1bb
mwait_idle+0x71/0xac
mwait_idle+0x72/0xac
cpu_idle+0xaf/0xf2
start_secondary+0xd5/0x1db

This I wrote from monitor. I can't find this long in any files in /var/log.
The last syslog I see is:
May 16 20:58:33 srv75 kernel: [964599.385413] r8169 0000:04:02.0: eth1: link down May 16 20:58:33 srv75 kernel: [964599.453234] r8169 0000:04:02.0: eth1: link down May 16 20:58:34 srv75 kernel: [964599.521359] r8169 0000:04:02.0: eth1: link down May 16 20:58:34 srv75 NetworkManager[2669]: <info> (eth1): carrier now OFF (device state 10) May 16 20:58:35 srv75 kernel: [964601.377367] r8169 0000:04:02.0: eth1: link down May 16 20:58:35 srv75 kernel: [964601.441345] r8169 0000:04:02.0: eth1: link down May 16 20:58:35 srv75 kernel: [964601.457347] r8169 0000:04:02.0: eth1: link down May 16 20:58:35 srv75 kernel: [964601.481490] r8169 0000:04:02.0: eth1: link down May 16 20:58:36 srv75 kernel: [964601.545376] r8169 0000:04:02.0: eth1: link down May 16 20:58:36 srv75 kernel: [964601.613469] r8169 0000:04:02.0: eth1: link down May 16 20:58:38 srv75 kernel: [964603.537367] r8169 0000:04:02.0: eth1: link down May 16 20:58:38 srv75 kernel: [964603.601333] r8169 0000:04:02.0: eth1: link down May 16 20:58:38 srv75 kernel: [964603.617366] r8169 0000:04:02.0: eth1: link down May 16 20:58:38 srv75 kernel: [964603.641390] r8169 0000:04:02.0: eth1: link down May 16 20:58:38 srv75 kernel: [964603.709262] r8169 0000:04:02.0: eth1: link down May 16 20:58:38 srv75 kernel: [964603.773498] r8169 0000:04:02.0: eth1: link down May 16 20:58:40 srv75 kernel: [964605.989385] r8169 0000:04:02.0: eth1: link down May 16 20:58:40 srv75 kernel: [964606.053304] r8169 0000:04:02.0: eth1: link down May 16 20:58:40 srv75 kernel: [964606.069368] r8169 0000:04:02.0: eth1: link down May 16 20:58:40 srv75 kernel: [964606.093361] r8169 0000:04:02.0: eth1: link down May 16 20:58:40 srv75 kernel: [964606.161337] r8169 0000:04:02.0: eth1: link down May 16 20:58:40 srv75 kernel: [964606.177362] r8169 0000:04:02.0: eth1: link down May 16 20:58:40 srv75 kernel: [964606.233433] r8169 0000:04:02.0: eth1: link down May 16 20:58:42 srv75 kernel: [964608.317506] r8169 0000:04:02.0: eth1: link down May 16 20:58:42 srv75 kernel: [964608.417395] r8169 0000:04:02.0: eth1: link down May 16 20:58:42 srv75 kernel: [964608.437444] r8169 0000:04:02.0: eth1: link down May 16 20:58:42 srv75 kernel: [964608.453374] r8169 0000:04:02.0: eth1: link down May 16 20:58:43 srv75 kernel: [964608.509455] r8169 0000:04:02.0: eth1: link down May 16 20:58:43 srv75 kernel: [964608.541280] r8169 0000:04:02.0: eth1: link May 17 09:13:27 srv75 kernel: imklog 5.8.11, log source = /proc/kmsg started. May 17 09:13:27 srv75 rsyslogd: [origin software="rsyslogd" swVersion="5.8.11" x-pid="2759" x-info="http://www.rsyslog.com";] start May 17 09:13:27 srv75 kernel: [ 0.000000] Initializing cgroup subsys cpuset
May 17 09:13:27 srv75 kernel: [    0.000000] Initializing cgroup subsys cpu
May 17 09:13:27 srv75 kernel: [ 0.000000] Linux version 3.2.0-4-amd64 (debian-kernel@lists.debian.org) (gcc version 4.6.3 (Debian 4.6.3-14) ) #1 SMP Debian 3.2.54-2 May 17 09:13:27 srv75 kernel: [ 0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-3.2.0-4-amd64 root=UUID=0c313e79-aa26-452a-82ec-943eba6e3cbe ro quiet
May 17 09:13:27 srv75 kernel: [    0.000000] BIOS-provided physical RAM map:
May 17 09:13:27 srv75 kernel: [ 0.000000] BIOS-e820: 0000000000000000 - 000000000009fc00 (usable) May 17 09:13:27 srv75 kernel: [ 0.000000] BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved) May 17 09:13:27 srv75 kernel: [ 0.000000] BIOS-e820: 00000000000e4000 - 0000000000100000 (reserved) May 17 09:13:27 srv75 kernel: [ 0.000000] BIOS-e820: 0000000000100000 - 00000000bffe0000 (usable) May 17 09:13:27 srv75 kernel: [ 0.000000] BIOS-e820: 00000000bffe0000 - 00000000bffef000 (ACPI data) May 17 09:13:27 srv75 kernel: [ 0.000000] BIOS-e820: 00000000bffef000 - 00000000bfff0000 (ACPI NVS) May 17 09:13:27 srv75 kernel: [ 0.000000] BIOS-e820: 00000000bfff0000 - 00000000c0000000 (reserved) May 17 09:13:27 srv75 kernel: [ 0.000000] BIOS-e820: 00000000e0000000 - 00000000f0000000 (reserved) May 17 09:13:27 srv75 kernel: [ 0.000000] BIOS-e820: 00000000ffb00000 - 0000000100000000 (reserved) May 17 09:13:27 srv75 kernel: [ 0.000000] NX (Execute Disable) protection: active
May 17 09:13:27 srv75 kernel: [    0.000000] SMBIOS 2.3 present.
May 17 09:13:27 srv75 kernel: [ 0.000000] DMI: FUJITSU SIEMENS PRIMERGY Econel200/D2020, BIOS 08.10.Rev.1100.2020 06/01/2006

What could be the problem of this?

--
Mimiko desu.


Reply to: