[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Network connection drops in Squeeze



On 03/09/2011 06:04 PM, David A. Parker wrote:
On 03/09/2011 05:32 PM, David A. Parker wrote:
So far, this has never happened randomly during an SSH session. It
always seems to be triggered by some sort of network activity. What's
especially frustrating is that absolutely nothing gets logged in the
system logs when this happens.

I originally suspected that the network adapter was being put to sleep,
but that doesn't explain why I can sustain a normal SSH connection for
long periods of time. I can even leave the computer on overnight and
nothing stops working. If it were a sleep or suspend issue, I would
think it would happen when the interface was idle, but I haven't ruled
this out.

Does anyone know what I should check to see if this is some sort of
power management issue? I stopped acpid, but the behavior persists.


The behavior I'm seeing is very similar to what's described here:

https://bugs.launchpad.net/ubuntu/+source/linux/+bug/404708

Could this be a driver bug, perhaps?


I triggered one of these wird lock-ups and then let it sit in the "frozen" state for several minutes. It eventually corrected the problem, and the following messages got dumped to the logs:

[  442.826956] Clocksource tsc unstable (delta = 299985606581 ns)
[  442.828411] Switching to clocksource hpet
[  442.829530] BUG: soft lockup - CPU#3 stuck for 279s! [ssh:1186]
[ 442.829566] Modules linked in: vboxnetadp vboxnetflt vboxdrv loop snd_hda_codec_realtek radeon ttm snd_hda_intel drm_kms_helper snd_hda_codec drm edac_core i2c_algo_bit snd_hwdep i2c_piix4 shpchp snd_pcm edac_mce_amd pci_hotplug i2c_ core snd_timer psmouse snd soundcore tpm_infineon snd_page_alloc wmi tpm tpm_bios pcspkr evdev button processor serio_raw ext4 mbcache jbd2 crc16 sg sr_mod cdrom sd_mod crc_t10dif usbhid hid ahci ohci_hcd libata thermal thermal_sys ehci
_hcd tg3 libphy usbcore nls_base scsi_mod [last unloaded: scsi_wait_scan]
[  442.829593] CPU 3:
[ 442.829594] Modules linked in: vboxnetadp vboxnetflt vboxdrv loop snd_hda_codec_realtek radeon ttm snd_hda_intel drm_kms_helper snd_hda_codec drm edac_core i2c_algo_bit snd_hwdep i2c_piix4 shpchp snd_pcm edac_mce_amd pci_hotplug i2c_ core snd_timer psmouse snd soundcore tpm_infineon snd_page_alloc wmi tpm tpm_bios pcspkr evdev button processor serio_raw ext4 mbcache jbd2 crc16 sg sr_mod cdrom sd_mod crc_t10dif usbhid hid ahci ohci_hcd libata thermal thermal_sys ehci
_hcd tg3 libphy usbcore nls_base scsi_mod [last unloaded: scsi_wait_scan]
[ 442.829614] Pid: 1186, comm: ssh Not tainted 2.6.32-5-amd64 #1 HP Compaq 6005 Pro MT PC [ 442.829616] RIP: 0010:[<ffffffff8119502d>] [<ffffffff8119502d>] copy_user_generic_string+0x2d/0x40
[  442.829623] RSP: 0018:ffff88021cf89b80  EFLAGS: 00000246
[ 442.829624] RAX: ffff88021cf88000 RBX: 0000000000000370 RCX: 0000000000000001 [ 442.829626] RDX: 0000000000000000 RSI: ffff88021c4f6604 RDI: 00007fffc4241358 [ 442.829628] RBP: ffffffff8101166e R08: 00000000d7f8a216 R09: 0000000000000000 [ 442.829629] R10: 0000000000000000 R11: ffff88021d2f8d00 R12: ffff88021ccc0000 [ 442.829631] R13: 0000000000000000 R14: 00000000000000db R15: ffffffff810fc797 [ 442.829632] FS: 00007fa02829f720(0000) GS:ffff880008d80000(0000) knlGS:0000000000000000
[  442.829634] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 442.829636] CR2: 00007fa028248000 CR3: 000000021d0f1000 CR4: 00000000000006e0 [ 442.829637] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 442.829639] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[  442.829640] Call Trace:
[  442.829644]  [<ffffffff812495d0>] ? memcpy_toiovec+0x34/0x63
[  442.829646]  [<ffffffff81249edc>] ? skb_copy_datagram_iovec+0x7c/0x1ed
[  442.829649]  [<ffffffff8127e251>] ? tcp_recvmsg+0x6e5/0xa9e
[  442.829653]  [<ffffffff812425fa>] ? sock_common_recvmsg+0x30/0x45
[  442.829656]  [<ffffffff812405f8>] ? sock_aio_read+0xb9/0xc4
[  442.829658]  [<ffffffff81195030>] ? copy_user_generic_string+0x30/0x40
[  442.829661]  [<ffffffff810ee809>] ? do_sync_read+0xce/0x113
[  442.829665]  [<ffffffff8100f6c4>] ? __switch_to+0x1ad/0x297
[  442.829668]  [<ffffffff81064d2a>] ? autoremove_wake_function+0x0/0x2e
[  442.829671]  [<ffffffff81048079>] ? finish_task_switch+0x96/0xaf
[  442.829673]  [<ffffffff810ef265>] ? vfs_read+0xb9/0xff
[  442.829675]  [<ffffffff810ef367>] ? sys_read+0x45/0x6e
[  442.829677]  [<ffffffff81010b42>] ? system_call_fastpath+0x16/0x1b

Note that this was happening before I installed VirtualBox, so I don't think the vbox modules are to blame.

I tried the ethtool command that was given in that bug report, but it did not help. Any ideas?

    Thanks,
    Dave


Reply to: