[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#629253: linux-image-2.6.32-5-amd64: Squeeze amd64 PV DOMU live migration fails



Package: linux-2.6
Version: 2.6.32-34
Severity: normal

Hi,

At first I must point out:

For me Squeeze is far away from being the best Debian ever. 
I'm really disappointed from the quality of Debian Squeeze as server OS.
I began testing when Squeeze became stable. Too many problems. Much too much time spent on testing, finding errors and fixing/workaround (my boss really loves me now for the work that did not succeed during the last few months). 

I don't know where to point this problems out at a higher position, because they are more general? Is there a contact to the "Debian management" where I can ask for more stability and completeness of Debian than for new features (possibly fine for a desktop only distri)? Answering this question would be nice!

Our business must have all security fixes within 3 months after release. 
Also we have 24/7 web services, servers mostly clustered, where every minute downtime really costs money (if downtime is too long it costs existence). 
We can't stay on Lenny for very long. EOL is coming near. 
After about half a year Squeeze stable, it is far away from being ready for our production servers.
That's why we decided for now to go with a RHEL clone for our new installed productive servers. 
I will keep an eye on Debian. Once the quality comes back....who knows.
A few lines later I describe the problems I had testing Squeeze as XEN DOM0.

My install system for Windows (OPSI) our Solaris servers (Jumpstart) and RHEL based distros (Kickstart) still resides on Debian Squeeze. 
Moving them to another platform is a lot of work. That's why it would be still nice to have a live migrateable Debian Squeeze. Here we go:

Sometimes lm works but after migration machine is not reachable through network anymore. 
Somebody mentioned a cronjob with traffic on vnet-device would help keeping machine reachable through network.
Mostly lm crashes during migration.

I tested all versions from 2.6.32-30 -- -34.

Dom0 is: Choose on - I tested:

XCP 1.0
OpenSuse 11.3 with Xen 4.0.1
Debian Squeeze (4.0.1)

Squeeze worked most bad as DOM0 and should not be advertised as a working enterprise solution for XEN virtualization:
- crashes our dell r7xx servers when using multipath + iscsi or only iscsi
- live migration (with a working os's(2008R2 f.e.) as DOMU ) works only 5-6 times then it crashes.
    f.e. scripted live migration of a 2008R2 and a Lenny DOMU (at the same time) between 2 nodes worked 2 days
    (DOMUs reachable all the time) with XCP 1.0 (about 230 live migrations each DOMU), then we stopped testing
- xend freezes
- random reboots of DOM0 due to other reasons that could not be located
- lots of scaring error messages when uptime grows

but back to live migration problem of Debian Squeeze DOMU's:
DomU's are installed with FAI.


Lenny with a similar install works just fine when:
/proc/cmdline: ... clocksource=jiffies
and 
/etc/sysctl.conf: ...\nxen_independent_wallclock=1\n....

Live Migration works also fine with 
Win2008R2
Winxpsp3
ncp 3.0.1
RHEL(PUIAS clone) 5.6
RHEL(PUIAS clone) 6.1

I assume the problem is the Squeeze kernel.




-- Package-specific info:
** Version:
Linux version 2.6.32-5-amd64 (Debian 2.6.32-34) 

** Command line:
root=/dev/mapper/vg01-root ro console=tty0 console=hvc0 

** Not tainted

** Model information
not available

** Loaded modules:
Module                  Size  Used by
nls_utf8                1208  0 
isofs                  27480  0 
loop                   11799  0 
autofs4                20805  8 
nfsd                  254782  13 
exportfs                3170  1 nfsd
nfs                   241114  1 
lockd                  57603  2 nfsd,nfs
fscache                29834  1 nfs
nfs_acl                 2031  2 nfsd,nfs
auth_rpcgss            33476  2 nfsd,nfs
sunrpc                161541  15 nfsd,nfs,lockd,nfs_acl,auth_rpcgss
btrfs                 375670  1 
zlib_deflate           17746  1 btrfs
crc32c                  2560  1 
libcrc32c               1074  1 btrfs
ext3                  106518  1 
jbd                    37085  1 ext3
psmouse                49937  0 
usbhid                 33292  0 
hid                    63257  1 usbhid
uhci_hcd               18521  0 
ohci_hcd               19343  0 
ehci_hcd               32081  0 
usbcore               122386  4 usbhid,uhci_hcd,ohci_hcd,ehci_hcd
nls_base                6377  3 nls_utf8,isofs,usbcore
dm_crypt               10664  0 
snd_pcm                60503  0 
snd_timer              15582  1 snd_pcm
snd                    46446  2 snd_pcm,snd_timer
soundcore               4598  1 snd
snd_page_alloc          6249  1 snd_pcm
pcspkr                  1699  0 
joydev                  8459  0 
evdev                   7352  0 
ext4                  289033  1 
mbcache                 5050  2 ext3,ext4
jbd2                   67015  1 ext4
crc16                   1319  1 ext4
dm_mod                 53706  10 dm_crypt
raid10                 17809  0 
raid456                44500  0 
async_raid6_recov       5170  1 raid456
async_pq                3479  2 raid456,async_raid6_recov
raid6_pq               77179  2 async_raid6_recov,async_pq
async_xor               2478  3 raid456,async_raid6_recov,async_pq
xor                     4380  1 async_xor
async_memcpy            1198  2 raid456,async_raid6_recov
async_tx                1734  5 raid456,async_raid6_recov,async_pq,async_xor,async_memcpy
raid1                  18431  0 
raid0                   5517  0 
multipath               6059  0 
linear                  3455  0 
md_mod                 73872  6 raid10,raid456,raid1,raid0,multipath,linear
xen_netfront           15196  0 
xen_blkfront            8358  5 

** PCI devices:

** USB devices:
not available


-- System Information:
Debian Release: 6.0.1
  APT prefers stable
  APT policy: (990, 'stable')
Architecture: amd64 (x86_64)

Kernel: Linux 2.6.32-5-amd64 (SMP w/2 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages linux-image-2.6.32-5-amd64 depends on:
ii  debconf [debconf-2.0]         1.5.36.1   Debian configuration management sy
ii  initramfs-tools [linux-initra 0.98.8     tools for generating an initramfs
ii  linux-base                    2.6.32-31  Linux image base package
ii  module-init-tools             3.12-1     tools for managing Linux kernel mo

Versions of packages linux-image-2.6.32-5-amd64 recommends:
ii  firmware-linux-free           2.6.32-31  Binary firmware for various driver

Versions of packages linux-image-2.6.32-5-amd64 suggests:
pn  grub | lilo                   <none>     (no description available)
pn  linux-doc-2.6.32              <none>     (no description available)

Versions of packages linux-image-2.6.32-5-amd64 is related to:
ii  firmware-bnx2                 0.28       Binary firmware for Broadcom NetXt
ii  firmware-bnx2x                0.28       Binary firmware for Broadcom NetXt
pn  firmware-ipw2x00              <none>     (no description available)
pn  firmware-ivtv                 <none>     (no description available)
pn  firmware-iwlwifi              <none>     (no description available)
ii  firmware-linux                0.28       Binary firmware for various driver
ii  firmware-linux-nonfree        0.28       Binary firmware for various driver
ii  firmware-qlogic               0.28       Binary firmware for QLogic IBA7220
pn  firmware-ralink               <none>     (no description available)
pn  xen-hypervisor                <none>     (no description available)

-- debconf information:
  linux-image-2.6.32-5-amd64/postinst/depmod-error-initrd-2.6.32-5-amd64: false
  linux-image-2.6.32-5-amd64/prerm/removing-running-kernel-2.6.32-5-amd64: true
  linux-image-2.6.32-5-amd64/postinst/missing-firmware-2.6.32-5-amd64:
  linux-image-2.6.32-5-amd64/postinst/ignoring-do-bootloader-2.6.32-5-amd64:



Reply to: