[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Problems with Pacemaker + Corosync after reboot



Hi all!

I'm beginning to test HA clusters with GNU/Linux and for that I decided
to try Pacemaker + Corosync in Debian Lenny following this [1] howto.

Both packages were installed from the Backports repositories. But I am
observing that if after configuration I reboot a node, it fails to join
to the cluster after the boot.

This is what I see in /var/log/daemon.log:

--------------------------------------------------------------------------
Dec 19 17:13:13 atlantis corosync[1508]:   [pcmk  ] WARN: route_ais_message: Sending message to local.crmd failed: unknown (rc=-2)
Dec 19 17:13:13 atlantis corosync[1508]:   [pcmk  ] WARN: route_ais_message: Sending message to local.cib failed: unknown (rc=-2)
Dec 19 17:13:13 atlantis corosync[1508]:   [pcmk  ] WARN: route_ais_message: Sending message to local.attrd failed: unknown (rc=-2)
Dec 19 17:13:13 atlantis corosync[1508]:   [pcmk  ] WARN: route_ais_message: Sending message to local.cib failed: unknown (rc=-2)
Dec 19 17:13:14 atlantis corosync[1508]:   [pcmk  ] WARN: route_ais_message: Sending message to local.cib failed: unknown (rc=-2)
Dec 19 17:13:14 atlantis corosync[1508]:   [pcmk  ] WARN: route_ais_message: Sending message to local.cib failed: unknown (rc=-2)
Dec 19 17:13:21 atlantis corosync[1508]:   [TOTEM ] A processor failed, forming new configuration.
Dec 19 17:13:25 atlantis corosync[1508]:   [pcmk  ] notice: pcmk_peer_update: Transitional membership event on ring 72: memb=1, new=0, lost=1
Dec 19 17:13:25 atlantis corosync[1508]:   [pcmk  ] info: pcmk_peer_update: memb: atlantis 335544586
Dec 19 17:13:25 atlantis corosync[1508]:   [pcmk  ] info: pcmk_peer_update: lost: daedalus 369099018
Dec 19 17:13:25 atlantis corosync[1508]:   [pcmk  ] notice: pcmk_peer_update: Stable membership event on ring 72: memb=1, new=0, lost=0
Dec 19 17:13:25 atlantis corosync[1508]:   [pcmk  ] info: pcmk_peer_update: MEMB: atlantis 335544586
Dec 19 17:13:25 atlantis corosync[1508]:   [pcmk  ] info: ais_mark_unseen_peer_dead: Node daedalus was not seen in the previous transition
Dec 19 17:13:25 atlantis corosync[1508]:   [pcmk  ] info: update_member: Node 369099018/daedalus is now: lost
Dec 19 17:13:25 atlantis corosync[1508]:   [pcmk  ] info: send_member_notification: Sending membership update 72 to 0 children
Dec 19 17:13:25 atlantis corosync[1508]:   [TOTEM ] A processor joined or left the membership and a new membership was formed.
Dec 19 17:13:25 atlantis corosync[1508]:   [MAIN  ] Completed service synchronization, ready to provide service.
--------------------------------------------------------------------------


# ps auxf
[...]
root      1508  0.1  1.9 182624  4880 ?        Ssl  15:52   0:22 /usr/sbin/corosync
root      1539  0.0  1.2 168144  3240 ?        S    15:52   0:00  \_ /usr/sbin/corosync
root      1540  0.0  1.2 168144  3240 ?        S    15:52   0:00  \_ /usr/sbin/corosync
root      1541  0.0  1.2 168144  3240 ?        S    15:52   0:00  \_ /usr/sbin/corosync
root      1542  0.0  1.2 168144  3240 ?        S    15:52   0:00  \_ /usr/sbin/corosync
root      1543  0.0  1.2 168144  3240 ?        S    15:52   0:00  \_ /usr/sbin/corosync
root      1544  0.0  1.2 168144  3240 ?        S    15:52   0:00  \_ /usr/sbin/corosync


From what I see in the howto, the output should be something like this:


root     29980  0.0  0.8  44304  3808 ?        Ssl  20:55   0:00 /usr/sbin/corosync
root     29986  0.0  2.4  10812 10812 ?        SLs  20:55   0:00  \_ /usr/lib/heartbeat/stonithd
102      29987  0.0  0.8  13012  3804 ?        S    20:55   0:00  \_ /usr/lib/heartbeat/cib
root     29988  0.0  0.4   5444  1800 ?        S    20:55   0:00  \_ /usr/lib/heartbeat/lrmd
102      29989  0.0  0.5  12364  2368 ?        S    20:55   0:00  \_ /usr/lib/heartbeat/attrd
102      29990  0.0  0.5   8604  2304 ?        S    20:55   0:00  \_ /usr/lib/heartbeat/pengine
102      29991  0.0  0.6  12648  3080 ?        S    20:55   0:00  \_ /usr/lib/heartbeat/crmd


I also tried compiling Pacemaker using these [2] steps, but I get the
same result.


Thanks in advance for your reply.

Regards,
Daniel

[1] http://www.clusterlabs.org/wiki/Debian_Lenny_HowTo
[2] http://www.clusterlabs.org/wiki/Install#Building_from_Source
-- 
Fingerprint: BFB3 08D6 B4D1 31B2 72B9  29CE 6696 BF1B 14E6 1D37
Powered by Debian GNU/Linux Lenny - Linux user #188.598

Attachment: signature.asc
Description: Digital signature


Reply to: