Questions tagged [pacemaker]

Pacemaker achieves maximum availability for cluster services by detecting and recovering from node and service-level failures. It achieves this by utilizing the messaging and membership capabilities provided by your preferred cluster infrastructure (currently either OpenAIS or Heartbeat).

297 questions
3
votes
1 answer

Corosync dynamically add node

I have been experimenting with Pacemaker & Corosync for a HA setup. I installed it on two Ubuntu 14.04 Linode's and after a lot of back and forth on outdated config examples and tutorials, managed to get it up and running. This mostly due to the…
Wildex999
  • 63
  • 6
3
votes
2 answers

Importance of ha.cf file in a heartbeat/pacemaker environment?

I'm having a few issues trying to understand ha.cf and how the cluster picks up on updates. For example, when creating a new cluster, I usually: Set some default options in ha.cf on node 1 - node x Start the cluster. Run crm on any node, configure…
William Hilsum
  • 3,536
  • 6
  • 29
  • 39
3
votes
1 answer

multicast address use in corosync

I'm wondering about the purpose of the multicast address in the corosync messaging software : Since we have to bind each net interface with an IP address and a specific port, and each ring communicate through those IP,I don't understand why we have…
Kuruwan
  • 91
  • 2
  • 2
  • 8
3
votes
1 answer

NFS failover fails with stale file handles while migrating resources

Running into a bit of a problem here , I set up two servers ( Centos 6 ) with Glusterfs and a shared directory between them, I have moved the nfs directory to the shared Gluster folder and have created a symlink on both boxes. The machines can talk…
Roncioiu
  • 53
  • 8
3
votes
1 answer

2 corosync nodes not online together (one offline, one online)

I am currently trying to set up corosync based on this tutorial: http://www.howtoforge.com/how-to-set-up-an-active-passive-postgresql-cluster-with-pacemaker-corosync-and-drbd-centos-5.5-p3 However, I am not able to get my 2 nodes connected. I have…
mokalovesoulmate
  • 51
  • 1
  • 1
  • 5
3
votes
1 answer

Pacemaker between Centos 5 and Centos 6

I'm having some troubles when configuring pacemaker+corosync between Centos 5 and Centos 6. Here is ouput of crm_mon: On node1: Last updated: Sun Jul 21 19:02:21 2013 Last change: Sun Jul 21 18:14:48 2013 via crmd on svr077-53149.localdomain Stack:…
cuonglm
  • 2,386
  • 2
  • 16
  • 20
3
votes
1 answer

corosync binding to 127.0.0.1 instead of correct interface

I have the following problem when trying to implement a two-node failover cluster, using Hetzner as the hosting provider. my corosync.conf is as follows: # Please read the corosync.conf.5 manual page compatibility: whitetank totem { version:…
thanasisk
  • 941
  • 6
  • 16
3
votes
1 answer

Make pacemaker retry failed resources

I would like to get pacemaker to retry starting my resource primitive Imq ocf:example:imq \ op monitor on-fail="restart" interval="10s" \ op start interval="0" timeout="60s" on-fail="restart" \ meta failure-timeout="30s" Note that this…
Gareth Davis
  • 253
  • 3
  • 11
3
votes
1 answer

Heartbeat Pacemaker 3 node/ip failover

I have setup two node active/active ip failover with heartbeat and pacemaker following the link at zivtech on the two load balancers for the backend web servers. Two LBs has public ip each which is shared for failover and I checked if the ips are…
user53864
  • 1,723
  • 11
  • 37
  • 66
3
votes
4 answers

NFS v4, HA Migration, and stale handles on clients

I'm managing a server running NFS v4 with Pacemaker/OpenAIS. NFS is configured to use TCP. When I migrate the NFS server to another node in the Pacemaker cluster, even though the metadata is persisted, connections from the clients 'hang' and…
Karl Katzke
  • 2,596
  • 1
  • 22
  • 24
3
votes
0 answers

Add a server to an already running DRBD9 configuration

I am on CentOS7 and I have an existing two-cluster HA node running pacemaker (1.1.23-1.el7_9.1) and DRBD (kmod-drbd90-9.0.22-3.el7_9). The backing partition for the DRBD drive is LUKS encrypted. We are adding a third server to the stack, but after…
Matt
  • 131
  • 1
3
votes
1 answer

Corosync-Pacemaker no split brain

I'm trying to set up a cluster of two nodes using CentOS 7, Corosync, Pacemaker and pcsd. I can migrate resources manually from one node to another, but if I turn off primary node (by unplugging the power cable), secondary node does not become…
user46170
  • 51
  • 3
2
votes
1 answer

How to control a service from Linux pacemaker cluster?

I have set a two node cluster on two RHEL7 virtual machines. I'm using pacemaker as my HA management software. I creates one resource for VIP and another resource for a shared FS. VIP and FS resources are working properly. However, I don't know how…
2
votes
0 answers

systemd service in pacemaker for high availability

I came to know about the pacemaker for high availability. I have one simple service running on linux VM. I have started that using systemctl start command. I want to make this HA in case my linux host goes down. For that purpose, I…
ajay_t
  • 121
  • 1
  • 3
2
votes
1 answer

Pacemaker/Corosync resource cleanup causes restart on Ubuntu (any version)

I'm having an issue with a pacemaker/corosync (2 node) cluster on Ubuntu (12.04 / 14.04 / 16.04 and 18.04) and couldn't find anyone else describing this issue. There are two ressources: res_ip (virtual IP) and res_apache (apache2). These are just…
pToker
  • 51
  • 4