Bug 906529
| Summary: | Cluster configuration fails using cluster manager pcs | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 7 | Reporter: | Justin Payne <jpayne> |
| Component: | pcs | Assignee: | Chris Feist <cfeist> |
| Status: | CLOSED NOTABUG | QA Contact: | |
| Severity: | urgent | Docs Contact: | |
| Priority: | urgent | ||
| Version: | 7.1 | CC: | bugproxy, cfeist, cluster-maint, jkachuck, jkortus, wgomerin |
| Target Milestone: | rc | ||
| Target Release: | --- | ||
| Hardware: | x86_64 | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | 894232 | Environment: | |
| Last Closed: | 2013-02-20 17:58:06 UTC | Type: | --- |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 894232 | ||
| Bug Blocks: | 744225 | ||
|
Description
Justin Payne
2013-01-31 19:42:48 UTC
[root@dash-01 ~]# /etc/init.d/pacemaker status
pacemaker.service - Pacemaker High Availability Cluster Manager
Loaded: loaded (/usr/lib/systemd/system/pacemaker.service; disabled)
Active: active (running) since Thu 2013-01-31 13:33:37 CST; 4s ago
Main PID: 7380 (pacemakerd)
CGroup: name=systemd:/system/pacemaker.service
├─7380 /usr/sbin/pacemakerd -f
└─7383 /usr/libexec/pacemaker/stonithd
Jan 31 13:33:37 dash-01.lab.msp.redhat.com attrd[7385]: notice: crm_cluster_connect: Connecting to cluster infrastructure: corosync
Jan 31 13:33:37 dash-01.lab.msp.redhat.com pengine[7386]: error: crm_is_writable: /var/lib/pacemaker/pengine must exist and be a directory
Jan 31 13:33:37 dash-01.lab.msp.redhat.com pengine[7386]: error: main: Bad permissions on /var/lib/pacemaker/pengine. Terminating
Jan 31 13:33:37 dash-01.lab.msp.redhat.com pacemakerd[7380]: error: pcmk_child_exit: Child process pengine exited (pid=7386, rc=100)
Jan 31 13:33:37 dash-01.lab.msp.redhat.com pacemakerd[7380]: warning: pcmk_child_exit: Pacemaker child process pengine no longer wishes to be respawned. Shutting ourselves down.
Jan 31 13:33:37 dash-01.lab.msp.redhat.com pacemakerd[7380]: notice: stop_child: Stopping attrd: Sent -15 to process 7385
Jan 31 13:33:37 dash-01.lab.msp.redhat.com attrd[7385]: notice: main: Starting mainloop...
Jan 31 13:33:37 dash-01.lab.msp.redhat.com attrd[7385]: notice: main: Exiting...
Jan 31 13:33:37 dash-01.lab.msp.redhat.com pacemakerd[7380]: notice: stop_child: Stopping lrmd: Sent -15 to process 7384
Jan 31 13:33:37 dash-01.lab.msp.redhat.com pacemakerd[7380]: notice: stop_child: Stopping stonith-ng: Sent -15 to process 7383
[root@dash-01 ~]# ls -ld /var/lib/pacemaker/pengine
drwxr-x---. 2 hacluster hacluster 6 Oct 25 05:46 /var/lib/pacemaker/pengine
[root@dash-01 ~]# getenforce
Disabled
[root@dash-01 ~]# systemctl status firewalld.service
firewalld.service - firewalld - dynamic firewall daemon
Loaded: loaded (/usr/lib/systemd/system/firewalld.service; disabled)
Active: inactive (dead)
[root@dash-01 ~]# systemctl status iptables.service
iptables.service
Loaded: error (Reason: No such file or directory)
Active: inactive (dead)
[root@dash-01 ~]# chmod 777 /var/lib/pacemaker/pengine
[root@dash-01 ~]# /etc/init.d/pacemaker restart
Restarting pacemaker (via systemctl): [ OK ]
[root@dash-01 ~]# /etc/init.d/pacemaker status
pacemaker.service - Pacemaker High Availability Cluster Manager
Loaded: loaded (/usr/lib/systemd/system/pacemaker.service; disabled)
Active: active (running) since Thu 2013-01-31 13:39:21 CST; 5s ago
Main PID: 8264 (pacemakerd)
CGroup: name=systemd:/system/pacemaker.service
├─8264 /usr/sbin/pacemakerd -f
└─8268 /usr/libexec/pacemaker/stonithd
Jan 31 13:39:22 dash-01.lab.msp.redhat.com pacemakerd[8264]: notice: stop_child: Stopping pengine: Sent -15 to process 8271
Jan 31 13:39:22 dash-01.lab.msp.redhat.com pengine[8271]: error: crm_is_writable: /var/lib/pacemaker/pengine must exist and be a directory
Jan 31 13:39:22 dash-01.lab.msp.redhat.com attrd[8270]: notice: crm_cluster_connect: Connecting to cluster infrastructure: corosync
Jan 31 13:39:22 dash-01.lab.msp.redhat.com pacemakerd[8264]: error: pcmk_child_exit: Child process pengine exited (pid=8271, rc=100)
Jan 31 13:39:22 dash-01.lab.msp.redhat.com pacemakerd[8264]: warning: pcmk_child_exit: Pacemaker child process pengine no longer wishes to be respawned. Shutting ourselves down.
Jan 31 13:39:22 dash-01.lab.msp.redhat.com pacemakerd[8264]: notice: stop_child: Stopping attrd: Sent -15 to process 8270
Jan 31 13:39:22 dash-01.lab.msp.redhat.com attrd[8270]: notice: main: Starting mainloop...
Jan 31 13:39:22 dash-01.lab.msp.redhat.com attrd[8270]: notice: main: Exiting...
Jan 31 13:39:22 dash-01.lab.msp.redhat.com pacemakerd[8264]: notice: stop_child: Stopping lrmd: Sent -15 to process 8269
Jan 31 13:39:22 dash-01.lab.msp.redhat.com pacemakerd[8264]: notice: stop_child: Stopping stonith-ng: Sent -15 to process 8268
[root@dash-01 ~]# ls -ld /var/lib/pacemaker/pengine
drwxrwxrwx. 2 hacluster hacluster 6 Oct 25 05:46 /var/lib/pacemaker/pengine
-bash-4.1$ for i in `seq 1 3`; do qarsh root@dash-0$i rpm -q pcs; done pcs-0.9.30-1.el7.x86_64 pcs-0.9.30-1.el7.x86_64 pcs-0.9.30-1.el7.x86_64 Closing this as it was not a bug. I had created a second hacluster user and caused permissions issues during the cluster setup. Correct behavior is as follows: [root@dash-01 ~]# ls -ld /var/lib/pacemaker/pengine drwxr-x---. 2 hacluster haclient 6 Oct 25 05:46 /var/lib/pacemaker/pengine [root@dash-01 ~]# id hacluster uid=999(hacluster) gid=999(haclient) groups=999(haclient) Once the UID and GID of the hacluster user were corrected, things worked as they should. |