Bug 1751559
| Summary: | osp15 Overcloud deployment fails with : Task 'provide_manageable' (02793a02-c785-4682-b85d-3aca564f6f90) [RUNNING -> ERROR, msg=Failure caused by error in tasks: send_message | ||
|---|---|---|---|
| Product: | Red Hat OpenStack | Reporter: | pkomarov |
| Component: | openstack-ironic | Assignee: | RHOS Maint <rhos-maint> |
| Status: | CLOSED DUPLICATE | QA Contact: | mlammon |
| Severity: | urgent | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 15.0 (Stein) | CC: | bfournie, mburns, sathlang |
| Target Milestone: | --- | Keywords: | AutomationBlocker |
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2019-09-12 11:30:02 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1751300 | ||
| Bug Blocks: | |||
|
Description
pkomarov
2019-09-12 07:30:27 UTC
sosreports and stack home are at : http://rhos-release.virt.bos.redhat.com/log/pkomarov_sosreports/BZ_1751559/ live env : titan88.lab.eng.tlv2.redhat.com hi,
just FYI, but it seems that we may have an issue with selinux that prevent the provisioning of the nodes. Running those commands on the undercloud-0 seems to fix it:
cat /var/log/audit/audit.log | audit2allow -M local ;
semodule -i local.pp;
systemctl restart tripleo_neutron_dhcp.service ;
and on the hypervisor:
for i in compute-0 controller-0 controller-1 controller-2 ; do virsh destroy $i; virsh start $i ;done
where local.pp is the compiled version of:
[root@undercloud-0 ~]# cat local.te
module local 1.0;
require {
type unlabeled_t;
type system_dbusd_t;
type spc_t;
type container_t;
class unix_stream_socket connectto;
class key create;
}
#============= container_t ==============
allow container_t system_dbusd_t:unix_stream_socket connectto;
#============= spc_t ==============
allow spc_t unlabeled_t:key create;
Then the heat deployment when on and then the ansible started.
Those commands have to be run during the "stuck" (and going to failure) heat deployment.
For CI, then should be run after the undercloud deployment (without the reboot of the nodes in that case) as a workaround.
See https://bugzilla.redhat.com/show_bug.cgi?id=1751300
Now, I think that we have an issue because the deployment failure wasn't detect somehow. This is the same selinux issue we are hitting everywhere else and which is causing tests to fail - its due to the dhcp agent not starting because of "process_linux.go:430: container init caused \\"write /proc/self/attr/keycreate: permission denied\\""\n: internal libpod error\n',)] See containers/neutron/dhcp-agent.log.1 which is filled with this error. Because the agent can't be started, no nodes can be provisioned. Marking as duplicate. *** This bug has been marked as a duplicate of bug 1751300 *** |