Bug 1456562
Summary: | Cannot upgrade OSP11 to OSP12 due to freeze after executing of overcloud deploy command | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | Artem Hrechanychenko <ahrechan> | ||||
Component: | openstack-selinux | Assignee: | Lon Hohberger <lhh> | ||||
Status: | CLOSED ERRATA | QA Contact: | Udi Shkalim <ushkalim> | ||||
Severity: | urgent | Docs Contact: | |||||
Priority: | high | ||||||
Version: | 12.0 (Pike) | CC: | dbecker, dmatthew, dprince, jschluet, lyarwood, mburns, mcornea, mgrepl, morazi, rbrady, rhallise, rhel-osp-director-maint, sasha, sathlang, sclewis, shardy, skramaja, srevivo, therve | ||||
Target Milestone: | ga | Keywords: | Triaged | ||||
Target Release: | 12.0 (Pike) | ||||||
Hardware: | x86_64 | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | openstack-selinux-0.8.8-0.20170804200925.ad96ed3.el7ost | Doc Type: | If docs needed, set a value | ||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | |||||||
: | 1478004 (view as bug list) | Environment: | |||||
Last Closed: | 2017-12-13 21:29:42 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 1478004 | ||||||
Attachments: |
|
Description
Artem Hrechanychenko
2017-05-29 15:09:19 UTC
This bugzilla has been removed from the release and needs to be reviewed and Triaged for another Target Release. This bugzilla has been removed from the release and needs to be reviewed and Triaged for another Target Release. In the Mistral executor logs I found the following error: 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor Traceback (most recent call last): 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor File "/usr/lib/python2.7/site-packages/mistral/executors/default_executor.py", line 112, in run_action 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor result = action.run() 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor File "/usr/lib/python2.7/site-packages/mistral/actions/openstack/base.py", line 159, in run 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor (self.__class__.__name__, self.client_method_name, e_str) 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor ActionException: ZaqarAction.queue_post failed: <class 'zaqarclient.transport.errors.InternalServerError'>: Error response from Zaqar. Code: 500. Text: <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN"> 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor <html><head> 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor <title>500 Internal Server Error</title> 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor </head><body> 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor <h1>Internal Server Error</h1> 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor <p>The server encountered an internal error or 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor misconfiguration and was unable to complete 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor your request.</p> 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor <p>Please contact the server administrator at 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor [no address given] to inform them of the time this error occurred, 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor and the actions you performed just before this error.</p> 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor <p>More information about this error may be available 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor in the server error log.</p> 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor </body></html> 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor . 2017-06-07 08:31:42.569 1446 ERROR mistral.executors.default_executor It seems that Zaqar is returning an error every time Mistral attempts to access it. So from the apache logs: [Wed Jun 07 09:56:16.069984 2017] [:error] [pid 1906] [remote 192.168.24.1:144] mod_wsgi (pid=1906): Target WSGI script '/var/www/cgi-bin/zaqar/zaqar-server' cannot be loaded as Python module. [Wed Jun 07 09:56:16.070035 2017] [:error] [pid 1906] [remote 192.168.24.1:144] mod_wsgi (pid=1906): Exception occurred processing WSGI script '/var/www/cgi-bin/zaqar/zaqar-server'. [Wed Jun 07 09:56:16.070054 2017] [:error] [pid 1906] [remote 192.168.24.1:144] Traceback (most recent call last): [Wed Jun 07 09:56:16.070068 2017] [:error] [pid 1906] [remote 192.168.24.1:144] File "/var/www/cgi-bin/zaqar/zaqar-server", line 41, in <module> [Wed Jun 07 09:56:16.070091 2017] [:error] [pid 1906] [remote 192.168.24.1:144] log.register_options(conf) [Wed Jun 07 09:56:16.070098 2017] [:error] [pid 1906] [remote 192.168.24.1:144] File "/usr/lib/python2.7/site-packages/oslo_log/log.py", line 253, in register_options [Wed Jun 07 09:56:16.070110 2017] [:error] [pid 1906] [remote 192.168.24.1:144] conf.register_cli_opts(_options.common_cli_opts) [Wed Jun 07 09:56:16.070115 2017] [:error] [pid 1906] [remote 192.168.24.1:144] File "/usr/lib/python2.7/site-packages/oslo_config/cfg.py", line 2318, in __inner [Wed Jun 07 09:56:16.070124 2017] [:error] [pid 1906] [remote 192.168.24.1:144] result = f(self, *args, **kwargs) [Wed Jun 07 09:56:16.070129 2017] [:error] [pid 1906] [remote 192.168.24.1:144] File "/usr/lib/python2.7/site-packages/oslo_config/cfg.py", line 2540, in register_cli_opts [Wed Jun 07 09:56:16.070136 2017] [:error] [pid 1906] [remote 192.168.24.1:144] self.register_cli_opt(opt, group, clear_cache=False) [Wed Jun 07 09:56:16.070140 2017] [:error] [pid 1906] [remote 192.168.24.1:144] File "/usr/lib/python2.7/site-packages/oslo_config/cfg.py", line 2322, in __inner [Wed Jun 07 09:56:16.070147 2017] [:error] [pid 1906] [remote 192.168.24.1:144] return f(self, *args, **kwargs) [Wed Jun 07 09:56:16.070151 2017] [:error] [pid 1906] [remote 192.168.24.1:144] File "/usr/lib/python2.7/site-packages/oslo_config/cfg.py", line 2532, in register_cli_opt [Wed Jun 07 09:56:16.070185 2017] [:error] [pid 1906] [remote 192.168.24.1:144] raise ArgsAlreadyParsedError("cannot register CLI option") [Wed Jun 07 09:56:16.070201 2017] [:error] [pid 1906] [remote 192.168.24.1:144] ArgsAlreadyParsedError: arguments already parsed: cannot register CLI option Restarting apache seems to fix it, but I don't know why. After the issue reappeared, it seemed that an additional step was required: move /var/log/zaqar/zaqar.log. I wasn't able to know why yet, but maybe permissions error? w/a restart httpd service and delete /var/log/zaqar/zaqar.log Any chance this is related to https://review.openstack.org/#/c/472618 Looks like an SELinux issue: [root@undercloud-0 stack]# grep zaqar /var/log/audit/audit.log type=AVC msg=audit(1499809225.125:6424): avc: denied { open } for pid=23130 comm="httpd" path="/var/log/zaqar/zaqar.log" dev="vda1" ino=12686192 scontext=system_u:system_r:httpd_t:s0 tcontext=system_u:object_r:var_log_t:s0 tclass=file type=AVC msg=audit(1499809391.677:6514): avc: denied { open } for pid=23563 comm="httpd" path="/var/log/zaqar/zaqar.log" dev="vda1" ino=12686192 scontext=system_u:system_r:httpd_t:s0 tcontext=system_u:object_r:var_log_t:s0 tclass=file [root@undercloud-0 stack]# ls -lZ /var/log/zaqar/zaqar.log -rw-r--r--. zaqar zaqar system_u:object_r:var_log_t:s0 /var/log/zaqar/zaqar.log After removing the file and restarting httpd: [root@undercloud-0 stack]# ls -lZ /var/log/zaqar/zaqar.log -rw-r--r--. zaqar zaqar system_u:object_r:httpd_log_t:s0 /var/log/zaqar/zaqar.log Lon, Can you help confirm this is an selinux bug? Created attachment 1304472 [details]
audit.log
After undercloud upgrade:
[root@undercloud-0 stack]# rpm -qa | grep openstack-selinux
openstack-selinux-0.8.8-0.20170622195307.74ddc0e.el7ost.noarch
[root@undercloud-0 stack]# ls -lZ /var/log/zaqar/zaqar.log
-rw-r--r--. zaqar zaqar system_u:object_r:var_log_t:s0 /var/log/zaqar/zaqar.log
Attaching /var/log/audit/audit.log after undercloud upgrade.
*** Bug 1477611 has been marked as a duplicate of this bug. *** We need to set /var/log/zaqar/zaqar.log to a different file context and then call 'restorecon' after deployment. https://github.com/redhat-openstack/openstack-selinux/commit/f7297a77903fcb8963c7796e49d357cb7cce2b28 This still needs a restorecon for it to operate correctly. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2017:3462 The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days |