Bug 1353346
Summary: | [OSP-director-9.0] After 7.3 -> 8.0 undercloud upgrade was successful, the following 8.0 -> 9.0 undercloud upgrade fails ( Tuskar leftovers ) | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | Omri Hochman <ohochman> | ||||
Component: | instack-undercloud | Assignee: | James Slagle <jslagle> | ||||
Status: | CLOSED CURRENTRELEASE | QA Contact: | Ronnie Rasouli <rrasouli> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | high | ||||||
Version: | 9.0 (Mitaka) | CC: | aschultz, augol, dbecker, dsavinea, jason.dobies, jcoufal, jschluet, mandreou, mburns, morazi, ohochman, rhel-osp-director-maint, sasha, sathlang, slinaber, tvignaud | ||||
Target Milestone: | zstream | Keywords: | TestOnly, Triaged | ||||
Target Release: | 9.0 (Mitaka) | ||||||
Hardware: | x86_64 | ||||||
OS: | Linux | ||||||
Whiteboard: | related to https://bugzilla.redhat.com/show_bug.cgi?id=1351712 | ||||||
Fixed In Version: | instack-undercloud-4.0.0-17.el7ost | Doc Type: | Bug Fix | ||||
Doc Text: |
Leftover tuskar packages that were installed with RHOSP 7 prevent a successful upgrade from RHOSP 7 to 8 to 9.
After successful upgrade from RHOSP 7 to 8, the undercloud upgrade from RHOSP 8 to 9 fails with an error (Command 'instack-install-undercloud' returned non-zero exit status 1)and is unable to restart the httpd service after the package update. For more information about the errors from the httpd service, see https://bugzilla.redhat.com/show_bug.cgi?id=1353346#c5
After upgrading from RHOSP 7 to 8, manually remove the tuskar packages and files from the undercloud node, before proceeding with the RHOSP 8 to 9 undercloud upgrade:
sudo yum remove tuskar*;
tuskarpyc_files=(/usr/share/openstack-dashboard/openstack_dashboard/local/enabled/_10_admin.pyc /usr/share/openstack-dashboard/openstack_dashboard/local/enabled/_20_project.pyc /usr/share/openstack-dashboard/openstack_dashboard/local/enabled/_30_identity.pyc /usr/share/openstack-dashboard/openstack_dashboard/local/enabled/_50_tuskar.pyc /usr/share/openstack-dashboard/openstack_dashboard/local/enabled/_60_tuskar_boxes.pyc /usr/share/openstack-dashboard/openstack_dashboard/local/enabled/_60_tuskar_sat_ui.pyc);
for file in ${tuskarpyc_files[@]}; do sudo rm -f $file; done
The undercloud upgrade from RHOSP 8 to 9 completes without error.
|
Story Points: | --- | ||||
Clone Of: | Environment: | ||||||
Last Closed: | 2019-07-10 10:41:32 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | 1455496, 1483233 | ||||||
Bug Blocks: | |||||||
Attachments: |
|
Description
Omri Hochman
2016-07-06 20:48:05 UTC
Marios - We believe this might be related to another issue you're looking at (sorry, I don't have the actual number accessible). Can you take a look? I think bug 1351712 and bug 1353346 are related - they are both about a failed undercloud upgrade for 8..9 and afaics both have the same root symptom, that httpd fails to come up on the undercloud during the upgrade. I think it makes sense to keep both as the issue manifests in slightly different circumstances; update 8.. 8 latest and then upgrade to 9, or upgrade 7..8, and then do the 8..9 upgrade. I see from logs/description that the root cause is httpd not coming up as part of the upgrade. I can't see enough information, either in the install-undercloud.log from bug 1351712 or in the description of bug 1353346. Basically we need the httpd logs. I think the rest of the errors from the trace (e.g. keystone related errors) are a consequence of the httpd not starting. Can we please have the httpd logs from the undercloud when this happens? Another thought is, I suspect that incorporating a stop on all undercloud services like at https://review.openstack.org/#/c/331804/ before invoking the "openstack undercloud upgrade" might solve this problem. You could try this if you can reproduce on an enviroonment. Otherwise needs logs/more info. It could yet be another root cause, but if the service stop before upgrade works we can land that to unblock us on these two bugs. For clarity, before "openstack undercloud upgrade" stop services (this has been my workflow for all 8..9 upgrades testing for the undercloud): sudo rm -rf /etc/yum.repos.d/* sudo rhos-release 9-director -d sudo rhos-release 9 -d sudo yum clean all && sudo yum clean metadata && sudo yum clean dbcache && sudo yum makecache sudo yum -y update sudo systemctl stop openstack-* sudo systemctl stop neutron-* openstack undercloud upgrade thanks, marios we were testing the scenario : upgrade from 7.3 -> 8.0 -> 9.0 and I encountered another issue that prevent us from verifying this bug. while the upgrade from 7.3 to 8.0 finished successfully, the following upgrade (of the same environment) from 8.0 to 9.0 failed -, it failed on the undercloud upgrade phase . (we've tried both with and without SSL) it seems to have some Error with httpd: 07:43:32 Error: /Stage[main]/Apache::Service/Service[httpd]: Failed to call refresh: Could not restart Service[httpd]: Execution of '/bin/systemctl restart httpd' returned 1: Warning: httpd.service changed on disk. Run 'systemctl daemon-reload' to reload units. 07:43:32 Job for httpd.service failed because the control process exited with error code. See "systemctl status httpd.service" and "journalctl -xe" for details. 07:43:32 Error: /Stage[main]/Apache::Service/Service[httpd]: Could not restart Service[httpd]: Execution of '/bin/systemctl restart httpd' returned 1: Warning: httpd.service changed on disk. Run 'systemctl daemon-reload' to reload units. 07:43:32 Job for httpd.service failed because the control process exited with error code. See "systemctl status httpd.service" and "journalctl -xe" for details. 07:43:32 Wrapped exception: 07:43:32 Execution of '/bin/systemctl restart httpd' returned 1: Warning: httpd.service changed on disk. Run 'systemctl daemon-reload' to reload units. 07:43:32 Job for httpd.service failed because the control process exited with error code. See "systemctl status httpd.service" and "journalctl -xe" for details. 07:43:32 Notice: /Stage[main]/Keystone::Deps/Anchor[keystone::service::end]: Triggered 'refresh' from 2 events 07:46:13 Error: Could not prefetch keystone_service provider 'openstack': Execution of '/bin/openstack service list --quiet --format csv --long' returned 1: Unable to establish connection to http://192.168.0.1:35357/v3/services (tried 37, for a total of 170 seconds) 07:46:13 Error: Not managing Keystone_service[Image Service] due to earlier Keystone API failures. 07:46:13 Error: /Stage[main]/Glance::Keystone::Auth/Keystone::Resource::Service_identity[glance]/Keystone_service[Image Service::image]/ensure: change from absent to present failed: Not managing Keystone_service[Image Service] due to earlier Keystone API failures. 07:48:55 Error: /Stage[main]/Neutron::Keystone::Auth/Keystone::Resource::Service_identity[neutron]/Keystone_user[neutron]: Could not evaluate: Execution of '/bin/openstack domain list --quiet --format csv' returned 1: Unable to establish connection to http://192.168.0.1:35357/v3/domains (tried 38, for a total of 170 seconds) 07:51:33 Error: /Stage[main]/Heat::Keystone::Auth/Keystone::Resource::Service_identity[heat]/Keystone_user[heat]: Could not evaluate: Execution of '/bin/openstack domain list --quiet --format csv' returned 1: Unable to establish connection to http://192.168.0.1:35357/v3/domains (tried 38, for a total of 170 seconds) 07:54:11 Error: Could not prefetch keystone_role provider 'openstack': Execution of '/bin/openstack role list --quiet --format csv' returned 1: Unable to establish connection to http://192.168.0.1:35357/v3/roles (tried 37, for a total of 170 seconds) 07:54:11 Error: Not managing Keystone_role[ResellerAdmin] due to earlier Keystone API failures. 07:54:11 Error: /Stage[main]/Ceilometer::Keystone::Auth/Keystone_role[ResellerAdmin]/ensure: change from absent to present failed: Not managing Keystone_role[ResellerAdmin] due to earlier Keystone API failures. 07:54:11 Error: Not managing Keystone_service[ironic] due to earlier Keystone API failures. 07:54:11 Error: /Stage[main]/Ironic::Keystone::Auth/Keystone::Resource::Service_identity[ironic]/Keystone_service[ironic::baremetal]/ensure: change from absent to present failed: Not managing Keystone_service[ironic] due to earlier Keystone API failures. 07:56:51 Error: /Stage[main]/Aodh::Keystone::Auth/Keystone::Resource::Service_identity[aodh]/Keystone_user[aodh]: Could not evaluate: Execution of '/bin/openstack domain list --quiet --format csv' returned 1: Unable to establish connection to http://192.168.0.1:35357/v3/domains (tried 36, for a total of 170 seconds) 07:59:31 Error: /Stage[main]/Nova::Keystone::Auth/Keystone::Resource::Service_identity[nova service, user nova]/Keystone_user[nova]: Could not evaluate: Execution of '/bin/openstack domain list --quiet --format csv' returned 1: Unable to establish connection to http://192.168.0.1:35357/v3/domains (tried 37, for a total of 170 seconds) 07:59:31 Error: Not managing Keystone_service[aodh] due to earlier Keystone API failures. 07:59:31 Error: /Stage[main]/Aodh::Keystone::Auth/Keystone::Resource::Service_identity[aodh]/Keystone_service[aodh::alarming]/ensure: change from absent to present failed: Not managing Keystone_service[aodh] due to earlier Keystone API failures. 08:02:11 Error: /Stage[main]/Glance::Keystone::Auth/Keystone::Resource::Service_identity[glance]/Keystone_user[glance]: Could not evaluate: Execution of '/bin/openstack domain list --quiet --format csv' returned 1: Unable to establish connection to http://192.168.0.1:35357/v3/domains (tried 38, for a total of 170 seconds) 08:02:11 Error: Not managing Keystone_service[novav3] due to earlier Keystone API failures. 08:02:11 Error: /Stage[main]/Nova::Keystone::Auth/Keystone::Resource::Service_identity[nova v3 service, user novav3]/Keystone_service[novav3::computev3]/ensure: change from absent to present failed: Not managing Keystone_service[novav3] due to earlier Keystone API failures. 08:02:11 Error: Not managing Keystone_role[heat_stack_user] due to earlier Keystone API failures. 08:02:11 Error: /Stage[main]/Heat::Keystone::Auth/Keystone_role[heat_stack_user]/ensure: change from absent to present failed: Not managing Keystone_role[heat_stack_user] due to earlier Keystone API failures. you can see more info on the link: https://rhos-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/view/Director/view/7.x/job/BM_rhos18_Upgrade_7.3_to_8.0_to_9.0_noSSL/lastFailedBuild/consoleFull Reproduced the issue. [root@instack ~]# journalctl -u httpd -- Logs begin at Tue 2016-08-02 17:31:18 EDT, end at Wed 2016-08-03 10:12:19 EDT. -- Aug 02 17:42:10 instack.localdomain systemd[1]: Starting The Apache HTTP Server... Aug 02 17:42:10 instack.localdomain python[28417]: WARNING:root:"dashboards" and "default_dashboard" in (local_)settings is DEPRECATED now and may be unsupported in some future release. The preferred way to specif Aug 02 17:42:11 instack.localdomain python[28417]: 0 static files copied to '/usr/share/openstack-dashboard/static', 848 unmodified. Aug 02 17:42:11 instack.localdomain python[28427]: WARNING:root:"dashboards" and "default_dashboard" in (local_)settings is DEPRECATED now and may be unsupported in some future release. The preferred way to specif Aug 02 17:42:11 instack.localdomain python[28427]: WARNING:py.warnings:RemovedInDjango19Warning: "requires_model_validation" is deprecated in favor of "requires_system_checks". Aug 02 17:42:11 instack.localdomain python[28427]: WARNING:py.warnings:RemovedInDjango19Warning: SortedDict is deprecated and will be removed in Django 1.9. Aug 02 17:42:11 instack.localdomain python[28427]: WARNING:py.warnings:RemovedInDjango19Warning: Loading the `url` tag from the `future` library is deprecated and will be removed in Django 1.9. Use the default `ur Aug 02 17:42:12 instack.localdomain python[28427]: WARNING:py.warnings:RemovedInDjango19Warning: SortedDict is deprecated and will be removed in Django 1.9. Aug 02 17:42:12 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:13 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:13 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:13 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:13 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:13 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:13 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:17 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:17 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:17 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:17 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:17 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:17 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:18 instack.localdomain python[28427]: ERROR:scss.expression:Function not found: twbs-font-path:1 Aug 02 17:42:19 instack.localdomain python[28427]: Found 'compress' tags in: Aug 02 17:42:19 instack.localdomain python[28427]: /usr/lib/python2.7/site-packages/horizon/templates/horizon/_conf.html Aug 02 17:42:19 instack.localdomain python[28427]: /usr/lib/python2.7/site-packages/tuskar_ui/infrastructure/templates/infrastructure/overview/index.html Aug 02 17:42:19 instack.localdomain python[28427]: /usr/lib/python2.7/site-packages/tuskar_ui/infrastructure/templates/infrastructure/overview/undeploy_confirmation.html Aug 02 17:42:19 instack.localdomain python[28427]: /usr/lib/python2.7/site-packages/tuskar_ui/infrastructure/templates/infrastructure/overview/scale_out.html Aug 02 17:42:19 instack.localdomain python[28427]: /usr/share/openstack-dashboard/openstack_dashboard/dashboards/theme/templates/_stylesheets.html Aug 02 17:42:19 instack.localdomain python[28427]: /usr/lib/python2.7/site-packages/tuskar_ui/infrastructure/templates/infrastructure/overview/deploy_confirmation.html Aug 02 17:42:19 instack.localdomain python[28427]: /usr/lib/python2.7/site-packages/tuskar_ui/infrastructure/templates/infrastructure/_workflow_base.html Aug 02 17:42:19 instack.localdomain python[28427]: /usr/lib/python2.7/site-packages/tuskar_boxes/templates/tuskar_boxes/overview/index.html ...skipping... Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/metadata/tree/metadata-tree-item.html' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/metadata/tree/metadata-tree-item.directive.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/hz-no-items.directive.spec.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/hz-table.directive.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/hz-select.directive.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/hz-select-all.directive.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/table.spec.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/st-table.mock.html' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/hz-no-items.html' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/table.mock.html' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/hz-expand-detail.directive.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/hz-no-items.directive.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/table.controller.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/hz-table-footer.directive.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/search-bar.spec.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/no-items.mock.html' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/table.module.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/table.scss' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/hz-search-bar.directive.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/hz-table-footer.html' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/table/search-bar.html' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/modal-wait-spinner/modal-wait-spinner.module.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/modal-wait-spinner/modal-wait-spinner.scss' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/modal-wait-spinner/modal-wait-spinner.spec.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/modal-wait-spinner/modal-wait-spinner.directive.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/modal-wait-spinner/modal-wait-spinner.service.js' Aug 02 21:16:39 instack.localdomain python[21290]: Copying '/usr/lib/python2.7/site-packages/horizon/static/framework/widgets/modal/simple-modal.spec.js' Aug 02 21:16:41 instack.localdomain python[21313]: Compressing... Aug 02 21:16:41 instack.localdomain systemd[1]: httpd.service: control process exited, code=exited status=1 Aug 02 21:16:41 instack.localdomain systemd[1]: Failed to start The Apache HTTP Server. Aug 02 21:16:41 instack.localdomain systemd[1]: Unit httpd.service entered failed state. Aug 02 21:16:41 instack.localdomain systemd[1]: httpd.service failed. Created attachment 1187096 [details]
httpd logs
We've been able to possibly workaround that problem, which seem to be caused by tuskar leftovers, that are preventing to restart httpd and therefore causing the undercloud upgrade to fail. the steps we took to workaround (and are not yet an Official workaround): before upgrading the undercloud from 8.0 to 9.0 , run: (1) yum remove tuskar* (2) removing all files from the following list: https://paste.fedoraproject.org/400796/70242832/ added some doctext. PLEASE NOTE: I did not verify this workaround. In particular I am not clear if the manual removal of the pyc files is absolutely necessary as included in the doctext, I went on comment #7 will try and catch omri later @omri needinfo on the pyc files. were they not removed with the yum removal of the tuskar packages? (In reply to marios from comment #9) > added some doctext. PLEASE NOTE: I did not verify this workaround. In > particular I am not clear if the manual removal of the pyc files is > absolutely necessary as included in the doctext, I went on comment #7 will > try and catch omri later > > @omri needinfo on the pyc files. were they not removed with the yum removal > of the tuskar packages? I was trying to verify the suggested workaround, but I got stuck by this Bz : https://bugzilla.redhat.com/show_bug.cgi?id=1364583 *** Bug 1378968 has been marked as a duplicate of this bug. *** Hi Jon, this one depends on https://bugzilla.redhat.com/show_bug.cgi?id=1455496, or am I missing something ? moving POST is merged back to OSP 8 (see clones at https://bugzilla.redhat.com/show_bug.cgi?id=1455496#c1 ) all good(In reply to Sofer Athlan-Guyot from comment #17) > Hi Jon, > > this one depends on https://bugzilla.redhat.com/show_bug.cgi?id=1455496, or > am I missing something ? sorry for noise I was doing some cleanup of clone Depends on entries and pruned one too many. According to our records, this should be resolved by instack-undercloud-4.0.0-17.el7ost. This build is available now. |