| Summary: | osp-director-10 : Upgrade fails during UPGRADE CONTROLLER AND BLOCKSTORAGE, due to broken PCS cluster ( environment with no SSL ). | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Product: | Red Hat OpenStack | Reporter: | Omri Hochman <ohochman> | ||||||
| Component: | rhosp-director | Assignee: | Michele Baldessari <michele> | ||||||
| Status: | CLOSED CURRENTRELEASE | QA Contact: | Omri Hochman <ohochman> | ||||||
| Severity: | urgent | Docs Contact: | |||||||
| Priority: | urgent | ||||||||
| Version: | 10.0 (Newton) | CC: | dbecker, jcoufal, mandreou, mburns, mcornea, michele, mlammon, morazi, ohochman, rhel-osp-director-maint | ||||||
| Target Milestone: | rc | Keywords: | Triaged | ||||||
| Target Release: | 10.0 (Newton) | ||||||||
| Hardware: | x86_64 | ||||||||
| OS: | Linux | ||||||||
| Whiteboard: | |||||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||||
| Doc Text: | Story Points: | --- | |||||||
| Clone Of: | Environment: | ||||||||
| Last Closed: | 2016-12-16 16:50:51 UTC | Type: | Bug | ||||||
| Regression: | --- | Mount Type: | --- | ||||||
| Documentation: | --- | CRM: | |||||||
| Verified Versions: | Category: | --- | |||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||
| Attachments: |
|
||||||||
|
Description
Omri Hochman
2016-11-07 19:47:06 UTC
[stack@undercloud-0 ~]$ ssh heat-admin.2.10
Last login: Mon Nov 7 18:32:54 2016 from 192.0.2.1
[heat-admin@controller-0 ~]$ sudo su -
Last login: Mon Nov 7 18:15:52 UTC 2016 from 192.0.2.1 on pts/0
[root@controller-0 ~]# pcs status
Cluster name: tripleo_cluster
Stack: corosync
Current DC: controller-2 (version 1.1.15-11.el7-e174ec8) - partition with quorum
Last updated: Mon Nov 7 19:44:24 2016 Last change: Mon Nov 7 06:01:25 2016 by root via cibadmin on controller-0
3 nodes and 124 resources configured
Online: [ controller-0 controller-1 controller-2 ]
Full list of resources:
ip-172.17.1.10 (ocf::heartbeat:IPaddr2): Started controller-0
ip-192.0.2.6 (ocf::heartbeat:IPaddr2): Started controller-1
ip-172.17.4.10 (ocf::heartbeat:IPaddr2): Started controller-2
Clone Set: haproxy-clone [haproxy]
Started: [ controller-0 controller-1 controller-2 ]
Master/Slave Set: galera-master [galera]
Masters: [ controller-0 controller-1 controller-2 ]
Clone Set: memcached-clone [memcached]
Started: [ controller-0 controller-1 controller-2 ]
ip-172.17.3.10 (ocf::heartbeat:IPaddr2): Started controller-0
ip-10.0.0.101 (ocf::heartbeat:IPaddr2): Started controller-1
Clone Set: rabbitmq-clone [rabbitmq]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-core-clone [openstack-core]
Started: [ controller-0 controller-1 controller-2 ]
Master/Slave Set: redis-master [redis]
Masters: [ controller-0 ]
Slaves: [ controller-1 controller-2 ]
ip-172.17.1.11 (ocf::heartbeat:IPaddr2): Started controller-2
Clone Set: mongod-clone [mongod]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-aodh-evaluator-clone [openstack-aodh-evaluator]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-nova-scheduler-clone [openstack-nova-scheduler]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: neutron-l3-agent-clone [neutron-l3-agent]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: neutron-netns-cleanup-clone [neutron-netns-cleanup]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: neutron-ovs-cleanup-clone [neutron-ovs-cleanup]
Started: [ controller-0 controller-1 controller-2 ]
openstack-cinder-volume (systemd:openstack-cinder-volume): Started controller-0
Clone Set: openstack-heat-engine-clone [openstack-heat-engine]
Started: [ controller-0 ]
Stopped: [ controller-1 controller-2 ]
Clone Set: openstack-aodh-listener-clone [openstack-aodh-listener]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: neutron-metadata-agent-clone [neutron-metadata-agent]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-gnocchi-metricd-clone [openstack-gnocchi-metricd]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-aodh-notifier-clone [openstack-aodh-notifier]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-heat-api-clone [openstack-heat-api]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-ceilometer-collector-clone [openstack-ceilometer-collector]
Started: [ controller-0 controller-1 ]
Stopped: [ controller-2 ]
Clone Set: openstack-glance-api-clone [openstack-glance-api]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-cinder-scheduler-clone [openstack-cinder-scheduler]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-nova-api-clone [openstack-nova-api]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-nova-consoleauth-clone [openstack-nova-consoleauth]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-sahara-api-clone [openstack-sahara-api]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-heat-api-cloudwatch-clone [openstack-heat-api-cloudwatch]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-sahara-engine-clone [openstack-sahara-engine]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-glance-registry-clone [openstack-glance-registry]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-gnocchi-statsd-clone [openstack-gnocchi-statsd]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-ceilometer-notification-clone [openstack-ceilometer-notification]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-cinder-api-clone [openstack-cinder-api]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: neutron-dhcp-agent-clone [neutron-dhcp-agent]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: neutron-openvswitch-agent-clone [neutron-openvswitch-agent]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-nova-novncproxy-clone [openstack-nova-novncproxy]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: delay-clone [delay]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: neutron-server-clone [neutron-server]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-ceilometer-central-clone [openstack-ceilometer-central]
Started: [ controller-0 controller-1 ]
Stopped: [ controller-2 ]
Clone Set: httpd-clone [httpd]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-heat-api-cfn-clone [openstack-heat-api-cfn]
Started: [ controller-0 controller-1 controller-2 ]
Clone Set: openstack-nova-conductor-clone [openstack-nova-conductor]
Started: [ controller-0 controller-1 controller-2 ]
Failed Actions:
* memcached_monitor_60000 on controller-2 'not running' (7): call=41, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:41:21 2016', queued=0ms, exec=0ms
* mongod_monitor_60000 on controller-2 'not running' (7): call=79, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:40:49 2016', queued=0ms, exec=0ms
* openstack-aodh-evaluator_monitor_60000 on controller-2 'not running' (7): call=342, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:41:46 2016', queued=0ms, exec=0ms
* neutron-l3-agent_monitor_60000 on controller-2 'not running' (7): call=120, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:41:18 2016', queued=0ms, exec=0ms
* openstack-heat-engine_start_0 on controller-2 'not running' (7): call=471, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:44:07 2016', queued=0ms, exec=2280ms
* openstack-aodh-listener_monitor_60000 on controller-2 'not running' (7): call=345, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:41:48 2016', queued=0ms, exec=0ms
* openstack-aodh-notifier_monitor_60000 on controller-2 'not running' (7): call=346, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:41:48 2016', queued=0ms, exec=0ms
* openstack-ceilometer-central_start_0 on controller-2 'not running' (7): call=439, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:43:16 2016', queued=1ms, exec=2595ms
* memcached_monitor_60000 on controller-1 'not running' (7): call=45, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:41:21 2016', queued=0ms, exec=0ms
* mongod_monitor_60000 on controller-1 'not running' (7): call=79, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:41:49 2016', queued=0ms, exec=0ms
* openstack-aodh-evaluator_monitor_60000 on controller-1 'not running' (7): call=344, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:42:46 2016', queued=0ms, exec=0ms
* neutron-l3-agent_monitor_60000 on controller-1 'OCF_PENDING' (196): call=122, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:41:19 2016', queued=0ms, exec=0ms
* openstack-heat-engine_start_0 on controller-1 'not running' (7): call=466, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:44:07 2016', queued=0ms, exec=2367ms
* openstack-aodh-listener_monitor_60000 on controller-1 'not running' (7): call=347, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:42:48 2016', queued=0ms, exec=0ms
* openstack-aodh-notifier_monitor_60000 on controller-1 'not running' (7): call=348, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:42:48 2016', queued=0ms, exec=0ms
* openstack-cinder-api_monitor_60000 on controller-1 'not running' (7): call=259, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:42:21 2016', queued=0ms, exec=0ms
* openstack-ceilometer-central_monitor_60000 on controller-1 'not running' (7): call=310, status=complete, exitreason='none',
last-rc-change='Mon Nov 7 06:42:30 2016', queued=0ms, exec=0ms
Daemon Status:
corosync: active/enabled
pacemaker: active/enabled
pcsd: active/enabled
[root@controller-0 ~]#
Created attachment 1218231 [details]
heat-engine.log
Created attachment 1218232 [details]
messages_controller
The CI Link https://rhos-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/view/Director/view/9.0/job/infrared_deploy_9.0_3_control_1_compute_1ceph_no-UCSSL_no-OCSSL_RHEL_7.3_upgrade_10/25/console gives 404. Can we get sosreports or access to a live system for this one? Deployed RHOS 9 latest Upgraded to RHOS 10 with latest puddle (2016-11-14.1) I no longer see this issue. |