Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1597875

Summary: FFU: OpenStack services stay in systemd after osp10->osp13 upgrade
Product: Red Hat OpenStack Reporter: Chris Janiszewski <cjanisze>
Component: openstack-tripleoAssignee: Sergii Golovatiuk <sgolovat>
Status: CLOSED DUPLICATE QA Contact: Arik Chernetsky <achernet>
Severity: high Docs Contact:
Priority: high    
Version: 13.0 (Queens)CC: aschultz, augol, ccamacho, jfrancoa, mbultel, mburns, morazi
Target Milestone: zstreamKeywords: Triaged
Target Release: 13.0 (Queens)   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2018-08-07 14:59:47 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1562199    
Bug Blocks:    
Attachments:
Description Flags
sos-controller
none
sos-undercloud none

Description Chris Janiszewski 2018-07-03 18:40:18 UTC
Created attachment 1456315 [details]
sos-controller

Description of problem:
I have completed FFWD upgrade and noticed certain service on one of the controllers have remained on the list in failed state:

(undercloud) [stack@test1-undercloud ~]$ for NODE in $(openstack server list -f value -c Networks | cut -d= -f2); do echo "=== $NODE ===" ; ssh heat-admin@$NODE "sudo systemctl list-units --state=failed 'openstack*' 'neutron*' 'httpd' 'docker' 'ceph*'" ; done
=== 172.16.0.114 ===
  UNIT                       LOAD   ACTIVE SUB    DESCRIPTION
● neutron-server.service     loaded failed failed OpenStack Neutron Server
● openstack-nova-api.service loaded failed failed OpenStack Nova API Server

LOAD   = Reflects whether the unit definition was properly loaded.
ACTIVE = The high-level unit activation state, i.e. generalization of SUB.
SUB    = The low-level unit activation state, values depend on unit type.

2 loaded units listed. Pass --all to see loaded but inactive units, too.
To show all installed unit files use 'systemctl list-unit-files'.
=== 172.16.0.106 ===
0 loaded units listed. Pass --all to see loaded but inactive units, too.
To show all installed unit files use 'systemctl list-unit-files'.
=== 172.16.0.107 ===
0 loaded units listed. Pass --all to see loaded but inactive units, too.
To show all installed unit files use 'systemctl list-unit-files'.
=== 172.16.0.113 ===
0 loaded units listed. Pass --all to see loaded but inactive units, too.
To show all installed unit files use 'systemctl list-unit-files'.
=== 172.16.0.108 ===
0 loaded units listed. Pass --all to see loaded but inactive units, too.
To show all installed unit files use 'systemctl list-unit-files'.

Rebooting that controller resolved the problem and: 
neutron-server.service
openstack-nova-api.service

are no longer showing on the failed list. This might be working as designed, and if controller reboot is required after the successful upgrade, we should probably note that in a docs. Otherwise I am attaching sosreports from undercloud and controller node with failed services

Version-Release number of selected component (if applicable):
OSP10-OSP13

How reproducible:
Only on single controller

Steps to Reproduce:
1. Upgrade OSP10->OSP13
2. Verify a list of failed openstack services with:
for NODE in $(openstack server list -f value -c Networks | cut -d= -f2); do echo "=== $NODE ===" ; ssh heat-admin@$NODE "sudo systemctl list-units --state=failed 'openstack*' 'neutron*' 'httpd' 'docker' 'ceph*'" ; done


Actual results:
'httpd' 'docker' 'ceph*'" ; done
=== 172.16.0.114 ===
  UNIT                       LOAD   ACTIVE SUB    DESCRIPTION
● neutron-server.service     loaded failed failed OpenStack Neutron Server
● openstack-nova-api.service loaded failed failed OpenStack Nova API Server

LOAD   = Reflects whether the unit definition was properly loaded.
ACTIVE = The high-level unit activation state, i.e. generalization of SUB.
SUB    = The low-level unit activation state, values depend on unit type.

Expected results:
=== 172.16.0.114 ===
0 loaded units listed. Pass --all to see loaded but inactive units, too.
To show all installed unit files use 'systemctl list-unit-files'.


Additional info:

Comment 1 Chris Janiszewski 2018-07-03 18:41:29 UTC
Created attachment 1456317 [details]
sos-undercloud