Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1743402

Summary: [RHOS 15] Controller replacement failed on unable to start memcached container
Product: Red Hat OpenStack Reporter: Archit Modi <amodi>
Component: python-memcachedAssignee: Python Maintainers <python-maint>
Status: CLOSED DUPLICATE QA Contact: nlevinki <nlevinki>
Severity: urgent Docs Contact:
Priority: urgent    
Version: 15.0 (Stein)CC: cstratak, michele, pviktori, torsava
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2019-08-20 05:40:55 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1690784    
Attachments:
Description Flags
controller-replacement.log none

Description Archit Modi 2019-08-19 20:25:53 UTC
Description of problem: While trying to replace controller nodes, after redeploying overcloud, deployment failed on "Failed to start memcached-w4dspvrv container." error


Steps to Reproduce:
1. Reproduction steps attached (controller-replacement.log)
2. Deploy OSP 15 with 3 controllers, 2 compute
3. [root@controller-0 ~]# ls -la /var/log/containers/nova/ |grep manage
-rw-------.  1 42436 42436    91905 Aug 16 23:43 nova-manage.log
[root@controller-0 ~]# rm /var/log/containers/nova/nova-manage.log
rm: remove regular file '/var/log/containers/nova/nova-manage.log'? yes
[root@controller-0 ~]# ls -la /var/log/containers/nova/ |grep manage
[root@controller-0 ~]# docker exec -it -u root nova_api nova-manage db version
Emulate Docker CLI using podman. Create /etc/containers/nodocker to quiet msg.
391
[root@controller-0 ~]# ls -la /var/log/containers/nova/ |grep manage
-rw-r--r--.  1 root  root         0 Aug 17 20:33 nova-manage.log

4. Follow guide (Chapter 14. Replacing Controller Nodes)14.3 specifically [1] 
5. Redeploy overcloud 

[1] https://access.redhat.com/documentation/en-us/red_hat_openstack_platform/15-beta/html-single/director_installation_and_usage/index#preparing-the-cluster-for-controller-replacement

Actual results:
TASK [Debug output for task: Start containers for step 1] **********************                                                                                                                                                    [189/1905]
Saturday 17 August 2019  17:29:24 -0400 (0:00:15.795)       0:15:21.052 *******
fatal: [controller-1]: FAILED! => {
    "failed_when_result": true,
    "outputs.stdout_lines | default([]) | union(outputs.stderr_lines | default([]))": [                                                                                                                                                      
        "stdout: aad88dc4b754806593a8e4722028ea60287aa320e25bb7e9dfd8ca460d9616d9",
        "",
        "stderr: Trying to pull 192.168.24.1:8787/rhosp15/openstack-mariadb:20190815.1...Getting image source signatures",
        "Copying blob sha256:e61d8721e62e50814b162c8341bb235d3453b9c95bd26439bf9100fcf88338c7",
        "Copying blob sha256:c585fd5093c62ee42a56af6c09813ac8384d8145c4285bc62c357b41224b1970",
        "Copying blob sha256:305e8a83063c1cc33517b21472472f9e213387f47be0996ce4f2aeeb1f004486",
        "Copying blob sha256:f31644cdb030c658b0d6603affab726881a90a2bc0dff8729f2e85321eb0899a",
        "Copying config sha256:aad88dc4b754806593a8e4722028ea60287aa320e25bb7e9dfd8ca460d9616d9",
        "Writing manifest to image destination",
        "Storing signatures",                                                                                                                                                                                                                
        "stdout: 721514fcd22b506794578a5fe494c0937fc4aabc1f6a618e5b10f49eae348490",                                                                                                                                                          
        "stderr: Trying to pull 192.168.24.1:8787/rhosp15/openstack-memcached:20190815.1...Getting image source signatures",                                                                                                                 
        "Copying blob sha256:5f7e9eb9bee5b3dd0b676a79077c102fc564d2ea98c092d3ffc344d899067307",
        "Copying config sha256:721514fcd22b506794578a5fe494c0937fc4aabc1f6a618e5b10f49eae348490",                                                                                                                                            
        "stdout: 214660d3aad60dac1132c01c78211392e1f6670948c87592607a457b704de807",                                                                                                                                                          
        "stderr: Trying to pull 192.168.24.1:8787/rhosp15/openstack-rabbitmq:20190815.1...Getting image source signatures",                                                                                                                  
        "Copying blob sha256:46430179a92aa221117e5f2af5738c2c357c4fb911b3d068df4135d47d1efecf",                                                                                                                                              
        "Copying config sha256:214660d3aad60dac1132c01c78211392e1f6670948c87592607a457b704de807",
        "No systemd unit file was found for tripleo_mysql_bootstrap.service",                                                                                                                                                                
        "No systemd unit file was found for tripleo_mysql_bootstrap_healthcheck.service",                                                                                                                                                    
        "No systemd unit file was found for tripleo_mysql_bootstrap_healthcheck.timer",                                                                                                                                                      
        "Removed /etc/systemd/system/multi-user.target.wants/tripleo_memcached.service.",
        "Removed /etc/systemd/system/timers.target.wants/tripleo_memcached_healthcheck.timer.",
        "Removed /etc/systemd/system/tripleo_memcached.service.requires/tripleo_memcached_healthcheck.timer.",
        "stdout: c68424af4f8840ec126d46d434a3372b5da3d01333435992370b5ac8ddc11ab1",
        "stderr: ",                                                                                                                                                                                                                          
        "Created symlink /etc/systemd/system/multi-user.target.wants/tripleo_memcached-w4dspvrv.service → /etc/systemd/system/tripleo_memcached-w4dspvrv.service.",                                                                          
        "Job for tripleo_memcached-w4dspvrv.service failed because the service did not take the steps required by its unit configuration.",                                                                                                  
        "See \"systemctl status tripleo_memcached-w4dspvrv.service\" and \"journalctl -xe\" for details.",                                                                                                                                   
        "systemctl failed",                                                                                                                                                                                                                  
        "Traceback (most recent call last):",                                                                                                                                                                                                
        "  File \"/usr/lib/python3.6/site-packages/paunch/utils/systemctl.py\", line 31, in systemctl",                                                                                                                                      
        "    subprocess.check_call(cmd)",
        "  File \"/usr/lib64/python3.6/subprocess.py\", line 311, in check_call",
        "    raise CalledProcessError(retcode, cmd)",
        "subprocess.CalledProcessError: Command '['systemctl', 'enable', '--now', 'tripleo_memcached-w4dspvrv']' returned non-zero exit status 1.",                                                                                          
        "During handling of the above exception, another exception occurred:",
        "  File \"/usr/lib/python3.6/site-packages/paunch/utils/systemd.py\", line 111, in service_create",
        "    systemctl.enable(service, now=True)",
        "  File \"/usr/lib/python3.6/site-packages/paunch/utils/systemctl.py\", line 53, in enable",
        "    systemctl(cmd, log)",
        "  File \"/usr/lib/python3.6/site-packages/paunch/utils/systemctl.py\", line 33, in systemctl",
        "    raise SystemctlException(str(err))",
        "paunch.utils.systemctl.SystemctlException: Command '['systemctl', 'enable', '--now', 'tripleo_memcached-w4dspvrv']' returned non-zero exit status 1.",                                                                              
        "Command '['systemctl', 'enable', '--now', 'tripleo_memcached-w4dspvrv']' returned non-zero exit status 1."


Expected results:
Overcloud deployment passed

Additional info:

Comment 1 Archit Modi 2019-08-19 20:27:20 UTC
Created attachment 1605891 [details]
controller-replacement.log

Comment 2 Archit Modi 2019-08-19 20:41:33 UTC
Core_puddle_version: RHOS_TRUNK-15.0-RHEL-8-20190815.n.1

Version: puppet-memcached-3.4.0-0.20190322030341.66d3a66.el8ost.noarch

Comment 3 Michele Baldessari 2019-08-20 05:40:55 UTC

*** This bug has been marked as a duplicate of bug 1742169 ***

Comment 4 Petr Viktorin (pviktori) 2019-08-20 08:53:22 UTC
As one of the Python maintainers, I can't make sense of this bug. I will assume that it has nothing to do with python-memcached, and we can ignore it. Let me know if that's wrong.

Comment 5 Michele Baldessari 2019-08-20 09:05:42 UTC
(In reply to Petr Viktorin from comment #4)
> As one of the Python maintainers, I can't make sense of this bug. I will
> assume that it has nothing to do with python-memcached, and we can ignore
> it. Let me know if that's wrong.

Correct. It is a bug in OSP/paunch, I closed it as a DUPLICATE