Bug 1881027 - Cluster installation fails at with error : the container name \"assisted-installer\" is already in use
Summary: Cluster installation fails at with error : the container name \"assisted-ins...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: assisted-installer
Version: 4.6
Hardware: Unspecified
OS: Unspecified
urgent
urgent
Target Milestone: ---
: 4.7.0
Assignee: Eran Cohen
QA Contact: Yuri Obshansky
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-09-21 11:49 UTC by Constantin Vultur
Modified: 2021-02-24 15:20 UTC (History)
5 users (show)

Fixed In Version: OCP-Metal-v1.0.9.5
Doc Type: No Doc Update
Doc Text:
Clone Of:
Environment:
Last Closed: 2021-02-24 15:18:56 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHSA-2020:5633 0 None None None 2021-02-24 15:20:57 UTC

Description Constantin Vultur 2020-09-21 11:49:49 UTC
Description of problem:
Cluster installation fails with :
error creating container storage: the container name \"logs-sender\" is already in use 

Version-Release number of selected component (if applicable):
Stage 

How reproducible:


Steps to Reproduce:
1. deploy cluster 
2.
3.

Actual results:
error creating container storage: the container name \"logs-sender\" is already in use 
Error: error creating container storage: the container name \"assisted-installer\" is already in use by 

Expected results:
Cluster installation to run properly

Additional info:

Comment 1 Constantin Vultur 2020-09-21 12:08:47 UTC
As per dev investigation the install command is ran twice
First time: 
time="2020-09-21T11:09:09Z" level=info msg="Submitting step <install>
Second time:
time="2020-09-21T11:11:10Z" level=info msg="Submitting step <install>

Moving to service

Comment 2 Michael Filanov 2020-09-21 12:26:26 UTC
It looks like the installer didn't started in a 2 minutes window so the agent called to GetNextStep again go another install command before the first installer started and reported anything.

{code}
time="2020-09-21T11:09:09Z" level=info msg="GetNextSteps cluster: ,<d0b05ec0-7676-4b10-82cc-e4723cc7b41e> host: <40b84db8-fc45-421b-a488-f751a07b9b97>, host status: <installing>" func="github.com/openshift/assisted-service/internal/host.(*InstructionManager).GetNextSteps" file="/go/src/github.com/openshift/origin/internal/host/instructionmanager.go:89" go-id=1863456 pkg=instructions request_id=9b3a2219-86dd-48a9-8a17-9d67bebf3cd9
time="2020-09-21T11:09:09Z" level=info msg="Submitting step <install> id <install-5dda01d5> to cluster <d0b05ec0-7676-4b10-82cc-e4723cc7b41e> host <40b84db8-fc45-421b-a488-f751a07b9b97> Command: <bash> Arguments: <[-c podman run -v /dev:/dev:rw -v /opt:/opt:rw -v /run/systemd/journal/socket:/run/systemd/journal/socket --privileged --pid=host --net=host -v /var/log:/var/log:rw --env PULL_SECRET_TOKEN --name assisted-installer registry.stage.redhat.io/openshift4/assisted-installer-rhel8:v4.6.0-19 --role worker --cluster-id d0b05ec0-7676-4b10-82cc-e4723cc7b41e --boot-device /dev/sda --host-id 40b84db8-fc45-421b-a488-f751a07b9b97 --openshift-version 4.6 --controller-image registry.stage.redhat.io/openshift4/assisted-installer-reporter-rhel8:v4.6.0-15 --url https://api.stage.openshift.com --insecure=false --agent-image registry.stage.redhat.io/openshift4/assisted-installer-agent-rhel8:v4.6.0-15 --host-name worker-0-0 || ( returnCode=$?; podman run --rm --privileged -v /run/systemd/journal/socket:/run/systemd/journal/socket -v /var/log:/var/log --env PULL_SECRET_TOKEN --name logs-sender registry.stage.redhat.io/openshift4/assisted-installer-agent-rhel8:v4.6.0-15 logs_sender -url https://api.stage.openshift.com -cluster-id d0b05ec0-7676-4b10-82cc-e4723cc7b41e -host-id 40b84db8-fc45-421b-a488-f751a07b9b97 --insecure=false -bootstrap=false; exit $returnCode; )]>" func=github.com/openshift/assisted-service/internal/host.logSteps file="/go/src/github.com/openshift/origin/internal/host/instructionmanager.go:125" go-id=1863456 pkg=instructions request_id=9b3a2219-86dd-48a9-8a17-9d67bebf3cd9
time="2020-09-21T11:11:10Z" level=info msg="GetNextSteps cluster: ,<d0b05ec0-7676-4b10-82cc-e4723cc7b41e> host: <40b84db8-fc45-421b-a488-f751a07b9b97>, host status: <installing>" func="github.com/openshift/assisted-service/internal/host.(*InstructionManager).GetNextSteps" file="/go/src/github.com/openshift/origin/internal/host/instructionmanager.go:89" go-id=1865112 pkg=instructions request_id=8d4dfd21-da7f-4387-9a5e-c3efa82405e0
time="2020-09-21T11:11:10Z" level=info msg="Submitting step <install> id <install-9aa410a1> to cluster <d0b05ec0-7676-4b10-82cc-e4723cc7b41e> host <40b84db8-fc45-421b-a488-f751a07b9b97> Command: <bash> Arguments: <[-c podman run -v /dev:/dev:rw -v /opt:/opt:rw -v /run/systemd/journal/socket:/run/systemd/journal/socket --privileged --pid=host --net=host -v /var/log:/var/log:rw --env PULL_SECRET_TOKEN --name assisted-installer registry.stage.redhat.io/openshift4/assisted-installer-rhel8:v4.6.0-19 --role worker --cluster-id d0b05ec0-7676-4b10-82cc-e4723cc7b41e --boot-device /dev/sda --host-id 40b84db8-fc45-421b-a488-f751a07b9b97 --openshift-version 4.6 --controller-image registry.stage.redhat.io/openshift4/assisted-installer-reporter-rhel8:v4.6.0-15 --url https://api.stage.openshift.com --insecure=false --agent-image registry.stage.redhat.io/openshift4/assisted-installer-agent-rhel8:v4.6.0-15 --host-name worker-0-0 || ( returnCode=$?; podman run --rm --privileged -v /run/systemd/journal/socket:/run/systemd/journal/socket -v /var/log:/var/log --env PULL_SECRET_TOKEN --name logs-sender registry.stage.redhat.io/openshift4/assisted-installer-agent-rhel8:v4.6.0-15 logs_sender -url https://api.stage.openshift.com -cluster-id d0b05ec0-7676-4b10-82cc-e4723cc7b41e -host-id 40b84db8-fc45-421b-a488-f751a07b9b97 --insecure=false -bootstrap=false; exit $returnCode; )]>" func=github.com/openshift/assisted-service/internal/host.logSteps file="/go/src/github.com/openshift/origin/internal/host/instructionmanager.go:125" go-id=1865112 pkg=instructions request_id=8d4dfd21-da7f-4387-9a5e-c3efa82405e0
{code}

Comment 3 Michael Filanov 2020-09-21 12:47:53 UTC
Installer logs:
{code}
Sep 21 11:11:14 master-0-0 installer[2960]: time="2020-09-21T11:11:14Z" level=info msg="Updating node installation stage: Starting installation - bootstrap"
{code}

Next step called before:
{code}
time="2020-09-21T11:11:10Z" level=info msg="GetNextSteps cluster: ,<d0b05ec0-7676-4b10-82cc-e4723cc7b41e> host: <40b84db8-fc45-421b-a488-f751a07b9b97>, host status: <installing>" func="github.com/openshift/assisted-service/internal/host.(*InstructionManager).GetNextSteps" 
{code}

Comment 4 Michael Filanov 2020-09-21 13:14:49 UTC
Eran suggested to add a validate that if docker return specific error that indicate if another instance of the container already running then we don't fail the installation
https://github.com/openshift/assisted-service/blob/master/internal/bminventory/inventory.go#L1556

Comment 7 errata-xmlrpc 2021-02-24 15:18:56 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.7.0 security, bug fix, and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2020:5633


Note You need to log in before you can comment on or make changes to this bug.