I was trying to assign qe to myself and messed up the dev assignment. Sorry.
Please reassign if it isn't set correctly.
Is it always reproducible and is the error the same every time?
We had seen something similar but can't reproduce it at will.
There is a race in the terraform between creating the bootstrap_ignition.ign file and consuming the file to create the Azure storage blob. Since the name of the file is known at plan-creation time, terraform does not determine that the storage blob resource has a dependency on the local file. In the reported case, the local file had not been created by the time it was needed to create the storage blob, which was 10+ seconds after starting to create the file.
I am marking this an not a blocker. There is a possibility that a user may encounter this issue, but it is not likely to affect most users. If a user does encounter this, then the user can retry their installation. The installation will fail relatively early. The user will not be left with what they think is a correctly functioning cluster that actually has issues.
Would that mean that running the `openshift-install wait-for bootstrap --dir <path> --log-level debug` command should be enough to get the install process continuing to deploy the cluster?
Or should it be more restarting the `openshift-install create cluster --dir <path> --log-level debug`?
In my case, I think I tried to run the `create cluster` again and that it failed.
If one of the commands should be enough to continue the installation, I can create a KCS to describe the process.
Otherwise, would it be possible to put a test for the file to be present, and if not, can we imagine allowing ~30 sec (in 10-sec segments) to Terraform to ensure the file is created?
@Vincent, No, you would need to redo the entire installation: Clean your install directory and run `create cluster` again.
The change that we need to make to the terraform is to create an explicit dependency between the local file and the storage blob. This will tell terraform to wait until the local file is created before attempting to create the storage blob.
(In reply to Matthew Staebler from comment #7)
> @Vincent, No, you would need to redo the entire installation: Clean your
> install directory and run `create cluster` again.
> The change that we need to make to the terraform is to create an explicit
> dependency between the local file and the storage blob. This will tell
> terraform to wait until the local file is created before attempting to
> create the storage blob.
Thank Matthew for the clarification. I thought I've missed something during the Testathon.
So that is what I tried several times (more than 6 tries in total) with the installers 4.9.0-rc.0 & -rc.1. But unfortunately, that never worked.
That's actually why I raised this BZ.
Good luck with the fix implementation, looking forward to checking the code modifications to understand the issue & solution :)
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.
For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated
files, follow the link below.
If the solution does not work for you, open a new bug report.