Bug 2102632 - a shorter cluster name leads to Uninstall fails with Observed a panic: runtime.boundsError
Summary: a shorter cluster name leads to Uninstall fails with Observed a panic: runtim...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Installer
Version: 4.11
Hardware: Unspecified
OS: Unspecified
urgent
high
Target Milestone: ---
: 4.12.0
Assignee: Brent Barbachem
QA Contact: Jianli Wei
URL:
Whiteboard:
Depends On:
Blocks: 2104906
TreeView+ depends on / blocked
 
Reported: 2022-06-30 11:58 UTC by Jiří Mencák
Modified: 2023-01-17 19:51 UTC (History)
2 users (show)

Fixed In Version:
Doc Type: No Doc Update
Doc Text:
Clone Of:
Environment:
Last Closed: 2023-01-17 19:51:09 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift installer pull 6068 0 None open BUG 2102632: destroy/gcp: Use min length for destroying disks 2022-07-01 15:26:29 UTC
Red Hat Product Errata RHSA-2022:7399 0 None None None 2023-01-17 19:51:38 UTC

Description Jiří Mencák 2022-06-30 11:58:07 UTC
Version:
4.11.0-0.nightly-2022-06-28-160049

$ ./openshift-install version
./openshift-install 4.11.0-0.nightly-2022-06-28-160049
built from commit 6daed68b9863a9b2ecebdf8a4056800aa5c60ad3
release image registry.ci.openshift.org/ocp/release@sha256:b79b1be6aa4f9f62c691c043e0911856cf1c11bb81c8ef94057752c6e5a8478a
release architecture amd64

Platform:
GCP

IPI (automated install with `openshift-install`.

What happened?
During uninstall, the cluster uninstall I received:

E0630 13:17:58.830361  271713 runtime.go:78] Observed a panic: runtime.boundsError{x:22, y:21, signed:true, code:0x1} (runtime error: slice bounds out of range [:22] with length 21)
goroutine 1 [running]:
k8s.io/apimachinery/pkg/util/runtime.logPanic({0x41d43c0?, 0xc0010637e8})
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/runtime/runtime.go:74 +0x86
k8s.io/apimachinery/pkg/util/runtime.HandleCrash({0x0, 0x0, 0x18?})
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/runtime/runtime.go:48 +0x75
panic({0x41d43c0, 0xc0010637e8})
	/usr/lib/golang/src/runtime/panic.go:838 +0x207
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).formatClusterIDForStorage(...)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/disk.go:25
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).storageIDFilter(...)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/disk.go:29
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).storageLabelOrClusterIDFilter(0xc000f22540)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/disk.go:39 +0x1fe
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).listDisks(0xc0015dc900?)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/disk.go:43 +0x1e
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).destroyDisks(0xc000f22540)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/disk.go:116 +0x36
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).destroyCluster(0xc000f22540)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/gcp.go:174 +0x78e
k8s.io/apimachinery/pkg/util/wait.ConditionFunc.WithContext.func1({0x18, 0xc000700000})
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:220 +0x1b
k8s.io/apimachinery/pkg/util/wait.runConditionWithCrashProtectionWithContext({0x19f06638?, 0xc0000721c0?}, 0xc00047d888?)
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:233 +0x57
k8s.io/apimachinery/pkg/util/wait.poll({0x19f06638, 0xc0000721c0}, 0xc8?, 0x1108485?, 0x10?)
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:580 +0x38
k8s.io/apimachinery/pkg/util/wait.PollImmediateInfiniteWithContext({0x19f06638, 0xc0000721c0}, 0x40d687?, 0x10?)
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:566 +0x49
k8s.io/apimachinery/pkg/util/wait.PollImmediateInfinite(0x19f06670?, 0xc00008b8c0?)
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:555 +0x46
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).Run(0xc000f22540)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/gcp.go:130 +0x519
main.runDestroyCmd({0x7fffe6a88d87, 0x9}, 0x0)
	/go/src/github.com/openshift/installer/cmd/openshift-install/destroy.go:67 +0x92
main.newDestroyClusterCmd.func1(0xc000536780?, {0xc000906100?, 0x2?, 0x2?})
	/go/src/github.com/openshift/installer/cmd/openshift-install/destroy.go:53 +0x7f
github.com/spf13/cobra.(*Command).execute(0xc000536780, {0xc0009060c0, 0x2, 0x2})
	/go/src/github.com/openshift/installer/vendor/github.com/spf13/cobra/command.go:860 +0x663
github.com/spf13/cobra.(*Command).ExecuteC(0xc00098db80)
	/go/src/github.com/openshift/installer/vendor/github.com/spf13/cobra/command.go:974 +0x3b4
github.com/spf13/cobra.(*Command).Execute(...)
	/go/src/github.com/openshift/installer/vendor/github.com/spf13/cobra/command.go:902
main.installerMain()
	/go/src/github.com/openshift/installer/cmd/openshift-install/main.go:60 +0x29e
main.main()
	/go/src/github.com/openshift/installer/cmd/openshift-install/main.go:38 +0xff
panic: runtime error: slice bounds out of range [:22] with length 21 [recovered]
	panic: runtime error: slice bounds out of range [:22] with length 21

goroutine 1 [running]:
k8s.io/apimachinery/pkg/util/runtime.HandleCrash({0x0, 0x0, 0x18?})
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/runtime/runtime.go:55 +0xd8
panic({0x41d43c0, 0xc0010637e8})
	/usr/lib/golang/src/runtime/panic.go:838 +0x207
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).formatClusterIDForStorage(...)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/disk.go:25
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).storageIDFilter(...)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/disk.go:29
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).storageLabelOrClusterIDFilter(0xc000f22540)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/disk.go:39 +0x1fe
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).listDisks(0xc0015dc900?)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/disk.go:43 +0x1e
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).destroyDisks(0xc000f22540)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/disk.go:116 +0x36
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).destroyCluster(0xc000f22540)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/gcp.go:174 +0x78e
k8s.io/apimachinery/pkg/util/wait.ConditionFunc.WithContext.func1({0x18, 0xc000700000})
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:220 +0x1b
k8s.io/apimachinery/pkg/util/wait.runConditionWithCrashProtectionWithContext({0x19f06638?, 0xc0000721c0?}, 0xc00047d888?)
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:233 +0x57
k8s.io/apimachinery/pkg/util/wait.poll({0x19f06638, 0xc0000721c0}, 0xc8?, 0x1108485?, 0x10?)
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:580 +0x38
k8s.io/apimachinery/pkg/util/wait.PollImmediateInfiniteWithContext({0x19f06638, 0xc0000721c0}, 0x40d687?, 0x10?)
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:566 +0x49
k8s.io/apimachinery/pkg/util/wait.PollImmediateInfinite(0x19f06670?, 0xc00008b8c0?)
	/go/src/github.com/openshift/installer/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:555 +0x46
github.com/openshift/installer/pkg/destroy/gcp.(*ClusterUninstaller).Run(0xc000f22540)
	/go/src/github.com/openshift/installer/pkg/destroy/gcp/gcp.go:130 +0x519
main.runDestroyCmd({0x7fffe6a88d87, 0x9}, 0x0)
	/go/src/github.com/openshift/installer/cmd/openshift-install/destroy.go:67 +0x92
main.newDestroyClusterCmd.func1(0xc000536780?, {0xc000906100?, 0x2?, 0x2?})
	/go/src/github.com/openshift/installer/cmd/openshift-install/destroy.go:53 +0x7f
github.com/spf13/cobra.(*Command).execute(0xc000536780, {0xc0009060c0, 0x2, 0x2})
	/go/src/github.com/openshift/installer/vendor/github.com/spf13/cobra/command.go:860 +0x663
github.com/spf13/cobra.(*Command).ExecuteC(0xc00098db80)
	/go/src/github.com/openshift/installer/vendor/github.com/spf13/cobra/command.go:974 +0x3b4
github.com/spf13/cobra.(*Command).Execute(...)
	/go/src/github.com/openshift/installer/vendor/github.com/spf13/cobra/command.go:902
main.installerMain()
	/go/src/github.com/openshift/installer/cmd/openshift-install/main.go:60 +0x29e
main.main()
	/go/src/github.com/openshift/installer/cmd/openshift-install/main.go:38 +0xff


Anything else we need to know?

Uninstall with openshift-install binary from OCP 4.10.16 worked fine.

Comment 2 Patrick Dillon 2022-07-01 14:11:22 UTC
Can we get an install config for the failing destroy?

Comment 3 Patrick Dillon 2022-07-01 14:13:30 UTC
Sorry. I see the install config is in the attachment.  I thought that was only the destroy log.

Comment 4 Patrick Dillon 2022-07-01 14:28:10 UTC
Marking this as blocker+. It looks like https://github.com/openshift/installer/pull/5976 introduced a regression when destroying disks. We should have a PR to fix up today.

Comment 6 Patrick Dillon 2022-07-01 17:56:37 UTC
For QE: This error would occur after installing and provisioning PV.

Comment 16 Jianli Wei 2022-07-12 02:52:45 UTC
>Verified in 4.12.0-0.nightly-2022-07-11-054352.

$ openshift-install version
openshift-install 4.12.0-0.nightly-2022-07-11-054352
built from commit 8879e19b4cb0256686a573f842363186f30f0ed7
release image registry.ci.openshift.org/ocp/release@sha256:e7f48276819b351a005ae69882ddcebf21c35abec633cf935f5a4a245a8c2161
release architecture amd64
$ openshift-install create cluster --dir work 
? SSH Public Key /home/fedora/.ssh/openshift-qe.pub
? Platform gcp
INFO Credentials loaded from file "/home/fedora/.gcp/osServiceAccount.json"
? Project ID OpenShift QE (openshift-qe)
? Region us-central1
? Base Domain qe.gcp.devcluster.openshift.com
? Cluster Name jiwei
? Pull Secret [? for help] *************************************************************************************************************
INFO Creating infrastructure resources...                                                                                               
INFO Waiting up to 20m0s (until 2:20AM) for the Kubernetes API at https://api.jiwei.qe.gcp.devcluster.openshift.com:6443...             
INFO API v1.24.0+b5932d4 up                                           
INFO Waiting up to 30m0s (until 2:32AM) for bootstrapping to complete...                                                                
INFO Destroying the bootstrap resources...                                                                                              
INFO Waiting up to 40m0s (until 2:52AM) for the cluster at https://api.jiwei.qe.gcp.devcluster.openshift.com:6443 to initialize...      
INFO Waiting up to 10m0s (until 2:30AM) for the openshift-console route to be created...                                                
INFO Install complete!                                                    
INFO To access the cluster as the system:admin user when using 'oc', run                                                                
INFO     export KUBECONFIG=/home/fedora/work/auth/kubeconfig                                                                            
INFO Access the OpenShift web-console here: https://console-openshift-console.apps.jiwei.qe.gcp.devcluster.openshift.com                
INFO Login to the console with user: "kubeadmin", and password: "nfXAb-h963V-SXxtV-No76J"                                               
INFO Time elapsed: 23m7s                                                      
$ export KUBECONFIG=/home/fedora/work/auth/kubeconfig                                                          
$ oc get clusterversion
NAME      VERSION                              AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.12.0-0.nightly-2022-07-11-054352   True        False         18m     Cluster version is 4.12.0-0.nightly-2022-07-11-054352
$ oc get nodes
NAME                                                 STATUS   ROLES    AGE   VERSION
jiwei-5tsrv-master-0.c.openshift-qe.internal         Ready    master   35m   v1.24.0+b5932d4
jiwei-5tsrv-master-1.c.openshift-qe.internal         Ready    master   35m   v1.24.0+b5932d4
jiwei-5tsrv-master-2.c.openshift-qe.internal         Ready    master   34m   v1.24.0+b5932d4
jiwei-5tsrv-worker-a-wsd27.c.openshift-qe.internal   Ready    worker   27m   v1.24.0+b5932d4
jiwei-5tsrv-worker-b-4857j.c.openshift-qe.internal   Ready    worker   27m   v1.24.0+b5932d4
jiwei-5tsrv-worker-c-cfblj.c.openshift-qe.internal   Ready    worker   27m   v1.24.0+b5932d4
$ oc get co
NAME                                       VERSION                              AVAILABLE   PROGRESSING   DEGRADED   SINCE   MESSAGE
authentication                             4.12.0-0.nightly-2022-07-11-054352   True        False         False      18m
baremetal                                  4.12.0-0.nightly-2022-07-11-054352   True        False         False      33m
cloud-controller-manager                   4.12.0-0.nightly-2022-07-11-054352   True        False         False      34m
cloud-credential                           4.12.0-0.nightly-2022-07-11-054352   True        False         False      34m
cluster-autoscaler                         4.12.0-0.nightly-2022-07-11-054352   True        False         False      32m
config-operator                            4.12.0-0.nightly-2022-07-11-054352   True        False         False      34m
console                                    4.12.0-0.nightly-2022-07-11-054352   True        False         False      19m
csi-snapshot-controller                    4.12.0-0.nightly-2022-07-11-054352   True        False         False      33m
dns                                        4.12.0-0.nightly-2022-07-11-054352   True        False         False      32m
etcd                                       4.12.0-0.nightly-2022-07-11-054352   True        False         False      31m
image-registry                             4.12.0-0.nightly-2022-07-11-054352   True        False         False      26m
ingress                                    4.12.0-0.nightly-2022-07-11-054352   True        False         False      26m
insights                                   4.12.0-0.nightly-2022-07-11-054352   True        False         False      27m
kube-apiserver                             4.12.0-0.nightly-2022-07-11-054352   True        False         False      30m
kube-controller-manager                    4.12.0-0.nightly-2022-07-11-054352   True        False         False      31m
kube-scheduler                             4.12.0-0.nightly-2022-07-11-054352   True        False         False      30m
kube-storage-version-migrator              4.12.0-0.nightly-2022-07-11-054352   True        False         False      34m
machine-api                                4.12.0-0.nightly-2022-07-11-054352   True        False         False      27m
machine-approver                           4.12.0-0.nightly-2022-07-11-054352   True        False         False      32m
machine-config                             4.12.0-0.nightly-2022-07-11-054352   True        False         False      32m
marketplace                                4.12.0-0.nightly-2022-07-11-054352   True        False         False      33m
monitoring                                 4.12.0-0.nightly-2022-07-11-054352   True        False         False      24m
network                                    4.12.0-0.nightly-2022-07-11-054352   True        False         False      35m
node-tuning                                4.12.0-0.nightly-2022-07-11-054352   True        False         False      33m
openshift-apiserver                        4.12.0-0.nightly-2022-07-11-054352   True        False         False      28m
openshift-controller-manager               4.12.0-0.nightly-2022-07-11-054352   True        False         False      29m
openshift-samples                          4.12.0-0.nightly-2022-07-11-054352   True        False         False      27m
operator-lifecycle-manager                 4.12.0-0.nightly-2022-07-11-054352   True        False         False      33m
operator-lifecycle-manager-catalog         4.12.0-0.nightly-2022-07-11-054352   True        False         False      33m
operator-lifecycle-manager-packageserver   4.12.0-0.nightly-2022-07-11-054352   True        False         False      28m
service-ca                                 4.12.0-0.nightly-2022-07-11-054352   True        False         False      34m
storage                                    4.12.0-0.nightly-2022-07-11-054352   True        False         False      33m
$ gcloud compute disks list --filter="name~jiwei"
NAME                        LOCATION       LOCATION_SCOPE  SIZE_GB  TYPE    STATUS
jiwei-5tsrv-master-0        us-central1-a  zone            128      pd-ssd  READY
jiwei-5tsrv-worker-a-wsd27  us-central1-a  zone            128      pd-ssd  READY
jiwei-5tsrv-master-1        us-central1-b  zone            128      pd-ssd  READY
jiwei-5tsrv-worker-b-4857j  us-central1-b  zone            128      pd-ssd  READY
jiwei-5tsrv-master-2        us-central1-c  zone            128      pd-ssd  READY
jiwei-5tsrv-worker-c-cfblj  us-central1-c  zone            128      pd-ssd  READY
$ openshift-install destroy cluster --dir work
INFO Credentials loaded from file "/home/fedora/.gcp/osServiceAccount.json"
INFO Stopped instance jiwei-5tsrv-worker-a-wsd27
INFO Stopped instance jiwei-5tsrv-worker-b-4857j
INFO Stopped instance jiwei-5tsrv-worker-c-cfblj
INFO Stopped instance jiwei-5tsrv-master-1
INFO Stopped instance jiwei-5tsrv-master-0
INFO Stopped instance jiwei-5tsrv-master-2
INFO Deleted IAM project role bindings
INFO Deleted service account projects/openshift-qe/serviceAccounts/jiwei-5tsrv-openshift-g-sjjbj.gserviceaccount.com
INFO Deleted service account projects/openshift-qe/serviceAccounts/jiwei-5tsrv-cloud-crede-hlhfl.gserviceaccount.com
INFO Deleted service account projects/openshift-qe/serviceAccounts/jiwei-5tsrv-m.gserviceaccount.com
INFO Deleted service account projects/openshift-qe/serviceAccounts/jiwei-5tsrv-openshift-g-966sn.gserviceaccount.com
INFO Deleted service account projects/openshift-qe/serviceAccounts/jiwei-5tsrv-openshift-i-c9pxk.gserviceaccount.com
INFO Deleted service account projects/openshift-qe/serviceAccounts/jiwei-5tsrv-openshift-m-nrgkj.gserviceaccount.com
INFO Deleted service account projects/openshift-qe/serviceAccounts/jiwei-5tsrv-openshift-c-xbxjb.gserviceaccount.com
INFO Deleted service account projects/openshift-qe/serviceAccounts/jiwei-5tsrv-openshift-i-pmmv2.gserviceaccount.com
INFO Deleted service account projects/openshift-qe/serviceAccounts/jiwei-5tsrv-w.gserviceaccount.com
INFO Deleted 2 recordset(s) in zone qe
INFO Deleted 3 recordset(s) in zone jiwei-5tsrv-private-zone
INFO Deleted DNS zone jiwei-5tsrv-private-zone
INFO Deleted bucket jiwei-5tsrv-image-registry-us-central1-wdesugaemhnptbibxmdminp
INFO Deleted instance jiwei-5tsrv-master-2
INFO Deleted instance jiwei-5tsrv-worker-c-cfblj
INFO Deleted instance jiwei-5tsrv-master-0
INFO Deleted instance jiwei-5tsrv-worker-a-wsd27
INFO Deleted instance jiwei-5tsrv-master-1
INFO Deleted instance jiwei-5tsrv-worker-b-4857j
INFO Deleted disk jiwei-5tsrv-master-1
INFO Deleted disk jiwei-5tsrv-worker-b-4857j
INFO Deleted disk jiwei-5tsrv-master-0
INFO Deleted disk jiwei-5tsrv-worker-a-wsd27
INFO Deleted disk jiwei-5tsrv-master-2
INFO Deleted disk jiwei-5tsrv-worker-c-cfblj
INFO Deleted firewall rule jiwei-5tsrv-health-checks
INFO Deleted firewall rule jiwei-5tsrv-internal-cluster
INFO Deleted firewall rule jiwei-5tsrv-internal-network
INFO Deleted firewall rule k8s-fw-a4fe1c396ab004289baa1f74120d29ef
INFO Deleted firewall rule k8s-a4fe1c396ab004289baa1f74120d29ef-http-hc
INFO Deleted firewall rule jiwei-5tsrv-api
INFO Deleted firewall rule jiwei-5tsrv-control-plane
INFO Deleted firewall rule jiwei-5tsrv-etcd
INFO Deleted address jiwei-5tsrv-cluster-ip
INFO Deleted address jiwei-5tsrv-cluster-public-ip
INFO Deleted forwarding rule jiwei-5tsrv-api-internal
INFO Deleted forwarding rule a4fe1c396ab004289baa1f74120d29ef
INFO Deleted forwarding rule jiwei-5tsrv-api
INFO Deleted router jiwei-5tsrv-router
INFO Deleted subnetwork jiwei-5tsrv-worker-subnet
INFO Deleted target pool a4fe1c396ab004289baa1f74120d29ef
INFO Deleted target pool jiwei-5tsrv-api
INFO Deleted backend service jiwei-5tsrv-api-internal
INFO Deleted subnetwork jiwei-5tsrv-master-subnet
INFO Deleted instance group jiwei-5tsrv-master-us-central1-b
INFO Deleted instance group jiwei-5tsrv-master-us-central1-a
INFO Deleted instance group jiwei-5tsrv-master-us-central1-c
INFO Deleted health check jiwei-5tsrv-api-internal
INFO Deleted HTTP health check a4fe1c396ab004289baa1f74120d29ef
INFO Deleted HTTP health check jiwei-5tsrv-api
INFO Deleted network jiwei-5tsrv-network
INFO Time elapsed: 4m14s
$

Comment 19 errata-xmlrpc 2023-01-17 19:51:09 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.12.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:7399


Note You need to log in before you can comment on or make changes to this bug.