Bug 1534275
| Summary: | apiserver pod of service catalog in CrashLoopBackOff status after upgrading to v3.9 | |||
|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Weihua Meng <wmeng> | |
| Component: | Service Broker | Assignee: | Jeff Peeler <jpeeler> | |
| Status: | CLOSED CURRENTRELEASE | QA Contact: | Weihua Meng <wmeng> | |
| Severity: | high | Docs Contact: | ||
| Priority: | high | |||
| Version: | 3.9.0 | CC: | aos-bugs, chezhang, dmoessne, jmatthew, jokerman, mmccomas, smunilla, wmeng, wzheng | |
| Target Milestone: | --- | |||
| Target Release: | 3.9.0 | |||
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | ||||
| Fixed In Version: | openshift-ansible-3.9.0-0.31.0.git.0.e0a0ad8.el7 | Doc Type: | No Doc Update | |
| Doc Text: |
undefined
|
Story Points: | --- | |
| Clone Of: | ||||
| : | 1534311 (view as bug list) | Environment: | ||
| Last Closed: | 2018-06-18 17:35:28 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | 1523298 | |||
| Bug Blocks: | 1534311 | |||
|
Description
Weihua Meng
2018-01-14 17:43:10 UTC
This looks like to be a problem with the aggregator not being enabled in 3.7 (now fixed in bug 1523298). Changing QA contact to wmeng since he is the reporter. Though I have no idea about the root cause, a pod recreataion can make the pod runnning. NAME READY STATUS RESTARTS AGE po/apiserver-6kw8d 0/1 CrashLoopBackOff 26 22h # oc delete po/apiserver-6kw8d -n kube-service-catalog pod "apiserver-6kw8d" deleted NAME READY STATUS RESTARTS AGE po/apiserver-6kw8d 0/1 Terminating 26 22h NAME READY STATUS RESTARTS AGE po/apiserver-bxvtd 1/1 Running 0 8s Are you sure the pod stays running? I believe it'll crash the same way previously if the aggregator is not setup. I think all of this will be fixed with 3.7.24 or later. @Weihua, I think the root cause should be not upgraded to correct images, do you need to update title of bug? Thx I'm confused about the status of this bug. The original report said version 3.7.23 was used to install from, but a later comment suggested 3.7.26 was used.
The original report also mentioned a missing configmap data, which would not have been fixed by restarting a catalog container. In order to verify that the cluster is in the correct state with the aggregator, the following should return a certificate:
kubectl --namespace kube-system get configmap extension-apiserver-authentication -o jsonpath="{ $.data['requestheader-client-ca-file'] }"
If that doesn't work, that's the problem. If it does work, then the scenario from the original report has changed.
Sorry for confusion. I absolutely agree with your point. Now all the pods are running after upgrade so the crash issue is fixed. Thanks. We usually try latest version to keep up with the times. 3.7.23 was latest version at the time bug was reported. and 3.7.26 was the latest version two weeks later. @Cheng, for the image tag problem during upgrade, could you report another bug to track? Thanks. weihua, Report in another bug https://bugzilla.redhat.com/show_bug.cgi?id=1540840 Fixed openshift-ansible-3.9.0-0.31.0.git.0.e0a0ad8.el7.noarch |