Summary: | ovnkube-master starts failing due to "ERROR RESTARTING - nbdb - too many failed ovn-nbctl attempts, giving up" | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Kenjiro Nakayama <knakayam> |
Component: | Networking | Assignee: | Ben Bennett <bbennett> |
Networking sub component: | ovn-kubernetes | QA Contact: | Anurag saxena <anusaxen> |
Status: | CLOSED DUPLICATE | Docs Contact: | |
Severity: | unspecified | ||
Priority: | unspecified | ||
Version: | 4.7 | ||
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2021-08-19 00:36:16 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: |
Description
Kenjiro Nakayama
2021-08-17 14:30:24 UTC
Not sure if this is related or not, but pods takes long time to start as it keeps "Pending" status until the network interface is attached. $ oc -n cupcake-1000-stage describe pod helloworld-go-00001-deployment-59fb487487-bvt8t ... Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal Scheduled 8m3s default-scheduler Successfully assigned cupcake-1000-stage/helloworld-go-00001-deployment-59fb487487-qhmrx to worker002 Normal AddedInterface 5m33s multus Add eth0 [10.131.0.10/23] Normal Pulled 5m30s kubelet Container image "quay.io/wreicher/quarkus-native-hello@sha256:09eb7225f59c6147d6a2acca945ed1b284b3ac317b3e0a51ca7f99136abc711c" already present on machine Normal Created 5m23s kubelet Created container user-container Normal Started 5m23s kubelet Started container user-container Normal Pulled 5m23s kubelet Container image "registry.redhat.io/openshift-serverless-1/serving-queue-rhel8@sha256:c2a97c0868e19f4e5a269d29bfe3b7c6b6ef870e135a5419388047965cc0b19d" already present on machine Normal Created 5m18s kubelet Created container queue-proxy Normal Started 5m18s kubelet Started container queue-proxy Also, it sometimes gets the following error. $ oc -n cupcake-1000-stage describe pod Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal Scheduled 4m45s default-scheduler Successfully assigned cupcake-1000-stage/helloworld-go-00001-deployment-59fb487487-m2h7z to worker002 Warning ErrorAddingLogicalPort 4m5s controlplane failed to add IP "10.131.0.20" to address set "edcc6c70-2415-4719-b32b-1d6841058f0d/cupcake-1000-stage_v4/a5666218552365519683", stderr: "2021-08-17T13:47:43Z|00001|fatal_signal|WARN|terminating with signal 14 (Alarm clock)\n" (OVN command '/usr/bin/ovn-nbctl --timeout=15 add address_set edcc6c70-2415-4719-b32b-1d6841058f0d addresses "10.131.0.20"' failed: signal: alarm clock) I would appreciate if I could know if these slow start is related to the ovnkube-master issue or not. This issue is quite similar to bz1952819. I will ask perf team to upgrade to 4.7.22 (bz1962608 said it fixed). *** This bug has been marked as a duplicate of bug 1962608 *** |