Bug 1367161
| Summary: | After nfs server lost connection, properly clean up the node | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Bradley Childs <bchilds> |
| Component: | Storage | Assignee: | Hemant Kumar <hekumar> |
| Status: | CLOSED CURRENTRELEASE | QA Contact: | Wenqi He <wehe> |
| Severity: | low | Docs Contact: | |
| Priority: | medium | ||
| Version: | 3.2.0 | CC: | aos-bugs, eparis, haowang, hchen, jhou, jsafrane, lxia, mawong, tdawson, wehe |
| Target Milestone: | --- | ||
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | 1337479 | Environment: | |
| Last Closed: | 2017-07-27 14:49:07 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1337479 | ||
| Bug Blocks: | |||
|
Description
Bradley Childs
2016-08-15 18:00:08 UTC
This is being tracked upstream by https://github.com/kubernetes/kubernetes/issues/31272 'Hung volumes can wedge the kubelet' It has been fixed for 1.5 and will be cherrypicked into 1.4 https://github.com/kubernetes/kubernetes/pull/35038 This has been merged into ocp and is in OCP v3.5.0.16 or newer. Tested on below version: openshift v3.5.0.16+a26133a kubernetes v1.5.2+43a9be4 After deleted all the pods, no pods keep terminating. This bug is fixed. Thanks. Still seeing pod/project stuck in terminating status.
# openshift version
openshift v3.5.0.20+87266c6
kubernetes v1.5.2+43a9be4
etcd 3.1.0
# oc get projects | grep -i terminat
6sgm7 Terminating
# oc get pods --all-namespaces | grep -i terminat
6sgm7 mysql-1-p7x04 0/1 Terminating 1 58m
# oc describe pod mysql-1-p7x04 -n 6sgm7
Name: mysql-1-p7x04
Namespace: 6sgm7
Security Policy: restricted
Node: qe-lxia-node-registry-router-1/10.240.0.11
Start Time: Tue, 14 Feb 2017 23:50:06 -0500
Labels: app=mysql-persistent
deployment=mysql-1
deploymentconfig=mysql
name=mysql
Status: Terminating (expires Tue, 14 Feb 2017 23:59:10 -0500)
Termination Grace Period: 30s
IP:
Controllers: ReplicationController/mysql-1
Containers:
mysql:
Container ID: docker://668e73ad086d3d2cc0a307c0e7e5d971556b459ae1375f4f136af334cc2fa542
Image: registry.ops.openshift.com/rhscl/mysql-57-rhel7@sha256:3136b2989e331fecabfc0c482ca9112efa5aa08289494e844e47da2f71b3de95
Image ID: docker-pullable://registry.ops.openshift.com/rhscl/mysql-57-rhel7@sha256:3136b2989e331fecabfc0c482ca9112efa5aa08289494e844e47da2f71b3de95
Port: 3306/TCP
Limits:
memory: 512Mi
Requests:
memory: 512Mi
State: Running
Started: Tue, 14 Feb 2017 23:57:48 -0500
Last State: Terminated
Reason: Error
Exit Code: 137
Started: Tue, 14 Feb 2017 23:56:23 -0500
Finished: Tue, 14 Feb 2017 23:57:46 -0500
Ready: False
Restart Count: 1
Liveness: tcp-socket :3306 delay=30s timeout=1s period=10s #success=1 #failure=3
Readiness: exec [/bin/sh -i -c MYSQL_PWD="$MYSQL_PASSWORD" mysql -h 127.0.0.1 -u $MYSQL_USER -D $MYSQL_DATABASE -e 'SELECT 1'] delay=5s timeout=1s period=10s #success=1 #failure=3
Volume Mounts:
/var/lib/mysql/data from mysql-data (rw)
/var/run/secrets/kubernetes.io/serviceaccount from default-token-vlm5v (ro)
Environment Variables:
MYSQL_USER: <set to the key 'database-user' in secret 'mysql'>
MYSQL_PASSWORD: <set to the key 'database-password' in secret 'mysql'>
MYSQL_ROOT_PASSWORD: <set to the key 'database-root-password' in secret 'mysql'>
MYSQL_DATABASE: sampledb
Conditions:
Type Status
Initialized True
Ready False
PodScheduled True
Volumes:
mysql-data:
Type: PersistentVolumeClaim (a reference to a PersistentVolumeClaim in the same namespace)
ClaimName: mysql
ReadOnly: false
default-token-vlm5v:
Type: Secret (a volume populated by a Secret)
SecretName: default-token-vlm5v
QoS Class: Burstable
Tolerations: <none>
No events.
|