Bug 1279683
| Summary: | NFS Recycling error: Pod failed, pod.Status.Message unknown | ||
|---|---|---|---|
| Product: | OKD | Reporter: | Liang Xia <lxia> |
| Component: | Storage | Assignee: | Sami Wagiaalla <swagiaal> |
| Status: | CLOSED CURRENTRELEASE | QA Contact: | Liang Xia <lxia> |
| Severity: | medium | Docs Contact: | |
| Priority: | medium | ||
| Version: | 3.x | CC: | aos-bugs, bchilds, bleanhar, jkrieger, jliggitt, mmcgrath, mturansk |
| Target Milestone: | --- | Keywords: | Reopened |
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2016-05-12 17:15:01 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Liang Xia
2015-11-10 03:20:53 UTC
The new security functionality caused errors in the recycler, which was used to NFS volumes without any security. The recycler needs to work with UID:GID. This BZ should be fixed by https://github.com/openshift/origin/pull/5792 *** This bug has been marked as a duplicate of bug 1279335 *** https://github.com/openshift/origin/pull/5792 is superceded by https://github.com/openshift/origin/pull/5847 ForkAMI available at https://ci.openshift.redhat.com/jenkins/job/fork_ami/132/ Reopening, since this was related to NFS permissions. The other bug was related to hostmount SCC. https://github.com/openshift/origin/pull/5847 is in the merge queue Check again on devenv-rhel7_2695, following exactly the same steps as in #comment 0 , PV still can not be recycled. # oc describe pv template-pv-5 Name: template-pv-5 Labels: template=PVs Status: Failed Claim: lxiap001/template-pvc-1 Reclaim Policy: Recycle Access Modes: RWO Capacity: 5Gi Message: Recycling error: Pod was active on the node longer than specified deadline Source: Type: NFS (an NFS mount that lasts the lifetime of a pod) Server: localhost Path: /home/data/pv05 ReadOnly: false The error is a little confuse since it says "Pod was active on the node longer than specified deadline", but actually there are no pods on this environment. # openshift version openshift v1.0.8-40-g42ad235 kubernetes v1.1.0-origin-1107-g4c8e6f4 etcd 2.1.2 https://bugzilla.redhat.com/show_bug.cgi?id=1281726 contains the same error ("Pod was active on the node longer than specified"). Are these two dupes? I attempted a MySQL pod with NFS using 700 and 770 (as indicated above). Only 777 worked. 700 had an error when mounting, others when writing. Try again with 777, please. Tried again on devenv-rhel7_2712 with openshift version openshift v1.1-25-g0c0e452 kubernetes v1.1.0-origin-1107-g4c8e6f4 etcd 2.1.2 and PV can be recycled when NFS exported with 777, adn PV Failed to recycle when NFS exported with 700/770. Hi Mark, Since NFS exported with 777 is not good. Could you confirm that NFS exported with 777 is required ? Thanks, Liang Assgin back to get the confirmation. There is a feature request for the automatic addition of GID to pod's running shared storage volumes (NFS, Gluster). The recycler would run a pod using the same GID that is stored on the PV. This allows permissions less than 777. Reassigning to Sami who I believe is handling that feature. Otherwise, Sami, please reassign to the feature owner. PR opened upstream to support a GID annotation which indicates the GID with which to access the volume. The recycler pod will use the same feature. https://github.com/kubernetes/kubernetes/pull/20490 Check on version, openshift v3.1.1.904 kubernetes v1.2.0-alpha.7-703-gbc4550d etcd 2.2.5 The PV (persistent volume) can be recycled now. Once the bug moved to ON_QA, we can move it to verified. On a closer look at this bug it seems like the UID GID setting is not the issue. The recycler script merged here: https://github.com/openshift/origin/pull/5847 and referenced above has a 'becomeUser' method which switches the UID to that of the file which requires deletion. Liang, This is working for you now then ? I think what happened is that only your most recent test contained the patch referenced above. Moving to ON_QA. Please reopen of you encounter this issue again. Moving the verified. |