Description of problem: Seeing many test failures related to storage/filesystems. The following tests have highest flake ratings: [sig-storage] In-tree Volumes [Driver: local][LocalVolumeType: blockfs] [Testpattern: Pre-provisioned PV (default fs)] subPath should support readOnly file specified in the volumeMount [Suite:openshift/conformance/parallel] [Suite:k8s] [sig-storage] PersistentVolumes-local [Volume type: blockfswithformat] Set fsGroup for local volume should set same fsGroup for two pods simultaneously [Suite:openshift/conformance/parallel] [Suite:k8s] See: https://testgrid.k8s.io/redhat-openshift-ocp-release-4.2-informing#release-openshift-ocp-installer-e2e-aws-rhel7-workers-4.2&sort-by-flakiness= Example jobs: https://prow.svc.ci.openshift.org/view/gcs/origin-ci-test/logs/release-openshift-ocp-installer-e2e-aws-rhel7-workers-4.2/352 https://prow.svc.ci.openshift.org/view/gcs/origin-ci-test/logs/release-openshift-ocp-installer-e2e-aws-rhel7-workers-4.2/354 https://prow.svc.ci.openshift.org/view/gcs/origin-ci-test/logs/release-openshift-ocp-installer-e2e-aws-rhel7-workers-4.2/355 https://prow.svc.ci.openshift.org/view/gcs/origin-ci-test/logs/release-openshift-ocp-installer-e2e-aws-rhel7-workers-4.2/356 Many of the failures seem linked to a `rm` hitting `EBUSY`. E.g.: - rm: cannot remove '/tmp/local-driver-a7012570-763e-11ea-affc-0a58ac105853': Device or resource busy - rm: cannot remove '/tmp/local-volume-test-e247402f-763f-11ea-9129-0a58ac105853': Device or resource busy - rm: cannot remove '/tmp/local-driver-43fcbeab-763e-11ea-87fc-0a58ac105853': Device or resource busy
Yes, indeed looks like a dupe of 1820717. Thanks! *** This bug has been marked as a duplicate of bug 1820717 ***
This report was closed as a duplicate of bug 1820717, which cites this failure: > [sig-storage] In-tree Volumes [Driver: nfs] [Testpattern: Pre-provisioned PV (block volmode)] volumeMode should fail to create pod by failing to mount volume [Suite:openshift/conformance/parallel] [Suite:k8s] I no longer see that failure in CI. However, I still see the failures that were reported in comment 0 in these jobs: > [sig-storage] In-tree Volumes [Driver: local][LocalVolumeType: blockfs] [Testpattern: Pre-provisioned PV (default fs)] subPath should support readOnly file specified in the volumeMount [Suite:openshift/conformance/parallel] [Suite:k8s] > > [sig-storage] PersistentVolumes-local [Volume type: blockfswithformat] Set fsGroup for local volume should set same fsGroup for two pods simultaneously [Suite:openshift/conformance/parallel] [Suite:k8s] In these recent failures, I still see the following error messages: rm: cannot remove '/tmp/local-driver-eabf8dc1-862d-11ea-8344-0a58ac1071e9': Device or resource busy rm: cannot remove '/tmp/local-volume-test-eaee3143-862d-11ea-824a-0a58ac1071e9': Device or resource busy These are some recent failures: https://prow.svc.ci.openshift.org/view/gcs/origin-ci-test/logs/release-openshift-ocp-installer-e2e-aws-rhel7-workers-4.2/385 https://prow.svc.ci.openshift.org/view/gcs/origin-ci-test/logs/release-openshift-ocp-installer-e2e-aws-rhel7-workers-4.2/391 https://prow.svc.ci.openshift.org/view/gcs/origin-ci-test/logs/release-openshift-ocp-installer-e2e-aws-rhel7-workers-4.2/392 It appears that the originally reported failures were not in fact the same issue as bug 1820717, so I am reopening this report.
Setting target release to current development version (4.5) for investigation. Where fixes (if any) are required/requested for prior versions, cloned BZs will be created when appropriate.
Closing as a dupe, since 4.4 has a fix. Backports to 4.2 are for CVE fixes mostly. *** This bug has been marked as a duplicate of bug 1823374 ***