Bug 2000872 - [tracker] container is not able to list on some directories within the nfs after upgrade to 4.7.24
Summary: [tracker] container is not able to list on some directories within the nfs af...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: RHCOS
Version: 4.7
Hardware: x86_64
OS: Linux
urgent
urgent
Target Milestone: ---
: 4.10.0
Assignee: Micah Abbott
QA Contact: Michael Nguyen
URL:
Whiteboard:
Depends On:
Blocks: 2004332
TreeView+ depends on / blocked
 
Reported: 2021-09-03 09:06 UTC by Pamela Escorza
Modified: 2022-03-10 16:07 UTC (History)
15 users (show)

Fixed In Version:
Doc Type: No Doc Update
Doc Text:
Clone Of:
: 2004332 (view as bug list)
Environment:
Last Closed: 2022-03-10 16:07:01 UTC
Target Upstream Version:
Embargoed:
fbertina: needinfo-


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHSA-2022:0056 0 None None None 2022-03-10 16:07:30 UTC

Description Pamela Escorza 2021-09-03 09:06:00 UTC
Description of problem:
Customer has detected a new issue and is not able to find a workaround, probably related with kernel o new nfs version.

From pod drupal, some nfs directories are not able to be listed from OCP app nodes after the upgrade


Version-Release number of selected component (if applicable):

Red Hat OpenShift Container Platform (OCP) 4.7.24
Kernel-4.18.0-305.10.2.el8_4.x86_64
nfs-utils-2.3.3-41.el8.x86_64

OCP UPI-VMWare

How reproducible:
Not able to reproduce the issue, probably related to the upgrade

Actual results:
Application is not able to be executes as can't list the files in nfs directories

Expected results:
Application to finish without issue

Additional info:

Comment 20 Micah Abbott 2021-09-24 19:50:05 UTC
It appears the root cause of this issue is tracked in https://bugzilla.redhat.com/show_bug.cgi?id=1982825

The 8.4.z version of that BZ is https://bugzilla.redhat.com/show_bug.cgi?id=1993895, which notes that it is fixed in `kernel-4.18.0-305.16.1.el8_4`.

Since RHCOS 4.7+ is using RHEL 8.4, the following RHCOS builds included a version of the kernel with that fix:

  - 48.84.202109082138-0 (OCP 4.8.11)
  - 47.84.202109082139-0 (OCP 4.7.30)

This BZ should have clones to track the inclusion of the fixed kernel in the different OCP releases, but since the releases have already been made and errata issued, it seems excessive.

---

The current version of OCP under development is 4.10 and since the devel builds already include a newer version of the kernel with the fix for this issue, I am going to move this BZ to MODIFIED.

Comment 22 Michael Nguyen 2021-10-12 19:42:48 UTC
Verified on RHCOS 410.84.202110081440-0 which is part of registry.ci.openshift.org/ocp/release:4.10.0-0.nightly-2021-10-10-083341.  It is running a kernel version newer than kernel-4.18.0-305.16.1.el8_4

[core@localhost ~]$ rpm -q kernel
kernel-4.18.0-305.19.1.el8_4.x86_64
[core@localhost ~]$ rpm-ostree status
State: idle
Deployments:
● ostree://67be3786510771cc550aa4be162f2b45cb796e4340755d7a67fa4720aa10e9ba
                   Version: 410.84.202110081440-0 (2021-10-08T14:43:28Z)

Comment 26 errata-xmlrpc 2022-03-10 16:07:01 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:0056


Note You need to log in before you can comment on or make changes to this bug.