Description of problem:
IBM is providing a grub patch for PPC systems. This patch will allow grub to retry IO and connection requests to fc devices in the event that the device returns a timeout.
Systems fail to boot with storage related errors.
Systems will no fail to boot.
Adding IBM team to this BZ. As this is affecting some very large customers, there is a heavy push to have this backported into RHEL8.
------- Comment From diegodo.com 2021-04-07 09:00 EDT-------
this is the patch that I'm willing to send upstream.
Please, let me know your thoughts.
This patch is on top of patch: Avoiding many unecessary open close
Created attachment 1769869 [details]
Retry open and read on failure
(In reply to IBM Bug Proxy from comment #1)
> ------- Comment From diegodo.com 2021-04-07 09:00 EDT-------
> This patch is on top of patch: Avoiding many unecessary open close
... which was integrated into RHEL8.4 through
LTC bug 187174 - RH1862632- RHEL8.3 Beta - ISST-LTE:PowerVM: Fleetwood:raylp83: LPAR installed on a namespace of Kona NVME card takes long time to boot ...
------- Comment From fnovak.com 2021-05-11 09:48 EDT-------
Where does this stand? No update in month+
(In reply to IBM Bug Proxy from comment #4)
> ------- Comment From fnovak.com 2021-05-11 09:48 EDT-------
> Where does this stand? No update in month+
Sorry, we were busy with other tasks. How far should this go for z-stream ?
We don't currently have any customers on RHEL8 hitting this race condition, but I don't want to wait until 8.5 to release this. I think 8.3.z and 8.4.z is sufficient.
(In reply to sgardner from comment #6)
> We don't currently have any customers on RHEL8 hitting this race condition,
> but I don't want to wait until 8.5 to release this. I think 8.3.z and 8.4.z
> is sufficient.
Thanks for the info. Yes, I set to 8.5 because we first need to push the fix in
before backporting to z-stream. As far as I know, 8.3.z is not an EUS release
and was EOL after the 8.4 release, so then we only need to fix this in 8.4.z ?
I suppose IBM is able to test it when patch is merged. So providing qa_ack.
early access to packages (as requested):
added grub2-ppc64le-modules.noarch.rpm to the package list:
------- Comment From diegodo.com 2021-08-02 09:28 EDT-------
the provided packages are working as expected.
Please make it available.
Let me know if something is missing from our side.
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.
For information on the advisory (grub2 bug fix and enhancement update), and where to find the updated
files, follow the link below.
If the solution does not work for you, open a new bug report.