RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 1942148 - Make grub2 more robust against Open Firmware storage race condition causing system boot failures [rhel-7.9.z]
Summary: Make grub2 more robust against Open Firmware storage race condition causing s...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 7
Classification: Red Hat
Component: grub2
Version: 7.9
Hardware: ppc64le
OS: Linux
high
high
Target Milestone: rc
: 7.9
Assignee: Jan Hlavac
QA Contact: Petr Janda
URL:
Whiteboard:
Depends On:
Blocks: 1857216
TreeView+ depends on / blocked
 
Reported: 2021-03-23 18:28 UTC by sgardner
Modified: 2021-10-12 16:11 UTC (History)
11 users (show)

Fixed In Version: grub2-2.02-0.87.el7_9.7
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2021-10-12 15:27:21 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
Retry open and read on failure (3.62 KB, patch)
2021-04-01 13:41 UTC, IBM Bug Proxy
no flags Details | Diff
Patch avoiding many unecessary open/close during the boot (4.44 KB, patch)
2021-04-01 13:41 UTC, IBM Bug Proxy
no flags Details | Diff


Links
System ID Private Priority Status Summary Last Updated
IBM Linux Technology Center 192173 0 None None None 2021-03-25 17:18:13 UTC
Red Hat Product Errata RHBA-2021:3794 0 None None None 2021-10-12 15:27:24 UTC

Description sgardner 2021-03-23 18:28:10 UTC
Description of problem:
IBM is providing a grub patch for PPC systems.  This patch will allow grub to retry IO and connection requests to fc devices in the event that the device returns a timeout.



Actual results:
Systems fail to boot with storage related errors.


Expected results:
Systems will no fail to boot.


Additional info:
Adding IBM team to this BZ.  As this is affecting some very large customers, there is a heavy push to have this backported into RHEL7.9.

Comment 4 IBM Bug Proxy 2021-04-01 13:41:12 UTC
Created attachment 1768256 [details]
Retry open and read on failure


------- Comment on attachment From diegodo.com 2021-04-01 09:33 EDT-------


Hello Redhat,

this is the patch that I'm willing to send upstream.

Please, let me know your thoughts.

This patch is on top of Avoiding many unecessary open close that I'll attach here as well (I don't know if it is already applied to RHEL7.9)

Comment 5 IBM Bug Proxy 2021-04-01 13:41:13 UTC
Created attachment 1768257 [details]
Patch avoiding many unecessary open/close during the boot

Comment 6 Petr Janda 2021-05-17 08:56:01 UTC
I expect IBM will verify it, providing qa_ack.

Comment 7 Javier Martinez Canillas 2021-05-17 14:14:32 UTC
(In reply to IBM Bug Proxy from comment #4)
> Created attachment 1768256 [details]
> Retry open and read on failure
> 
> 
> ------- Comment on attachment From diegodo.com 2021-04-01 09:33
> EDT-------
> 
> 
> Hello Redhat,
> 
> this is the patch that I'm willing to send upstream.
> 
> Please, let me know your thoughts.
> 
> This patch is on top of Avoiding many unecessary open close that I'll attach
> here as well (I don't know if it is already applied to RHEL7.9)

Yes, the latter was included in build grub2-2.02-0.87.el7_9.6.

Comment 8 Hanns-Joachim Uhl 2021-05-19 09:38:12 UTC
(In reply to Javier Martinez Canillas from comment #7)
...
> 
> Yes, the latter was included in build grub2-2.02-0.87.el7_9.6.
.
Hello Red Hat / Javier,
... can you please provide us the updated grub2 rpm (for ppc64le ..) for our early testing ...?
Please advise ...
Thanks in advance for your support.

Comment 9 sgardner 2021-06-03 15:54:25 UTC
I just wanted to provide some clarification just in case there is some confusion.  The patch to "avoiding many unnecessary open/close during the boot" has already been backported into RHEL7.9.  This BZ is ONLY for backporting the "Retry open and read on failure" code from attachment "https://bugzilla.redhat.com/attachment.cgi?id=1768256".

Created attachment 1768256 [details]
Retry open and read on failure

Comment 11 IBM Bug Proxy 2021-06-10 19:10:42 UTC
------- Comment From janani.com 2021-06-10 15:00 EDT-------
Thank you Brock

Comment 16 Brock Organ 2021-06-13 03:13:05 UTC
(In reply to Brock Organ from comment #10)
> (In reply to Hanns-Joachim Uhl from comment #8)
> > (In reply to Javier Martinez Canillas from comment #7)
> > ...
> > > 
> > > Yes, the latter was included in build grub2-2.02-0.87.el7_9.6.
> > .
> > Hello Red Hat / Javier,
> > ... can you please provide us the updated grub2 rpm (for ppc64le ..) for our
> > early testing ...?
> > Please advise ...
> > Thanks in advance for your support.
> 
> early access to packages:
> 
> http://people.redhat.com/~borgan/.8.5/grub2-2.02-0.87.el7_9.6.ppc64le/


Hi Team,

Steven has corrected my package list, here is the right set of new packages to test, sorry for the miscommunication:

http://people.redhat.com/~borgan/.8.5/grub2-2.02-0.87.el7_9.7.ppc64le/

Comment 17 IBM Bug Proxy 2021-07-22 14:41:37 UTC
------- Comment From diegodo.com 2021-07-22 10:36 EDT-------
(In reply to comment #15)
> (In reply to Brock Organ from comment #10)
> > (In reply to Hanns-Joachim Uhl from comment #8)
> > > (In reply to Javier Martinez Canillas from comment #7)
> > > ...
> > > >
> > > > Yes, the latter was included in build grub2-2.02-0.87.el7_9.6.
> > > .
> > > Hello Red Hat / Javier,
> > > ... can you please provide us the updated grub2 rpm (for ppc64le ..) for our
> > > early testing ...?
> > > Please advise ...
> > > Thanks in advance for your support.
> >
> > early access to packages:
> >
> > http://people.redhat.com/~borgan/.8.5/grub2-2.02-0.87.el7_9.6.ppc64le/
> Hi Team,
> Steven has corrected my package list, here is the right set of new packages
> to test, sorry for the miscommunication:
> http://people.redhat.com/~borgan/.8.5/grub2-2.02-0.87.el7_9.7.ppc64le/

Hi Redhat,

just for my better understading: what is the next step here?
Is the package already available to customers?

Thanks

Comment 18 IBM Bug Proxy 2021-08-02 13:32:15 UTC
------- Comment From diegodo.com 2021-08-02 09:27 EDT-------
Hello Redhat,

the provided packages are working as expected.

Please make it available.

Let me know if something is missing from our side.

Thanks

Comment 20 Petr Janda 2021-08-17 06:29:04 UTC
Hello

I consider it as verified by customer.

Petr

Comment 31 errata-xmlrpc 2021-10-12 15:27:21 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (grub2 bug fix and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:3794


Note You need to log in before you can comment on or make changes to this bug.