Bug 2060031
| Summary: | NMCLI OVS connections intermittently get stuck in "activating" state after power cycle or crash [rhel-8] | ||||||
|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 8 | Reporter: | Rick Alongi <ralongi> | ||||
| Component: | NetworkManager | Assignee: | Thomas Haller <thaller> | ||||
| Status: | CLOSED ERRATA | QA Contact: | Vladimir Benes <vbenes> | ||||
| Severity: | unspecified | Docs Contact: | |||||
| Priority: | unspecified | ||||||
| Version: | 8.6 | CC: | bgalvani, ctrautma, lrintel, ralongi, rkhan, sfaye, sukulkar, thaller, till, vbenes | ||||
| Target Milestone: | rc | Keywords: | OtherQA, Triaged, ZStream | ||||
| Target Release: | --- | Flags: | pm-rhel:
mirror+
|
||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | NetworkManager-1.40.2-1.el8 | Doc Type: | No Doc Update | ||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | |||||||
| : | 2153429 2153430 2173890 (view as bug list) | Environment: | |||||
| Last Closed: | 2023-05-16 09:04:54 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Bug Depends On: | |||||||
| Bug Blocks: | 2153429, 2153430, 2173890 | ||||||
| Attachments: |
|
||||||
|
Description
Rick Alongi
2022-03-02 14:43:52 UTC
Still seeing this issue using RHEL-8.6 with openvswitch2.17. This time it is happening after a forced crash as part of a test and can be reproduced manually. Beaker job link: https://beaker.engineering.redhat.com/jobs/6611291 Still seeing this issue in FDP 22.J testing using RHEL-8.6 (RHEL-8.6.0-updates-20221014.0) with openvswitch2.15-2.15.0-124.el8fdp and openvswitch2.17-2.17.0-58.el8fdp: [root@netqe40 ~]# rpm -qa | grep NetworkManager NetworkManager-libnm-1.36.0-9.el8_6.x86_64 NetworkManager-tui-1.36.0-9.el8_6.x86_64 NetworkManager-ovs-1.36.0-9.el8_6.x86_64 NetworkManager-team-1.36.0-9.el8_6.x86_64 NetworkManager-1.36.0-9.el8_6.x86_64 [root@netqe40 ~]# uname -r 4.18.0-372.32.1.el8_6.x86_64 Rick, sorry for taking so long to reply. Thank you for being persistent and keep pinging the rhbz :) This looks to me, as if it could be fixed by https://gitlab.freedesktop.org/NetworkManager/NetworkManager/-/commit/4f60fe293cd5461c47d218b632753ecdfb50cbab. @Beniamino, what do you think? This seems indeed fixed upstream by [1]. [1] got backported to upstream nm-1-40 branch as [2]. [2] was released upstream as 1.40.2. rhel-8.8 is about to get version NetworkManager-1.40.2-1.el8, which contains [2]. [1] https://gitlab.freedesktop.org/NetworkManager/NetworkManager/-/commit/4f60fe293cd5461c47d218b632753ecdfb50cbab [2] https://gitlab.freedesktop.org/NetworkManager/NetworkManager/-/commit/f702be2992f0f34c82e96b420947f9056a4cb24e This should be fixed by NetworkManager-1.40.2-1.el8. If possible, please try that package. Thanks for the report!! Hi Thomas, I will test this with NetworkManager-1.40.2-1.el8 as soon as it is merged into RHEL-8.8. I should mention that I am also seeing this same issue with RHEL-9.0: [root@netqe40 ~]# rpm -qa | grep NetworkManager NetworkManager-libnm-1.36.0-5.el9_0.x86_64 NetworkManager-1.36.0-5.el9_0.x86_64 NetworkManager-team-1.36.0-5.el9_0.x86_64 NetworkManager-tui-1.36.0-5.el9_0.x86_64 NetworkManager-ovs-1.36.0-5.el9_0.x86_64 Do you know if there is also a fix available for NetworkManager for RHEL-9.0? Would it make sense for me to log a separate BZ to track this issue for RHEL-9.0? Thanks! Rick the fix [1] is on upstream main branch, which is in upstream 1.41.3. which, is about to come to rhel-9.2 with "NetworkManager-1.41.3-1.el9" > [1] https://gitlab.freedesktop.org/NetworkManager/NetworkManager/-/commit/4f60fe293cd5461c47d218b632753ecdfb50cbab > Do you know if there is also a fix available for NetworkManager for RHEL-9.0? This rhbz tracks development for upcoming RHEL releases (in this case, rhel-8.8), where the issue is about to be fixed. rhel-9.2 is also about to be fixed. To fix any older release (rhel-8.7/rhel-9.1 or older), it requires to follow the Z-stream process. Which -- given the severity -- would be appropriate. I will discuss that internally. It would still be interesting, if you could comment on how this issue affects you (or a RH customer), so we get data about the severity/priority. (In reply to Thomas Haller from comment #6) Rick, although you seem to easily reproduce the issue, Beniamino (who fixed the bug) was not able to reproduce it locally. Seems something special is about your setup. So whether the patch really fixes your issue (or any issue at all) is only the working assumption. It would be very useful, if you could test either the rhel-8.8 or rhel-9.2 package, and see whether the issue is avoided. That might be in particular relevant, if we should do a Z-stream fix for this bug. Is that cumbersome for you to do? Hi Thomas, I saw that compose RHEL-9.2.0-20221019.2 contains NetworkManager-1.41.3-1.el9 so I just ran a beaker job using that compose. I did not see the failure where a connection is stuck in "activating" state so it may be that the fix in question does address the problem. I'd like to run multiple iterations of the test using a script on a system using RHEL-9.0 and one using RHEL-9.2.0-20221019.2 to see if I can reproduce the issue and also see no occurrences of the issue. I'd also like to run similar tests using a RHEL-8.8 compose that contains the fix when it becomes available (the latest stable compose for RHEL-8.8 is RHEL-8.8.0-20221017.2 and that does not appear to have the newer NetworkManager packages yet). I'll let you know what I find. Thanks, Rick Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (NetworkManager bug fix and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2023:2968 |