Bug 2047302

Summary: [NMCI] dracut_NM_vlan_over_bridge and dracut_NM_vlan_over_bond test failure
Product: Red Hat Enterprise Linux 9 Reporter: Vladimir Benes <vbenes>
Component: NetworkManagerAssignee: Beniamino Galvani <bgalvani>
Status: CLOSED ERRATA QA Contact: Filip Pokryvka <fpokryvk>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 9.0CC: bgalvani, ferferna, lrintel, rkhan, sukulkar, till
Target Milestone: rcKeywords: Triaged
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: NetworkManager-1.36.0-0.8.el9 Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-05-17 15:48:19 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
NMCI patch to delay udev messages none

Comment 1 Vladimir Benes 2022-01-27 14:54:38 UTC
not sure if the third bug is related but there is one more
https://jenkins-networkmanager.apps.ocp.ci.centos.org/job/NetworkManager-main-c9s/75/artifact/FAIL-report_NetworkManager-ci-M0_Test0318_dracut_legacy_iSCSI_ibft_table.html

We can file a new bug for this one if needed (but we somehow planned to turn off legacy dracut tests completely). What do you think?

Comment 2 Beniamino Galvani 2022-02-01 14:09:11 UTC
(In reply to Vladimir Benes from comment #1)
> not sure if the third bug is related but there is one more
> https://jenkins-networkmanager.apps.ocp.ci.centos.org/job/NetworkManager-
> main-c9s/75/artifact/FAIL-report_NetworkManager-ci-
> M0_Test0318_dracut_legacy_iSCSI_ibft_table.html

It's not clear what's happening there, as NM completes the configuration and one minute later there are I/O errors. Anyway, it looks like a different issue. If it happens again, please file a separate bz.

Comment 3 Beniamino Galvani 2022-02-02 09:06:07 UTC
BTW, I was not able to reproduce the failure even after running the test in loop for hundreds of times; it seems the issue happens only when, for some reasons, udev annonces the interface late.

I could reproduce the failure by rebuilding the initrd image with an additional service that sends SIGSTOP and SIGCONT to udev so that messages get delayed. It's a hack, but I'm attaching in case it's useful for verification. Maybe the service could have a ConditionKernelCommandLine= so that we can activate it when needed, and we could add a test for it.

Comment 4 Beniamino Galvani 2022-02-02 09:07:29 UTC
Created attachment 1858563 [details]
NMCI patch to delay udev messages

Comment 8 Vladimir Benes 2022-02-24 09:17:21 UTC
Pushing this to a verified state as we don't see such issues anymore.

Comment 10 errata-xmlrpc 2022-05-17 15:48:19 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (new packages: NetworkManager), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2022:3915