Bug 1940569

Summary: Restart lldpad and fcoe services only if needed
Product: Red Hat Enterprise Virtualization Manager Reporter: Marian Jankular <mjankula>
Component: vdsmAssignee: Ales Musil <amusil>
Status: CLOSED CURRENTRELEASE QA Contact: Michael Burman <mburman>
Severity: high Docs Contact:
Priority: high    
Version: 4.4.4CC: amusil, dholler, jortialc, lsurette, mburman, michal.skrivanek, mkalinin, mkranz, mperina, mtessun, pelauter, pratshar, rhodain, srevivo, wmarx, ycui
Target Milestone: ovirt-4.4.6   
Target Release: 4.4.6   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: vdsm-4.40.60.4 Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-04-22 08:28:49 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Network RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1944724    

Description Marian Jankular 2021-03-18 16:09:29 UTC
Description of problem:
customer is removing vlan from host interface, lldpad and fccoe services are restarted as result of /usr/libexec/vdsm/hooks/before_network_setup/50_fcoe hook


Version-Release number of selected component (if applicable):
rhv 4.4.4

How reproducible:
everytime

Steps to Reproduce:
1. remove vlan from host wihile host is Up

Actual results:
storage domains become invalid dor shot time because fcoe service is restarted

Expected results:

no storage outage

Additional info:

hw used on the host side:

37:00.0 Ethernet controller [0200]: QLogic Corp. FastLinQ QL45000 Series 25GbE Controller [1077:1656] (rev 10)
37:00.1 Ethernet controller [0200]: QLogic Corp. FastLinQ QL45000 Series 25GbE Controller [1077:1656] (rev 10)
37:00.2 Ethernet controller [0200]: QLogic Corp. FastLinQ QL45000 Series 10/25/40/50GbE Controller (FCoE) [1077:165c] (rev 10)
37:00.3 Ethernet controller [0200]: QLogic Corp. FastLinQ QL45000 Series 10/25/40/50GbE Controller (FCoE) [1077:165c] (rev 10)
37:00.4 Ethernet controller [0200]: QLogic Corp. FastLinQ QL45000 Series 25GbE Controller [1077:1656] (rev 10)
37:00.5 Ethernet controller [0200]: QLogic Corp. FastLinQ QL45000 Series 25GbE Controller [1077:1656] (rev 10)

there is no setup for the interfaces, it looks like the interfaces are full offloading

Comment 1 Michal Skrivanek 2021-03-19 10:25:15 UTC
why does it happen on vlan removal? this hook is run on setup networks, that shouldn't be such a common operation?

Comment 3 Ales Musil 2021-03-19 10:38:53 UTC
(In reply to Michal Skrivanek from comment #1)
> why does it happen on vlan removal? this hook is run on setup networks, that
> shouldn't be such a common operation?

Almost every network operation goes through setup networks and because this hook is in before setup it will 
run before every setup no matter what is in that setup. 

Looking at this hook we could probably restart those services only if something has changed with the fcoe 
and not every time.

Comment 22 Michael Burman 2021-04-22 08:28:49 UTC
No regression seen with this fix. This can be closed as current release.
Customer has approved the fix.

Comment 23 Michael Burman 2021-04-22 08:30:46 UTC
Regression tested on vdsm-4.40.60.4-1.el8ev.x86_64 and rhvm-4.4.6.5-447.gd80dda7.9.el8ev.noarch

Comment 29 Red Hat Bugzilla 2023-09-15 01:03:39 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 500 days