Bug 1313586
Summary: | Host unreacheble if vdsm-network fail to setup | ||
---|---|---|---|
Product: | [oVirt] vdsm | Reporter: | Badalyan Vyacheslav <v.badalyan> |
Component: | Services | Assignee: | Edward Haas <edwardh> |
Status: | CLOSED UPSTREAM | QA Contact: | Meni Yakove <myakove> |
Severity: | high | Docs Contact: | |
Priority: | unspecified | ||
Version: | 4.18.0 | CC: | bugs, danken, v.badalyan |
Target Milestone: | ovirt-4.0.0-beta | Flags: | ykaul:
ovirt-4.0.0?
rule-engine: planning_ack? rule-engine: devel_ack? rule-engine: testing_ack? |
Target Release: | --- | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2016-05-22 13:29:19 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | Network | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Badalyan Vyacheslav
2016-03-02 00:43:36 UTC
My configuration 2-4 ETH -> Bonding LACP -> VLANs (3-5 vlans. 1500 and 900 MTU. Managment in vlan with MTU 9000) Also here MainProcess|jsonrpc.Executor/5::ERROR::2016-02-17 23:26:59,345::supervdsmServer::118::SuperVdsm.ServerCallback::(wrapper) Error in setupNetworks Traceback (most recent call last): File "/usr/share/vdsm/supervdsmServer", line 116, in wrapper res = func(*args, **kwargs) File "/usr/share/vdsm/supervdsmServer", line 241, in setupNetworks return setupNetworks(networks, bondings, **options) File "/usr/share/vdsm/network/api.py", line 939, in setupNetworks logger, _netinfo) File "/usr/share/vdsm/network/api.py", line 768, in _add_missing_networks implicitBonding=True, _netinfo=_netinfo, **d) File "/usr/share/vdsm/network/api.py", line 222, in wrapped ret = func(**attrs) File "/usr/share/vdsm/network/api.py", line 314, in _addNetwork _netinfo=_netinfo, configurator=configurator, opts=options) File "/usr/share/vdsm/network/api.py", line 138, in _objectivizeNetwork nics, mtu, _netinfo, implicitBonding) File "/usr/share/vdsm/network/models.py", line 301, in objectivize destroyOnMasterRemoval=destroyOnMasterRemoval) File "/usr/share/vdsm/network/models.py", line 209, in __init__ self.validateOptions(options) File "/usr/share/vdsm/network/models.py", line 341, in validateOptions 'valid bonding option' % key) ConfigNetworkError: (25, "'miinmon' is not a valid bonding option") Please include the complete supervdsm.log, as a rollback should have taken place, and specify the precise vdsm version and release (4.18.0 has not yet been released). i recreate all configurations and get worked. 1. If i have 2 networks with gateway, vdsm create Route Tables with bad. If you host do forwardng and packet recive from NETWORK1, ip route look to route table 1 and don't see second network. Route tables must know about ALL networks. 2. If i save network, in vdsm logs i see many PING HOSTS and its not save. In Engine forewer task SetUp Network. Only reboot engine helps. If i dont check "test connection to engine" - all done normal! 3. If default GW or some cluster host is unreacheble for engine host. Engine do forewer check. But GW is HA VM on one of host. I must emulate GW in network to engine start and run VM! Part 3 its VERY bad! Engine (HOSTED) MUST CAN start in degress mode without all storage domains online! It's must check only storage domain needed to start ENGINE. I can't access to Engine WEB if one of hosts with storages is offline. Is do meny trubles to start datacenert after mass power down! i recreate all configurations and get worked. 1. If i have 2 networks with gateway, vdsm create Route Tables with bad. If you host do forwardng and packet recive from NETWORK1, ip route look to route table 1 and don't see second network. Route tables must know about ALL networks. 2. If i save network, in vdsm logs i see many PING HOSTS and its not save. In Engine forewer task SetUp Network. Only reboot engine helps. If i dont check "test connection to engine" - all done normal! 3. If default GW or some cluster host is unreacheble for engine host. Engine do forewer check. But GW is HA VM on one of host. I must emulate GW in network to engine start and run VM! Part 3 its VERY bad! Engine (HOSTED) MUST CAN start in degress mode without all storage domains online! It's must check only storage domain needed to start ENGINE. I can't access to Engine WEB if one of hosts with storages is offline. Is do meny trubles to start datacenert after mass power down! Badalyan, this bug was opened about Vdsm's failure to handle mistyped "miinmon" on boot time. Vdsm should have rejected this bond option in the first place, hence my requests of comment 3. Please provide the information that was requested there. Regarding your three new points: I must admit that I do not understand the worries expressed there, but they seem unrelated to this bug. Would you open open a fresh bug for each, and remember to include package versions and all relevant logs. Moving from 4.0 alpha to 4.0 beta since 4.0 alpha has been already released and bug is not ON_QA. Please reopen when the information rerequested in comment 6 is available. Fixed in 3.6.6 |