Red Hat Bugzilla – Bug 1270732
[engine-backend] Undesired handling with a UnsupportedGlusterVolumeReplicaCountError from vdsm
Last modified: 2016-02-10 12:10:27 EST
Created attachment 1081929 [details]
engine.log and vdsm.log
Description of problem:
Engine fails with NullPointerException when VDSM response with UnsupportedGlusterVolumeReplicaCountError while trying to create a Gluster domain which resides on a non 3 bricks volume (tried with 2 bricks).
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. Create a Gluster storage domain resides on a replicate volume which has 2 bricks
VDSM fails to create the Gluster domain which is the desired behaviour. Engine gets VDSM response and doesn't know how to handle with it:
2015-10-12 11:20:21,437 ERROR [org.ovirt.engine.core.bll.storage.AddStorageServerConnectionCommand] (ajp-/127.0.0.1:8702-2)  Command 'org.ovirt.engine.core.bll.storage.AddStorageServerConnectionCommand' failed: null
2015-10-12 11:20:21,437 ERROR [org.ovirt.engine.core.bll.storage.AddStorageServerConnectionCommand] (ajp-/127.0.0.1:8702-2)  Exception: java.lang.NullPointerException
at org.ovirt.engine.core.common.errors.EngineException.<init>(EngineException.java:24) [common.jar:]
at org.ovirt.engine.core.bll.storage.AddStorageServerConnectionCommand.executeCommand(AddStorageServerConnectionCommand.java:51) [bll.jar:]
at org.ovirt.engine.core.bll.CommandBase.executeWithoutTransaction(CommandBase.java:1211) [bll.jar:]
at org.ovirt.engine.core.bll.CommandBase.executeActionInTransactionScope(CommandBase.java:1355) [bll.jar:]
at org.ovirt.engine.core.bll.CommandBase.runInTransaction(CommandBase.java:1979) [bll.jar:]
Engine should know how to catch the failure from VDSM.
engine.log and vdsm.log
Root cause is inconsistency between error code sent from vdsm (480) and error code expected by the engine (4710).
Changing error code at engine to 480.
Ala, since the BZ is in POST, I assume that a patch exists. Can you add a reference to it?
(In reply to Allon Mureinik from comment #2)
> Ala, since the BZ is in POST, I assume that a patch exists. Can you add a
> reference to it?
In oVirt testing is done on single release by default. Therefore I'm removing the 4.0 flag. If you think this bug must be tested in 4.0 as well, please re-add the flag. Please note we might not have testing resources to handle the 4.0 clone.
Engine handles well with an unsupported replica volume count:
2015-11-30 13:26:41,208 ERROR [org.ovirt.engine.core.bll.storage.BaseFsStorageHelper] (ajp-/127.0.0.1:8702-1) [1e6a8626] The connection with details '10.35.65.25:/elad1' failed because of error code '480' and error message is: unsupported gluster volume replica count
According to verification status and target milestone this issue should be fixed in oVirt 3.6.1. Closing current release.