Bug 1349068 - vdsm unable to start: wdmd: unrecognized service
Summary: vdsm unable to start: wdmd: unrecognized service
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: vdsm
Version: 3.5.7
Hardware: All
OS: Linux
high
urgent
Target Milestone: ovirt-3.5.9
: ---
Assignee: Nir Soffer
QA Contact: Ying Cui
URL:
Whiteboard:
Depends On: 1344139
Blocks: rhevh-3.5-6.8
TreeView+ depends on / blocked
 
Reported: 2016-06-22 16:01 UTC by Douglas Schilling Landgraf
Modified: 2016-11-08 17:33 UTC (History)
20 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2016-08-19 10:20:03 UTC
oVirt Team: Node
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
host-deploy log (248.59 KB, text/plain)
2016-06-22 16:01 UTC, Douglas Schilling Landgraf
no flags Details
vdsm files (9.63 KB, application/x-gzip)
2016-06-22 16:02 UTC, Douglas Schilling Landgraf
no flags Details
rpm-qa (21.33 KB, text/plain)
2016-06-22 20:08 UTC, Douglas Schilling Landgraf
no flags Details
vdsm-rpm-qa-el6.txt (15.84 KB, text/plain)
2016-06-22 20:48 UTC, Douglas Schilling Landgraf
no flags Details


Links
System ID Private Priority Status Summary Last Updated
oVirt gerrit 59655 0 master ABANDONED spec: Require sanlock 2.8-3 2016-06-24 14:08:20 UTC
oVirt gerrit 59656 0 ovirt-3.5 NEW spec: Require sanlock 2.8-3 2016-06-22 22:05:58 UTC

Description Douglas Schilling Landgraf 2016-06-22 16:01:12 UTC
Description of problem:


Version-Release number of selected component (if applicable):

# rpm -qa | grep -i vdsm
vdsm-yajsonrpc-4.16.37-1.el6ev.noarch
vdsm-hook-vhostmd-4.16.37-1.el6ev.noarch
vdsm-reg-4.16.37-1.el6ev.noarch
vdsm-cli-4.16.37-1.el6ev.noarch
vdsm-jsonrpc-4.16.37-1.el6ev.noarch
vdsm-python-zombiereaper-4.16.37-1.el6ev.noarch
vdsm-xmlrpc-4.16.37-1.el6ev.noarch
vdsm-4.16.37-1.el6ev.x86_64
vdsm-hook-ethtool-options-4.16.37-1.el6ev.noarch
ovirt-node-plugin-vdsm-0.2.0-26.el6ev.noarch
vdsm-python-4.16.37-1.el6ev.noarch

Red Hat Enterprise Virtualization Manager Version 3.5.8-0.1.el6ev

How reproducible:

- Install rhev-hypervisor6-6.8-20160621.1.iso
- Register and approve the node
- Registration will fail

Actual results:
Registration failed.

Expected results:
Node become UP.

Additional info:

# service vdsmd start
wdmd: unrecognized service
vdsm: Start dependent wdmd                                 [FAILED]
vdsm start    

# vdsm-tool configure --force

Checking configuration status...  [FAILED]

Traceback (most recent call last):
  File "/usr/bin/vdsm-tool", line 219, in main
    return tool_command[cmd]["command"](*args)
  File "/usr/lib/python2.6/site-packages/vdsm/tool/__init__.py", line 37, in wrapper
  File "/usr/lib/python2.6/site-packages/vdsm/tool/configurator.py", line 62, in configure
  File "/usr/lib/python2.6/site-packages/vdsm/tool/configurators/sanlock.py", line 71, in isconfigured
KeyError: 'getpwnam(): name not found: sanlock'

Comment 1 Douglas Schilling Landgraf 2016-06-22 16:01:49 UTC
Created attachment 1170871 [details]
host-deploy log

Comment 2 Douglas Schilling Landgraf 2016-06-22 16:02:50 UTC
Created attachment 1170873 [details]
vdsm files

Comment 4 Fabian Deutsch 2016-06-22 16:07:18 UTC
Note: That RHEV-H build is using the RHEV 3.5 package set

Comment 5 Nir Soffer 2016-06-22 19:43:28 UTC
(In reply to Douglas Schilling Landgraf from comment #0)
> # service vdsmd start
> wdmd: unrecognized service
> vdsm: Start dependent wdmd                                 [FAILED]
> vdsm start

You must run vdsm-tool configure --force before starting vdsmd.

> # vdsm-tool configure --force
...
> KeyError: 'getpwnam(): name not found: sanlock'

Is sanlock installed?

Please attach output of "rpm -qa"

Comment 6 Douglas Schilling Landgraf 2016-06-22 20:08:01 UTC
Created attachment 1171073 [details]
rpm-qa

Comment 7 Nir Soffer 2016-06-22 20:36:00 UTC
Douglass, the attached rpm-qa is from el7 system, but the bug is describing el6
system :-)

Comment 8 Douglas Schilling Landgraf 2016-06-22 20:48:30 UTC
Created attachment 1171077 [details]
vdsm-rpm-qa-el6.txt

Comment 9 Douglas Schilling Landgraf 2016-06-22 20:49:34 UTC
(In reply to Nir Soffer from comment #7)
> Douglass, the attached rpm-qa is from el7 system, but the bug is describing
> el6
> system :-)

Sorry Nir, I have updated a new attachment!

Comment 10 Nir Soffer 2016-06-22 20:58:41 UTC
Thanks Douglas,

Vdsm is requiring sanlock and sanlock is installed, but when host deploy is configuring sanlock, sanlock user is missing.

May be vdsm installation issue, host-deploy issue or sanlock installation issue.

David, do you have and idea why user sanlock was missing? Did sanlock install
changed recently for el6?

Comment 11 Nir Soffer 2016-06-22 21:01:13 UTC
Didi, can you check the host-deploy log? Do you have an idea why sanlock user is 
missing when host-deploy is trying to configure vdsm? Did we change anything in
host-deploy for el6 recently?

Comment 12 David Teigland 2016-06-22 21:07:24 UTC
Yes, sanlock was just recently rebuilt in rhel6 for bug 1344139.  The problem there was a missing dependency on the useradd and groupadd commands, which the sanlock rpm needs.

Comment 13 Nir Soffer 2016-06-22 21:41:31 UTC
Douglas, vdsm must require sanlock >= 2.8.3 to consume this fix, would you clone
this bug to vdsm?

Comment 14 Douglas Schilling Landgraf 2016-06-23 01:29:27 UTC
(In reply to Nir Soffer from comment #13)
> Douglas, vdsm must require sanlock >= 2.8.3 to consume this fix, would you
> clone
> this bug to vdsm?

Hi Nir, I have used edit-node and updated sanlock to 2.8.3 and registration process continued, so I can confirm it should work. Howerver, now I see: https://bugzilla.redhat.com/show_bug.cgi?id=1349218

Comment 16 Douglas Schilling Landgraf 2016-06-30 20:47:02 UTC
Nir, FYI:

As vdsm 3.5 won't get re-spin, I have included these changes into ovirt-node package.

Comment 17 Yaniv Lavi 2016-07-03 08:21:25 UTC
(In reply to Douglas Schilling Landgraf from comment #16)
> Nir, FYI:
> 
> As vdsm 3.5 won't get re-spin, I have included these changes into ovirt-node
> package.

moving to node.

Comment 18 Nir Soffer 2016-07-05 21:41:55 UTC
Based on comment 16 and comment 17, moved to ovirt-node.

Comment 19 Douglas Schilling Landgraf 2016-07-06 02:37:41 UTC
(In reply to Nir Soffer from comment #18)
> Based on comment 16 and comment 17, moved to ovirt-node.

Hi Nir, vdsm in 3.5 will be re-spin based on the yesterday meeting.
Moving back to vdsm.

Comment 20 Nir Soffer 2016-07-07 12:17:40 UTC
Gil, I get too much spam from ack-police about missing qe ack. Would
you handle this?

Comment 21 Anatoly Litovsky 2016-07-12 09:49:42 UTC
The vdsm was respinned

Comment 22 Anatoly Litovsky 2016-07-12 09:50:41 UTC
Does it happens in the latest RHEHV 6.8 ?

Comment 24 cshao 2016-08-01 12:31:00 UTC
Test version:
rhev-hypervisor6-6.8-20160707.3
vdsm-4.16.38-1.el6ev.x86_64
ovirt-node-3.2.3-34.el6.noarch
RHEV-M vt 21(3.5.8-0.1.)


Test steps:
1. Install rhev-hypervisor6-6.8-20160707.3
2. Register and approve the node
3. Check Node status

Test result:
1. Node become UP.

So the bug is fixed, change bug status to VERIFIED.


Note You need to log in before you can comment on or make changes to this bug.