Red Hat Bugzilla – Bug 837735
[VDSM] Node randomally goes offline.
Last modified: 2012-10-27 19:20:11 EDT
Created attachment 596310 [details]
Log Collector files from the crashing system.
Description of problem:
1 of my 3 Nodes keeps randomally going off line the other 3 nodes are running fine without any issues.
Version-Release number of selected component (if applicable):
Name : vdsm
Arch : x86_64
Version : 4.10.0
Release : 0.58.gita6f4929.el6
Size : 2.3 M
Repo : installed
From repo : vdsm-dre
Summary : Virtual Desktop Server Manager
URL : http://www.ovirt.org/wiki/Vdsm
License : GPLv2+
Description : The VDSM service is required by a Virtualization Manager to manage
: the Linux hosts. VDSM manages and monitors the host's storage,
: memory and networks as well as virtual machine creation, other
: host administration tasks, statistics gathering, and log
Seems to be happening ever few hours but only the one host.
Steps to Reproduce:
1.No steps needed it happens
The other hosts are stable but not this one.
All 3 being stable.
Attached is ovirt-log-collector files.
not sure which OS/distro this is from (I'm guessing centos 6.2), but:
all root files (dmidecode, etc.) are 0 byte size
vdsm log is missing
I think some sos plugins on this host are missing.
keith - thoughts
In addition to proper logs, What do you mean it goes offline?
The physical host shuts down?
It is non-operational in engine?
Please define what you mean by SOS plugins?
Host goes non-operational.
I moved to a newer build and the one node hasn't gone offline since. If it pop's back up what logs are you looking for?
I'd like to see the output of getVdsCaps on the non-op machine. Running `sosreport -o vdsm` should provide it (and more). Please reopen if needed.