Bug 1506620

Summary: collectd with netlink plugin read-function fails
Product: Red Hat OpenStack Reporter: David Vallee Delisle <dvd>
Component: collectdAssignee: Matthias Runge <mrunge>
Status: CLOSED CURRENTRELEASE QA Contact: Leonid Natapov <lnatapov>
Severity: low Docs Contact:
Priority: low    
Version: 10.0 (Newton)CC: apannu, jbadiapa, lars, marjones, mmagr, mrunge, rmccabe, slinaber, tdosek
Target Milestone: zstreamKeywords: Rebase, TestOnly, Triaged, ZStream
Target Release: 10.0 (Newton)   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: collectd-5.8.0-8.el7ost Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2019-01-15 11:37:14 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description David Vallee Delisle 2017-10-26 13:03:33 UTC
Description of problem:
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: Initialization complete, entering read-loop.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: netlink plugin: link_filter_cb: IFLA_STATS64 mnl_attr_validate2 failed.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: netlink plugin: ir_read: mnl_socket_recvfrom failed.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: read-function of plugin `netlink' failed. Will suspend it for 600.000 seconds.

Seems to be generated by https://github.com/collectd/collectd/blob/cb46533c34d5ab991e7df09312739b9be1f8821b/src/netlink.c#L360
and https://github.com/collectd/collectd/blob/cb46533c34d5ab991e7df09312739b9be1f8821b/src/netlink.c#L681

I think this upstream bug talks about this:
https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=829634

Version-Release number of selected component (if applicable):
collectd-5.5.2-2.el7ost.x86_64
collectd-netlink-5.5.2-2.el7ost.x86_64


How reproducible:
Everytime

Steps to Reproduce:
1. Start collectd with netlink plugin


Actual results:
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: Initialization complete, entering read-loop.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: netlink plugin: link_filter_cb: IFLA_STATS64 mnl_attr_validate2 failed.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: netlink plugin: ir_read: mnl_socket_recvfrom failed.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: read-function of plugin `netlink' failed. Will suspend it for 600.000 seconds.

Comment 1 Matthias Runge 2017-10-27 10:14:29 UTC
On a centos 7 test machine using a build from epel:
The version coming from EPEL is 5.7.1

first run, it didn't work
second one, after fully updating the machine, it works

Re-testing with collectd-5.7.2 from centos-opstools does not show any issues

Comment 2 Matthias Runge 2017-10-28 08:54:28 UTC
5.7.2 from osp works, once fixing the config issue https://bugzilla.redhat.com/show_bug.cgi?id=1507203

Comment 12 Lon Hohberger 2018-08-16 10:38:40 UTC
According to our records, this should be resolved by collectd-5.8.0-10.el7ost.  This build is available now.

Comment 13 Leonid Natapov 2019-01-10 14:27:27 UTC
collectd-netlink-5.8.0-12.el7ost.x86_64

Created conf file for netlink plugin:

LoadPlugin "netlink"
<Plugin "netlink">
 Interface "eth0"
</Plugin>

Jan 10 09:26:07 compute-0 collectd[221601]: plugin_load: plugin "netlink" successfully loaded.