Bug 1506620 - collectd with netlink plugin read-function fails
Summary: collectd with netlink plugin read-function fails
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: collectd
Version: 10.0 (Newton)
Hardware: Unspecified
OS: Unspecified
low
low
Target Milestone: zstream
: 10.0 (Newton)
Assignee: Matthias Runge
QA Contact: Leonid Natapov
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2017-10-26 13:03 UTC by David Vallee Delisle
Modified: 2022-08-16 11:47 UTC (History)
9 users (show)

Fixed In Version: collectd-5.8.0-8.el7ost
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2019-01-15 11:37:14 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Debian BTS 829634 0 None None None 2017-10-26 13:03:33 UTC
Github collectd collectd issues 2510 0 'None' 'closed' 'netlink plugin not working on CentOS, RHEL' 2019-11-11 00:08:11 UTC
Red Hat Issue Tracker OSP-4738 0 None None None 2022-08-16 11:47:10 UTC

Description David Vallee Delisle 2017-10-26 13:03:33 UTC
Description of problem:
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: Initialization complete, entering read-loop.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: netlink plugin: link_filter_cb: IFLA_STATS64 mnl_attr_validate2 failed.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: netlink plugin: ir_read: mnl_socket_recvfrom failed.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: read-function of plugin `netlink' failed. Will suspend it for 600.000 seconds.

Seems to be generated by https://github.com/collectd/collectd/blob/cb46533c34d5ab991e7df09312739b9be1f8821b/src/netlink.c#L360
and https://github.com/collectd/collectd/blob/cb46533c34d5ab991e7df09312739b9be1f8821b/src/netlink.c#L681

I think this upstream bug talks about this:
https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=829634

Version-Release number of selected component (if applicable):
collectd-5.5.2-2.el7ost.x86_64
collectd-netlink-5.5.2-2.el7ost.x86_64


How reproducible:
Everytime

Steps to Reproduce:
1. Start collectd with netlink plugin


Actual results:
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: Initialization complete, entering read-loop.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: netlink plugin: link_filter_cb: IFLA_STATS64 mnl_attr_validate2 failed.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: netlink plugin: ir_read: mnl_socket_recvfrom failed.
Oct 06 23:38:57 wcnec2-l-rh-cmp-9.hqplan.lab collectd[45142]: read-function of plugin `netlink' failed. Will suspend it for 600.000 seconds.

Comment 1 Matthias Runge 2017-10-27 10:14:29 UTC
On a centos 7 test machine using a build from epel:
The version coming from EPEL is 5.7.1

first run, it didn't work
second one, after fully updating the machine, it works

Re-testing with collectd-5.7.2 from centos-opstools does not show any issues

Comment 2 Matthias Runge 2017-10-28 08:54:28 UTC
5.7.2 from osp works, once fixing the config issue https://bugzilla.redhat.com/show_bug.cgi?id=1507203

Comment 12 Lon Hohberger 2018-08-16 10:38:40 UTC
According to our records, this should be resolved by collectd-5.8.0-10.el7ost.  This build is available now.

Comment 13 Leonid Natapov 2019-01-10 14:27:27 UTC
collectd-netlink-5.8.0-12.el7ost.x86_64

Created conf file for netlink plugin:

LoadPlugin "netlink"
<Plugin "netlink">
 Interface "eth0"
</Plugin>

Jan 10 09:26:07 compute-0 collectd[221601]: plugin_load: plugin "netlink" successfully loaded.


Note You need to log in before you can comment on or make changes to this bug.