Bug 1419729

Summary: AgentX Subagent Crashes on snmpd Stop/Restart
Product: Red Hat Enterprise Linux 7 Reporter: aaron
Component: net-snmpAssignee: Josef Ridky <jridky>
Status: CLOSED WONTFIX QA Contact: BaseOS QE - Apps <qe-baseos-apps>
Severity: high Docs Contact:
Priority: unspecified    
Version: 7.2CC: caleb.hensley
Target Milestone: rc   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2019-12-10 14:28:30 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description aaron 2017-02-06 21:47:19 UTC
Description of problem:

A connected agentx subagent will crash on an snmpd stop or restart. The backtrace shows the crash originates in libnetsnmp snmp_sess_close.

Debugging shows the crash is within netsnmp_transport_free [SNMP_FREE(t)]. The transport pointer (t) is null and the final backtrace call is in glibc.

Prior to the crash any queries to snmpd for OID's the subagent is registered for gets a `Timeout: no response from localhost`. 

Both the agent and the snmpd foreground process show the agent connects successfully.

Version-Release number of selected component (if applicable):

net-snmp-5.7.2-24


How reproducible:

Every time snmpd is stopped or restarted with a connected net-snmp AgentX subagent written against the 5.7.2-24 library.

Steps to Reproduce:

1. Connect AgentX sub-agent to snmpd running foreground or background
2. Restart or Stop snmpd
3. AgentX sub-agent will crash with mention of a double free in glibc. Backtrace and debugging shows this call is made from within netsnmp_transport_free. 

Actual results:

The subagent thread will crash bringing down any application process it is running within.

Expected results:

No crash on snmp stop/restart. snmpget to registered OID's of the subagent should return data, not timeouts.

Additional info:

This subagent was developed on Fedora 25 with net-snmp-5.7.3-13 where the crash does not exist. A stop restart of snmpd either in the foreground or background simply results in the subagent attempting a reconnect every 15 seconds.

The same correct behavior can be seen with the same code compiled on Ubuntu 16.04 with the libsnmp 5.7.3.

Additionally, the OID's mentioned previously are found and reported properly - with no change in the associated MIB's.

The issue only exists when the subagent is compiled against net-snmp-5.7.2-24.

Comment 1 Josef Ridky 2017-02-07 10:11:35 UTC
Thank you for taking the time to report this issue to us. We appreciate the feedback and use reports such as this one to guide our efforts at improving our products. That being said, this bug tracking system is not a mechanism for requesting support, and we are not able to guarantee the timeliness or suitability of a resolution.

If this issue is critical or in any way time sensitive, please raise a ticket through the regular Red Hat support channels to ensure it receives the proper attention and prioritization to assure a timely resolution. 

For information on how to contact the Red Hat production support team, please visit:
    https://www.redhat.com/support/process/production/#howto

Comment 2 Josef Ridky 2019-12-10 14:28:30 UTC
Red Hat Enterprise Linux version 7 is entering the Maintaining phase 1 of its lifetime and this bug doesn't meet the criteria for it, i.e. only high severity issues will be fixed. Please see https://access.redhat.com/support/policy/updates/errata/ for further information.

This issue should be fixed in Red Hat Enterprise Linux 8.

Closing as WONTFIX.

Comment 3 Caleb Hensley 2021-09-14 21:07:04 UTC
This bug is also seen using net-snmp 5.7.3 on Debian Stretch. I do not believe it is a Fedora issue, rather, it seems to be a net-snmp issue. I am currently messaging net-snmp SourceForge forums (-users and -coders) to push for a solution to this.

Comment 4 aaron 2021-09-15 01:01:03 UTC
(In reply to Caleb Hensley from comment #3)
> This bug is also seen using net-snmp 5.7.3 on Debian Stretch. I do not
> believe it is a Fedora issue, rather, it seems to be a net-snmp issue. I am
> currently messaging net-snmp SourceForge forums (-users and -coders) to push
> for a solution to this.

That would be great, I appreciate you updating this thread. I have little confidence that it is fixed in RHEL 8 (above seems to be a generic "all your problems go away"), but have not tested it.