Both NodeNetworkTransmitErrs and NodeNetworkTransmitErrs alerts fire when more than 10 errors happen in the last 2 minutes. Depending on the amount of network traffic, the alerts might be too noisy. It would be better to measure errors against the total amount of traffic. See https://github.com/openshift/cluster-monitoring-operator/issues/937#issuecomment-698191872
(In reply to Simon Pasquier from comment #0) > Both NodeNetworkTransmitErrs and NodeNetworkTransmitErrs alerts fire when > more than 10 errors happen in the last 2 minutes. should be NodeNetworkReceiveErrs and NodeNetworkTransmitErrs
tested with 4.7.0-0.nightly-2020-10-26-152308, expr for NodeNetworkReceiveErrs and NodeNetworkTransmitErrs alerts are measured errors against the total amount of traffic. alert: NodeNetworkTransmitErrs expr: rate(node_network_transmit_errs_total[2m]) / rate(node_network_transmit_packets_total[2m]) > 0.01 for: 1h labels: severity: warning annotations: description: '{{ $labels.instance }} interface {{ $labels.device }} has encountered {{ printf "%.0f" $value }} transmit errors in the last two minutes.' summary: Network interface is reporting many transmit errors. alert: NodeNetworkReceiveErrs expr: rate(node_network_receive_errs_total[2m]) / rate(node_network_receive_packets_total[2m]) > 0.01 for: 1h labels: severity: warning annotations: description: '{{ $labels.instance }} interface {{ $labels.device }} has encountered {{ printf "%.0f" $value }} receive errors in the last two minutes.' summary: Network interface is reporting many receive errors.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.7.0 security, bug fix, and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2020:5633