Bug 1650397
Summary: | [RFE] Dashboard to display MTU setting per node under network details | ||||||
---|---|---|---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Ceph Storage | Reporter: | Mike Hackett <mhackett> | ||||
Component: | Ceph-Dashboard | Assignee: | Aashish sharma <aasharma> | ||||
Status: | CLOSED ERRATA | QA Contact: | Sunil Angadi <sangadi> | ||||
Severity: | medium | Docs Contact: | Ranjini M N <rmandyam> | ||||
Priority: | high | ||||||
Version: | 4.0 | CC: | bniver, ceph-eng-bugs, epuertat, flucifre, gmeno, kdreyer, mkasturi, pcuzner, rmandyam, sangadi, vereddy, vumrao | ||||
Target Milestone: | --- | Keywords: | FutureFeature | ||||
Target Release: | 5.0 | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | |||||||
Fixed In Version: | ceph-16.1.0-486.el8cp | Doc Type: | Enhancement | ||||
Doc Text: |
.The Prometheus Alertmanager rule triggers an alert for different MTU settings on the {storage-product} Dashboard
Previously, mismatch in MTU settings, which is a well-known cause of networking issues, had to be identified and managed using the command-line interface.
With this release, when a node or a minority of them have an MTU setting that differs from the majority of nodes, an alert is triggered on the {storage-product} Dashboard. The user can either mute the alert or fix the MTU mismatched settings.
See the link:{dashboard-guide}#management-of-alerts-on-the-ceph-dashboard[_Management of Alerts on the Ceph dashboard_] section in the _{storage-product} Dashboard Guide_ for more information.
|
Story Points: | --- | ||||
Clone Of: | Environment: | ||||||
Last Closed: | 2021-08-30 08:22:53 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 1832807, 1959686 | ||||||
Attachments: |
|
Description
Mike Hackett
2018-11-16 05:23:00 UTC
Created attachment 1514326 [details]
Dashboard Hosts Grafana
Mike, RHCS 4.0 Dashboard integrates several Cephmetrics charts, but as you may see in the attached picture, there's no similar placeholder for detailed networks stats. Another question I have, given this is mostly a static setting. Does it make sense to display it into Grafana dashboard, or can it be moved somewhere else? @Federico? I agree with the idea of putting the MTU value in the host's info page. Targeting 4.1 for delivery. @Mike, when reviewing this for 4.1, I think this request does not fit into the dashboard concerns. Let me explain: - The goal of this is to let the user know about a sub-optimal/wrong setting and 'call for action' on that setting. - MTU is not highly valuable data to display: it's not a changing setting, and once fixed it is unlikely to change again. Based on the above, I'd suggest to move this to ceph-medic. In fact there's an upstream RFE for this (https://github.com/ceph/ceph-medic/issues/7). (In reply to Ernesto Puerta from comment #10) > @Mike, when reviewing this for 4.1, I think this request does not fit into > the dashboard concerns. Let me explain: > - The goal of this is to let the user know about a sub-optimal/wrong setting > and 'call for action' on that setting. > - MTU is not highly valuable data to display: it's not a changing setting, > and once fixed it is unlikely to change again. > > Based on the above, I'd suggest to move this to ceph-medic. In fact there's > an upstream RFE for this (https://github.com/ceph/ceph-medic/issues/7). So why is this on 4.1? Please handle: - Close - Move to 5.x - Fix. @Yaniv, Federico targeted this specifically at 4.1 (https://bugzilla.redhat.com/show_bug.cgi?id=1650397#c7). That's why I asked him (and Mike & Paul) for their thoughts. (In reply to Ernesto Puerta from comment #12) > @Yaniv, Federico targeted this specifically at 4.1 That was a year+ ago... > (https://bugzilla.redhat.com/show_bug.cgi?id=1650397#c7). That's why I asked > him (and Mike & Paul) for their thoughts. BTW, a different idea would be to launch the host's Cockpit - which has all this data + configuration for this host. Anyway, I'm moving it to 5.0. We can always bring it back. @Yaniv @Ernesto No issues with the 5.0 plan, I think ceph-medic and even an insights rule would cover the supportability aspect I was looking to address. Thanks @Yaniv (and @Mike for the clarification)! I think the intention of this feature was lost with all the details. The idea is to alert the user if a specific network is configured with a custom MTU and a host is mis-configured. The idea is to alert the user. If it is already part of node exporter, so a nice chunk of the work is done. We now need some integration work - understand which NIC is under which network and what is the expected MTU. Once completed in RHCS 5, please evaluate if we can backport it to 4. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Red Hat Ceph Storage 5.0 bug fix and enhancement), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2021:3294 |