Bug 1887490
Summary: | [CNV][Chaos] Networking issues between masters and workers | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat OpenShift Data Foundation | Reporter: | Piotr Kliczewski <pkliczew> |
Component: | ocs-operator | Assignee: | Jose A. Rivera <jarrpa> |
Status: | CLOSED INSUFFICIENT_DATA | QA Contact: | Raz Tamir <ratamir> |
Severity: | high | Docs Contact: | |
Priority: | unspecified | ||
Version: | 4.6 | CC: | bniver, danken, gmeno, jarrpa, kramdoss, madam, mbukatov, muagarwa, ocs-bugs, odf-bz-bot, sostapov, ycui |
Target Milestone: | --- | Keywords: | AutomationBackLog |
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2021-10-06 16:14:41 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 1908661 |
Description
Piotr Kliczewski
2020-10-12 15:40:26 UTC
Not sure whether this should be an OCS issue or not but lets start with installation rather than unclassified. Mudit, so far "installation" has not proven to be really useful. Can we get more logs? If the deployment never rolls out, isn't that an OCP issue? Moving to OCS-op based on "NeedsReinstall installing: waiting for deployment ocs-operator to become ready: Waiting for rollout to finish: 0 of 1 updated replicas are available...". Important information is that OCS was fully operational before disrupting test and there was workload (vm) using it. The issue is that when connectivity between nodes were restored the cluster recovered by OCS became unusable. While interesting, this is not something that should be a blocker for OCS 4.6. Moving to OCS 4.7. This is still interesting, and still not a blocker. We really need more information before we can proceed, at the very least full OCP and OCS must-gather after the chaos was initiated. It may end up being a general OCP bug. Also, what platform was this on? Setting NEEDINFO and moving out to OCS 4.8. Here is must-gather [1] of one of such scenarios. Unfortunately OCS was stable but it could give you more information about what is happening in the cluster. This issue could be time dependent and as such not easy to reproduce. [1] https://drive.google.com/file/d/1nqIiuCu9zVeZZJESE-8SkU0N36XUCoHE/view?usp=sharing Jose, what is needed to move forward with this? Sorry for letting this sit around so long. Since there hasn't been any other follow-up from the chaos testing, we can probably safely move this to ODF 4.9. However, I'll try and set up something with the CNV team to see if this is still relevant and look further into it if desired. No update for a long time, not sure whether it is still relevant. Closing it, please reopen if required. |