Bug 1871408
Summary: | Noobaa falsely reports AWS endpoint as having IO_ERRORS if its name doesn't contain "amazonaws.com" | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat OpenShift Container Storage | Reporter: | Ben Eli <belimele> |
Component: | Multi-Cloud Object Gateway | Assignee: | Ohad <omitrani> |
Status: | CLOSED ERRATA | QA Contact: | Ben Eli <belimele> |
Severity: | high | Docs Contact: | |
Priority: | unspecified | ||
Version: | 4.5 | CC: | dahorak, ebenahar, etamir, muagarwa, nbecker, ocs-bugs, pbalogh |
Target Milestone: | --- | Keywords: | AutomationBackLog |
Target Release: | OCS 4.5.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | 4.5.0-64.ci | Doc Type: | Bug Fix |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2020-09-15 10:19:07 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 1862755 |
Description
Ben Eli
2020-08-23 06:44:38 UTC
Moving out of 4.5, its not a blocker But Nimrod - it blocks 1862755, which is a blocker It's not, its only when you use and endpoint which is not *.amazonaws.com or at least this is how its described, am I missing something ? It does; BZ1862755 is about backingstores failing to communicate with their target buckets, which is the exact same case here. When trying to connect to buckets that don't reside in the same availability zone as the cluster, NooBaa has to connect to their endpoint via the proxy, which in turn changes the address. Ohad should be able to explain this in more detail, but the bottom line is that NB still can't connect to any proxied backingstores, even if both the cluster and target buckets are on AWS. Nimrod, from Ben's explanation to me, the endpoint name is being modified when communicating through a proxy so the name is not something the admin has control over. In addition to that, if this is the situation and this BZ blocks the verification of bug 1862755, we must treat this one (1871408) as a blocker for 4.5. Elad, Ben and Nimrod Please let me calrify, The name not containing an "amazonaws.com" was a prelimiary assumption about the origin of the failure in coomunication with the aws endpoints. It was WRONG The actual issue here was that one call site, that try to communicate with the cloud, did not took into account the proxy settings. Apperently this call site was used when checking the health of a backing store, which resulted with an IO_ERRORS status. I fixed this issue in an upstream PR (see links section) and Ben and I manually run the test on a patched proxied environment, where it finished successfully. As a side note, the title for this bug does not reflect the actual issue and its resolution. Is this part of new RC2 build? If so, can this be moved to ON_QE? (In reply to Petr Balogh from comment #10) > Is this part of new RC2 build? If so, can this be moved to ON_QE? Yes, it is and ideally errata should do that. Moving it manually. Cluster is on us-east-2. I created two backingstores - us-east-2, and us-west-1. Created a bucketclass that uses both, and an OBC that uses the bucketclass. OBC was healthy, backingstores were ready. Verified on 4.5.0-64.ci. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Red Hat OpenShift Container Storage 4.5.0 bug fix and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:3754 |