Bug 1984804
| Summary: | [Tracker for OCP BZ #1988013] AWS - degradation in RBD pod reattach time in OCP 4.8 vs 4.7 | |||
|---|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat OpenShift Data Foundation | Reporter: | Yuli Persky <ypersky> | |
| Component: | csi-driver | Assignee: | Humble Chirammal <hchiramm> | |
| Status: | CLOSED NOTABUG | QA Contact: | Elad <ebenahar> | |
| Severity: | high | Docs Contact: | ||
| Priority: | unspecified | |||
| Version: | 4.8 | CC: | alayani, kramdoss, madam, muagarwa, ocs-bugs, odf-bz-bot, owasserm, ratamir, rcyriac | |
| Target Milestone: | --- | Keywords: | Automation, Performance, Regression | |
| Target Release: | --- | Flags: | kramdoss:
needinfo+
kramdoss: needinfo+ kramdoss: needinfo+ |
|
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | ||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | ||
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 1988013 (view as bug list) | Environment: | ||
| Last Closed: | 2021-09-14 09:09:19 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | ||||
| Bug Blocks: | 1988013 | |||
|
Description
Yuli Persky
2021-07-22 09:35:16 UTC
Must-gather logs are available here: 10.70.39.233:/home/ypersky/bz_1984804/logs-20210722-145415 Yuli, how can we access the MG logs @ 10.70.39.233:/home/ypersky/bz_1984804/logs-20210722-145415 We re-ran the tests after discussing with engineering with the following combinations to rule out any issues in OCP 1) OCP 4.8 + OCS 4.8 - https://ocs4-jenkins-csb-ocsqe.apps.ocp4.prod.psi.redhat.com/job/qe-deploy-ocs-cluster/4756/consoleFull 2) OCP 4.7 + OCS 4.8 - https://ocs4-jenkins-csb-ocsqe.apps.ocp4.prod.psi.redhat.com/job/qe-deploy-ocs-cluster/4757/consoleFull Reattach time for RBD in OCP 4.8 + OCS 4.8 is 43.99 seconds Reattach time for RBD in OCP 4.7 + OCS 4.8 is 29.17 seconds Mustgather logs for both cases shall be attached shortly. Thanks Karthick. So, the above data suggests that there is a regression in OCP 4.8 Just for the records, we are using new side car images in OCS4.8 Hi Avi, Thanks for the collecting above metrics too. With that, from the available data it looks like below. Reattach time for RBD in OCP 4.8 + OCS 4.7 is 43.99 seconds Reattach time for RBD in OCP 4.8 + OCS 4.8 is 43.99 seconds Reattach time for RBD in OCP 4.7 + OCS 4.8 is 29.17 seconds Reattach time for RBD in OCP 4.7 + OCS 4.7 is 29.1 seconds As mentioned earlier, it seems that OCS 4.7 and 4.8 against same OCP versions respond pretty much the same way. However while looking at the vmware test result [1] for reattach, it has reported an improvement in performance with 4.8 versions: For POD attach time we can observe improvement of ~70% on CephFS For POD reattach time we can observe improvement of ~50% on RBD Are these build and hardware remains same across these tests in different ( aws and vmware) platforms? [1] https://docs.google.com/document/d/1KDPPfVywM5-Y4MzYOSUndAnAbPfhgth9UazppOOfMck/edit# (In reply to Humble Chirammal from comment #11) > Hi Avi, Thanks for the collecting above metrics too. With that, from the > available data it looks like below. > > Reattach time for RBD in OCP 4.8 + OCS 4.7 is 43.99 seconds > Reattach time for RBD in OCP 4.8 + OCS 4.8 is 43.99 seconds > Reattach time for RBD in OCP 4.7 + OCS 4.8 is 29.17 seconds > Reattach time for RBD in OCP 4.7 + OCS 4.7 is 29.1 seconds > > As mentioned earlier, it seems that OCS 4.7 and 4.8 against same OCP > versions respond pretty much the same way. However while looking at the > vmware test result [1] for reattach, it has reported an improvement in > performance with 4.8 versions: > > For POD attach time we can observe improvement of ~70% on CephFS > For POD reattach time we can observe improvement of ~50% on RBD > > Are these build and hardware remains same across these tests in different ( > aws and vmware) platforms? Yes, during the test hardware and build remains the same. > > [1] > https://docs.google.com/document/d/1KDPPfVywM5- > Y4MzYOSUndAnAbPfhgth9UazppOOfMck/edit# (In reply to Avi Liani from comment #12) > (In reply to Humble Chirammal from comment #11) > > Hi Avi, Thanks for the collecting above metrics too. With that, from the > > available data it looks like below. > > > > Reattach time for RBD in OCP 4.8 + OCS 4.7 is 43.99 seconds > > Reattach time for RBD in OCP 4.8 + OCS 4.8 is 43.99 seconds > > Reattach time for RBD in OCP 4.7 + OCS 4.8 is 29.17 seconds > > Reattach time for RBD in OCP 4.7 + OCS 4.7 is 29.1 seconds > > > > As mentioned earlier, it seems that OCS 4.7 and 4.8 against same OCP > > versions respond pretty much the same way. However while looking at the > > vmware test result [1] for reattach, it has reported an improvement in > > performance with 4.8 versions: > > > > For POD attach time we can observe improvement of ~70% on CephFS > > For POD reattach time we can observe improvement of ~50% on RBD > > > > Are these build and hardware remains same across these tests in different ( > > aws and vmware) platforms? > > Yes, during the test hardware and build remains the same. This is bit confusing, if all the OCP builds and hardware remains same and reattach time regression showed in AWS but not in VMWARE platform. Its difficult to reach into a conclusion that, even OCP code have a regression. > > > > > [1] > > https://docs.google.com/document/d/1KDPPfVywM5- > > Y4MzYOSUndAnAbPfhgth9UazppOOfMck/edit# Hi Yuli/Avi/Karthick We have a request from Jan on the OCP BZ, PTAL https://bugzilla.redhat.com/show_bug.cgi?id=1988013#c19 I am closing this bug as per the comment (https://bugzilla.redhat.com/show_bug.cgi?id=1988013#c23) in the tracking issue. Please feel free to open a new issue if we come across the same issue. |