Bug 1435238
| Summary: | heketi is too sluggish, volume create takes more than 15 minutes | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | krishnaram Karthick <kramdoss> | ||||||
| Component: | heketi | Assignee: | Ramakrishna Reddy Yekulla <rreddy> | ||||||
| Status: | CLOSED ERRATA | QA Contact: | krishnaram Karthick <kramdoss> | ||||||
| Severity: | high | Docs Contact: | |||||||
| Priority: | unspecified | ||||||||
| Version: | cns-3.5 | CC: | hchiramm, kramdoss, madam, mliyazud, pprakash, rcyriac, rhs-bugs, rreddy, sselvan, storage-qa-internal | ||||||
| Target Milestone: | --- | ||||||||
| Target Release: | CNS 3.6 | ||||||||
| Hardware: | Unspecified | ||||||||
| OS: | Unspecified | ||||||||
| Whiteboard: | |||||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||||
| Doc Text: |
The minimum requirements for the memory need to adhere for creating volumes as mentioned in the OpenShift Container Guide is 32 GB.
|
Story Points: | --- | ||||||
| Clone Of: | Environment: | ||||||||
| Last Closed: | 2017-10-11 07:07:22 UTC | Type: | Bug | ||||||
| Regression: | --- | Mount Type: | --- | ||||||
| Documentation: | --- | CRM: | |||||||
| Verified Versions: | Category: | --- | |||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||
| Embargoed: | |||||||||
| Bug Depends On: | |||||||||
| Bug Blocks: | 1445447 | ||||||||
| Attachments: |
|
||||||||
|
Description
krishnaram Karthick
2017-03-23 12:46:56 UTC
Could be related to this https://bugzilla.redhat.com/show_bug.cgi?id=1320977 . I am placing this as a reference. When a pv create request is submitted, two of the three nodes has accepted the request immediately while the node which was down and restored later yesterday is taking a while to process the request. I'll attach the complete heketi log from the time volume request was submitted and the volume request completed. log snippet: [negroni] Started POST /volumes [negroni] Completed 202 Accepted in 1.195593ms [asynchttp] INFO 2017/03/24 03:51:24 Started job 2956fbc50176ed7c2c15d179b64456e8 [heketi] INFO 2017/03/24 03:51:24 Creating volume b9baf9b7a534d6fcdb0d0cc5101a48fe [negroni] Started GET /queue/2956fbc50176ed7c2c15d179b64456e8 [negroni] Completed 200 OK in 69.048µs [heketi] INFO 2017/03/24 03:51:24 brick_num: 0 [heketi] INFO 2017/03/24 03:51:24 Creating brick d105f896d650aa40dfa4352261936836 [heketi] INFO 2017/03/24 03:51:24 Creating brick af467c53680ce258dca7ff5327dbea4f [heketi] INFO 2017/03/24 03:51:24 Creating brick 2277909c5c4deeb497b3da7b06647df9 [kubeexec] DEBUG 2017/03/24 03:51:24 /src/github.com/heketi/heketi/executors/kubeexec/kubeexec.go:250: Host: dhcp47-51.lab.eng.blr.redhat.com Pod: glusterfs-vx1s0 Command: mkdir -p /var/lib/heketi/mounts/vg_fe3c78e3d576382ccb5acf8267db4623/brick_d105f896d650aa40dfa4352261936836 Result: [kubeexec] DEBUG 2017/03/24 03:51:24 /src/github.com/heketi/heketi/executors/kubeexec/kubeexec.go:250: Host: dhcp46-165.lab.eng.blr.redhat.com Pod: glusterfs-hcp7j Command: mkdir -p /var/lib/heketi/mounts/vg_f0565469addfecd5ec737e86bdda2950/brick_2277909c5c4deeb497b3da7b06647df9 Result: [negroni] Started GET /queue/2956fbc50176ed7c2c15d179b64456e8 [negroni] Completed 200 OK in 89.243µs . . . . [kubeexec] DEBUG 2017/03/24 03:52:09 /src/github.com/heketi/heketi/executors/kubeexec/kubeexec.go:250: Host: dhcp47-21.lab.eng.blr.redhat.com Pod: glusterfs-jg4kw Command: mkdir -p /var/lib/heketi/mounts/vg_685f6ed53fca4ae417831b5ecd297079/brick_af467c53680ce258dca7ff5327dbea4f Result: [negroni] Started GET /queue/2956fbc50176ed7c2c15d179b64456e8 [negroni] Completed 200 OK in 71.028µs . . . . . [kubeexec] DEBUG 2017/03/24 03:56:18 /src/github.com/heketi/heketi/executors/kubeexec/kubeexec.go:250: Host: dhcp47-21.lab.eng.blr.redhat.com Pod: glusterfs-jg4kw Command: chmod 2775 /var/lib/heketi/mounts/vg_685f6ed53fca4ae417831b5ecd297079/brick_af467c53680ce258dca7ff5327dbea4f/brick . . [sshexec] INFO 2017/03/24 03:56:18 Creating volume vol_b9baf9b7a534d6fcdb0d0cc5101a48fe replica 3 [kubeexec] DEBUG 2017/03/24 03:56:19 /src/github.com/heketi/heketi/executors/kubeexec/kubeexec.go:250: Host: dhcp46-165.lab.eng.blr.redhat.com Pod: glusterfs-hcp7j Command: gluster --mode=script volume create vol_b9baf9b7a534d6fcdb0d0cc5101a48fe replica 3 10.70.46.165:/var/lib/heketi/mounts/vg_f0565469addfecd5ec737e86bdda2950/brick_2277909c5c4deeb497b3da7b06647df9/brick 10.70.47.51:/var/lib/heketi/mounts/vg_fe3c78e3d576382ccb5acf8267db4623/brick_d105f896d650aa40dfa4352261936836/brick 10.70.47.21:/var/lib/heketi/mounts/vg_685f6ed53fca4ae417831b5ecd297079/brick_af467c53680ce258dca7ff5327dbea4f/brick Created attachment 1265966 [details]
volumecreatelog_comment4
As observed in 1435401, the restarted node would have lost labels. This makes the cluster have only 2 working nodes out of 3. The third node will not respond at all, hence the command is not sluggish, it is hung. Fix for now is to relabel the rebooted node again. This issue is not seen in the recent 3.6 builds. Moving the bug to verified. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2017:2879 |