Bug 1702413
Summary: | Overcloud deployment fails when deploying nodes with more than 2 disks | ||
---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | Marius Cornea <mcornea> |
Component: | rhosp-director | Assignee: | RHOS Maint <rhos-maint> |
Status: | CLOSED DUPLICATE | QA Contact: | Sasha Smolyak <ssmolyak> |
Severity: | urgent | Docs Contact: | |
Priority: | unspecified | ||
Version: | 15.0 (Stein) | CC: | bfournie, dbecker, derekh, dsneddon, dtantsur, hjensas, johfulto, mburns, morazi |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2019-04-24 23:39:30 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Marius Cornea
2019-04-23 17:31:40 UTC
This error doesn't seem to be related to disks: 2019-04-23 15:14:07.330 8 ERROR ironic.drivers.modules.agent_client [req-a8564393-0c08-4adf-8d0c-af1a2e26dff4 - - - - -] Failed to connect to the agent running on node 81724853-5131-4d60-b568-134931b8b60e for invoking command image.install_bootloader. Error: HTTPConnectionPool(host='192.168.24.16', port=9999): Read timed out. (read timeout=60): requests.exceptions.ReadTimeout: HTTPConnectionPool(host='192.168.24.16', port=9999): Read timed out. (read timeout=60) It also seems transient, since some commands succeed. Can you confirm that 192.168.24.16 is the correct address and is reachable from the undercloud? (In reply to Dmitry Tantsur from comment #2) > This error doesn't seem to be related to disks: > > 2019-04-23 15:14:07.330 8 ERROR ironic.drivers.modules.agent_client > [req-a8564393-0c08-4adf-8d0c-af1a2e26dff4 - - - - -] Failed to connect to > the agent running on node 81724853-5131-4d60-b568-134931b8b60e for invoking > command image.install_bootloader. Error: > HTTPConnectionPool(host='192.168.24.16', port=9999): Read timed out. (read > timeout=60): requests.exceptions.ReadTimeout: > HTTPConnectionPool(host='192.168.24.16', port=9999): Read timed out. (read > timeout=60) > > It also seems transient, since some commands succeed. Can you confirm that > 192.168.24.16 is the correct address and is reachable from the undercloud? I don't have this environment anymore but I'll confirm on the next one. Based on my observations though deployment passes when I leave only 2 disks for the ceph nodes. Nice find Derek! As this has the same symptoms as https://bugzilla.redhat.com/show_bug.cgi?id=1691551, marking this as a duplicate so we have one place to track this issue *** This bug has been marked as a duplicate of bug 1691551 *** |