Bug 1727955 - DVR OSP-10 CI job failing due to SSH failures
Summary: DVR OSP-10 CI job failing due to SSH failures
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-neutron
Version: 10.0 (Newton)
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: ---
Assignee: Slawek Kaplonski
QA Contact: Eran Kuris
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2019-07-08 16:08 UTC by Slawek Kaplonski
Modified: 2019-10-16 09:41 UTC (History)
6 users (show)

Fixed In Version: openstack-neutron-9.4.1-47.el7ost
Doc Type: No Doc Update
Doc Text:
Clone Of:
Environment:
Last Closed: 2019-10-16 09:41:17 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Launchpad 1835808 0 None None None 2019-07-08 16:10:50 UTC
Red Hat Product Errata RHBA-2019:3115 0 None None None 2019-10-16 09:41:36 UTC

Description Slawek Kaplonski 2019-07-08 16:08:34 UTC
DVR CI job - many tests failing due to FIP ssh connectivity failure.
See https://rhos-qe-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/job/DFG-network-neutron-10_director-rhel-virthost-3cont_2comp-ipv4-vxlan-dvr/51/#showFailuresLink for example

It looks that this is due to bug when external_network_bridge is set in L3 agent's config.

Comment 1 Slawek Kaplonski 2019-07-10 14:28:35 UTC
I run CI job with openstack-neutron-9.4.1-47.el7ost and tests didn't failed due to SSH issue: https://rhos-qe-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/job/DFG-network-neutron-10_director-rhel-virthost-3cont_2comp-ipv4-vxlan-dvr/54/#showFailuresLink
So this looks like fixed.

Comment 4 Tomer 2019-07-23 12:56:02 UTC
I ran it again with the latest build. Seems it still has the same issues -
2019-07-23 07:15:38,448 8702 ERROR    [tempest.lib.common.ssh] Failed to establish authenticated ssh connection to cirros.0.210 after 20 attempts
2019-07-23 07:15:38.448 8702 ERROR tempest.lib.common.ssh Traceback (most recent call last):
2019-07-23 07:15:38.448 8702 ERROR tempest.lib.common.ssh   File "/home/stack/tempest-dir/tempest/lib/common/ssh.py", line 75, in _get_ssh_connection
2019-07-23 07:15:38.448 8702 ERROR tempest.lib.common.ssh     timeout=self.channel_timeout, pkey=self.pkey)
2019-07-23 07:15:38.448 8702 ERROR tempest.lib.common.ssh   File "/usr/lib/python2.7/site-packages/paramiko/client.py", line 324, in connect
2019-07-23 07:15:38.448 8702 ERROR tempest.lib.common.ssh     raise NoValidConnectionsError(errors)
2019-07-23 07:15:38.448 8702 ERROR tempest.lib.common.ssh NoValidConnectionsError: [Errno None] Unable to connect to port 22 on 10.0.0.210
2019-07-23 07:15:38.448 8702 ERROR tempest.lib.common.ssh 
2019-07-23 07:15:38,450 8702 ERROR    [tempest.scenario.manager] (TestGettingAddress:test_dhcp6_stateless_from_os) Initializing SSH connection to 10.0.0.210 failed. Error: Connection to the 10.0.0.210 via SSH timed out.


https://rhos-qe-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/job/DFG-network-neutron-10_director-rhel-virthost-3cont_2comp-ipv4-vxlan-dvr/58//artifact/tempest-results/tempest-results-neutron.1.html

Comment 6 Slawek Kaplonski 2019-08-05 10:20:04 UTC
CI run https://rhos-qe-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/job/DFG-network-neutron-10_director-rhel-virthost-3cont_2comp-ipv4-vxlan-dvr/58//artifact/tempest-results/tempest-results-neutron.1.html where those tests were still failing were still using older puddle without version which includes fix for this.
So, I'm moving this to MODIFIED again to wait for puddle which contains this fix.

Comment 15 errata-xmlrpc 2019-10-16 09:41:17 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2019:3115


Note You need to log in before you can comment on or make changes to this bug.