Bug 1380461

Summary: Receiving Azure::Armrest::ApiException during a provider refresh after successfully adding the provider
Product: Red Hat CloudForms Management Engine Reporter: Colin Arnott <carnott>
Component: ProvidersAssignee: Daniel Berger <dberger>
Status: CLOSED CURRENTRELEASE QA Contact: Jeff Teehan <jteehan>
Severity: urgent Docs Contact:
Priority: urgent    
Version: 5.6.0CC: benglish, cpelland, dberger, jfrey, jhardy, jteehan, obarenbo, simaishi
Target Milestone: GAKeywords: TestOnly
Target Release: 5.8.0   
Hardware: x86_64   
OS: Linux   
Whiteboard: provider:azure:refresh
Fixed In Version: 5.8.0.0 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
: 1382408 1386940 (view as bug list) Environment:
Last Closed: 2017-06-12 16:14:53 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: Azure Target Upstream Version:
Bug Depends On:    
Bug Blocks: 1382408, 1386940    

Comment 2 Colin Arnott 2016-09-29 16:20:03 UTC
*** Bug 1380459 has been marked as a duplicate of this bug. ***

Comment 4 Daniel Berger 2016-09-29 16:48:17 UTC
I believe this may be an issue with Azure that started this morning. At 7:41am I received this notification within the Azure portal:

"The portal is having issues getting authentication tokens for Microsoft_Azure_Storage."

I'm seeing some strange, but sporadic, behavior with storage endpoint DNS resolution as well. I suspect it may be related.

We'll continue to investigate.

Comment 6 Daniel Berger 2016-09-29 19:21:17 UTC
The actual error that you're hitting (and we're hitting it locally also) is an OpenTimeout error. This is a connection timeout, meaning we're not able to connect to the storage account endpoint.

IMO this confirms that it's an issue on the Azure side.

In any case, going forward we're going to skip over private image collection if it fails so that it doesn't interfere with the rest of inventory collection.

https://github.com/ManageIQ/manageiq/pull/11587

Comment 7 Jeff Teehan 2016-10-04 23:44:07 UTC
I think to test this I'll need to be able to set the timeout to a few seconds.  Daniel can you let me know if that's possible.

Comment 9 CFME Bot 2016-10-17 16:20:21 UTC
New commit detected on ManageIQ/manageiq/euwe:
https://github.com/ManageIQ/manageiq/commit/45c9db4e1c6adbc9c0ce59172556dd2a15be00a9

commit 45c9db4e1c6adbc9c0ce59172556dd2a15be00a9
Author:     Greg Blomquist <blomquisg>
AuthorDate: Wed Oct 5 16:49:58 2016 -0400
Commit:     Oleg Barenboim <chessbyte>
CommitDate: Thu Oct 13 14:52:03 2016 -0400

    Merge pull request #11587 from djberg96/azure_refresh_parser
    
    Skip over private image collection for Azure on failure
    (cherry picked from commit a1c1ec03807e6ca35b0af8c868caa1f10c64f41f)
    
    https://bugzilla.redhat.com/show_bug.cgi?id=1380461

 .../manageiq/providers/azure/cloud_manager/refresh_parser.rb       | 7 +++++++
 1 file changed, 7 insertions(+)

Comment 10 Daniel Berger 2016-10-19 14:21:44 UTC
https://github.com/ManageIQ/manageiq/pull/11587 was merged.

Comment 13 Jeff Teehan 2017-04-24 18:42:45 UTC
Just retested for 5.8.0.11 and it's still working fine.  Moving to verified.  This has a test case that gets hit regularly so this issue is unlikely to be missed.