888031 – Better handling when cloud resource providers go offline

Bug 888031 - Better handling when cloud resource providers go offline

Summary: Better handling when cloud resource providers go offline

Keywords:
Status:	CLOSED EOL
Alias:	None
Product:	CloudForms Cloud Engine
Classification:	Retired
Component:	rhev-agent
Sub Component:
Version:	2.0.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	low
Severity:	medium
Target Milestone:	2.0.0
Assignee:	Mike Orazi
QA Contact:	Dave Johnson
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2012-12-17 20:57 UTC by Matthew Davis
Modified:	2020-03-27 18:36 UTC (History)
CC List:	3 users (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:	2020-03-27 18:36:59 UTC
Embargoed:

Attachments	(Terms of Use)

Description Matthew Davis 2012-12-17 20:57:13 UTC

Description of problem:
The North American SA group runs a cloudforms instance with hooks into 3 different datacenters (as well as EC2). One of the datacenters went offline. The NFS mount when unaccessable and problems started. I didn't do a full test suite, but I know programs that tried to stat the filesystem list would hang.

Unfortunately when it happened I was more interested in getting df and other system commands to return, so I forcibly umounted it with 'umount -f'.

Version-Release number of selected component (if applicable):
Cloduforms v1.1

How reproducible:
Everytime

Steps to Reproduce:
1. Add a Cloud Resource Provider (rhev based, with an NFS backed export domain)
2. Interrupt communication between CF & RHEV (turn off NFS for example)

  
Actual results:


Expected results:
Some way for CF to detect the CRP has gone offline and/or taken approprite actions to umount or disable the NFS store.

Additional info:
I'll admit I may not fully understand the full workflow of the product still. But I feel this may need at least another revisit to see if there's a better way to handle this exception.

Comment 2 Thom Carlin 2012-12-17 22:13:52 UTC

This is for hard NFS mounts.  I went in and manually disabled the Cloud Resource Provider.

Comment 3 Thom Carlin 2012-12-17 22:14:57 UTC

RHEVM Cloud Resource Provider, that is

Note You need to log in before you can comment on or make changes to this bug.