Bug 888031 - Better handling when cloud resource providers go offline
Summary: Better handling when cloud resource providers go offline
Keywords:
Status: CLOSED EOL
Alias: None
Product: CloudForms Cloud Engine
Classification: Retired
Component: rhev-agent
Version: 2.0.0
Hardware: Unspecified
OS: Unspecified
low
medium
Target Milestone: 2.0.0
Assignee: Mike Orazi
QA Contact: Dave Johnson
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2012-12-17 20:57 UTC by Matthew Davis
Modified: 2020-03-27 18:36 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2020-03-27 18:36:59 UTC
Embargoed:


Attachments (Terms of Use)

Description Matthew Davis 2012-12-17 20:57:13 UTC
Description of problem:
The North American SA group runs a cloudforms instance with hooks into 3 different datacenters (as well as EC2). One of the datacenters went offline. The NFS mount when unaccessable and problems started. I didn't do a full test suite, but I know programs that tried to stat the filesystem list would hang.

Unfortunately when it happened I was more interested in getting df and other system commands to return, so I forcibly umounted it with 'umount -f'.

Version-Release number of selected component (if applicable):
Cloduforms v1.1

How reproducible:
Everytime

Steps to Reproduce:
1. Add a Cloud Resource Provider (rhev based, with an NFS backed export domain)
2. Interrupt communication between CF & RHEV (turn off NFS for example)

  
Actual results:


Expected results:
Some way for CF to detect the CRP has gone offline and/or taken approprite actions to umount or disable the NFS store.

Additional info:
I'll admit I may not fully understand the full workflow of the product still. But I feel this may need at least another revisit to see if there's a better way to handle this exception.

Comment 2 Thom Carlin 2012-12-17 22:13:52 UTC
This is for hard NFS mounts.  I went in and manually disabled the Cloud Resource Provider.

Comment 3 Thom Carlin 2012-12-17 22:14:57 UTC
RHEVM Cloud Resource Provider, that is


Note You need to log in before you can comment on or make changes to this bug.