| Summary: | need smarter conductor rhevm heartbeat | |||
|---|---|---|---|---|
| Product: | [Retired] CloudForms Cloud Engine | Reporter: | Dave Johnson <dajohnso> | |
| Component: | aeolus-conductor | Assignee: | Angus Thomas <athomas> | |
| Status: | CLOSED ERRATA | QA Contact: | Dave Johnson <dajohnso> | |
| Severity: | high | Docs Contact: | ||
| Priority: | unspecified | |||
| Version: | 1.0.0 | CC: | akarol, clalance, cpelland, dajohnso, deltacloud-maint, dgao, lutter, morazi, ssachdev, whayutin | |
| Target Milestone: | rc | |||
| Target Release: | --- | |||
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | ||||
| Fixed In Version: | Doc Type: | Bug Fix | ||
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 723896 (view as bug list) | Environment: | ||
| Last Closed: | 2012-05-15 21:44:56 UTC | Type: | --- | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
|
Description
Dave Johnson
2011-07-14 15:49:28 UTC
Yes, we knew this was going to be a problem. I don't think tuning down the poll interval is the right solution. I think that we need to: 1) Upgrade to RHEV-M 3.0, where this should be a lot faster 2) Do a lighter weight status update, which should also be faster We'll need to look at doing one (or both) of these in the deltacloud driver to improve this situation. *** Bug 722226 has been marked as a duplicate of this bug. *** I think the easiest fix is to skip update runs if a previous one hasn't finished yet. Slow backends will be slow, no matter what we do. Markmc gives me the impression that asking for fewer details when we list instances isn't going to speed matters up. Actually, that is what condor does today. If a batch status times out, it will skip it and try to re-ping it later. That being said, there is likely a bug in there, in that it can take a long time for condor to re-activate the backend. Also, I'm not sure how we would ever get out of this situation. If the last status update took > 30 seconds (for instance), why won't the next one? doc it making sure all the bugs are at the right version for future queries adding to sprint tracker Condor, whose operation this bug relates to, is no longer part of cloudforms. This bug should either be refreshed with a description of the issue which pertains to the current software stack, or closed. Dave.. double check how this now runs w/o condor This all seems to be good now with condor no longer part of the mix. Marking this as verified aeolus-all-0.8.0-16.el6.noarch aeolus-conductor-0.8.0-16.el6.noarch aeolus-conductor-daemons-0.8.0-16.el6.noarch aeolus-conductor-doc-0.8.0-16.el6.noarch Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHEA-2012-0583.html |