Bug 1905801

Summary: Add check for disk usage in candlepin-directory
Product: Red Hat Satellite Reporter: Amit Upadhye <aupadhye>
Component: Satellite MaintainAssignee: Amit Upadhye <aupadhye>
Status: CLOSED ERRATA QA Contact: Gaurav Talreja <gtalreja>
Severity: medium Docs Contact:
Priority: high    
Version: 6.8.0CC: ahumbe, apatel, gtalreja, kgaikwad, ktordeur, pcreech
Target Milestone: 6.9.0Keywords: PrioBumpGSS, Triaged
Target Release: Unused   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: rubygem-foreman_maintain-0.7.2-1 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
: 1905804 1933165 (view as bug list) Environment:
Last Closed: 2021-04-21 14:48:24 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1905804    

Description Amit Upadhye 2020-12-09 05:55:33 UTC
Description of problem:

Moving to foreman-2.1 katello-3.16 we experienced hanging tasks on some of our test-systems.

The reason was that candlepin or rather the Artemis-service used by candlepin started blocking requests as soon as the disk-usage rose above 90% (even though this meant that 200GB were still free).

To avoid that users run into this issue, there should be a check to make the root-cause visible.

Version-Release number of selected component (if applicable):

Foreman 2.1(6.8) onward

How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 1 Brad Buckingham 2020-12-10 15:07:40 UTC
*** Bug 1905804 has been marked as a duplicate of this bug. ***

Comment 2 Bryan Kearney 2020-12-24 21:40:36 UTC
Moving this bug to POST for triage into Satellite since the upstream issue https://projects.theforeman.org/issues/31210 has been resolved.

Comment 3 Gaurav Talreja 2021-02-11 11:08:43 UTC
Verified.

Satellite 6.9.0 Snap 12
Version: rubygem-foreman_maintain-0.7.4-1.el7sat.noarch

Steps:
# foreman-maintain health list | grep available-space-cp
[available-space-cp] Check to make sure /var/lib/candlepin has enough space [pre-upgrade]


# foreman-maintain health check --label available-space-cp
Running ForemanMaintain::Scenario::FilteredScenario
================================================================================
Check to make sure /var/lib/candlepin has enough space:               [OK]
--------------------------------------------------------------------------------


# foreman-maintain upgrade check --target-version 6.9.z
<snip>
Running Checks before upgrading to Satellite 6.9.z
================================================================================
Check number of fact names in database:                               [OK]
--------------------------------------------------------------------------------
Check whether all services are running:                               [OK]
--------------------------------------------------------------------------------
Check whether all services are running using the ping call:           [OK]
--------------------------------------------------------------------------------
Check for paused tasks:                                               [OK]
--------------------------------------------------------------------------------
Check to verify no empty CA cert requests exist:                      [OK]
--------------------------------------------------------------------------------
Check whether system is self-registered or not:                       [OK]
--------------------------------------------------------------------------------
Check to make sure root(/) partition has enough space:                [OK]
--------------------------------------------------------------------------------
Check to make sure /var/lib/candlepin has enough space:               [OK]
--------------------------------------------------------------------------------
Check to validate candlepin database:                                 [OK]
</snip>


Observations:
We have a check to make sure the candlepin directory (/var/lib/candlepin) has enough space now and if space is more than 90 PERCENT then it points to https://bugzilla.redhat.com/show_bug.cgi?id=1898605

Comment 6 errata-xmlrpc 2021-04-21 14:48:24 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Satellite 6.9 Satellite Maintenance Release), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:1312