Bug 1504999

Summary: [RFE] Replication slot monitoring
Product: Red Hat CloudForms Management Engine Reporter: Nick Carboni <ncarboni>
Component: ReplicationAssignee: Gregg Tanzillo <gtanzill>
Status: CLOSED DUPLICATE QA Contact: Dave Johnson <dajohnso>
Severity: high Docs Contact:
Priority: high    
Version: 5.8.0CC: jhardy, jocarter, mfeifer, obarenbo
Target Milestone: GAKeywords: FutureFeature, RFE
Target Release: 5.8.3   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-11-13 19:08:41 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: CFME Core Target Upstream Version:
Embargoed:

Description Nick Carboni 2017-10-20 22:33:13 UTC
Description of problem:
It is possible that a situation occurs when a replication slot is not being consumed from. This causes WAL log to accumulate on the database server which will eventually lead to the database filesystem filling up and system failure.

This is something we can monitor from the application and ideally send an alert or notification so the admin can fix the situation.

In more detail, we will want to check that the 'active' column of the pg_replication_slots table is always 'true'. When a slot is not being consumed from, it becomes inactive and this is a precursor to disk usage issues.

Comment 2 Nick Carboni 2017-11-13 19:08:41 UTC
oops

*** This bug has been marked as a duplicate of bug 1445291 ***