Bug 1439910

Summary: Large partitions make Cassandra unstable and cause requests to fail in Hawkular Metric
Product: OpenShift Container Platform Reporter: John Sanda <jsanda>
Component: HawkularAssignee: Matt Wringe <mwringe>
Status: CLOSED ERRATA QA Contact: Liming Zhou <lizhou>
Severity: high Docs Contact:
Priority: unspecified    
Version: 3.4.0CC: aos-bugs, bmorriso, gburges, jforrest, jgoulding, jsanda, juzhao, mmahut, mtayer, mwringe, pdwyer, penli, sten, tdawson, whearn, wsun, xiazhao, zhiwliu, zhizhang
Target Milestone: ---Keywords: OpsBlocker
Target Release: 3.4.z   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 1422271 Environment:
Last Closed: 2017-05-18 09:27:59 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1422271, 1439912    
Bug Blocks: 1439852    

Comment 3 Matt Wringe 2017-04-27 20:35:20 UTC
*** Bug 1440548 has been marked as a duplicate of this bug. ***

Comment 6 Liming Zhou 2017-05-10 08:53:22 UTC
@mwringe,

This bug is similiar with bug "1439912" with different OCP version, in that bug, @juzhao is asking if the following scenario is ok to cover the test for bug:
###
Thanks a lot, I see compaction_large_partition_warning_threshold_mb=100 in hawkular-cassandra pod log, I think we can verify this fix by the following steps:

1. Create a lot of projects to consume memory, CPU and network resources, so data can be kept in cassandra partition.

2. Check the hawkular-cassandra and hawkular-metrics pod logs, make sure there are no such warn info
"WARN  18:29:53 Writing large partition hawkular_metrics/metrics_idx:ops-health-monitoring:2 (****** bytes)"

Do you think my solution is well enough to verify this defect?
###
So my question is also does above steps ok to verify the bug?

Thanks,
lizhou

Comment 7 Matt Wringe 2017-05-10 18:43:16 UTC
@jsanda: can you provide a test case which can be used to verify this is fixed?

Comment 10 Junqi Zhao 2017-05-16 05:49:25 UTC
Vlaad(vlaad) created 6500 pods and deleted them under one project, and I checked the hawkular-cassandra and hawkular-metrics pod logs, there were no such warn info exists:
"WARN  18:29:53 Writing large partition hawkular_metrics/metrics_idx:ops-health-monitoring:2 (****** bytes)"

Set it to VERIFIED

Comment 12 errata-xmlrpc 2017-05-18 09:27:59 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2017:1235

Comment 13 Red Hat Bugzilla 2023-09-14 03:56:05 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days