Bug 1608059 - post opgrade to 3.9, metrics not stable
Summary: post opgrade to 3.9, metrics not stable
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Hawkular
Version: 3.9.0
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: 3.11.z
Assignee: Ruben Vargas Palma
QA Contact: Junqi Zhao
URL:
Whiteboard:
Depends On:
Blocks: 1607667 1611896
TreeView+ depends on / blocked
 
Reported: 2018-07-24 21:27 UTC by Eric Jones
Modified: 2021-12-10 16:45 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
: 1611896 (view as bug list)
Environment:
Last Closed: 2020-04-14 15:40:10 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker HWKMETRICS-800 0 Critical Resolved NPE in job scheduler prevents server from starting up 2020-04-14 15:33:29 UTC

Description Eric Jones 2018-07-24 21:27:01 UTC
Description of problem:
Hawkular-metrics pod complaining about NullPointerException (HAWKMETRICS00006) trying to connect to cassandra cluster.

3 cassandra instances available.

Uploading pod logs, nodetool information from cassandra, and project yamls shortly.

Version-Release number of selected component (if applicable):
hawkular-cassandra instances all running 3.9.31
heapster and hawkular-metrics running 3.9.33

Additional info:
Per conversation with engineering, requesting [0] from customer to be uploaded here.

$ oc -n openshift-infra exec <cassandra pod> -- cqlsh --ssl -e "select * from hawkular_metrics.scheduled_jobs_idx"

$ oc -n openshift-infra exec <cassandra pod> -- cqlsh --ssl -e "select * from hawkular_metrics.finished_jobs_idx"


Note You need to log in before you can comment on or make changes to this bug.