1459877 – Hawkular-metrics fails when one instance Cassandra is disabled

Bug 1459877 - Hawkular-metrics fails when one instance Cassandra is disabled

Summary: Hawkular-metrics fails when one instance Cassandra is disabled

Keywords:
Status:	CLOSED DUPLICATE of bug 1459968
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Hawkular
Sub Component:
Version:	3.4.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	high
Severity:	high
Target Milestone:	---
Target Release:	---
Assignee:	Matt Wringe
QA Contact:	Liming Zhou
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2017-06-08 12:35 UTC by Vladislav Walek
Modified:	2020-12-14 08:50 UTC (History)
CC List:	2 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2017-06-08 18:55:12 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Description Vladislav Walek 2017-06-08 12:35:00 UTC

Description of problem:

The metrics are running with two node of Cassandra with each replication controller and pv. However, when you scale down one node, the hawkular will fail with following error:

com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) tried for query failed (tried: /10.12.1.5:9042 

Where ip 10.12.1.5 is the ip of the pod which was scaled down.
I found that hawkular when starting is attaching the cassandra nodes like this:

08:02:44,344 INFO  [com.datastax.driver.core.Cluster] (ServerService Thread Pool -- 78) New Cassandra host /10.12.1.5:9042 added
08:02:44,344 INFO  [com.datastax.driver.core.Cluster] (ServerService Thread Pool -- 78) New Cassandra host hawkular-cassandra/172.30.57.143:9042 added

Which one os the pod's ip and one svc's ip. However, if the one pod is scaled down, it will show the error above.
Just adding that the replication factor is set to 1.
(com.datastax.driver.core.exceptions.UnavailableException: Not enough replicas available for query at consistency LOCAL_ONE (1 required but only 0 alive)))
(cf. output included)

I know that there is no HA on Cassandra or load balancing, but the issue is hawkular, why it is taking the ip of the pod and not the ip of the service as primary source?

Version-Release number of selected component (if applicable):
OpenShift Container Platform 3.4.0

How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 3 Matt Wringe 2017-06-08 18:55:12 UTC

I am marking this as a duplicate of 1459968

*** This bug has been marked as a duplicate of bug 1459968 ***

Note You need to log in before you can comment on or make changes to this bug.