Bug 1459877 - Hawkular-metrics fails when one instance Cassandra is disabled
Summary: Hawkular-metrics fails when one instance Cassandra is disabled
Keywords:
Status: CLOSED DUPLICATE of bug 1459968
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Hawkular
Version: 3.4.0
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: ---
Assignee: Matt Wringe
QA Contact: Liming Zhou
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2017-06-08 12:35 UTC by Vladislav Walek
Modified: 2020-12-14 08:50 UTC (History)
2 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2017-06-08 18:55:12 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Vladislav Walek 2017-06-08 12:35:00 UTC
Description of problem:

The metrics are running with two node of Cassandra with each replication controller and pv. However, when you scale down one node, the hawkular will fail with following error:

com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) tried for query failed (tried: /10.12.1.5:9042 

Where ip 10.12.1.5 is the ip of the pod which was scaled down.
I found that hawkular when starting is attaching the cassandra nodes like this:

08:02:44,344 INFO  [com.datastax.driver.core.Cluster] (ServerService Thread Pool -- 78) New Cassandra host /10.12.1.5:9042 added
08:02:44,344 INFO  [com.datastax.driver.core.Cluster] (ServerService Thread Pool -- 78) New Cassandra host hawkular-cassandra/172.30.57.143:9042 added

Which one os the pod's ip and one svc's ip. However, if the one pod is scaled down, it will show the error above.
Just adding that the replication factor is set to 1.
(com.datastax.driver.core.exceptions.UnavailableException: Not enough replicas available for query at consistency LOCAL_ONE (1 required but only 0 alive)))
(cf. output included)

I know that there is no HA on Cassandra or load balancing, but the issue is hawkular, why it is taking the ip of the pod and not the ip of the service as primary source?

Version-Release number of selected component (if applicable):
OpenShift Container Platform 3.4.0

How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 3 Matt Wringe 2017-06-08 18:55:12 UTC
I am marking this as a duplicate of 1459968

*** This bug has been marked as a duplicate of bug 1459968 ***


Note You need to log in before you can comment on or make changes to this bug.