1540413 – [3.9]hawkular-metrics pod took a long time to become running if set openshift_metrics_cassandra_replicas as non-default value

Bug 1540413 - [3.9]hawkular-metrics pod took a long time to become running if set openshift_metrics_cassandra_replicas as non-default value

Summary: [3.9]hawkular-metrics pod took a long time to become running if set openshift...

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Hawkular
Sub Component:
Version:	3.9.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	low
Severity:	low
Target Milestone:	---
Target Release:	3.10.0
Assignee:	John Sanda
QA Contact:	Junqi Zhao
Docs Contact:
URL:
Whiteboard:
Depends On:	1469423 1544546 1560695 1590449 1590451 1592966
Blocks:
TreeView+	depends on / blocked

Reported:	2018-01-31 01:58 UTC by Junqi Zhao
Modified:	2018-12-20 21:45 UTC (History)
CC List:	5 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:	1469423
Environment:
Last Closed:	2018-12-20 21:11:02 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
metrics logs (62.71 KB, application/x-gzip) 2018-01-31 01:58 UTC, Junqi Zhao	no flags	Details
View All

Description Junqi Zhao 2018-01-31 01:58:16 UTC

Created attachment 1388714 [details]
metrics logs

+++ This bug was initially created as a clone of Bug #1469423 +++

Description of problem:
Set openshift_metrics_cassandra_replicas=2 to deploy,it take a long time, about 15 minutes, to make hawkular-metrics and heapster pod get ready.

Same phenomenon if set openshift_metrics_hawkular_replicas as non-default value

Version-Release number of selected component (if applicable):
# openshift version
openshift v3.9.0-0.31.0
kubernetes v1.9.1+a0ce1bc657
etcd 3.2.8

Metrics version
metrics-heapster-v3.9.0-0.34.0.0
metrics-hawkular-metrics-v3.9.0-0.34.0.0
metrics-cassandra-v3.9.0-0.34.0.0

How reproducible:
Always

Steps to Reproduce:
1. Set openshift_metrics_cassandra_replicas=2 in inventory file, and deploy metrics via ansible
2. 
3.

Actual results:
It take a long time, about 15 minutes, to make hawkular-metrics and heapster pod get ready

Expected results:
hawkular-metrics and heapster pod should not wait for so long to become ready.

Comment 1 John Sanda 2018-08-06 20:23:13 UTC

I am moving to ON_QA since this ticket depended on the work done in bug 1560695.

Comment 2 Junqi Zhao 2018-08-09 09:08:28 UTC

It takes around 9 minutes to reach Running status for all pods, much more quickly than before, but one hawkular-cassandra pod had restarted for 3 times, 2 times for hawkular-metrics-schema pod, since the restart mechanism is expected, close this defect as VERIFIED.

NAME                            READY     STATUS    RESTARTS   AGE       IP            NODE
hawkular-cassandra-1-gb77z      1/1       Running   0          8m        10.129.0.12   *****-qeos-nrr-1
hawkular-cassandra-2-vxq6n      1/1       Running   3          8m        10.128.0.10   *****-qeos-master-etcd-1
hawkular-metrics-2h2mw          0/1       Running   0          7m        10.129.0.11   *****-qeos-nrr-1
hawkular-metrics-schema-qznfv   1/1       Running   2          9m        10.129.0.9    *****-qeos-nrr-1
heapster-dkpjx                  0/1       Running   0          7m        10.128.0.9    *****-qeos-master-etcd-1


parameters
openshift_metrics_install_metrics=true
openshift_metrics_cassandra_storage_type=dynamic
openshift_metrics_cassandra_replicas=2

metrics version: v3.10.27-1

# openshift version
openshift v3.10.27

Note You need to log in before you can comment on or make changes to this bug.