Bug 1417528 - [RFE] cfme-openshift-app pod need to support scaling up on OCP 3.5
Summary: [RFE] cfme-openshift-app pod need to support scaling up on OCP 3.5
Alias: None
Product: Red Hat CloudForms Management Engine
Classification: Red Hat
Component: cfme-openshift-app
Version: 5.7.0
Hardware: Unspecified
OS: Unspecified
Target Milestone: GA
: 5.8.0
Assignee: Loic Avenel
QA Contact: Einat Pacifici
Red Hat CloudForms Documentation
Whiteboard: container:pod
Depends On:
TreeView+ depends on / blocked
Reported: 2017-01-30 05:49 UTC by Jaspreet Kaur
Modified: 2021-03-11 14:55 UTC (History)
5 users (show)

Fixed In Version:
Doc Type: Enhancement
Doc Text:
Clone Of:
Last Closed: 2017-05-31 14:00:19 UTC
Category: ---
Cloudforms Team: Container Management
Target Upstream Version:

Attachments (Terms of Use)

System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2017:1366 0 normal SHIPPED_LIVE CFME 5.8.0 images for OpenShift 2017-06-01 11:38:16 UTC

Description Jaspreet Kaur 2017-01-30 05:49:17 UTC
Description of problem: When trying to scale cloudforms pod below errors are seen in the pod  and it doesn't scale :

 oc get pods
NAME                 READY     STATUS    RESTARTS   AGE
cloudforms-3-03cdq   1/1       Running   28         2d
cloudforms-3-p61zz   0/1       Running   358        2d
memcached-1-6ahiy    1/1       Running   2          2d
postgresql-1-ndp3c   1/1       Running   1          2d
[root@ansible-m1 ~]# os rsh cloudforms-3-03cdq

[----] I, [2017-01-27T12:44:33.654925 #1722:e61130]  INFO -- : <AutomationEngine> MiqAeEvent.build_evm_event >> event=<"evm_worker_start"> inputs=<{:event_details=>"Worker started: ID [2982], PID [16020], GUID [574155ca-e48e-11e6-ae92-4ec637fe7c9c]", :type=>"MiqReportingWorker", "MiqEvent::miq_event"=>3036, :miq_event_id=>3036, "EventStream::event_stream"=>3036, :event_stream_id=>3036}>
[----] I, [2017-01-27T12:44:33.666973 #15989:e61130]  INFO -- : MIQ(MiqGenericWorker#log_status) [Generic Worker] Worker ID [2978], PID [15989], GUID [56fb61a0-e48e-11e6-ae92-4ec637fe7c9c], Last Heartbeat [2017-01-27 12:44:32 UTC], Process Info: Memory Usage [287612928], Memory Size [589725696], Proportional Set Size: [154088000], Memory % [2.31], CPU Time [68.0], CPU % [0.0], Priority [30]
[----] E, [2017-01-27T12:44:33.667414 #15989:e61130] ERROR -- : MIQ(MiqGenericWorker::Runner) ID [2978] PID [15989] GUID [56fb61a0-e48e-11e6-ae92-4ec637fe7c9c] Error heartbeating to MiqServer because DRb::DRbConnError: druby:// - #<Errno::ECONNREFUSED: Connection refused - connect(2) for "" port 41860> Worker exiting.

the 2nd appliance is starting but fails to seek the first one. Also, cfme with the same GUID...

sh-4.2# cat GUID

Version-Release number of selected component (if applicable):

Cfme 4.3
OCP 4.0

How reproducible:

Steps to Reproduce:
1. Deploy cloudforms on openshift using below article :

2. Once the pods start running to scale :

oc scale --replicas=2 rc cloudforms

3. The new pod never starts with status 1/1

Actual results: Second pod never starts and the 
the 2nd appliance is starting but fails to seek the first one

Expected results: It should have started successfully.

Additional info:

Comment 3 Barak 2017-03-23 18:41:55 UTC
Patch that supports PetSet was merged

As this will be supported in CF 4.5 on OCP 3.5 we will have to move to SatefulSet so not moving this bug to POST yet.

Comment 5 Barak 2017-05-01 15:41:56 UTC
Move to statefulset was completed and released downstream,
It was included in the template released on build

Therefor I am moving this bug to ON_QA

Comment 6 Einat Pacifici 2017-05-10 09:54:09 UTC
Verified. Setup OCP3.5 and Podified CFME ( 
ran command: 
oc scale statefulset cloudforms --replicas=2
oc get pods
two cloudforms pods were seen (cloudforms-0 & cloudforms-1)

Comment 8 errata-xmlrpc 2017-05-31 14:00:19 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.


Note You need to log in before you can comment on or make changes to this bug.