Bug 1934877 - Add timeout parameters for the readiness and liveness probes of the Registry pod
Summary: Add timeout parameters for the readiness and liveness probes of the Registry pod
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Migration Toolkit for Containers
Classification: Red Hat
Component: Controller
Version: 1.4.1
Hardware: x86_64
OS: Linux
medium
medium
Target Milestone: ---
: 1.5.0
Assignee: Pranav Gaikwad
QA Contact: Xin jiang
Avital Pinnick
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-03-03 22:15 UTC by Guillermo Ojea Quitana
Modified: 2024-10-01 17:36 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2021-07-28 04:08:00 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github konveyor mig-controller pull 1097 0 None open Bug 1934877: Add configuration for liveness and readiness timeout values for mig registry 2021-04-30 16:49:15 UTC
Github konveyor mig-operator pull 664 0 None open Bug 1934877: Add registry timeout configuration variable 2021-05-03 13:56:32 UTC
Red Hat Product Errata RHEA-2021:2929 0 None None None 2021-07-28 04:08:07 UTC

Description Guillermo Ojea Quitana 2021-03-03 22:15:03 UTC
Description of problem:
With the currrent 3 second limit the registry pod readiness and liveness probes timeout during migration.

Event:
Readiness probe failed: Get http://10.128.6.214:5000/v2/_catalog?n=5: net/http: request canceled (Client.Timeout exceeded while awaiting headers)

Please allow configurable timeout for readiness and liveness probe

Version-Release number of selected component (if applicable):
v1.4.1

How reproducible:
Always

Steps to Reproduce:
1. Create migration plan
2. Launch staging
3. When staging gets to image migration phase the registry pods in source and dest cluster start crashlooping

Actual results:
Migration fails

Expected results:
Migration succeeds

Additional info:

Comment 1 Pranav Gaikwad 2021-05-03 16:49:06 UTC
We have now introduced the timeout configuration values in MigCluster configmap on each cluster. Please find the documentation on configuring timeout here in the upstream doc: https://github.com/konveyor/mig-operator/blob/master/docs/usage/MigClusterConfiguration.md

Comment 7 Sergio 2021-06-16 13:07:20 UTC
Verified using MTC 1.5.0
SOURCE CLUSTER: AWS OCP 3.11 (CONTROLLER + UI)
TARGET CLUSETR: AWS OCP 4.7

Operator:
    registry.redhat.io/rhmtc/openshift-migration-rhel7-operator@sha256:c0375fa6ecff4d50c181fc3f31d66b6c13023fecb8bcef6899197ccd96c50a30
    - name: MIG_CONTROLLER_REPO
      value: openshift-migration-controller-rhel8@sha256
    - name: MIG_CONTROLLER_TAG
      value: 83f26020b731f78dc9e817186d3247ab46d7daedec62c808be3259ed571656aa
    - name: MIG_UI_REPO
      value: openshift-migration-ui-rhel8@sha256
    - name: MIG_UI_TAG
      value: 4e177e58e311ff2d9c37935308591df5680838255e35b138a696b065c03044f8
    - name: VELERO_REPO
      value: openshift-migration-velero-rhel8@sha256
    - name: VELERO_TAG
      value: e776a798ce8c1b1e6fcc10edaded1e70514a5c6cc2c177dead2d82ef562becde

With this configuration, we get the following values for readiness and liveness

    migration_registry_liveness_timeout: 100
    migration_registry_readiness_timeout: 200


      livenessProbe:
        failureThreshold: 3
        httpGet:
          path: /v2/_catalog?n=5
          port: 5000
          scheme: HTTP
        initialDelaySeconds: 15
        periodSeconds: 5
        successThreshold: 1
        timeoutSeconds: 100
 
      readinessProbe:
        failureThreshold: 3
        httpGet:
          path: /v2/_catalog?n=5
          port: 5000
          scheme: HTTP
        initialDelaySeconds: 15
        periodSeconds: 5
        successThreshold: 1
        timeoutSeconds: 200


Every OCP cluster in MTC is using the values configured in its MigrationController resource. So I can configure different values in source and destination clusters.



We move the status to VERIFIED.

Comment 13 errata-xmlrpc 2021-07-28 04:08:00 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Migration Toolkit for Containers (MTC) image release advisory 1.5.0), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2021:2929


Note You need to log in before you can comment on or make changes to this bug.