2088022 – Default CPU requests on Velero/Restic are too demanding making scheduling fail in certain environments

Bug 2088022 - Default CPU requests on Velero/Restic are too demanding making scheduling fail in certain environments

Summary: Default CPU requests on Velero/Restic are too demanding making scheduling fai...

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Migration Toolkit for Containers
Classification:	Red Hat
Component:	Operator
Sub Component:
Version:	1.7.1
Hardware:	Unspecified
OS:	Unspecified
Priority:	high
Severity:	high
Target Milestone:	---
Target Release:	1.7.2
Assignee:	Pranav Gaikwad
QA Contact:	Prasad Joshi
Docs Contact:	Richard Hoch
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2022-05-18 16:51 UTC by Pranav Gaikwad
Modified:	2022-07-01 09:53 UTC (History)
CC List:	2 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2022-07-01 09:53:23 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Priority	Status	Summary	Last Updated
Github	konveyor mig-operator pull 819	None	Merged	Add restic CPU/Memory resources requirements	2022-05-18 17:53:07 UTC
Github	konveyor mig-operator pull 820	None	open	Bug 2088022: Update merge type for DPA CR	2022-05-18 17:51:52 UTC
Github	konveyor mig-operator pull 826	None	Merged	Bug 2088022: Lower velero/restic resource requirements	2022-05-25 14:52:53 UTC
Red Hat Product Errata	RHSA-2022:5483	None	None	None	2022-07-01 09:53:31 UTC

Description Pranav Gaikwad 2022-05-18 16:51:12 UTC

Description of problem:
Default CPU requests for Velero and Restic Pods are set to 500m. These values are high. The resources can be configured in DPA using `podConfig` field for Velero and Restic. Migration operator should set CPU requests to a lower value such as 100m so that Velero/Restic pods can be scheduled in resource constrained environments MTC often operates in.  

Version-Release number of selected component (if applicable):
1.7.1

How reproducible:
Always

Steps to Reproduce:
Deploy migration operator in an environment where some of the nodes do not have 500m CPU available. 


Actual results:
Restic Pods failed to schedule on nodes which don't have enough CPU.

Expected results:
Restic Pods should request lower CPU such that it can be scheduled on all nodes.

Additional info:

Comment 6 Prasad Joshi 2022-06-20 13:04:44 UTC

Verified with MTC 1.7.2 Pre-stage 

metadata_nvr: openshift-migration-operator-metadata-container-v1.7.2-15

DPA CR: 
  spec:
    backupImages: false
    configuration:
      restic:
        enable: true
        podConfig:
          labels:
            app.kubernetes.io/part-of: openshift-migration
          resourceAllocations:
            requests:
              cpu: 100m
        supplementalGroups: []
        timeout: 1h
      velero:
        defaultPlugins:
        - openshift
        - aws
        - gcp
        - azure
        noDefaultBackupLocation: true
        podConfig:
          labels:
            app.kubernetes.io/part-of: openshift-migration
          resourceAllocations:
            requests:
              cpu: 100m

$ oc get pod -n openshift-migration  velero-57c48b4bb-82mff -o yaml
    resources:
      limits:
        cpu: "1"
        memory: 512Mi
      requests:
        cpu: 100m
        memory: 128Mi

$ oc get pod -n openshift-migration  restic-xdsdb -o yaml 
    name: restic
    resources:
      limits:
        cpu: "1"
        memory: 512Mi
      requests:
        cpu: 100m
        memory: 128Mi

I see the correct cpu.request value as per above PR 

Moving this to verified status.

Comment 12 errata-xmlrpc 2022-07-01 09:53:23 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Migration Toolkit for Containers (MTC) 1.7.2 security and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:5483

Note You need to log in before you can comment on or make changes to this bug.