Bug 2008290 - ImageStream with RHCOS version tag needed for RHODS GPU support
Summary: ImageStream with RHCOS version tag needed for RHODS GPU support
Keywords:
Status: CLOSED NEXTRELEASE
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Driver Toolkit
Version: 4.8
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: 4.10.0
Assignee: dagray
QA Contact: Walid A.
URL:
Whiteboard:
Depends On:
Blocks: 2008619 2011024
TreeView+ depends on / blocked
 
Reported: 2021-09-27 19:59 UTC by dagray
Modified: 2022-01-04 19:14 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
: 2008619 (view as bug list)
Environment:
Last Closed: 2022-01-04 19:14:40 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift driver-toolkit pull 62 0 None Merged (reimplement) Add imagestream for driver-toolkit 2021-09-28 17:24:28 UTC
Github openshift kubernetes pull 963 0 None Merged UPSTREAM: <carry>: openshift-hack/images/os/Dockerfile: Add io.openshift.build.versions, etc. 2021-09-28 17:25:09 UTC

Description dagray 2021-09-27 19:59:42 UTC
The driver-toolkit ImageStream with the RHCOS version as a tag needs to be backported to 4.9 and 4.8. This is required in order for the GPU operator to build the GPU drivers without entitlements, as required by the RHODS team.

See driver-toolkit PR for 4.10 which added the ImageStream:
https://github.com/openshift/driver-toolkit/pull/62/files

Note: This driver-toolkit change was temporarily reverted as it caused some CI jobs to fail when creating a release, because some ci-tools-built machine-os-content images lacked the io.openshift.build.versions field required by consumers of this label, like the driver-toolkit. 

After https://github.com/openshift/kubernetes/pull/963 was merged, the driver-toolkit PR #62 re-added the ImageStream tags. Therefore, we also need to cherry-pick https://github.com/openshift/kubernetes/pull/963 to 4.9 and 4.8 before the imagestream can be cherry-picked.

Comment 1 Walid A. 2021-09-28 23:56:21 UTC
The driver-toolkit imagestreaom on OCP 4.10.0-0.nightly-2021-09-28-020957 shows the tag based on the OSTREE_VERSION in /etc/os-release on the RHCOS worker nodes:

# oc get clusterversion
NAME      VERSION                              AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.10.0-0.nightly-2021-09-28-020957   True        False         3h42m   Cluster version is 4.10.0-0.nightly-2021-09-28-020957

# oc get imagestreams -n openshift | grep driver
driver-toolkit                                        image-registry.openshift-image-registry.svc:5000/openshift/driver-toolkit                                        49.84.202109241334-0,latest                              4 hours ago
jboss-datavirt64-driver-openshift                     image-registry.openshift-image-registry.svc:5000/openshift/jboss-datavirt64-driver-openshift                     1.0,1.1,1.2,1.3,1.4,1.5,1.6,1.7                          3 hours ago

# oc describe imagestreams -n openshift  driver-toolkit
Name:			driver-toolkit
Namespace:		openshift
Created:		4 hours ago
Labels:			<none>
Annotations:		include.release.openshift.io/ibm-cloud-managed=true
			include.release.openshift.io/self-managed-high-availability=true
			include.release.openshift.io/single-node-developer=true
			openshift.io/image.dockerRepositoryCheck=2021-09-28T20:05:39Z
Image Repository:	image-registry.openshift-image-registry.svc:5000/openshift/driver-toolkit
Image Lookup:		local=false
Unique Images:		1
Tags:			2

latest
  updates automatically from registry quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:849cec3b62455bbd2e61f1d2b6f733d690daca149e1eed9adf9548292003a66a

  * quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:849cec3b62455bbd2e61f1d2b6f733d690daca149e1eed9adf9548292003a66a
      4 hours ago

49.84.202109241334-0
  updates automatically from registry quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:849cec3b62455bbd2e61f1d2b6f733d690daca149e1eed9adf9548292003a66a

  * quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:849cec3b62455bbd2e61f1d2b6f733d690daca149e1eed9adf9548292003a66a
      4 hours ago


# oc get nodes
NAME                                                          STATUS   ROLES    AGE     VERSION
walid410gcpcia-dkbpv-master-0.c.openshift-qe.internal         Ready    master   3h44m   v1.22.0-rc.0+af080cb
walid410gcpcia-dkbpv-master-1.c.openshift-qe.internal         Ready    master   3h43m   v1.22.0-rc.0+af080cb
walid410gcpcia-dkbpv-master-2.c.openshift-qe.internal         Ready    master   3h43m   v1.22.0-rc.0+af080cb
walid410gcpcia-dkbpv-worker-a-r6vjq.c.openshift-qe.internal   Ready    worker   3h34m   v1.22.0-rc.0+af080cb
walid410gcpcia-dkbpv-worker-b-tgthz.c.openshift-qe.internal   Ready    worker   3h34m   v1.22.0-rc.0+af080cb
walid410gcpcia-dkbpv-worker-c-mtf6x.c.openshift-qe.internal   Ready    worker   3h34m   v1.22.0-rc.0+af080cb

# oc debug node/walid410gcpcia-dkbpv-worker-a-r6vjq.c.openshift-qe.internal
W0928 23:38:37.907185 1664618 warnings.go:70] would violate "latest" version of "baseline" PodSecurity profile: host namespaces (hostNetwork=true, hostPID=true), hostPath volumes (volume "host"), privileged (container "container-00" must not set securityContext.privileged=true)
Starting pod/walid410gcpcia-dkbpv-worker-a-r6vjqcopenshift-qeinternal-debug ...
To use host binaries, run `chroot /host`
Pod IP: 10.0.128.4
If you don't see a command prompt, try pressing enter.
sh-4.4# 
sh-4.4# chroot /host
sh-4.4# 
sh-4.4# cat /etc/os-release
NAME="Red Hat Enterprise Linux CoreOS"
VERSION="49.84.202109241334-0"
ID="rhcos"
ID_LIKE="rhel fedora"
VERSION_ID="4.9"
PLATFORM_ID="platform:el8"
PRETTY_NAME="Red Hat Enterprise Linux CoreOS 49.84.202109241334-0 (Ootpa)"
ANSI_COLOR="0;31"
CPE_NAME="cpe:/o:redhat:enterprise_linux:8::coreos"
HOME_URL="https://www.redhat.com/"
DOCUMENTATION_URL="https://docs.openshift.com/container-platform/4.9/"
BUG_REPORT_URL="https://bugzilla.redhat.com/"
REDHAT_BUGZILLA_PRODUCT="OpenShift Container Platform"
REDHAT_BUGZILLA_PRODUCT_VERSION="4.9"
REDHAT_SUPPORT_PRODUCT="OpenShift Container Platform"
REDHAT_SUPPORT_PRODUCT_VERSION="4.9"
OPENSHIFT_VERSION="4.9"
RHEL_VERSION="8.4"
OSTREE_VERSION='49.84.202109241334-0'
sh-4.4# 
sh-4.4# exit
exit
sh-4.4# exit
exit

Removing debug pod ...

# oc describe imagestreams -n openshift  driver-toolkit | grep "49.84.202109241334-0"
49.84.202109241334-0
[root@ip-172-31-45-145 ~]#


Note You need to log in before you can comment on or make changes to this bug.