Bug 1922262
| Summary: | [4.5.z] real-time kernel in RHCOS is not synchronized | |||
|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Micah Abbott <miabbott> | |
| Component: | RHCOS | Assignee: | Steve Milner <smilner> | |
| Status: | CLOSED ERRATA | QA Contact: | Michael Nguyen <mnguyen> | |
| Severity: | urgent | Docs Contact: | ||
| Priority: | high | |||
| Version: | 4.5 | CC: | bbreard, brault, dornelas, imcleod, jligon, kholtz, mapfelba, miabbott, mnguyen, mrussell, nstielau, smilner | |
| Target Milestone: | --- | |||
| Target Release: | 4.5.z | |||
| Hardware: | All | |||
| OS: | All | |||
| Whiteboard: | ||||
| Fixed In Version: | Doc Type: | Bug Fix | ||
| Doc Text: |
Cause: RHCOS build config was using a stage repo location for the kernel-rt package instead of the production repo location
Consequence: The kernel-rt package would not be synchronized to the vanilla kernel package
Fix: Change the RHCOS build config to use the production repo location.
Result: kernel-rt package is synchronized with vanilla kernel package
|
Story Points: | --- | |
| Clone Of: | 1914988 | |||
| : | 1922263 (view as bug list) | Environment: | ||
| Last Closed: | 2021-03-03 04:40:35 UTC | Type: | --- | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | 1914469, 1914988 | |||
| Bug Blocks: | 1922263 | |||
|
Description
Micah Abbott
2021-01-29 14:42:35 UTC
Verified on 4.6.0-0.nightly-2021-01-30-211400.
$ oc get clusterversion
NAME VERSION AVAILABLE PROGRESSING SINCE STATUS
version 4.6.0-0.nightly-2021-01-30-211400 True False 129m Cluster version is 4.6.0-0.nightly-2021-01-30-211400
$ cat << EOF > 99-worker-realtime.yaml
> apiVersion: machineconfiguration.openshift.io/v1
> kind: MachineConfig
> metadata:
> labels:
> machineconfiguration.openshift.io/role: "worker"
> name: 99-worker-realtime
> spec:
> config:
> ignition:
> version: 3.1.0
> kernelType: realtime
> EOF
$
$
$
$ oc create -f 99-worker-realtime.yaml
machineconfig.machineconfiguration.openshift.io/99-worker-realtime created
$
$
$ oc get mc
NAME GENERATEDBYCONTROLLER IGNITIONVERSION AGE
00-master fc2e69c4408d898b24760eea9e889f0673369e67 3.1.0 111m
00-worker fc2e69c4408d898b24760eea9e889f0673369e67 3.1.0 111m
01-master-container-runtime fc2e69c4408d898b24760eea9e889f0673369e67 3.1.0 111m
01-master-kubelet fc2e69c4408d898b24760eea9e889f0673369e67 3.1.0 111m
01-worker-container-runtime fc2e69c4408d898b24760eea9e889f0673369e67 3.1.0 111m
01-worker-kubelet fc2e69c4408d898b24760eea9e889f0673369e67 3.1.0 111m
99-master-generated-registries fc2e69c4408d898b24760eea9e889f0673369e67 3.1.0 111m
99-master-ssh 3.1.0 121m
99-worker-generated-registries fc2e69c4408d898b24760eea9e889f0673369e67 3.1.0 111m
99-worker-realtime 3.1.0 2s
99-worker-ssh 3.1.0 121m
rendered-master-6f3f026c1d4096998a67917ad4d4b48c fc2e69c4408d898b24760eea9e889f0673369e67 3.1.0 111m
rendered-worker-5b593a73abfd633d505de51bbd3e98f6 fc2e69c4408d898b24760eea9e889f0673369e67 3.1.0 111m
$ oc get mcp/worker
NAME CONFIG UPDATED UPDATING DEGRADED MACHINECOUNT READYMACHINECOUNT UPDATEDMACHINECOUNT DEGRADEDMACHINECOUNT AGE
worker rendered-worker-5b593a73abfd633d505de51bbd3e98f6 False True False 3 0 0 0 112m
$ watch oc get mcp/worker
$ oc get nodes
NAME STATUS ROLES AGE VERSION
ip-10-0-142-78.us-west-2.compute.internal Ready master 147m v1.19.0+e49167a
ip-10-0-146-27.us-west-2.compute.internal Ready worker 142m v1.19.0+e49167a
ip-10-0-176-172.us-west-2.compute.internal Ready worker 139m v1.19.0+e49167a
ip-10-0-191-140.us-west-2.compute.internal Ready master 148m v1.19.0+e49167a
ip-10-0-206-98.us-west-2.compute.internal Ready worker 142m v1.19.0+e49167a
ip-10-0-223-245.us-west-2.compute.internal Ready master 148m v1.19.0+e49167a
$ oc debug node/ip-10-0-142-78.us-west-2.compute.internal
Starting pod/ip-10-0-142-78us-west-2computeinternal-debug ...
To use host binaries, run `chroot /host`
If you don't see a command prompt, try pressing enter.
sh-4.2# chroot /host
sh-4.4# uname -a
Linux ip-10-0-142-78 4.18.0-193.41.1.el8_2.x86_64 #1 SMP Wed Jan 13 11:33:33 EST 2021 x86_64 x86_64 x86_64 GNU/Linux
sh-4.4# rpm -qa | grep kernel
kernel-modules-extra-4.18.0-193.41.1.el8_2.x86_64
kernel-modules-4.18.0-193.41.1.el8_2.x86_64
kernel-4.18.0-193.41.1.el8_2.x86_64
kernel-core-4.18.0-193.41.1.el8_2.x86_64
sh-4.4# exit
exit
sh-4.2# exit
exit
Removing debug pod ...
$ oc debug node/ip-10-0-146-27.us-west-2.compute.internal
Starting pod/ip-10-0-146-27us-west-2computeinternal-debug ...
To use host binaries, run `chroot /host`
If you don't see a command prompt, try pressing enter.
sh-4.2# chroot /host
sh-4.4# uname -a
Linux ip-10-0-146-27 4.18.0-193.41.1.rt13.91.el8_2.x86_64 #1 SMP PREEMPT RT Wed Jan 13 15:16:38 EST 2021 x86_64 x86_64 x86_64 GNU/Linux
sh-4.4# rpm -qa | grep kernel
kernel-rt-modules-extra-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-rt-modules-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-rt-kvm-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-rt-core-4.18.0-193.41.1.rt13.91.el8_2.x86_64
sh-4.4# rpm-ostree status
State: idle
Deployments:
* pivot://quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:09e3228d294d07940efebc06301179cf376f50ca16efe21ba0e343ab12ffa131
CustomOrigin: Managed by machine-config-operator
Version: 46.82.202101301821-0 (2021-01-30T18:25:04Z)
RemovedBasePackages: kernel-core kernel-modules kernel kernel-modules-extra 4.18.0-193.41.1.el8_2
LayeredPackages: kernel-rt-core kernel-rt-kvm kernel-rt-modules kernel-rt-modules-extra
pivot://quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:09e3228d294d07940efebc06301179cf376f50ca16efe21ba0e343ab12ffa131
CustomOrigin: Managed by machine-config-operator
Version: 46.82.202101301821-0 (2021-01-30T18:25:04Z)
sh-4.4# exit
exit
sh-4.2# exit
exit
Removing debug pod ...
This bug should be targeted at 4.5.z; moving back to MODIFIED so that it will be added to the correct errata. Verified on 4.5.0-0.nightly-2021-02-05-192721
$ oc get clusterversion
NAME VERSION AVAILABLE PROGRESSING SINCE STATUS
version 4.5.0-0.nightly-2021-02-05-192721 True False 18m Cluster version is 4.5.0-0.nightly-2021-02-05-192721
$ cat 99-worker-realtime.yaml
apiVersion: machineconfiguration.openshift.io/v1
kind: MachineConfig
metadata:
labels:
machineconfiguration.openshift.io/role: "worker"
name: 99-worker-realtime
spec:
config:
ignition:
version: 2.2.0
kernelType: realtime
$ oc create -f 99-worker-realtime.yaml
machineconfig.machineconfiguration.openshift.io/99-worker-realtime created
$ oc get mc
NAME GENERATEDBYCONTROLLER IGNITIONVERSION AGE
00-master f0034240b4ad1a5537bca817454717e418a85370 2.2.0 39m
00-worker f0034240b4ad1a5537bca817454717e418a85370 2.2.0 39m
01-master-container-runtime f0034240b4ad1a5537bca817454717e418a85370 2.2.0 39m
01-master-kubelet f0034240b4ad1a5537bca817454717e418a85370 2.2.0 39m
01-worker-container-runtime f0034240b4ad1a5537bca817454717e418a85370 2.2.0 39m
01-worker-kubelet f0034240b4ad1a5537bca817454717e418a85370 2.2.0 39m
99-master-078e1dc0-15c5-4e47-b9a5-e994ed3e5b1a-registries f0034240b4ad1a5537bca817454717e418a85370 2.2.0 39m
99-master-generated-crio-capabilities 2.2.0 39m
99-master-ssh 2.2.0 48m
99-worker-0c5242b1-e122-4688-b002-18d81a1e8f5f-registries f0034240b4ad1a5537bca817454717e418a85370 2.2.0 39m
99-worker-generated-crio-capabilities 2.2.0 39m
99-worker-realtime 2.2.0 3s
99-worker-ssh 2.2.0 48m
rendered-master-9fdb731f2dd849f3d0264c06dd493d01 f0034240b4ad1a5537bca817454717e418a85370 2.2.0 39m
rendered-worker-1971ecf737395f2fd5ad6aeb5943b7be f0034240b4ad1a5537bca817454717e418a85370 2.2.0 39m
$ oc get mcp/worker
NAME CONFIG UPDATED UPDATING DEGRADED MACHINECOUNT READYMACHINECOUNT UPDATEDMACHINECOUNT DEGRADEDMACHINECOUNT AGE
worker rendered-worker-1971ecf737395f2fd5ad6aeb5943b7be False True False 3 0 0 0 41m
$ oc get mcp/worker
NAME CONFIG UPDATED UPDATING DEGRADED MACHINECOUNT READYMACHINECOUNT UPDATEDMACHINECOUNT DEGRADEDMACHINECOUNT AGE
worker rendered-worker-6fe9a019855a23f056c6579bedad2820 True False False 3 3 3 0 54m
$ oc get nodes
NAME STATUS ROLES AGE VERSION
ip-10-0-132-142.us-west-2.compute.internal Ready worker 43m v1.18.3+c3b5327
ip-10-0-151-200.us-west-2.compute.internal Ready master 55m v1.18.3+c3b5327
ip-10-0-174-228.us-west-2.compute.internal Ready worker 43m v1.18.3+c3b5327
ip-10-0-174-83.us-west-2.compute.internal Ready master 55m v1.18.3+c3b5327
ip-10-0-218-111.us-west-2.compute.internal Ready master 56m v1.18.3+c3b5327
ip-10-0-218-233.us-west-2.compute.internal Ready worker 43m v1.18.3+c3b5327
$ for i in $(oc get nodes -o name); do oc debug $i -- chroot /host rpm -qa kernel*; done
Starting pod/ip-10-0-132-142us-west-2computeinternal-debug ...
To use host binaries, run `chroot /host`
kernel-devel-4.18.0-193.41.1.el8_2.x86_64
kernel-rt-modules-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-headers-4.18.0-193.41.1.el8_2.x86_64
kernel-rt-core-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-rt-modules-extra-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-rt-kvm-4.18.0-193.41.1.rt13.91.el8_2.x86_64
Removing debug pod ...
Starting pod/ip-10-0-151-200us-west-2computeinternal-debug ...
To use host binaries, run `chroot /host`
kernel-devel-4.18.0-193.41.1.el8_2.x86_64
kernel-core-4.18.0-193.41.1.el8_2.x86_64
kernel-modules-4.18.0-193.41.1.el8_2.x86_64
kernel-modules-extra-4.18.0-193.41.1.el8_2.x86_64
kernel-headers-4.18.0-193.41.1.el8_2.x86_64
kernel-4.18.0-193.41.1.el8_2.x86_64
Removing debug pod ...
Starting pod/ip-10-0-174-228us-west-2computeinternal-debug ...
To use host binaries, run `chroot /host`
kernel-devel-4.18.0-193.41.1.el8_2.x86_64
kernel-rt-modules-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-headers-4.18.0-193.41.1.el8_2.x86_64
kernel-rt-core-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-rt-modules-extra-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-rt-kvm-4.18.0-193.41.1.rt13.91.el8_2.x86_64
Removing debug pod ...
Starting pod/ip-10-0-174-83us-west-2computeinternal-debug ...
To use host binaries, run `chroot /host`
kernel-devel-4.18.0-193.41.1.el8_2.x86_64
kernel-core-4.18.0-193.41.1.el8_2.x86_64
kernel-modules-4.18.0-193.41.1.el8_2.x86_64
kernel-modules-extra-4.18.0-193.41.1.el8_2.x86_64
kernel-headers-4.18.0-193.41.1.el8_2.x86_64
kernel-4.18.0-193.41.1.el8_2.x86_64
Removing debug pod ...
Starting pod/ip-10-0-218-111us-west-2computeinternal-debug ...
To use host binaries, run `chroot /host`
kernel-devel-4.18.0-193.41.1.el8_2.x86_64
kernel-core-4.18.0-193.41.1.el8_2.x86_64
kernel-modules-4.18.0-193.41.1.el8_2.x86_64
kernel-modules-extra-4.18.0-193.41.1.el8_2.x86_64
kernel-headers-4.18.0-193.41.1.el8_2.x86_64
kernel-4.18.0-193.41.1.el8_2.x86_64
Removing debug pod ...
Starting pod/ip-10-0-218-233us-west-2computeinternal-debug ...
To use host binaries, run `chroot /host`
kernel-devel-4.18.0-193.41.1.el8_2.x86_64
kernel-rt-modules-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-headers-4.18.0-193.41.1.el8_2.x86_64
kernel-rt-core-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-rt-modules-extra-4.18.0-193.41.1.rt13.91.el8_2.x86_64
kernel-rt-kvm-4.18.0-193.41.1.rt13.91.el8_2.x86_64
Removing debug pod ...
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: OpenShift Container Platform 4.5.33 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:0428 |