Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1860926

Summary: [4.5] rpm-ostree crashes when deploying realtime kernel
Product: OpenShift Container Platform Reporter: Johnny Liu <jialiu>
Component: RHCOSAssignee: Jonathan Lebon <jlebon>
Status: CLOSED DEFERRED QA Contact: Michael Nguyen <mnguyen>
Severity: high Docs Contact:
Priority: medium    
Version: 4.5CC: bbreard, behoward, imcleod, jlebon, jligon, miabbott, nstielau, skumari, smilner
Target Milestone: ---   
Target Release: 4.5.z   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-08-26 20:22:08 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1859269, 1862233    
Bug Blocks:    
Attachments:
Description Flags
bootstrap gather log none

Description Johnny Liu 2020-07-27 13:10:25 UTC
Description of problem:


Version-Release number of selected component (if applicable):
4.5.0-0.nightly-2020-07-25-031342

How reproducible:
always

Steps to Reproduce:
1. ./openshift-install create manifests
2. add this MachineConfig to manifests:
   cat <<EOF >openshift/99-master-kerneltype.yaml
apiVersion: machineconfiguration.openshift.io/v1
kind: MachineConfig
metadata:
  labels:
    machineconfiguration.openshift.io/role: "master"
  name: 99-master-kerneltype
spec:
  kernelType: realtime
EOF
    cat <<EOF >openshift/99-worker-kerneltype.yaml
apiVersion: machineconfiguration.openshift.io/v1
kind: MachineConfig
metadata:
  labels:
    machineconfiguration.openshift.io/role: "worker"
  name: 99-worker-kerneltype
spec:
  kernelType: realtime
EOF


Actual results:
bootstrap get failed. 

Expected results:
installation get completed.

Additional info:
Applying RT machineconfig as Day 2 deployment get successful.

Comment 1 Johnny Liu 2020-07-27 13:12:42 UTC
Created attachment 1702537 [details]
bootstrap gather log

Comment 2 Micah Abbott 2020-07-27 14:21:11 UTC
This looks like it could be a dupe of https://bugzilla.redhat.com/show_bug.cgi?id=1859269

@Sinny @Jonathan could either of you have a look and confirm my suspicion?

Comment 3 Sinny Kumari 2020-07-27 15:07:48 UTC
(In reply to Micah Abbott from comment #2)
> This looks like it could be a dupe of
> https://bugzilla.redhat.com/show_bug.cgi?id=1859269
> 
> @Sinny @Jonathan could either of you have a look and confirm my suspicion?

Yes, this looks like duplicate of BZ#1859269

From log:
$cat control-plane/10.0.99.119/unit-status/machine-config-daemon-firstboot.service.txt
...
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: rpm-ostreed.service: Consumed 2min 11.403s CPU time
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: I0727 12:10:35.701919    1807 update.go:1404] Initiating switch from kernel realtime to default
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: I0727 12:10:35.707378    1807 rpm-ostree.go:368] Running captured: rpm-ostree status --json
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: I0727 12:10:35.799701    1807 update.go:813] Updating files
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: I0727 12:10:35.800482    1807 update.go:850] Deleting stale data
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: error: error rolling back Real time Kernel No kernel-rt package installed on host: failed to run pivot: failed to start machine-config-daemon-host.service: exit status 1

Comment 4 Jonathan Lebon 2020-07-27 15:15:09 UTC
Yeah, this is a dupe though this one is targeted at 4.5 while https://bugzilla.redhat.com/show_bug.cgi?id=1859269 is targeted at 4.6. https://bugzilla.redhat.com/show_bug.cgi?id=1859269#c12 makes me think it might be related to a glib bump in 8.2.

Anyway, let's keep this one open for now as the 4.5 version of this bug as I investigate this.

Comment 5 Micah Abbott 2020-07-27 15:27:03 UTC
Targeted for 4.5.z, but the BZ bots might be unhappy about that.  I may end up marking this as a dupe and cloning the original for 4.5

Comment 7 Johnny Liu 2020-07-28 01:41:45 UTC
When I was opening this bug, I already noticed BZ#1859269, but 1859269 is targeted 4.6. Let us use this one for tracking 4.5 issue. Thx.

Comment 8 Jonathan Lebon 2020-07-31 19:14:13 UTC
This is blocked on the RHEL 8.2.0 respin. Bumping to next sprint.