Bug 1860926 - [4.5] rpm-ostree crashes when deploying realtime kernel
Summary: [4.5] rpm-ostree crashes when deploying realtime kernel
Keywords:
Status: CLOSED DEFERRED
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: RHCOS
Version: 4.5
Hardware: Unspecified
OS: Unspecified
medium
high
Target Milestone: ---
: 4.5.z
Assignee: Jonathan Lebon
QA Contact: Michael Nguyen
URL:
Whiteboard:
Depends On: 1859269 1862233
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-07-27 13:10 UTC by Johnny Liu
Modified: 2020-08-26 20:22 UTC (History)
9 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2020-08-26 20:22:08 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
bootstrap gather log (1.23 MB, application/gzip)
2020-07-27 13:12 UTC, Johnny Liu
no flags Details

Description Johnny Liu 2020-07-27 13:10:25 UTC
Description of problem:


Version-Release number of selected component (if applicable):
4.5.0-0.nightly-2020-07-25-031342

How reproducible:
always

Steps to Reproduce:
1. ./openshift-install create manifests
2. add this MachineConfig to manifests:
   cat <<EOF >openshift/99-master-kerneltype.yaml
apiVersion: machineconfiguration.openshift.io/v1
kind: MachineConfig
metadata:
  labels:
    machineconfiguration.openshift.io/role: "master"
  name: 99-master-kerneltype
spec:
  kernelType: realtime
EOF
    cat <<EOF >openshift/99-worker-kerneltype.yaml
apiVersion: machineconfiguration.openshift.io/v1
kind: MachineConfig
metadata:
  labels:
    machineconfiguration.openshift.io/role: "worker"
  name: 99-worker-kerneltype
spec:
  kernelType: realtime
EOF


Actual results:
bootstrap get failed. 

Expected results:
installation get completed.

Additional info:
Applying RT machineconfig as Day 2 deployment get successful.

Comment 1 Johnny Liu 2020-07-27 13:12:42 UTC
Created attachment 1702537 [details]
bootstrap gather log

Comment 2 Micah Abbott 2020-07-27 14:21:11 UTC
This looks like it could be a dupe of https://bugzilla.redhat.com/show_bug.cgi?id=1859269

@Sinny @Jonathan could either of you have a look and confirm my suspicion?

Comment 3 Sinny Kumari 2020-07-27 15:07:48 UTC
(In reply to Micah Abbott from comment #2)
> This looks like it could be a dupe of
> https://bugzilla.redhat.com/show_bug.cgi?id=1859269
> 
> @Sinny @Jonathan could either of you have a look and confirm my suspicion?

Yes, this looks like duplicate of BZ#1859269

From log:
$cat control-plane/10.0.99.119/unit-status/machine-config-daemon-firstboot.service.txt
...
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: rpm-ostreed.service: Consumed 2min 11.403s CPU time
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: I0727 12:10:35.701919    1807 update.go:1404] Initiating switch from kernel realtime to default
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: I0727 12:10:35.707378    1807 rpm-ostree.go:368] Running captured: rpm-ostree status --json
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: I0727 12:10:35.799701    1807 update.go:813] Updating files
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: I0727 12:10:35.800482    1807 update.go:850] Deleting stale data
Jul 27 12:10:35 jialiu451-lpmzx-control-plane-0 machine-config-daemon[1807]: error: error rolling back Real time Kernel No kernel-rt package installed on host: failed to run pivot: failed to start machine-config-daemon-host.service: exit status 1

Comment 4 Jonathan Lebon 2020-07-27 15:15:09 UTC
Yeah, this is a dupe though this one is targeted at 4.5 while https://bugzilla.redhat.com/show_bug.cgi?id=1859269 is targeted at 4.6. https://bugzilla.redhat.com/show_bug.cgi?id=1859269#c12 makes me think it might be related to a glib bump in 8.2.

Anyway, let's keep this one open for now as the 4.5 version of this bug as I investigate this.

Comment 5 Micah Abbott 2020-07-27 15:27:03 UTC
Targeted for 4.5.z, but the BZ bots might be unhappy about that.  I may end up marking this as a dupe and cloning the original for 4.5

Comment 7 Johnny Liu 2020-07-28 01:41:45 UTC
When I was opening this bug, I already noticed BZ#1859269, but 1859269 is targeted 4.6. Let us use this one for tracking 4.5 issue. Thx.

Comment 8 Jonathan Lebon 2020-07-31 19:14:13 UTC
This is blocked on the RHEL 8.2.0 respin. Bumping to next sprint.


Note You need to log in before you can comment on or make changes to this bug.