Bug 1896381
| Summary: | NTO fails to load kernel modules | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Jiří Mencák <jmencak> |
| Component: | Node Tuning Operator | Assignee: | Jiří Mencák <jmencak> |
| Status: | CLOSED ERRATA | QA Contact: | Simon <skordas> |
| Severity: | high | Docs Contact: | |
| Priority: | high | ||
| Version: | 4.6 | CC: | sejug |
| Target Milestone: | --- | ||
| Target Release: | 4.6.z | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2020-11-30 16:46:09 UTC | Type: | --- |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1895919 | ||
| Bug Blocks: | |||
|
Description
Jiří Mencák
2020-11-10 12:33:26 UTC
$ oc get clusterversion
NAME VERSION AVAILABLE PROGRESSING SINCE STATUS
version 4.6.0-0.nightly-2020-11-22-160856 True False 50m Cluster version is 4.6.0-0.nightly-2020-11-22-160856
$ oc get nodes
NAME STATUS ROLES AGE VERSION
ip-10-0-154-108.us-east-2.compute.internal Ready master 74m v1.19.0+43983cd
ip-10-0-156-254.us-east-2.compute.internal Ready worker 67m v1.19.0+43983cd
ip-10-0-168-148.us-east-2.compute.internal Ready worker 70m v1.19.0+43983cd
ip-10-0-176-62.us-east-2.compute.internal Ready master 75m v1.19.0+43983cd
ip-10-0-219-123.us-east-2.compute.internal Ready worker 67m v1.19.0+43983cd
ip-10-0-222-120.us-east-2.compute.internal Ready master 76m v1.19.0+43983cd
$ worker=ip-10-0-156-254.us-east-2.compute.internal
$ oc get pods -o wide | grep $worker
tuned-dqskq 1/1 Running 0 68m 10.0.156.254 ip-10-0-156-254.us-east-2.compute.internal <none> <none>
$ pod=tuned-dqskq
$ oc get node $worker --show-labels
NAME STATUS ROLES AGE VERSION LABELS
ip-10-0-156-254.us-east-2.compute.internal Ready worker 68m v1.19.0+43983cd beta.kubernetes.io/arch=amd64,beta.kubernetes.io/instance-type=m5.large,beta.kubernetes.io/os=linux,failure-domain.beta.kubernetes.io/region=us-east-2,failure-domain.beta.kubernetes.io/zone=us-east-2a,kubernetes.io/arch=amd64,kubernetes.io/hostname=ip-10-0-156-254,kubernetes.io/os=linux,node-role.kubernetes.io/worker=,node.kubernetes.io/instance-type=m5.large,node.openshift.io/os_id=rhcos,topology.ebs.csi.aws.com/zone=us-east-2a,topology.kubernetes.io/region=us-east-2,topology.kubernetes.io/zone=us-east-2a
$ # kubernetes.io/hostname=ip-10-0-156-254 label
$ oc create -f- <<EOF
apiVersion: tuned.openshift.io/v1
kind: Tuned
metadata:
name: fuse-for-buildah
namespace: openshift-cluster-node-tuning-operator
spec:
profile:
- data: |
[main]
summary=An OpenShift profile to load 'fuse' module
include=openshift-node
[modules]
fuse=+r
name: openshift-fuse
recommend:
- match:
- label: kubernetes.io/hostname
value: ip-10-0-156-254
priority: 5
profile: openshift-fuse
EOF
tuned.tuned.openshift.io/fuse-for-buildah created
for pr in $(oc get profiles -n openshift-cluster-node-tuning-operator --no-headers | cut -d ' ' -f 1); do echo $pr; oc get profile $pr -n openshift-cluster-node-tuning-operator -o json | jq ".spec.config.tunedProfile"; done
ip-10-0-154-108.us-east-2.compute.internal
"openshift-control-plane"
ip-10-0-156-254.us-east-2.compute.internal
"openshift-fuse"
ip-10-0-168-148.us-east-2.compute.internal
"openshift-node"
ip-10-0-176-62.us-east-2.compute.internal
"openshift-control-plane"
ip-10-0-219-123.us-east-2.compute.internal
"openshift-node"
ip-10-0-222-120.us-east-2.compute.internal
"openshift-control-plane"
$ # Correct profile on correct node
$ oc logs $pod
2020-11-24 19:47:03,616 INFO tuned.plugins.base: instance disk: assigning devices dm-0
2020-11-24 19:47:03,618 INFO tuned.plugins.base: instance net: assigning devices ens5
2020-11-24 19:47:03,647 INFO tuned.plugins.plugin_sysctl: reapplying system sysctl
2020-11-24 19:47:03,653 INFO tuned.daemon.daemon: static tuning from profile 'openshift-node' applied
I1124 20:35:24.183017 2280 tuned.go:281] extracting Tuned profiles
I1124 20:35:24.330718 2280 tuned.go:315] recommended Tuned profile openshift-node content unchanged
I1124 20:35:24.330895 2280 tuned.go:359] written "/etc/tuned/recommend.d/50-openshift.conf" to set Tuned profile openshift-fuse
I1124 20:35:25.385964 2280 tuned.go:563] active profile (openshift-node) != recommended profile (openshift-fuse)
I1124 20:35:25.386008 2280 tuned.go:445] reloading tuned...
I1124 20:35:25.386014 2280 tuned.go:448] sending HUP to PID 3530
2020-11-24 20:35:25,386 INFO tuned.daemon.daemon: stopping tuning
2020-11-24 20:35:25,402 INFO tuned.daemon.daemon: terminating Tuned, rolling back all changes
2020-11-24 20:35:25,409 INFO tuned.daemon.daemon: Running in automatic mode, checking what profile is recommended for your configuration.
2020-11-24 20:35:25,409 INFO tuned.daemon.daemon: Using 'openshift-fuse' profile
2020-11-24 20:35:25,410 INFO tuned.profiles.loader: loading profile: openshift-fuse
2020-11-24 20:35:25,455 INFO tuned.daemon.daemon: starting tuning
2020-11-24 20:35:25,458 INFO tuned.plugins.base: instance cpu: assigning devices cpu1, cpu0
2020-11-24 20:35:25,459 INFO tuned.plugins.plugin_cpu: We are running on an x86 GenuineIntel platform
2020-11-24 20:35:25,461 WARNING tuned.plugins.plugin_cpu: your CPU doesn't support MSR_IA32_ENERGY_PERF_BIAS, ignoring CPU energy performance bias
2020-11-24 20:35:25,463 INFO tuned.plugins.base: instance disk: assigning devices dm-0
2020-11-24 20:35:25,465 INFO tuned.plugins.base: instance net: assigning devices ens5
2020-11-24 20:35:25,469 INFO tuned.plugins.plugin_sysctl: reapplying system sysctl
2020-11-24 20:35:25,511 INFO tuned.daemon.daemon: static tuning from profile 'openshift-fuse' applied
$ # ^^ No errors
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (OpenShift Container Platform 4.6.6 bug fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:5115 |