Bug 2016600

Summary: Duplicate iommu entries in /proc/cmdline on a DU node deployed via ZTP process
Product: OpenShift Container Platform Reporter: Marius Cornea <mcornea>
Component: Telco EdgeAssignee: Ian Miller <imiller>
Telco Edge sub component: ZTP QA Contact: yliu1
Status: CLOSED DUPLICATE Docs Contact:
Severity: low    
Priority: unspecified CC: angwang
Version: 4.9   
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-03-04 17:55:20 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Marius Cornea 2021-10-22 08:04:32 UTC
Description of problem:

On a DU node deployed via ZTP flow we can see duplicte' intel_iommu=on iommu=pt' entries in /proc/cmdline.

ssh core.lab.eng.rdu2.redhat.com     'cat /proc/cmdline' 
BOOT_IMAGE=(hd3,gpt3)/ostree/rhcos-9b401531c3ea4339fa17ccc833c28a91c917b6dd04f50ee078703bdbb3abf56f/vmlinuz-4.18.0-305.19.1.rt7.91.el8_4.x86_64 random.trust_cpu=on console=tty0 console=ttyS0,115200n8 ignition.platform.id=metal ostree=/ostree/boot.1/rhcos/9b401531c3ea4339fa17ccc833c28a91c917b6dd04f50ee078703bdbb3abf56f/0 ip=ens2f0:dhcp6 root=UUID=f19b44d7-1c06-4c42-85e8-7f685ff7d78f rw rootflags=prjquota intel_iommu=on iommu=pt skew_tick=1 nohz=on rcu_nocbs=2-23,26-47 tuned.non_isolcpus=03000003 intel_pstate=disable nosoftlockup tsc=nowatchdog intel_iommu=on iommu=pt isolcpus=managed_irq,2-23,26-47 systemd.cpu_affinity=0,1,24,25 default_hugepagesz=1G hugepagesz=1G hugepages=32 idle=poll rcupdate.rcu_normal_after_boot=0 nohz_full=2-23,26-47


Version-Release number of selected component (if applicable):
4.9.0

How reproducible:
not always

Steps to Reproduce:
1. Deploy DU node via ZTP process
siteconfig and policygentemplates are in http://registry.kni-qe-0.lab.eng.rdu2.redhat.com:3000/kni-qe/ztp-site-configs/src/kni-qe-1-4.9, site kni-qe-2


Actual results:
ssh core.lab.eng.rdu2.redhat.com     'cat /proc/cmdline' 
BOOT_IMAGE=(hd3,gpt3)/ostree/rhcos-9b401531c3ea4339fa17ccc833c28a91c917b6dd04f50ee078703bdbb3abf56f/vmlinuz-4.18.0-305.19.1.rt7.91.el8_4.x86_64 random.trust_cpu=on console=tty0 console=ttyS0,115200n8 ignition.platform.id=metal ostree=/ostree/boot.1/rhcos/9b401531c3ea4339fa17ccc833c28a91c917b6dd04f50ee078703bdbb3abf56f/0 ip=ens2f0:dhcp6 root=UUID=f19b44d7-1c06-4c42-85e8-7f685ff7d78f rw rootflags=prjquota intel_iommu=on iommu=pt skew_tick=1 nohz=on rcu_nocbs=2-23,26-47 tuned.non_isolcpus=03000003 intel_pstate=disable nosoftlockup tsc=nowatchdog intel_iommu=on iommu=pt isolcpus=managed_irq,2-23,26-47 systemd.cpu_affinity=0,1,24,25 default_hugepagesz=1G hugepagesz=1G hugepages=32 idle=poll rcupdate.rcu_normal_after_boot=0 nohz_full=2-23,26-47

Expected results:
No duplicate entries in /proc/cmdline

Additional info:
Must-gather available at:
http://registry.kni-qe-0.lab.eng.rdu2.redhat.com:8080/images/must-gather.local.5832368568665900044/

Comment 1 yliu1 2021-11-01 17:02:49 UTC
When this issue happens, master MCP will be in degraded state with error in appending iommu kernel args, it will stay in Degraded and Updating status for extended amount of time, and eventually give in to append the extra kernel args. So we could observe much longer than avg deployment time when this issue happens.

Comment 2 Marius Cornea 2021-11-23 11:54:42 UTC
I believe this is caused by the same root cause as described in https://bugzilla.redhat.com/show_bug.cgi?id=2021151

Comment 3 Angie Wang 2022-03-04 17:55:20 UTC

*** This bug has been marked as a duplicate of bug 2021151 ***