Description of problem: On a DU node deployed via ZTP flow we can see duplicte' intel_iommu=on iommu=pt' entries in /proc/cmdline. ssh core.lab.eng.rdu2.redhat.com 'cat /proc/cmdline' BOOT_IMAGE=(hd3,gpt3)/ostree/rhcos-9b401531c3ea4339fa17ccc833c28a91c917b6dd04f50ee078703bdbb3abf56f/vmlinuz-4.18.0-305.19.1.rt7.91.el8_4.x86_64 random.trust_cpu=on console=tty0 console=ttyS0,115200n8 ignition.platform.id=metal ostree=/ostree/boot.1/rhcos/9b401531c3ea4339fa17ccc833c28a91c917b6dd04f50ee078703bdbb3abf56f/0 ip=ens2f0:dhcp6 root=UUID=f19b44d7-1c06-4c42-85e8-7f685ff7d78f rw rootflags=prjquota intel_iommu=on iommu=pt skew_tick=1 nohz=on rcu_nocbs=2-23,26-47 tuned.non_isolcpus=03000003 intel_pstate=disable nosoftlockup tsc=nowatchdog intel_iommu=on iommu=pt isolcpus=managed_irq,2-23,26-47 systemd.cpu_affinity=0,1,24,25 default_hugepagesz=1G hugepagesz=1G hugepages=32 idle=poll rcupdate.rcu_normal_after_boot=0 nohz_full=2-23,26-47 Version-Release number of selected component (if applicable): 4.9.0 How reproducible: not always Steps to Reproduce: 1. Deploy DU node via ZTP process siteconfig and policygentemplates are in http://registry.kni-qe-0.lab.eng.rdu2.redhat.com:3000/kni-qe/ztp-site-configs/src/kni-qe-1-4.9, site kni-qe-2 Actual results: ssh core.lab.eng.rdu2.redhat.com 'cat /proc/cmdline' BOOT_IMAGE=(hd3,gpt3)/ostree/rhcos-9b401531c3ea4339fa17ccc833c28a91c917b6dd04f50ee078703bdbb3abf56f/vmlinuz-4.18.0-305.19.1.rt7.91.el8_4.x86_64 random.trust_cpu=on console=tty0 console=ttyS0,115200n8 ignition.platform.id=metal ostree=/ostree/boot.1/rhcos/9b401531c3ea4339fa17ccc833c28a91c917b6dd04f50ee078703bdbb3abf56f/0 ip=ens2f0:dhcp6 root=UUID=f19b44d7-1c06-4c42-85e8-7f685ff7d78f rw rootflags=prjquota intel_iommu=on iommu=pt skew_tick=1 nohz=on rcu_nocbs=2-23,26-47 tuned.non_isolcpus=03000003 intel_pstate=disable nosoftlockup tsc=nowatchdog intel_iommu=on iommu=pt isolcpus=managed_irq,2-23,26-47 systemd.cpu_affinity=0,1,24,25 default_hugepagesz=1G hugepagesz=1G hugepages=32 idle=poll rcupdate.rcu_normal_after_boot=0 nohz_full=2-23,26-47 Expected results: No duplicate entries in /proc/cmdline Additional info: Must-gather available at: http://registry.kni-qe-0.lab.eng.rdu2.redhat.com:8080/images/must-gather.local.5832368568665900044/
When this issue happens, master MCP will be in degraded state with error in appending iommu kernel args, it will stay in Degraded and Updating status for extended amount of time, and eventually give in to append the extra kernel args. So we could observe much longer than avg deployment time when this issue happens.
I believe this is caused by the same root cause as described in https://bugzilla.redhat.com/show_bug.cgi?id=2021151
*** This bug has been marked as a duplicate of bug 2021151 ***