Because of the nature of the hypershift install flow, the time it takes CNO to probe MTU causes a lot of head-of-line blocking. Now, in the general case, there's nothing we can do: we don't necessarily know the MTU of the worker nodes until they come online. However, in certain cases, we can safely assume the MTU of the underlay, and remove this ~90 second delay.
Verified on 4.11.0-0.nightly-2022-05-25-193227 AWS 268:[pod/cluster-network-operator-69b866655b-k6nl9/cluster-network-operator] I0601 13:20:41.719200 1 mtu_probe.go:44] AWS cluster, omitting MTU probing and using default of 9001 269:[pod/cluster-network-operator-69b866655b-k6nl9/cluster-network-operator] I0601 13:20:41.719223 1 log.go:195] Using detected MTU 9001 400:[pod/cluster-network-operator-69b866655b-k6nl9/cluster-network-operator] I0601 13:20:43.565223 1 mtu_probe.go:44] AWS cluster, omitting MTU probing and using default of 9001 401:[pod/cluster-network-operator-69b866655b-k6nl9/cluster-network-operator] I0601 13:20:43.565273 1 log.go:195] Using detected MTU 9001 It looks like MCO doesn't exist for the hosted cluster, so MTU migration of the hosted cluster is not supported.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: OpenShift Container Platform 4.11.0 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:5069