test: Node process segfaulted is failing frequently in CI, see search results: https://search.ci.openshift.org/?maxAge=168h&context=1&type=bug%2Bjunit&name=&maxMatches=5&maxBytes=20971520&groupBy=job&search=Node+process+segfaulted https://prow.ci.openshift.org/view/gcs/origin-ci-test/logs/release-openshift-ocp-installer-e2e-aws-ovn-4.6/1312061952540807168 nodes/ip-10-0-249-199.us-east-2.compute.internal/journal.gz:Oct 02 16:21:44.182453 ip-10-0-249-199 kernel: gensnippet_if[1443]: segfault at 7ffc7fdbefc8 ip 00007f720a676af5 sp 00007ffc7fdbefb0 error 6 in libc-2.28.so[7f720a64f000+1b9000] nodes/ip-10-0-249-199.us-east-2.compute.internal/journal.gz:Oct 02 16:22:06.595875 ip-10-0-249-199 kernel: gensnippet_if[6038]: segfault at 7ffc600a0ff8 ip 00007f62c38fcadf sp 00007ffc600a1000 error 6 in libc-2.28.so[7f62c38d5000+1b9000] nodes/ip-10-0-249-199.us-east-2.compute.internal/journal.gz:Oct 02 16:22:31.868243 ip-10-0-249-199 kernel: gensnippet_if[10458]: segfault at 7ffee4a93f10 ip 000055a05a1b25d9 sp 00007ffee4a93ec0 error 6 in bash[55a05a179000+108000]
Seth this is on the OCP 4.6 list, can you please evaluate.
This is indeed caused by the same bug as https://bugzilla.redhat.com/show_bug.cgi?id=1884236. gensnippet_if contains a recursive call that is overflowing the stack. The fix is now in console-login-helper-messages v0.19-3 and is now sync'ed into the 4.6 plashets.
The new version of `console-login-helper-messages` landed in 46.82.202010021940-0 which is available in the latest nightly-4.6 payloads
Node process segfaulted seeing if that helps the sippy test linking
$ oc get clusterversion NAME VERSION AVAILABLE PROGRESSING SINCE STATUS version 4.7.0-0.nightly-2020-10-26-124513 True False 5h56m Cluster version is 4.7.0-0.nightly-2020-10-26-124513 $ oc get nodes NAME STATUS ROLES AGE VERSION ip-10-0-136-111.us-west-2.compute.internal Ready worker 6h11m v1.19.0+e67f5dc ip-10-0-138-139.us-west-2.compute.internal Ready master 6h16m v1.19.0+e67f5dc ip-10-0-166-15.us-west-2.compute.internal Ready worker 6h11m v1.19.0+e67f5dc ip-10-0-187-209.us-west-2.compute.internal Ready master 6h16m v1.19.0+e67f5dc ip-10-0-218-114.us-west-2.compute.internal Ready worker 6h6m v1.19.0+e67f5dc ip-10-0-219-131.us-west-2.compute.internal Ready master 6h17m v1.19.0+e67f5dc $ oc debug node/ip-10-0-218-114.us-west-2.compute.internal -- chroot /host rpm -q console-login-helper-messages Starting pod/ip-10-0-218-114us-west-2computeinternal-debug ... To use host binaries, run `chroot /host` console-login-helper-messages-0.19-3.rhaos4.6.el8.noarch Removing debug pod ... $
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.7.0 security, bug fix, and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2020:5633