Bug 1892159

Summary: Upgrade failed for Task Wait for node to be ready.
Product: OpenShift Container Platform Reporter: Abhishek <aabhishe>
Component: InstallerAssignee: Russell Teague <rteague>
Installer sub component: openshift-ansible QA Contact: Gaoyun Pei <gpei>
Status: CLOSED ERRATA Docs Contact:
Severity: high    
Priority: high CC: bleanhar
Version: 3.11.0   
Target Milestone: ---   
Target Release: 3.11.z   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Cause: Output of previous task does not have all values set for validating task execution conditionals. Consequence: The task fails due to undefined values. Fix: Explicitly check each step in the value tree to ensure it is defined. Result: Task conditionals are validated as expected even when not all values are defined.
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-11-12 10:08:21 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Abhishek 2020-10-28 04:22:29 UTC
openshift-ansible-3.11.306-3.git.1.22c55e7.el7.noarch

Invoked PLAYBOOK: upgrade.yml
/usr/share/ansible/openshift-ansible/playbooks/byo/openshift-cluster/upgrades/v3_11/upgrade.yml


Play [Restart nodes]: /usr/share/ansible/openshift-ansible/playbooks/openshift-node/private/registry_auth.yaml

  - name: Wait for node to be ready
    oc_obj:
      state: list
      kind: node
      name: "{{ l_kubelet_node_name | lower }}"
    register: node_output
    delegate_to: "{{ groups.oo_first_master.0 }}"
    when: inventory_hostname in groups.oo_nodes_to_config
    until:
    - node_output.module_results is defined
    - node_output.module_results.returncode is defined
    - node_output.module_results.results is defined
    - node_output.module_results.returncode == 0
    - node_output.module_results.results[0].status.conditions | selectattr('type', 'match', '^Ready$') | map(attribute='status') | join | bool == True
    # Give the node three minutes to come back online.
    retries: 36
    delay: 5



Error message:

    "msg": "The conditional check 'node_output.module_results.results[0].status.conditions | selectattr('type', 'match', '^Ready$') | map(attribute='status') | join | bool == True' failed. The error was: error while evaluating conditional (node_output.module_results.results[0].status.conditions | selectattr('type', 'match', '^Ready$') | map(attribute='status') | join | bool == True): 'dict object' has no attribute 'status'"

Comment 2 Gaoyun Pei 2020-10-29 09:45:57 UTC
Verified this bug with openshift-ansible-3.11.313-1.git.0.3ccc470.el7.noarch.rpm. 

Upgrade a v3.11.219 cluster to v3.11.306, no error happened during step "Wait for node to be ready".

Comment 5 errata-xmlrpc 2020-11-12 10:08:21 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 3.11.317 bug fix and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:4430