Bug 1852619

Summary: must-gather creates empty files occasionally
Product: OpenShift Container Platform Reporter: Denys Shchedrivyi <dshchedr>
Component: Performance Addon OperatorAssignee: Artyom <alukiano>
Status: CLOSED ERRATA QA Contact: Denys Shchedrivyi <dshchedr>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 4.6CC: alukiano, aos-bugs, fromani, grajaiya
Target Milestone: ---   
Target Release: 4.6.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: performance-addon-operator-must-gather-rhel8-container-v4.6.0-35 Doc Type: Bug Fix
Doc Text:
Cause: We tried to collect must-gather logs before the ds was ready. Consequence: Creation of empty files. Fix: Add check for the ds readiness. Result: We do not have empty files anymore, but only files with the real content.
Story Points: ---
Clone Of:
: 1856317 (view as bug list) Environment:
Last Closed: 2021-01-18 17:57:34 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1856317    

Description Denys Shchedrivyi 2020-06-30 21:36:26 UTC
Description of problem:
 Sometimes (rarely) must-gather creates empty files:

[root@dell-r640-004 ~]# ls -la nodes/worker-0/lscpu
-rw-r--r--. 1 root root 0 Jun 30 14:29 nodes/worker-0/lscpu


Version-Release number of selected component (if applicable):
# oc version
Client Version: 4.4.10
Server Version: 4.4.10

must-gather image:
        "url": "https://access.redhat.com/containers/#/registry.access.redhat.com/openshift4/performance-addon-operator-must-gather-rhel8/images/v4.4.0-6"

Comment 1 Francesco Romani 2020-07-01 09:34:06 UTC
(In reply to Denys Shchedrivyi from comment #0)
> Description of problem:
>  Sometimes (rarely) must-gather creates empty files:
> 
> [root@dell-r640-004 ~]# ls -la nodes/worker-0/lscpu
> -rw-r--r--. 1 root root 0 Jun 30 14:29 nodes/worker-0/lscpu
> 
> 
> Version-Release number of selected component (if applicable):
> # oc version
> Client Version: 4.4.10
> Server Version: 4.4.10
> 
> must-gather image:
>         "url":
> "https://access.redhat.com/containers/#/registry.access.redhat.com/
> openshift4/performance-addon-operator-must-gather-rhel8/images/v4.4.0-6"

How often does this happen? Say 1 in 10 times, 1 in 5, 1 in 20?

Comment 2 Denys Shchedrivyi 2020-07-02 13:52:50 UTC
hm, I can't reproduce issue with empty files, but I see another behavior: on every 3-4 runs it does not collect lscpu, lspci and proc_cmdline files for some of worker nodes:

# ls -la nodes/worker-0
total 96876
drwxr-xr-x.  2 root root       81 Jul  2 09:48 .
drwxr-xr-x. 10 root root      159 Jul  2 09:48 ..
-rw-r--r--.  1 root root      198 Jul  2 09:47 lscpu
-rw-r--r--.  1 root root    36451 Jul  2 09:47 lspci
-rw-r--r--.  1 root root      347 Jul  2 09:47 proc_cmdline
-rw-r--r--.  1 root root 99155968 Jul  2 09:48 worker-0_logs_kubelet

# ls -la nodes/worker-1
total 110368
drwxr-xr-x.  2 root root        35 Jul  2 09:48 .
drwxr-xr-x. 10 root root       159 Jul  2 09:48 ..
-rw-r--r--.  1 root root 113016832 Jul  2 09:48 worker-1_logs_kubelet

Comment 3 Denys Shchedrivyi 2020-07-16 15:11:01 UTC
can't reproduce it after 700 runs, bug fixed

Comment 6 errata-xmlrpc 2021-01-18 17:57:34 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.6.12 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2021:0037