Description of problem: During an upgrade of logging components on starter-ca-central-1, the logging playbooks were executed but had to be forcibly terminated after running for over 8 hours and generating ~50G of log output. Version-Release number of the following components: Ansible 2.4.3 OCP v3.9.0-0.42.0 A link to a portion of the log output will be attached.
If this merges in as well, it would save some more time/output https://github.com/openshift/openshift-ansible/pull/7150
Installed logging with -vvv on 85 node cluster with openshift-ansible.noarch 3.9.0-0.53.0.git.0.f8f01ef.el7 INSTALLER STATUS **************************************************************************************** Initialization : Complete (0:00:17) Logging Install : Complete (0:06:21) real 6m42.417s user 2m14.022s sys 1m39.366s Output was 4.4MB
2nd -vvv install in 148 node cluster INSTALLER STATUS **************************************************************************************** Initialization : Complete (0:00:38) Logging Install : Complete (0:09:03) real 9m48.398s user 3m2.816s sys 2m55.410s 6.3MB log
@Mike, Many thanks. Have you try the logging redeploy/upgrade?
@Mike Redeploy is ok. Move to verified.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2018:0489