Description of problem:RedHat Support plugin for rhev randomly fails to collect sosreports from hypervisors on a customer setup, while running log-collector from command line works well and as desired. Version-Release number of selected component (if applicable): RHEV 3.4 redhat-support-plugin-rhev-3.4.0-4.el6ev.noarch How reproducible: Randomly, but happens more often than not Steps to Reproduce: 1.Login to supoort plugin for rhev 2.Try collecting sosreports for hypervisors 3. Actual results: Failing to collect sosreport for hypervisors Expected results: It should be able to collect sosreports or report back a meaningful error Additional info:
I have seen this problem before when /tmp does not have enough space to hold all of the SOS reports. When attaching Hypervisor SOS reports the plugin asynchronously runs SOS on the hypervisors, it then transfers each SOS report back to /tmp on the Manger and then uploaded the report to Red Hat. Once uploaded the sos report is deleted. Due to the large nature of the SOS reports and the fact that multiple are selected, space runs out quickly. I am currently investigating why the code is not failing in a better way. An interim solution would be to either add more space to /tmp or upload 1 SOS report at a time. I will us this bug to track progress on finding a better solution to this bug.
I've had a customer try this and it seems that we're getting all of the files from the RHEV-M, RHEV-Hs, sos_pgdump.tar, and the timeskew.txt files, but they are all getting uploaded separately rather than being combined into a sosreport-LogCollector file like normal. A RHEV-M + 4 hypervisors leads to 10 files being uploaded to a case. This process did take approximately an hour to start til upload, for what it's worth, so it doesn't seem like a space issue in this case either.
based on irc discussion
Cause: A timeout was happing during upload of a SOS report. Consequence: This was resulting in a 500 error displayed on the screen and the user thinking the SOS upload failed. Fix: The timeout has now been increased to 2 hours. Result: The plugin now successfully displays the upload status of the SOS reports.
ok, redhat-support-plugin-rhev-3.5.0-1.el6ev.noarch see https://access.redhat.com/support/cases/#/case/01302112
Spenser, please mark the require_doc_text flag as '-' if any of these bugs needs doc for errata or ? if you need to document it and request assistance from doc team? this is needed asap for the last rc build due this week.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2015-0211.html