Bug 1447672

Summary:

SSA fails on timeout for large images

Product:

Red Hat CloudForms Management Engine

Reporter:

Einat Pacifici <epacific>

Component:

SmartState Analysis

Assignee:

Erez Freiberger <efreiber>

Status:

CLOSED CURRENTRELEASE

QA Contact:

Einat Pacifici <epacific>

Severity:

high

Docs Contact:

Priority:

high

Version:

5.8.0

CC:

cpelland, dajohnso, dlamotta, efreiber, fsimonce, jhardy, mtayer, obarenbo, simaishi

Target Milestone:

Keywords:

Regression, TestOnly

Target Release:

5.9.0

Flags:

epacific: automate_bug+

Hardware:

Unspecified

OS:

Unspecified

Whiteboard:

container:smartstate

Fixed In Version:

5.9.0.1

Doc Type:

If docs needed, set a value

Doc Text:

Story Points:

---

Clone Of:

Clones:

1450470 (view as bug list)

Environment:

Last Closed:

2018-03-06 14:34:25 UTC

Type:

Bug

Regression:

---

Mount Type:

---

Documentation:

---

CRM:

Verified Versions:

Category:

---

oVirt Team:

---

RHEL 7.3 requirements from Atomic Host:

Cloudforms Team:

Container Management

Target Upstream Version:

Embargoed:

Bug Depends On:

Bug Blocks:

1450470

Attachments:

Description	Flags
ssa failed task	none

Description Einat Pacifici 2017-05-03 12:46:41 UTC

Created attachment 1275912 [details]
ssa failed task

Description of problem:
CFME podified image is 1.8Gb. 
When running SSA on the image, the image download fails on timeout and as a result SSA cannot happen. 

Version-Release number of selected component (if applicable):
5.8.0.12

How reproducible:
Always

Steps to Reproduce:
1.Add Openshift 3.5. provider to CFME
2.Go to compute --> Containers --> Images
3.Find CFME image
4. Ensure Policy OpenSCAP is assigned to image
5. Select image and select Configuration --> Perform smart state analysis

Actual results:
Task always fails on timeout

Expected results:


Additional info:
Image attached.

Comment 2 Mooli Tayer 2017-05-03 14:16:16 UTC

Erez please add details on the related docker change

Comment 3 Erez Freiberger 2017-05-03 14:41:05 UTC

IIUC docker canceled to continued downloading after the client disconnects. I can't find it in the docs, but I see it referenced in issues. [2] is where I think it was disabled.


[1]https://docs.docker.com/engine/reference/commandline/pull/#cancel-a-pull
[2]https://github.com/moby/moby/pull/18353
[3]https://github.com/moby/moby/issues/6928
[4]https://github.com/moby/moby/issues/3136

Comment 4 Mooli Tayer 2017-05-09 11:51:06 UTC

Some notes from email thread:
In vm scanning there are some timeout adjustment mechanisms that we can adopt[1]
The timeout there is 20 min and not 5. It should be safe to increase our timeout as well.

[1]
https://github.com/manageiq/manageiq/blob/01fb29ed1b9b3e046976bcd123ecfa8de87e649a/app/models/job.rb#L164
https://github.com/manageiq/manageiq/blob/6abb5559f81e73446712488fe27b77be0f5d4fac/app/models/vm_scan.rb#L8-L13
https://github.com/manageiq/manageiq/blob/01fb29ed1b9b3e046976bcd123ecfa8de87e649a/app/models/job.rb#L178-L186

Comment 5 Erez Freiberger 2017-05-09 13:24:04 UTC

PR to provide setting the timeout for the scanning jobs https://github.com/ManageIQ/manageiq-providers-kubernetes/pull/14