Bug 2124419
Summary: | Jobs pushed in MQTT queue is not delivered if yggdrasild was not running and communicating with the right broker before the jobs were pushed | ||
---|---|---|---|
Product: | Red Hat Satellite | Reporter: | Sayan Das <saydas> |
Component: | Remote Execution | Assignee: | Adam Ruzicka <aruzicka> |
Status: | CLOSED ERRATA | QA Contact: | Peter Ondrejka <pondrejk> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 6.12.0 | CC: | ahumbe, aruzicka, bbuckingham, mdolezel, pcreech, pmoravec, saydas, timo.alatalo |
Target Milestone: | 6.13.0 | Keywords: | Triaged |
Target Release: | Unused | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | rubygem-foreman_remote_execution-8.1.0, rubygem-smart_proxy_remote_execution_ssh-0.9.0 | Doc Type: | Known Issue |
Doc Text: | Story Points: | --- | |
Clone Of: | 2124287 | Environment: | |
Last Closed: | 2023-05-03 13:21:46 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 2124287 |
Description
Sayan Das
2022-09-06 06:01:01 UTC
This bug should be treated as a product bug to fix the same behavior of https://bugzilla.redhat.com/show_bug.cgi?id=2124287 ( https://github.com/RedHatInsights/yggdrasil/issues/82 ) I agree that this should get fixed eventually, but I don't see us fixing it in 6.12. If this does not get fixed in yggdrasil itself, we have a workaround in the works that should land in 6.13. This should be fixed in foreman_remote_execution-8.1.0 and smart_proxy_remote_execution-0.9.0 in the spirit of [1] and [2]. The MQTT notification is re-sent every 15 minutes (configurable in /etc/foreman-proxy/settings.d/remote_execution_ssh.yml under the mqtt_resend_interval key). There is also a satellite-wide setting (overridable per-job) to set a time to pickup. If the host does not pick up the job within the given time interval, the job fails. [1] - https://issues.redhat.com/browse/SAT-1668 [2] - https://issues.redhat.com/browse/SAT-11349 Verified on Sat 6.13 sn 5, pending jobs are successfully picked up after yggdrasil starts on the host, resend interval can be set as expected Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: Satellite 6.13 Release), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2023:2097 |