Bug 1732729
| Summary: | [BUG] Errata Management action/tasks hang | ||
|---|---|---|---|
| Product: | Red Hat Satellite | Reporter: | Imaan <ikaur> |
| Component: | Errata Management | Assignee: | satellite6-bugs <satellite6-bugs> |
| Status: | CLOSED NOTABUG | QA Contact: | Perry Gagne <pgagne> |
| Severity: | unspecified | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 6.6.0 | CC: | bbuckingham, bkearney, mmccune |
| Target Milestone: | Unspecified | ||
| Target Release: | Unused | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2019-09-28 16:12:07 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Imaan
2019-07-24 08:46:42 UTC
Applying the following tunings resolved the situation/problem: https://github.com/redhat-performance/satellite-tune/blob/master/ansible/roles/qpidd-fs-aio-max-nr/tasks/main.yaml https://github.com/redhat-performance/satellite-tune/blob/master/ansible/roles/qdrouterd-max-open-files/tasks/main.yaml https://github.com/redhat-performance/satellite-tune/blob/master/ansible/roles/qpidd-max-open-files/tasks/main.yaml clearing the needinfo as no request was made. We definitely have a '500' connected agents limit in 6.6 with default settings. I configured ~700 agent containers to connect to a Satellite 6.6 server and it maxes out at 500:
# qpid-stat -q --ssl-certificate=/etc/pki/pulp/qpid/client.crt -b amqps://localhost:5671 |grep pulp.agent | wc -l
503
# docker ps | wc -l
700
new connection attempts result in:
Sep 28 11:37:13 ci-vm-10-0-150-175.hosted.upshift.rdu2.redhat.com goferd[13653]: [ERROR][worker-0] gofer.messaging.adapter.connect:33 - connect: proton+amqps://sat-r220-09.lab.eng.rdu2.redhat.com:5647, failed: Connection amqps://sat-r220-09.lab.eng.rdu2.redhat.com:5647 disconnected: Condition('amqp:resource-limit-exceeded', 'local-idle-timeout expired')
the fix is to add these 2 configurations to /etc/foreman-installer/custom-hiera.yaml
qpid::open_file_limit: 65536
qpid::router::open_file_limit: 150100
run 'satellite-installer' and restart. Once applied, clients are able to connect:
Sep 28 11:58:18 ci-vm-10-0-150-175.hosted.upshift.rdu2.redhat.com goferd[13653]: [INFO][pulp.agent.70dc6424-48d7-43bf-92a0-f465df9eea89] gofer.messaging.adapter.connect:30 - connected: proton+amqps://sat-r220-09.lab.eng.rdu2.redhat.com:5647
This is covered in the 6.5 and 6.6 Tuning Guide:
https://access.redhat.com/solutions/4224211
as well as the Tuning Profiles documented:
https://github.com/RedHatSatellite/satellite-support/tree/master/tuning-profiles
Going to close this out as NOTABUG as it is documented in our tuning guides
note, fs.aio-max-nr is not required tuning for 500 gofer/katello-agent clients, just the open_file_limit. |