1573892 – regenerate applicability of a consumer takes many minutes

Red Hat Satellite engineering is moving the tracking of its product development work on Satellite to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "Satellite project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs will be migrated starting at the end of May. If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "Satellite project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/SAT-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 1573892 - regenerate applicability of a consumer takes many minutes

Summary: regenerate applicability of a consumer takes many minutes

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Satellite
Classification:	Red Hat
Component:	Pulp
Sub Component:
Version:	6.3.1
Hardware:	x86_64
OS:	Linux
Priority:	high
Severity:	high with 1 vote
Target Milestone:	Unspecified
Assignee:	satellite6-bugs
QA Contact:	jcallaha
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2018-05-02 13:03 UTC by Pavel Moravec
Modified:	2023-03-24 14:04 UTC (History)
CC List:	33 users (show)
Fixed In Version:	pulp-rpm-2.13.4.9-1,pulp-2.13.4.11-1
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Clones:	1596341 1596360 (view as bug list)
Environment:
Last Closed:	2018-08-22 20:07:08 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
tested patch (11.12 KB, patch) 2018-05-31 12:55 UTC, Pavel Moravec	no flags	Details \| Diff
verification screenshot (37.18 KB, image/png) 2018-08-13 20:30 UTC, jcallaha	no flags	Details
View All

Links
System	ID	Priority	Status	Summary	Last Updated
Pulp Redmine	3172	Normal	CLOSED - CURRENTRELEASE	Celery worker consumes large number of memory when regenerating applicability for a consumer that binds to many reposito...	2018-07-09 15:07:06 UTC
Pulp Redmine	3795	Normal	CLOSED - CURRENTRELEASE	Errata are not shown as applicable if epoch info is absent in pkglist	2018-07-09 15:05:31 UTC
Pulp Redmine	3886	Normal	CLOSED - CURRENTRELEASE	CursorNotFound while queueing applicability tasks	2020-01-15 15:01:30 UTC
Red Hat Bugzilla	1523433	high	CLOSED	Celery worker consumes large number of memory when regenerating applicability for a consumer that binds to many reposito...	2021-12-10 15:28:48 UTC

Internal Links: 1523433

Description Pavel Moravec 2018-05-02 13:03:40 UTC

Description of problem:
in a customer setup reproduced internally, a single 

pulp.server.managers.consumer.applicability.regenerate_applicability_for_consumers

task takes many minutes for a single consumer. This makes Satellite unusable when many systems are updated (as they send package profile to Sat what triggers the reg.app. task).

coredumps taken during the task execution showed the task spends almost whole time in the same code that https://bugzilla.redhat.com/show_bug.cgi?id=1523433 refers to. Applying the patch from the BZ led to approx. 1/4 time improvement, but still a reg.app. task running for 5minutes or so is tooo much.

Typical consumer is bound to few repos and (after adding some debugs) the majority of time is spent on RHEL6 and RHEL6 Extras repos calculation (for the consumer):

Apr 24 07:41:21 dell-per820-2 pulp: celery.worker.strategy:INFO: Received task: pulp.server.managers.consumer.applicability.regenerate_applicability_for_consumers[c990e256-a580-476f-8205-044fd8ee807d]
Apr 24 07:41:21 dell-per820-2 pulp: pulp.server.managers.consumer.applicability:INFO: PavelM: regenerate_applicability for bound_repo_id ORG-Linux-Red_Hat_Enterprise_Linux_Server-Red_Hat_Enterprise_Linux_6_Server_RPMs_x86_64_6Server
Apr 24 07:45:02 dell-per820-2 pulp: pulp.server.managers.consumer.applicability:INFO: PavelM: regenerate_applicability for bound_repo_id ORG-Linux-Red_Hat_Enterprise_Linux_Server-Red_Hat_Satellite_Tools_6_2_for_RHEL_6_Server_RPMs_x86_64
Apr 24 07:45:03 dell-per820-2 pulp: pulp.server.managers.consumer.applicability:INFO: PavelM: regenerate_applicability for bound_repo_id ORG-Linux-Red_Hat_Enterprise_Linux_Server-Red_Hat_Enterprise_Linux_6_Server_-_RH_Common_RPMs_x86_64_6Server
Apr 24 07:45:04 dell-per820-2 pulp: pulp.server.managers.consumer.applicability:INFO: PavelM: regenerate_applicability for bound_repo_id ORG-Linux-Red_Hat_Enterprise_Linux_Server-Red_Hat_Enterprise_Linux_6_Server_-_Optional_RPMs_x86_64_6Server
Apr 24 07:47:58 dell-per820-2 pulp: pulp.server.managers.consumer.applicability:INFO: PavelM: regenerate_applicability for bound_repo_id ORG-Linux-Red_Hat_Enterprise_Linux_Server-Red_Hat_Enterprise_Linux_6_Server_-_Extras_RPMs_x86_64
Apr 24 07:47:58 dell-per820-2 pulp: celery.worker.job:INFO: Task pulp.server.managers.consumer.applicability.regenerate_applicability_for_consumers[c990e256-a580-476f-8205-044fd8ee807d] succeeded in 397.601516957s: None

See some further observation in:
https://bugzilla.redhat.com/show_bug.cgi?id=1523433#c17
https://bugzilla.redhat.com/show_bug.cgi?id=1523433#c18

Reproducer:
https://bugzilla.redhat.com/show_bug.cgi?id=1523433#c19 (beaker default password)


Version-Release number of selected component (if applicable):
6.3.1


How reproducible:
100%


Steps to Reproduce:
1. Use the reproducer machine / bz1523433#c19 (beaker default password)
2. Check the time the reg.app. task will take


Actual results:
>5minutes for the specified consumers


Expected results:
below 1 minute (?) will be a win


Additional info:
Problem seen on 6.2.14, customer upgrade to 6.3.1 didnt help here; reproducer machines are being updated to 6.3.1

Sizes of some mongo collections:
# for i in consumer_bindings consumers consumer_unit_profiles erratum_pkglists repo_content_units repo_profile_applicability repos units_erratum units_package_group units_rpm ; do echo $i $(mongo pulp_database --eval "db.${i}.count()" | grep "^[0-9]"); done
consumer_bindings 15766
consumers 4641
consumer_unit_profiles 4628
erratum_pkglists 188923
repo_content_units 16372066
repo_profile_applicability 197668
repos 4433
units_erratum 23975
units_package_group 89676
units_rpm 238287
#

(is the 16M repo_content_units the key slow down factor?)

Comment 3 Pavel Moravec 2018-05-04 12:26:48 UTC

Another observation from the same customer / user scenario: remove orphans takes >6 hours (and still running with 100%CPU on mongo).

Comment 4 Pavel Moravec 2018-05-07 15:12:41 UTC

(In reply to Pavel Moravec from comment #3)
> Another observation from the same customer / user scenario: remove orphans
> takes >6 hours (and still running with 100%CPU on mongo).

Remove orphans took over 2 days there :-S

Comment 22 pulp-infra@redhat.com 2018-05-18 09:03:23 UTC

The Pulp upstream bug status is at MODIFIED. Updating the external tracker on this bug.

Comment 23 pulp-infra@redhat.com 2018-05-18 09:03:29 UTC

The Pulp upstream bug priority is at Normal. Updating the external tracker on this bug.

Comment 24 pulp-infra@redhat.com 2018-05-18 09:33:11 UTC

All upstream Pulp bugs are at MODIFIED+. Moving this bug to POST.

Comment 31 Pavel Moravec 2018-05-23 07:51:05 UTC

There must be a bug in serializers part of the patch, as units search of a repo fails. Try searches like:

pulpAdminPassword=$(grep ^default_password /etc/pulp/server.conf | cut -d' ' -f2)

repo=whatever-Repository-you-have

curl -i -H "Content-Type: application/json" -X POST -d "{\"criteria\":{\"type_ids\":[\"erratum\"],\"fields\":{\"unit\":[],\"association\":[\"unit_id\"]}}}" -u admin:$pulpAdminPassword https://$(hostname -f)/pulp/api/v2/repositories/${repo}/search/units/

(this POST request is queried by katello when processing Katello::Api::Rhsm::CandlepinProxiesController#get requests)

Comment 36 Pavel Moravec 2018-05-31 12:55:39 UTC

Created attachment 1446278 [details]
tested patch

Tested the cumulative patch of pulp_rpm PRs 1107 (without unit tests) and 1111 - see attached, applied via:

cd /usr/lib/python2.7/site-packages/pulp_rpm
cat /root/bz1573892-improvement-and-serializers.patch | patch -p3

(the above can be shared as officially _untested_ patch; "yum reinstall pulp-rpm-plugins" is a rollback)


My testing results are all green:

(*) reg.app. on the "benchmarked" consumers was still similarly significantly faster (3-20 times, now)

(*) reg.app. properly updates errata applicability (played with downloading&upgrading&removing a package)

(*) errata search works fine:

repo=someRepoName
curl -H "Content-Type: application/json" -X POST -d "{\"criteria\":{\"type_ids\":[\"erratum\"]}}" -u admin:$pulpAdminPassword https://$(hostname -f)/pulp/api/v2/repositories/${repo}/search/units/

(*) recursive units association works fine (tested per #c28)

(*) previously failing errata search per #c31 works fine

(*) tried several CV actions, all work OK

Comment 40 pulp-infra@redhat.com 2018-06-28 13:14:26 UTC

The Pulp upstream bug status is at MODIFIED. Updating the external tracker on this bug.

Comment 41 pulp-infra@redhat.com 2018-06-28 13:14:33 UTC

The Pulp upstream bug priority is at Normal. Updating the external tracker on this bug.

Comment 44 pulp-infra@redhat.com 2018-07-09 15:05:33 UTC

The Pulp upstream bug status is at CLOSED - CURRENTRELEASE. Updating the external tracker on this bug.

Comment 45 pulp-infra@redhat.com 2018-07-09 15:07:07 UTC

The Pulp upstream bug status is at CLOSED - CURRENTRELEASE. Updating the external tracker on this bug.

Comment 53 pulp-infra@redhat.com 2018-07-27 17:04:00 UTC

Requesting needsinfo from upstream developer ttereshc because the 'FailedQA' flag is set.

Comment 54 pulp-infra@redhat.com 2018-07-27 17:04:14 UTC

Requesting needsinfo from upstream developer ttereshc because the 'FailedQA' flag is set.

Comment 65 pulp-infra@redhat.com 2018-07-30 14:04:06 UTC

Requesting needsinfo from upstream developer ttereshc because the 'FailedQA' flag is set.

Comment 66 pulp-infra@redhat.com 2018-07-30 14:04:19 UTC

Requesting needsinfo from upstream developer ttereshc because the 'FailedQA' flag is set.

Comment 68 pulp-infra@redhat.com 2018-07-30 19:15:43 UTC

Requesting needsinfo from upstream developer ttereshc because the 'FailedQA' flag is set.

Comment 69 pulp-infra@redhat.com 2018-07-30 19:15:58 UTC

Requesting needsinfo from upstream developer ttereshc because the 'FailedQA' flag is set.

Comment 75 Mike McCune 2018-08-02 17:58:30 UTC

There was a missed CP as we didn't associate 3886  https://pulp.plan.io/issues/3886 to this bug, moving back to POST

Comment 76 Mike McCune 2018-08-02 18:01:42 UTC

Ignore above comment, I was looking at the wrong RPM/repo

Comment 77 pulp-infra@redhat.com 2018-08-03 11:34:29 UTC

The Pulp upstream bug status is at MODIFIED. Updating the external tracker on this bug.

Comment 78 pulp-infra@redhat.com 2018-08-03 11:34:38 UTC

The Pulp upstream bug priority is at Normal. Updating the external tracker on this bug.

Comment 79 jcallaha 2018-08-13 20:30:43 UTC

Created attachment 1475668 [details]
verification screenshot

Verified in Satellite 6.3.3 Snap 2.

Regenerate Applicability now only takes me less than a minute for RHEL 6 and RHEL 7 systems.

RHEL 6 Systems had 135 applicable updates.
RHEL 7 Systems had 376 applicable updates.

See attached screenshot for task execution times.

Comment 81 errata-xmlrpc 2018-08-22 20:07:08 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:2550

Comment 82 pulp-infra@redhat.com 2020-01-15 15:01:31 UTC

The Pulp upstream bug status is at CLOSED - CURRENTRELEASE. Updating the external tracker on this bug.

Note You need to log in before you can comment on or make changes to this bug.

amasolov
andrew.schofield
bkearney
bmbouter
cduryee
cmarinea
daniele
daviddavis
dconsoli
dkliban
fgarciad
gapatil
ggainey
hyu
ipanova
jentrena
jsenkyri
jstrong
kabbott
katello-qa-list
ktordeur
mhrivnak
mjia
mkearey
mmccune
mruzicka
pcreech
pdwyer
pmoravec
rchan
rplevka
ttereshc
vijsingh