1603706 – multiple virt-who configurations create multiples users == *undefined method `[]' for nil:NilClass* error on Hypervisor tasks

Red Hat Satellite engineering is moving the tracking of its product development work on Satellite to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "Satellite project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs will be migrated starting at the end of May. If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "Satellite project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/SAT-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 1603706 - multiple virt-who configurations create multiples users == *undefined method `[]' for nil:NilClass* error on Hypervisor tasks

Summary: multiple virt-who configurations create multiples users == *undefined method ...

Keywords:
Status:	CLOSED DEFERRED
Alias:	None
Product:	Red Hat Satellite
Classification:	Red Hat
Component:	Subscriptions - virt-who
Sub Component:
Version:	6.3.2
Hardware:	All
OS:	All
Priority:	urgent
Severity:	high
Target Milestone:	Released
Assignee:	satellite6-bugs
QA Contact:	Perry Gagne
Docs Contact:
URL:
Whiteboard:
Depends On:	1615405
Blocks:
TreeView+	depends on / blocked

Reported:	2018-07-19 18:30 UTC by Waldirio M Pinheiro
Modified:	2023-03-24 14:09 UTC (History)
CC List:	22 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Clones:	1615405 (view as bug list)
Environment:
Last Closed:	2019-01-18 19:17:50 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
virt-who outputs and snippets from production.log (5.19 KB, text/plain) 2018-07-23 18:48 UTC, Pablo Hess	no flags	Details
trace from 6.5 snap 1 (15.08 KB, text/plain) 2018-11-09 12:42 UTC, Marek Hulan	no flags	Details
View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Bugzilla	1599752	0	high	CLOSED	virt-who failed to upload data to satellite 6 if VM count is more than 17000	2023-09-07 19:14:07 UTC
Red Hat Knowledge Base (Solution)	3930351	0	Performance tune	None	Candlepin performance is much degraded, "Can't write records bigger than the bufferSize" in error log	2019-02-21 06:49:46 UTC

Internal Links: 1599752

Description Waldirio M Pinheiro 2018-07-19 18:30:20 UTC

Description of problem:
Customer with +1 different configuration files *different vCenters*, then virt-who is able to reach all of them and collect the info BUT the behaviour to prepare the data and upload to Satellite is different *causing error on Satellite side*

Version-Release number of selected component (if applicable):
virt-who-0.21.7-1.el7_5.noarch
Satellite 6.3.x

How reproducible:
Just on customer environment. I'm not able to reproduce locally.

Steps to Reproduce:
1. Configure multiple vCenters
2. Execute virt-who -o
3. Check the output

Actual results:
are are able to see one *Sending updated Host-to-guest mapping to* to each *Hosts-to-guests mapping for config*, then customer with 4 config files, we will see
---
Hosts-to-guests mapping for config conf_file_1
Sending updated Host-to-guest mapping to "ORG"

Hosts-to-guests mapping for config conf_file_2
Sending updated Host-to-guest mapping to "ORG"

Hosts-to-guests mapping for config conf_file_3
Sending updated Host-to-guest mapping to "ORG"

Hosts-to-guests mapping for config conf_file_4
Sending updated Host-to-guest mapping to "ORG"
---

Expected results:
See just one *Host-to-guest mapping to* according below
---
Hosts-to-guests mapping for config conf_file_1
Hosts-to-guests mapping for config conf_file_2
Hosts-to-guests mapping for config conf_file_3
Hosts-to-guests mapping for config conf_file_4

Sending updated Host-to-guest mapping to "ORG"
---


Additional info:
More detail below.

Comment 4 Barnaby Court 2018-07-20 15:01:31 UTC

Waldirio, I see one error in the candlepin log file:

RROR org.hornetq.core.server - HQ224016: Caught exception java.lang.IllegalStateException: Can't write records bigger than the bufferSize(102400) on the journal

I would recommend setting the 
candlepin.audit.hornetq.large_msg_size
property in the /etc/candlepin/candlepin.conf file to something larger than 102400 and see if it makes a difference. 

Please try setting that value to 1024000 and see if that fixes the problem.

Comment 5 Barnaby Court 2018-07-20 15:02:51 UTC

Waldirio, if setting the larger value does not not fix the problem please scan the candlepin log file for new errors If you are no longer seeing the HQ224016 error (or something else that stands out) in the candlepin log then the next place to look would be the foreman log.

Comment 7 Pablo Hess 2018-07-23 18:45:46 UTC

On my reproducer increasing large_msg_size to 1024000 as per c#4 resolves the failed upload issue.


I'll be attaching log files and virt-who -o output files (in a tarball) following this scheme:

* Running virt-who on the capsule, identified as $CAPSULE on the logs
* Two hypervisors, $HYPERVISOR01 and $HYPERVISOR02
* Two virt-who config files, virt-who-config-1.conf and virt-who-config-2.conf pointing to respective hypervisors.


Run virt-who -o -c virt-who-config-1.conf on capsule, capture command output on capsule command line and production.log on satellite.
* Files:
   - virt-who-hypervisor01.log
   - virt-who-1-output.out


Run virt-who -o -c virt-who-config-2.conf on capsule, capture command output on capsule command line and production.log on satellite.
* Files:
   - virt-who-hypervisor02.log
   - virt-who-2-output.out


Run virt-who -o on capsule, capture output, and production.log on satellite with a backtrace:
* Files:
   - virt-who-all-together-07.log
   - virt-who-all-together-output-07.out



Add line to /etc/candlepin/candlepin.conf:

 candlepin.audit.hornetq.large_msg_size=1024000

Restart tomcat service, run virt-who -o a few times on capsule (see why below).

Then run virt-who -o on capsule, capture command output, and production.log with no backtrace.
* Files:
   - virt-who-all-together.log
   - virt-who-all-together-output.out



## NOTE: not every run of virt-who was failing before modifying the config file, only between 20% and 50% of runs were failing.

## NOTE 2: after modifying candlepin config file to increase or decrease large_msg_size the first run of virt-who goes as if no modification was made.

Example:

1. Candlepin is running with increased large_msg_size=1024000. No tracebacks on the logs after 10 runs of virt-who -o.
2. Reduce large_msg_size by commenting out the line in candlepin conf file.
3. Restart tomcat
4. Run virt-who -o, no traceback appears.
5. Run virt-who -o again 30 seconds after the last one finished, get a traceback.
6. Further runs of virt-who -o will produce a traceback between 20% and 50% of the time.

Comment 8 Pablo Hess 2018-07-23 18:48:30 UTC

Created attachment 1470037 [details]
virt-who outputs and snippets from production.log

As described on my latest comment.

Comment 33 Marek Hulan 2018-11-09 12:42:40 UTC

Created attachment 1503629 [details]
trace from 6.5 snap 1

Comment 38 William Poteat 2019-01-18 19:17:50 UTC

This BZ has become far too unwieldy. Customer issues are attached simply because there is a problem with virt-who and not because of the specific scenario. This will not help get the issues resolved, just the opposite. Individual problems are likely to be ignored.


If the issue is about a db error in candlepin because the IN clause is too large you should use:
https://bugzilla.redhat.com/show_bug.cgi?id=1599752


If the issue is that you are using multiple users to do hypervisor updates for one org, instead of a single one, you should use:
https://bugzilla.redhat.com/show_bug.cgi?id=1667545


If the issue is that you are trying to submit a hypervisor in the report that does not have a value for the field you have chosen as the hostname then you should use:
https://bugzilla.redhat.com/show_bug.cgi?id=1667522

Note You need to log in before you can comment on or make changes to this bug.

aeladawy
akapse
arahaman
bbuckingham
bcourt
bkearney
cmarinea
dpeess
jalviso
jmcdonald
khowell
ktordeur
mhulan
mjia
mmccune
patalber
phess
pmoravec
rjerrido
syangsao
wpinheir
wpoteat