Bug 465365 - Upgraded x86_64 520 Sat locks up when performing a Kickstart with the Config Management flag.
Summary: Upgraded x86_64 520 Sat locks up when performing a Kickstart with the Config ...
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat Satellite 5
Classification: Red Hat
Component: Provisioning
Version: 520
Hardware: All
OS: Linux
medium
high
Target Milestone: ---
Assignee: Mike McCune
QA Contact: Steve Salevan
URL:
Whiteboard:
Depends On:
Blocks: 457075
TreeView+ depends on / blocked
 
Reported: 2008-10-02 21:33 UTC by Jeff Browning
Modified: 2009-09-10 19:23 UTC (History)
4 users (show)

Fixed In Version: sat530
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2009-09-10 19:23:41 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
error_log from test03-64 (621.13 KB, application/octet-stream)
2008-10-02 21:35 UTC, Jeff Browning
no flags Details
catalina.out from test03-64 (196.87 KB, application/octet-stream)
2008-10-02 21:35 UTC, Jeff Browning
no flags Details

Description Jeff Browning 2008-10-02 21:33:41 UTC
Description of problem:
There is a particular kickstart I've been working with on test03-64 that causes httpd and Jabber to shut down, and eventually the system to lock up if not canceled. Please see the repro steps for full details.

Version-Release number of selected component (if applicable):


How reproducible:
100%

Steps to Reproduce:
1. Install Sat 511 with an embedded DB on a testxx-64 box.
2. Sync the RHEL5 i386 base and tools channels.
3. Upgrade the Sat to 520
4. Register a RHEL4 system to the Sat
5. Create a Kickstart for the RHEL5 base channel, the RHEL5 tools channel, and that includes the option to install RHN Configuration Management on the target system.
6. Using this Kickstart file, provision the RHEL4 box.

  
Actual results:
The Kickstart itself is very slow, and if you monitor top on the Sat box, you'll notice that over time the spikes in CPU usage of kswamp0 grow larger and larger until they reach the high 90%s.

Eventually (it could take an hour), the system will grind to a hault and will have to be manually rebooted, as terminal access to the machine will become unresponsive. You may also encounter a situation where the WebUI fails to refresh during the Kickstart. At this point, restarting the rhn-satellite service will show that both httpd and Jabber had already been stopped. Restarting rhn-satellite at this point will spare the system the full lockup that occurs if you let it continue to run past this point.

The target system is not completely kickstarted, and must be recovered to use again.

Expected results:
This should complete in a timely manner without any problems.


Additional info:
My Sat: test03-64.rhndev.redhat.com
The i386 systems I'm reproduced this one: fjs-0-01 and rlx-0-06
The kickstart file in question on test03-64: Jeff_RHEL_5

I am attaching the error_log and cataline.out files from test03-64. The dates of interest should be 10/01 and 10/02.

Also, when this same type of Kickstart is tried on an i386 RHEL4 520 Sat (not upgraded), the Kickstart completes with a failure. The three rhncfg files fail to install on the target system.

Comment 1 Jeff Browning 2008-10-02 21:35:11 UTC
Created attachment 319297 [details]
error_log from test03-64

Comment 2 Jeff Browning 2008-10-02 21:35:38 UTC
Created attachment 319298 [details]
catalina.out from test03-64

Comment 3 Brad Buckingham 2008-10-03 13:24:48 UTC
Jeff,  
Do you see this problem only with an x86_64 Satellite?
Does the problem occur on 520 only ? (i.e. not on 511)
Does it occur on a 520 fresh install or only after an upgrade from 511 to 520?

Comment 4 Brad Buckingham 2008-10-03 13:26:06 UTC
Input requested for comment #3.

Comment 5 Jeff Browning 2008-10-07 20:48:04 UTC
Provided information for this problem offline.

So far we are only seeing it on x86_64, 520, with RHEL5 kickstarts.

Comment 10 Brad Buckingham 2008-10-09 19:02:26 UTC
moving to sat530-triage since this is not a regression... 

The behavior we see may be the what is expected; however, it would be worthwhile to understand why the kickstart is requiring so much memory/swap resources to complete successfully.

Comment 12 Mike McCune 2009-02-05 23:18:17 UTC
I'm actually tracking and fixing this one here:


https://bugzilla.redhat.com/show_bug.cgi?id=472595

I'm going to be rewriting the handler that sends down RPMs during a kickstart to be done in Java.

Comment 13 Mike McCune 2009-03-19 02:19:02 UTC
moving to MODIFIED as I also fixed 472595

Comment 14 Mike McCune 2009-03-23 21:18:10 UTC
This landed in:

/mnt/redhat/devel/candidate-trees/RHN-Proxy-5.3.0-RHEL4-re20090323.0/

I recommend a regression test against existing kickstart functionality.

Comment 15 Steve Salevan 2009-07-20 16:33:12 UTC
VERIFIED on 7/14 x86_64 build.

Comment 16 Steve Salevan 2009-08-14 18:51:08 UTC
RELEASE_PENDING from latest Stage build.

Comment 17 Brandon Perkins 2009-09-10 19:23:41 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHEA-2009-1434.html


Note You need to log in before you can comment on or make changes to this bug.