Bug 601461

Summary: auto installation onto NetApp iSCSI storage get hang with Broadcom Corporation NetXtreme II BCM5708 Gigabit Ethernet (bnx2)
Product: Red Hat Enterprise Linux 5 Reporter: Vivian Bian <vbian>
Component: ovirt-nodeAssignee: Alan Pevec <apevec>
Status: CLOSED ERRATA QA Contact: Virtualization Bugs <virt-bugs>
Severity: high Docs Contact:
Priority: high    
Version: 5.5CC: benlu, cpelland, gouyang, llim, mshao, ovirt-maint, ycui
Target Milestone: rcKeywords: TestOnly
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
On machines with the Broadcom NetXtreme II BCM5708 Gigabit Ethernet controller, an automatic installation of Red Hat Enterprise Virtualization Hypervisor on a NetApp iSCSI storage could stop responding during an attempt to restart the machine. With this update, this error no longer occurs, and the automatic installation no longer fails to reboot the computer.
Story Points: ---
Clone Of: Environment:
Last Closed: 2011-01-13 18:41:05 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
serial console log for reproducer with initscript changed with set -x in /etc/rc
none
serial console log for reproducer with initscript changed with set -x in /etc/init.d/function __umount_loop() function
none
kernel_panic2.txt none

Comment 1 Vivian Bian 2010-06-08 03:00:48 UTC
QE lab bnx2 info
==================

[root@intel-5205-32-1 ~]# lspci |grep Ethernet -i
03:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708 Gigabit Ethernet (rev 12)
07:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708 Gigabit Ethernet (rev 12)
09:01.0 Ethernet controller: QLogic Corp. ISP4032-based Ethernet IPv6 NIC
[root@intel-5205-32-1 ~]# lspci -n |grep 03:00.0
03:00.0 0200: 14e4:164c (rev 12) 

QE checked 
============
get a hang when umounting filesystems during finish autoinstallation and reboot


devel lab bnx2 info
=====================

 # lspci | grep Ethernet -i
 39:02.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5706
 Gigabit Ethernet (rev 02)


devel checked 
================

pxeboot command line:
management_server=rhevm.virt.bos.redhat.com storage_vol=:::::
hostname=rhev.virt.bos.redhat.com local_boot console=tty1
console=ttyS0,115200n8 rootpw=8QkUVaou1iVw6
storage_init=/dev/mapper/3600c0ff000d7a*
netconsole_server=rhevm.virt.bos.redhat.com 

Did not encounter this hang.



conclusion
==========
this bug need to investigate to narrow where the bug is lying . But please add relnote for this quirk first .

Comment 2 Vivian Bian 2010-06-09 06:11:18 UTC
QE tried to install RHEVH onto Dell Equallogic iSCSI storage with Broadcom NIC . Didn't meet umount filesystem hang .

QE is now trying to install RHEL5.5 onto NetApp iSCSI with Broadcom NIC . Then will decide whether it would be a blocker bug after collecting needed info.

Comment 3 Vivian Bian 2010-06-09 08:03:35 UTC
Created attachment 422467 [details]
serial console log for reproducer with initscript changed with set -x in /etc/rc

This is a comparation file for installation onto NetApp iSCSI storage. In the file , there are three-time installation process. The last one is the reproducer which is the PXE auto installation. The other part is the record for automatic installation from CDROM .

Comment 5 Vivian Bian 2010-06-12 10:04:12 UTC
Created attachment 423473 [details]
serial console log for reproducer with initscript changed with set -x in /etc/init.d/function __umount_loop() function

Comment 9 Ying Cui 2010-10-14 06:22:09 UTC
Created attachment 453380 [details]
kernel_panic2.txt

Comment 12 Ying Cui 2010-10-20 09:38:29 UTC
For comment#9, I have submit a new bug for kernel panic issue.
 https://bugzilla.redhat.com/show_bug.cgi?id=644777

Comment 13 Alan Pevec 2010-11-03 22:26:14 UTC
Original issue looks like a duplicate of bug 583218, next RHEV-H 5.6 build will pick initscript-8.45.33-1.el5

Comment 16 Guohua Ouyang 2010-12-07 07:59:07 UTC
I try to reproduce the bug on the same machine intel-5205-32-1 and the same storage (Netapp iSCSI storage), didn't reproduce the hang issue, test matrix:

Build             auto-install-by       times
5.5-2.2.7.3       pxe                   8
5.5-2.2.7.3       usb                   2
5.5-2.2.1         cdrom                 2
5.5-2.2.8.4       pxe                   1
5.5-2.2.8.5       pxe                   2
5.6.4             pxe                   3
5.6-5.1           pxe                   2
5.6-6.1           pxe                   4
5.6-6.1           usb                   1

then, I did not reproduce the issue over 20 times.

Comment 17 Guohua Ouyang 2010-12-13 09:02:51 UTC
Tried this bug about 15 times on build 5.6-7, did not reproduce, seems fixed.
Close this bug currently as it's low rate reproduciable bug and will reopen it if encounter future.

Comment 18 Guohua Ouyang 2010-12-13 09:09:25 UTC
(In reply to comment #17)
> Tried this bug about 15 times on build 5.6-7, did not reproduce, seems fixed.
> Close this bug currently as it's low rate reproduciable bug and will reopen it
> if encounter future.

Clarify: I reproduced the bug on build 5.5-2.2.8.5 once, not every time.
Status: verified.

Comment 19 Ying Cui 2010-12-13 09:14:05 UTC
According to comment 17 and comment 18, change it to 'Verified'

Comment 20 Jaromir Hradilek 2011-01-06 13:52:44 UTC
    Technical note added. If any revisions are required, please edit the "Technical Notes" field
    accordingly. All revisions will be proofread by the Engineering Content Services team.
    
    New Contents:
On machines with the Broadcom NetXtreme II BCM5708 Gigabit Ethernet controller, an automatic installation of Red Hat Enterprise Virtualization Hypervisor on a NetApp iSCSI storage could stop responding during an attempt to restart the machine. With this update, this error no longer occurs, and the automatic installation no longer fails to reboot the computer.

Comment 22 errata-xmlrpc 2011-01-13 18:41:05 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2011-0148.html