Bug 643831
Summary: | Guest kernel panic during bonding test with e1000 nic | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | juzhang <juzhang> | ||||||
Component: | kvm | Assignee: | Michael S. Tsirkin <mst> | ||||||
Status: | CLOSED WONTFIX | QA Contact: | Virtualization Bugs <virt-bugs> | ||||||
Severity: | low | Docs Contact: | |||||||
Priority: | low | ||||||||
Version: | 5.6 | CC: | akong, jasowang, jpirko, juzhang, michen, mkenneth, mst, tburke, virt-maint, ykaul | ||||||
Target Milestone: | rc | Keywords: | Triaged | ||||||
Target Release: | --- | ||||||||
Hardware: | All | ||||||||
OS: | Linux | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | 643577 | Environment: | |||||||
Last Closed: | 2011-08-02 15:18:18 UTC | Type: | --- | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Bug Depends On: | 643577 | ||||||||
Bug Blocks: | 580948, 640580 | ||||||||
Attachments: |
|
Description
juzhang
2010-10-18 09:06:35 UTC
Created attachment 454053 [details]
guest crash screendump
Created attachment 454054 [details]
guest crash screendump
can you get kdump instead please we can analyse with crash? (In reply to comment #4) > can you get kdump instead please we can analyse with crash? http://fileshare.englab.nay.redhat.com/pub/kvm/akong/vmcore-bz643831 snip form vmcore #crash /usr/lib/debug/lib/modules/2.6.32-71.el6.x86_64/vmlinux vmcore This GDB was configured as "x86_64-unknown-linux-gnu"... KERNEL: /usr/lib/debug/lib/modules/2.6.32-71.el6.x86_64/vmlinux DUMPFILE: vmcore [PARTIAL DUMP] CPUS: 2 DATE: Mon Oct 18 08:18:29 2010 UPTIME: 00:03:59 LOAD AVERAGE: 0.30, 0.41, 0.21 TASKS: 167 NODENAME: dhcp-91-78.nay.redhat.com RELEASE: 2.6.32-71.el6.x86_64 VERSION: #1 SMP Wed Sep 1 01:33:01 EDT 2010 MACHINE: x86_64 (2826 Mhz) MEMORY: 4 GB PANIC: "Oops: 0000 [#1] SMP " (check log for details) PID: 8569 COMMAND: "ifconfig" TASK: ffff88013809f520 [THREAD_INFO: ffff880139ac2000] CPU: 0 STATE: TASK_RUNNING (PANIC) crash> bt PID: 8569 TASK: ffff88013809f520 CPU: 0 COMMAND: "ifconfig" #0 [ffff880028203a90] machine_kexec at ffffffff8103695b #1 [ffff880028203af0] crash_kexec at ffffffff810b8f08 #2 [ffff880028203bc0] oops_end at ffffffff814cbbd0 #3 [ffff880028203bf0] no_context at ffffffff8104651b #4 [ffff880028203c40] __bad_area_nosemaphore at ffffffff810467a5 #5 [ffff880028203c90] bad_area_nosemaphore at ffffffff81046873 #6 [ffff880028203ca0] do_page_fault at ffffffff814cd658 #7 [ffff880028203cf0] page_fault at ffffffff814caf45 [exception RIP: e1000_clean+274] RIP: ffffffffa0121442 RSP: ffff880028203da0 RFLAGS: 00010246 RAX: ffff880139b5f000 RBX: 0000000000000000 RCX: ffff880139b5f000 RDX: 0000000000000000 RSI: ffffc90001eae000 RDI: ffff880133a348f0 RBP: ffff880028203e60 R8: ffff8800282141c0 R9: 0000000000000000 R10: 000000000000803b R11: ffffffff8172fa80 R12: 0000000000000000 R13: ffff880137709e40 R14: 0000000000000000 R15: 0000000000000001 ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018 #8 [ffff880028203e68] net_rx_action at ffffffff8140fe53 #9 [ffff880028203ec8] __do_softirq at ffffffff81073bd7 #10 [ffff880028203f38] call_softirq at ffffffff810142cc #11 [ffff880028203f50] do_softirq at ffffffff81015f35 #12 [ffff880028203f70] irq_exit at ffffffff810739d5 #13 [ffff880028203f80] do_IRQ at ffffffff814cf915 --- <IRQ stack> --- #14 [ffff880139ac3c38] ret_from_intr at ffffffff81013ad3 [exception RIP: e1000_open+239] RIP: ffffffffa01253ef RSP: ffff880139ac3ce8 RFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff880139ac3d08 RCX: ffffc90001040000 RDX: 0000000000000004 RSI: 0000000000000246 RDI: 0000000000000246 RBP: ffffffff81013ace R8: 0000000000000000 R9: 0000000000000026 R10: 000000000000803b R11: ffffffff8172fa80 R12: ffff880100000000 R13: ffffffff8172fa80 R14: ffffffff810d9f64 R15: ffff880139ac3cb8 ORIG_RAX: ffffffffffffffc4 CS: 0010 SS: 0018 #15 [ffff880139ac3d10] dev_open at ffffffff814115a1 #16 [ffff880139ac3d30] dev_change_flags at ffffffff81410cd1 #17 [ffff880139ac3d70] devinet_ioctl at ffffffff81470b2b #18 [ffff880139ac3e20] inet_ioctl at ffffffff81471b78 #19 [ffff880139ac3e30] sock_ioctl at ffffffff813fcdaa #20 [ffff880139ac3e60] vfs_ioctl at ffffffff8117f182 #21 [ffff880139ac3ea0] do_vfs_ioctl at ffffffff8117f324 #22 [ffff880139ac3f30] sys_ioctl at ffffffff8117f8a1 #23 [ffff880139ac3f80] system_call_fastpath at ffffffff81013172 RIP: 00007f9af550d5f7 RSP: 00007fffd3e0a258 RFLAGS: 00010206 RAX: 0000000000000010 RBX: ffffffff81013172 RCX: 0000000000000062 RDX: 00007fffd3e0ad30 RSI: 0000000000008914 RDI: 0000000000000004 RBP: 00007fffd3e0ae40 R8: 000000000000000a R9: 000000000000000a R10: 00007fffd3e0aab0 R11: 0000000000000202 R12: 000000000060fc40 R13: 00007fffd3e0b020 R14: 0000000000000041 R15: 00007fffd3e0ae40 ORIG_RAX: 0000000000000010 CS: 0033 SS: 002b I lowered the priority since it is not a common use case. Does all the host backend tap are connected to different hosts? Make sure there are no l2 loops. (In reply to comment #9) > I lowered the priority since it is not a common use case. > Does all the host backend tap are connected to different hosts? Make sure there > are no l2 loops. Just one host,boot a guest with 4 virtual e1000 nics,then bonding testing,the following is details CML on rhel5.6 host. RHEL5.6 CML: /usr/libexec/qemu-kvm -no-hpet -usbdevice tablet -rtc-td-hack -m 4G -smp 2 -monitor stdio -drive file=/root/zhangjunyi/rhel6.0_64.qcow2,if=virtio,boot=on,werror=stop -drive file=/root/zhangjunyi/boot.iso,media=cdrom -fda /usr/share/virtio-win/virtio-drivers-1.0.0-45801-1.0.0.vfd -net nic,vlan=0,macaddr=22:11:22:45:66:83,model=e1000 -net tap,vlan=0,script=/etc/qemu-ifup -uuid `uuidgen` -cpu qemu64,+sse2 -balloon none -boot c -vnc :10 -notify all -net nic,macaddr=10:10:20:34:23:13,model=e1000,vlan=1 -net tap,script=/etc/qemu-ifup,vlan=1 -net nic,macaddr=10:10:20:34:23:14,model=e1000,vlan=2 -net tap,script=/etc/qemu-ifup,vlan=2 -net nic,macaddr=10:10:20:34:23:15,model=e1000,vlan=3 -net tap,script=/etc/qemu-ifup,vlan=3 MY questions is are all those tap connected to the same bridge on the host? My hunch that they are and that's wrong. What's the content of /etc/qemu-ifup and the output of brctl show (In reply to comment #11) > MY questions is are all those tap connected to the same bridge on the host? > My hunch that they are and that's wrong. > What's the content of /etc/qemu-ifup and the output of brctl show On host #brctl show bridge name bridge id STP enabled interfaces breth0 8000.0023ae7a6f2e no tap3 tap2 tap1 tap0 eth0 virbr0 8000.000000000000 yes #cat /etc/qemu-ifup #!/bin/sh switch=breth0 /sbin/ifconfig $1 0.0.0.0 up /usr/sbin/brctl addif ${switch} $1 #ifconfig breth0 Link encap:Ethernet HWaddr 00:23:AE:7A:6F:2E inet addr:10.66.91.91 Bcast:10.66.91.255 Mask:255.255.255.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:22150182 errors:0 dropped:0 overruns:0 frame:0 TX packets:4592948 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:0 RX bytes:30034121520 (27.9 GiB) TX bytes:2749706908 (2.5 GiB) eth0 Link encap:Ethernet HWaddr 00:23:AE:7A:6F:2E UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:23444522 errors:0 dropped:0 overruns:0 frame:0 TX packets:6024239 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:31896407961 (29.7 GiB) TX bytes:2888201680 (2.6 GiB) Interrupt:58 Memory:febe0000-fec00000 lo Link encap:Local Loopback inet addr:127.0.0.1 Mask:255.0.0.0 UP LOOPBACK RUNNING MTU:16436 Metric:1 RX packets:1046954 errors:0 dropped:0 overruns:0 frame:0 TX packets:1046954 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:0 RX bytes:1425116516 (1.3 GiB) TX bytes:1425116516 (1.3 GiB) tap0 Link encap:Ethernet HWaddr 16:79:FD:23:5B:3B UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:21 errors:0 dropped:0 overruns:0 frame:0 TX packets:332 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:500 RX bytes:3049 (2.9 KiB) TX bytes:56619 (55.2 KiB) tap1 Link encap:Ethernet HWaddr C2:5F:7A:7A:F0:30 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:14 errors:0 dropped:0 overruns:0 frame:0 TX packets:339 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:500 RX bytes:2925 (2.8 KiB) TX bytes:56743 (55.4 KiB) tap2 Link encap:Ethernet HWaddr 9A:CD:EE:B9:2C:81 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:15 errors:0 dropped:0 overruns:0 frame:0 TX packets:338 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:500 RX bytes:3141 (3.0 KiB) TX bytes:56217 (54.8 KiB) tap3 Link encap:Ethernet HWaddr 8A:6F:A3:F6:54:37 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:13 errors:0 dropped:0 overruns:0 frame:0 TX packets:339 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:500 RX bytes:2731 (2.6 KiB) TX bytes:56565 (55.2 KiB) virbr0 Link encap:Ethernet HWaddr 00:00:00:00:00:00 inet addr:192.168.122.1 Bcast:192.168.122.255 Mask:255.255.255.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:32529 errors:0 dropped:0 overruns:0 frame:0 TX packets:294001 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:0 RX bytes:1824078 (1.7 MiB) TX bytes:432896202 (412.8 MiB Ok, it looks fine, thanks for the info This request was evaluated by Red Hat Product Management for inclusion in the current release of Red Hat Enterprise Linux. Because the affected component is not scheduled to be updated in the current release, Red Hat is unfortunately unable to address this request at this time. Red Hat invites you to ask your support representative to propose this request, if appropriate and relevant, in the next release of Red Hat Enterprise Linux. This request was erroneously denied for the current release of Red Hat Enterprise Linux. The error has been fixed and this request has been re-proposed for the current release. |