Description of problem: host shutdown while run stress testing on guest Version-Release number of selected component (if applicable): kvm-83-237.el5 How reproducible: 2/2 Steps to Reproduce: 1. cmd qemu-kvm -drive file=/images/RHEL-Server-5.6-64-virtio.qcow2,index=0,if=virtio,media=disk,cache=none,boot=on,format=qcow2 -net nic,vlan=0,model=virtio,macaddr='9a:bd:56:49:18:c0' -net tap,vlan=0,script='/usr/autotest/client/tests/kvm/scripts/qemu-ifup-switch',downscript='no' -m 2048 -smp 2,cores=1,threads=1,sockets=2 -cpu qemu64,+sse2 -soundhw ac97 -spice port=8000,disable-ticketing -qxl 1 -rtc-td-hack -M rhel5.6.0 -boot c -usbdevice tablet -no-kvm-pit-reinjection 2. run stress on guest #stress --cpu 4 --io 4 --vm 4 --vm-bytes 1020487680 --hdd 4 --hdd-bytes 1048576 --timeout 60 3. Actual results: host shutdown Expected results: Additional info: 1. host kernel: 2.6.18-266.el5 cpuinfo: processor : 1 vendor_id : AuthenticAMD cpu family : 15 model : 67 model name : Dual-Core AMD Opteron(tm) Processor 1216 stepping : 3 cpu MHz : 2400.000 cache size : 1024 KB flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt rdtscp lm 3dnowext 3dnow pni cx16 lahf_lm cmp_legacy svm extapic cr8_legacy bogomips : 4822.46 TLB size : 1024 4K pages clflush size : 64 cache_alignment : 64 address sizes : 40 bits physical, 48 bits virtual power management: ts fid vid ttp tm stc 2. guest rhel5.64
will try to get core file and serial info
no core file, and only can get serial info before os shutdown, host shutdown like Power is turn off kvm: 3553: cpu0 unhandled wrmsr: 0x0 data 0 kvm: 3553: cpu1 unhandled wrmsr: 0x0 data 0 kvm: 3553: cpu0 unimplemented perfctr wrmsr: 0xc0010004 data 0x0 kvm: 3553: cpu0 unimplemented perfctr wrmsr: 0xc0010000 data 0x130076 kvm: 3553: cpu0 unimplemented perfctr wrmsr: 0xc0010004 data 0xffffffffffe4682d kvm: 3553: cpu0 unimplemented perfctr wrmsr: 0xc0010000 data 0x530076 kvm: 3553: cpu1 unimplemented perfctr wrmsr: 0xc0010004 data 0x0 kvm: 3553: cpu1 unimplemented perfctr wrmsr: 0xc0010000 data 0x130076 kvm: 3553: cpu1 unimplemented perfctr wrmsr: 0xc0010004 data 0xffffffffffe4682d kvm: 3553: cpu1 unimplemented perfctr wrmsr: 0xc0010000 data 0x530076 kvm: 3553: cpu0 unimplemented perfctr wrmsr: 0xc0010000 data 0x0 kvm: 6744: cpu0 unimplemented perfctr wrmsr: 0xc0010004 data 0x0 kvm: 6744: cpu0 unimplemented perfctr wrmsr: 0xc0010000 data 0x130076 kvm: 6744: cpu0 unimplemented perfctr wrmsr: 0xc0010004 data 0xffffffffffe467e8 kvm: 6744: cpu0 unimplemented perfctr wrmsr: 0xc0010000 data 0x530076 kvm: 6744: cpu1 unimplemented perfctr wrmsr: 0xc0010004 data 0x0 kvm: 6744: cpu1 unimplemented perfctr wrmsr: 0xc0010000 data 0x130076 kvm: 6744: cpu1 unimplemented perfctr wrmsr: 0xc0010004 data 0xffffffffffe467e8 kvm: 6744: cpu1 unimplemented perfctr wrmsr: 0xc0010000 data 0x530076 kvm: 6744: cpu0 unimplemented perfctr wrmsr: 0xc0010000 data 0x0
What's /var/log/messages on the host say? Can you hook it into regular serial so we'll see the whole complete messages?
Created attachment 512187 [details] messages info 1. this issue still exist when update BIOS to latest version 1.4.4 2. trying to run stress test on host to check if if crash (will update result tmr)
host didn't crash while run stress testing in host directly. repeat 10 times #stress --cpu 4 --io 4 --vm 4 --vm-bytes 1020487680 --hdd 4 --hdd-bytes 1048576 --timeout 60
Potential duplicate of Bug 681126 - both AMD without NPT.
Does the same test fail on RHEL 6?
repeat around 100 times, can not reproduce it any more, will con to test it.