Bug 1024257
Summary: | [SVVP]Host crashed while multiple (4) guests running SVVP test on it | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 6 | Reporter: | Min Deng <mdeng> |
Component: | qemu-kvm | Assignee: | Virtualization Maintenance <virt-maint> |
Status: | CLOSED NOTABUG | QA Contact: | Virtualization Bugs <virt-bugs> |
Severity: | unspecified | Docs Contact: | |
Priority: | unspecified | ||
Version: | 6.5 | CC: | acathrow, areis, bcao, bsarathy, gleb, juzhang, michen, mkenneth, qzhang, rhod, virt-maint |
Target Milestone: | rc | ||
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2013-10-29 12:49:59 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Min Deng
2013-10-29 09:04:29 UTC
Assign it to kernel component firstly,feel free to change it if it is wrong.Thanks Memory is faulty: <4>{1}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 229 <4>{1}[Hardware Error]: APEI generic hardware error status <4>{1}[Hardware Error]: severity: 2, corrected <4>{1}[Hardware Error]: section: 0, severity: 2, corrected <4>{1}[Hardware Error]: flags: 0x01 <4>{1}[Hardware Error]: primary <4>{1}[Hardware Error]: section_type: memory error <4>{1}[Hardware Error]: error_status: 0x0000000000000004 <4>{1}[Hardware Error]: physical_address: 0x0000000001a84380 <4>{1}[Hardware Error]: node: 1 <4>{1}[Hardware Error]: card: 2 <4>{1}[Hardware Error]: module: 1 <4>{1}[Hardware Error]: bank: 0 <4>{1}[Hardware Error]: row: 384 <4>{1}[Hardware Error]: column: 164 <4>{1}[Hardware Error]: error_type: 2, single-bit ECC Looks like the machine has hardware problems. The log in the vmcore (comment #2 tarball) has plenty of these: <4>{68}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 229 <4>{68}[Hardware Error]: APEI generic hardware error status <4>{68}[Hardware Error]: severity: 2, corrected <4>{68}[Hardware Error]: section: 0, severity: 2, corrected <4>{68}[Hardware Error]: flags: 0x01 <4>{68}[Hardware Error]: primary <4>{68}[Hardware Error]: section_type: memory error <4>{68}[Hardware Error]: error_status: 0x0000000000000004 <4>{68}[Hardware Error]: physical_address: 0x0000004053529d00 <4>{68}[Hardware Error]: node: 2 <4>{68}[Hardware Error]: card: 4 <4>{68}[Hardware Error]: module: 2 <4>{68}[Hardware Error]: bank: 3 <4>{68}[Hardware Error]: device: 4 <4>{68}[Hardware Error]: row: 21375 <4>{68}[Hardware Error]: column: 328 <4>{68}[Hardware Error]: error_type: 2, single-bit ECC Finally this one (which is where the machine panics): <0>[Hardware Error]: CPU 34: Machine Check Exception: 5 Bank 5: fa00000000400405 <0>[Hardware Error]: TSC 40f9833b5ea MISC 4200 <0>[Hardware Error]: PROCESSOR 0:206e6 TIME 1382931845 SOCKET 2 APIC 41 <0>[Hardware Error]: CPU 2: Machine Check Exception: 5 Bank 5: fa00000000400405 <0>[Hardware Error]: RIP !INEXACT! 10:<ffffffff812e0f91> {intel_idle+0xb1/0x170} <0>[Hardware Error]: TSC 40f9833afd6 MISC 4200 <0>[Hardware Error]: PROCESSOR 0:206e6 TIME 1382931845 SOCKET 2 APIC 40 <0>[Hardware Error]: Machine check: Processor context corrupt <0>Kernel panic - not syncing: Fatal Machine check |