Bug 693993
Summary: | BSOD sometimes happened when run whql NIC jobs | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 6 | Reporter: | Chao Yang <chayang> | ||||
Component: | qemu-kvm | Assignee: | Virtualization Maintenance <virt-maint> | ||||
Status: | CLOSED WORKSFORME | QA Contact: | Virtualization Bugs <virt-bugs> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | high | ||||||
Version: | 6.1 | CC: | afrenkel, dawu, jasowang, juzhang, michen, mkenneth, qzhang, tburke, virt-maint, vrozenfe, yvugenfi | ||||
Target Milestone: | rc | ||||||
Target Release: | --- | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2011-04-11 14:21:17 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Attachments: |
|
Comment 2
RHEL Program Management
2011-04-06 08:23:54 UTC
Since forgetting to log into bz with my account, use other account to report this bug, actually this bug is reported by dawu,sorry for my mistake. Please refer to the crash dump analysis according to comment #9 in bug 684082,(https://bugzilla.redhat.com/show_bug.cgi?id=684082#c9) Crash dump analysis: 0: kd> !analyze -v ******************************************************************************* * * * Bugcheck Analysis * * * ******************************************************************************* THREAD_STUCK_IN_DEVICE_DRIVER (ea) The device driver is spinning in an infinite loop, most likely waiting for hardware to become idle. This usually indicates problem with the hardware itself or with the device driver programming the hardware incorrectly. If the kernel debugger is connected and running when watchdog detects a timeout condition then DbgBreakPoint() will be called instead of KeBugCheckEx() and detailed message including bugcheck arguments will be printed to the debugger. This way we can identify an offending thread, set breakpoints in it, and hit go to return to the spinning code to debug it further. Because KeBugCheckEx() is not called the .bugcheck directive will not return bugcheck information in this case. The arguments are already printed out to the kernel debugger. You can also retrieve them from a global variable via "dd watchdog!g_WdBugCheckData l5" (use dq on NT64). On MP machines (OS builds <= 3790) it is possible to hit a timeout when the spinning thread is interrupted by hardware interrupt and ISR or DPC routine is running at the time of the bugcheck (this is because the timeout's work item can be delivered and handled on the second CPU and the same time). If this is the case you will have to look deeper at the offending thread's stack (e.g. using dds) to determine spinning code which caused the timeout to occur. Arguments: Arg1: fffffa800348ba80, Pointer to a stuck thread object. Do .thread then kb on it to find the hung location. Arg2: fffffa80034c7990, Pointer to a DEFERRED_WATCHDOG object. Arg3: fffffa80034bf850, Pointer to offending driver name. Arg4: 0000000000000001, Number of times this error occurred. If a debugger is attached, this error is not always fatal -- see DESCRIPTION below. On the blue screen, this will always equal 1. Debugging Details: ------------------ Page 2170e not present in the dump file. Type ".hh dbgerr004" for details Page 23162 not present in the dump file. Type ".hh dbgerr004" for details ERROR - could not read driver name for bugcheck parameter 3 Page 23162 not present in the dump file. Type ".hh dbgerr004" for details Page 23162 not present in the dump file. Type ".hh dbgerr004" for details PEB is paged out (Peb.Ldr = 000007ff`fffdf018). Type ".hh dbgerr001" for details PEB is paged out (Peb.Ldr = 000007ff`fffdf018). Type ".hh dbgerr001" for details FAULTING_THREAD: fffffa800348ba80 FAULTING_IP: framebuf!bInitSURF+47 fffff960`009d22e3 3bc5 cmp eax,ebp IMAGE_NAME: framebuf.dll DEBUG_FLR_IMAGE_TIMESTAMP: 4a5bc587 MODULE_NAME: framebuf FAULTING_MODULE: fffff960009d0000 framebuf DEFAULT_BUCKET_ID: GRAPHICS_DRIVER_FAULT BUGCHECK_STR: 0xEA PROCESS_NAME: csrss.exe CURRENT_IRQL: 1 LAST_CONTROL_TRANSFER: from fffff88001995c54 to fffff800026cd740 STACK_TEXT: fffff880`040684f8 fffff880`01995c54 : 00000000`000000ea fffffa80`0348ba80 fffffa80`034c7990 fffffa80`034bf850 : nt!KeBugCheckEx fffff880`04068500 fffff800`026aa0b7 : fffffa80`0348ba80 fffffa80`0348bad0 00000000`00000001 fffff800`02615939 : VIDEOPRT!WdpKernelApc+0x2e8 fffff880`04068a30 fffff800`026aa477 : 00000000`00000001 00000000`00000000 00000000`00000000 00000000`00000000 : nt!KiDeliverApc+0x1d7 fffff880`04068ab0 fffff800`0262b9d6 : 00000000`00000010 00000000`00000001 fffff800`02614000 fffff800`0262de3e : nt!KiApcInterrupt+0xd7 fffff880`04068c40 fffff800`0262c03f : fffff800`0263bf60 00000000`00000001 00000000`00000000 fffff800`02614000 : hal!x86BiosWriteIoSpace+0x15e fffff880`04068c90 fffff800`0262be30 : fffff800`0263bf60 fffff880`04068d60 00000000`00000000 fffff880`04068d60 : hal!XmEmulateStream+0x10f fffff880`04068cc0 fffff800`026251c5 : 00000000`00000000 00000000`00004115 fffffa80`034c8fc0 fffff880`04068db0 : hal!x86BiosExecuteInterruptShadowed+0xfc fffff880`04068cf0 fffff880`0199f0a7 : 00000000`00000000 00000000`00000000 fffffa80`0278c300 00000000`00000000 : hal!x86BiosCall+0x45 fffff880`04068d30 fffff880`03c6830d : 00000000`00000007 fffffa80`029fe618 00000000`00000000 fffff800`0285c880 : VIDEOPRT!VideoPortInt10+0x9f fffff880`04068d90 fffff880`03c68d81 : fffffa80`0278c380 00000000`00000000 00000000`00000000 fffff880`00e1d9ac : vgapnp!VgaSetMode+0x115 fffff880`04068df0 fffff880`019aaeb3 : fffffa80`0278c500 fffff880`00000000 00000000`00000000 00000000`00000000 : vgapnp!VgaStartIO+0x2e5 fffff880`04068e90 fffff960`00084abe : 00000000`00000004 fffffa80`0278c350 fffff900`c0083de8 fffffa80`029fe618 : VIDEOPRT!pVideoPortDispatch+0x3af fffff880`04068f40 fffff960`001f83a8 : fffff900`c0083db0 00000000`00000000 fffff900`c00d8010 00000000`00000001 : win32k!GreDeviceIoControlEx+0xd6 fffff880`04068fe0 fffff960`009d22e3 : fffff900`c00c4000 00000000`00000000 00000009`0000000c 00000000`00000000 : win32k!EngDeviceIoControl+0x3c fffff880`04069030 fffff960`009d1867 : 00000000`00000000 fffff900`c0083db0 fffff900`c0083db0 fffff960`0008ff9e : framebuf!bInitSURF+0x47 fffff880`040690b0 fffff960`00079cb6 : fffffa80`034b3d60 fffff900`c00d8920 fffff800`02847e80 fffff800`026d9de0 : framebuf!DrvEnableSurface+0x1f fffff880`04069100 fffff960`002e3de2 : fffffa80`034b3d60 fffff900`c00d8920 00000000`40210088 fffff900`64776447 : win32k!WatchdogDrvEnableSurface+0x36 fffff880`04069140 fffff960`000882f2 : fffff900`c00c4000 fffff880`040692b8 fffff900`c00c4000 fffff900`c00bdec0 : win32k!PanEnableSurface+0x5a fffff880`04069210 fffff960`00080fae : fffff900`c00c4000 00000000`00000001 00000000`00000000 fffffa80`029fe070 : win32k!PDEVOBJ::bMakeSurface+0x4a fffff880`04069240 fffff960`0007c1ea : fffff900`c00be420 fffff900`c00be420 00000000`00000001 00000000`00000000 : win32k!hCreateHDEV+0xb8e fffff880`040693a0 fffff960`00234b83 : 00000000`00000000 00000000`ffffffff 00000000`00000000 00000000`00000000 : win32k!DrvCreateMDEV+0x7d6 fffff880`04069640 fffff960`0007a9dd : fffff880`00000000 ffffffff`00000000 00000000`00000000 00000000`00000000 : win32k!DrvInternalChangeDisplaySettings+0x7a3 fffff880`04069880 fffff960`0007a200 : 00000000`00000128 00000000`00000000 00000000`00000000 00000000`00000000 : win32k!DrvChangeDisplaySettings+0x62d fffff880`04069a60 fffff960`000815da : 00000000`00000000 00000000`ffffaecc 00000000`00000000 00000000`00000000 : win32k!InitVideo+0x1f8 fffff880`04069b60 fffff960`00075841 : fffff900`c0600000 fffff8a0`0166ec60 fffffa80`0348ab30 00000000`0000004c : win32k!UserInitialize+0x28a fffff880`04069be0 fffff800`026cc993 : fffffa80`0348ba80 00000000`001f0003 00000000`00000007 fffff880`04069c01 : win32k!NtUserInitialize+0xc1 fffff880`04069c20 000007fe`fd7233da : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : nt!KiSystemServiceCopyEnd+0x13 00000000`001bf4f8 00000000`00000000 : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : 0x7fe`fd7233da (In reply to comment #4) > Please refer to the crash dump analysis according to comment #9 in bug > 684082,(https://bugzilla.redhat.com/show_bug.cgi?id=684082#c9) > > Crash dump analysis: > 0: kd> !analyze -v > Hi Dawn, Not sure whether or not it can help, but could you completely disable the display hardware acceleration, including write combining feature (Start->Control Panel->Display->Settings->Troubleshoot)? Best regards, Vadim. Hi Vadim, I will have a try for run whql job of "NDIS Test 6.0" with disable the display hardware acceleration and update this result later. Thanks Best Regards, Dawn Please try disabling watchdog http://msdn.microsoft.com/en-us/library/ff553890%28v=vs.85%29.aspx (Just wonder how did I miss it) Cheers, Vadim. (In reply to comment #5) > (In reply to comment #4) > > Please refer to the crash dump analysis according to comment #9 in bug > > 684082,(https://bugzilla.redhat.com/show_bug.cgi?id=684082#c9) > > > > Crash dump analysis: > > 0: kd> !analyze -v > > > > Hi Dawn, > > Not sure whether or not it can help, but could you completely > disable the display hardware acceleration, including write > combining feature > (Start->Control Panel->Display->Settings->Troubleshoot)? > Best regards, > Vadim. Hi Vadim, We have tried this method, unfortunately, this issue still reproduced. Best Regards, Dawn (In reply to comment #7) > Please try disabling watchdog > http://msdn.microsoft.com/en-us/library/ff553890%28v=vs.85%29.aspx > (Just wonder how did I miss it) > > Cheers, > Vadim. Hi Vadim, There is no watchdog under regist path of"HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\" on win2k8/win7,but can find it on winxp/win2k3,seems like it's not support by win7/win2k8,but I didn't find related information on microsoft website. Best Regards, Dawn (In reply to comment #9) > (In reply to comment #7) > > Please try disabling watchdog > > http://msdn.microsoft.com/en-us/library/ff553890%28v=vs.85%29.aspx > > (Just wonder how did I miss it) > > > > Cheers, > > Vadim. > > Hi Vadim, > > There is no watchdog under regist path > of"HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\" on win2k8/win7,but can > find it on winxp/win2k3,seems like it's not support by win7/win2k8,but I didn't > find related information on microsoft website. > > Best Regards, > Dawn (In reply to comment #9) > (In reply to comment #7) > > Please try disabling watchdog > > http://msdn.microsoft.com/en-us/library/ff553890%28v=vs.85%29.aspx > > (Just wonder how did I miss it) > > > > Cheers, > > Vadim. > > Hi Vadim, > > There is no watchdog under regist path > of"HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\" on win2k8/win7,but can > find it on winxp/win2k3,seems like it's not support by win7/win2k8,but I didn't > find related information on microsoft website. > > Best Regards, > Dawn It must be under Tdr category on Vista and higher. Try changing TdrDebugMode to 1 http://msdn.microsoft.com/en-us/library/ff569918%28v=vs.85%29.aspx Best regards, Vadim. (In reply to comment #10) > (In reply to comment #9) > > (In reply to comment #7) > > > Please try disabling watchdog > > > http://msdn.microsoft.com/en-us/library/ff553890%28v=vs.85%29.aspx > > > (Just wonder how did I miss it) > > > > > > Cheers, > > > Vadim. > > > > Hi Vadim, > > > > There is no watchdog under regist path > > of"HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\" on win2k8/win7,but can > > find it on winxp/win2k3,seems like it's not support by win7/win2k8,but I didn't > > find related information on microsoft website. > > > > Best Regards, > > Dawn > > (In reply to comment #9) > > (In reply to comment #7) > > > Please try disabling watchdog > > > http://msdn.microsoft.com/en-us/library/ff553890%28v=vs.85%29.aspx > > > (Just wonder how did I miss it) > > > > > > Cheers, > > > Vadim. > > > > Hi Vadim, > > > > There is no watchdog under regist path > > of"HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\" on win2k8/win7,but can > > find it on winxp/win2k3,seems like it's not support by win7/win2k8,but I didn't > > find related information on microsoft website. > > > > Best Regards, > > Dawn > > It must be under Tdr category on Vista and higher. > > Try changing TdrDebugMode to 1 > http://msdn.microsoft.com/en-us/library/ff569918%28v=vs.85%29.aspx > > Best regards, > Vadim. Hi Vadim, 1.According to http://msdn.microsoft.com/en-us/library/ff569918%28v=vs.85%29.aspx, we can not find TdrDebugMode under HKEY_LOCAL_MACHINE\System\CurrentControlSet\Control\GraphicsDrivers, so we manually created "TdrDebugMode" with REG_DWORD type,and set it's value to 1, but still failed with the same BSOD happened when ran NIC related job such as "Sleep Stress with IO","Common Scenario" or "NDISTest 6.5(MPE)". 2.Guest reboot very slowly and need long time to start,almost use 100% cpu resource, but when changed to other NIC such as e1000 instead of virtio-NIC,guest can be reboot soon. Best Regards, Dawn Hi, Vadim I found when boot guest with vhost=on, guest will always hit this problem especially for win2k832 and 64. Guest boot up slowly and got BSOD as bug description. Without vhost, guest boot up normally (using the same image). Now the job "NDISTest 6.5 MPE" is running, I will check if it will suffer the BSOD (error code 0xEA) or not. Hi, dears! I received the exactly same BSOD without NetKvm installed at all on my WinDbg host VM machine. Check http://cleo.tlv.redhat.com/qumrawiki/BSOD%20info Regards Arkady Hi, Arkady And after I use the build in Comment 13, do not meet this problem any more. Best Regards, Qunfang |