Bug 128003
Summary: | firstboot or rhgb hangs at gray screen | ||
---|---|---|---|
Product: | [Fedora] Fedora | Reporter: | Barry K. Nathan <barryn> |
Component: | rhgb | Assignee: | Daniel Veillard <veillard> |
Status: | CLOSED RAWHIDE | QA Contact: | |
Severity: | high | Docs Contact: | |
Priority: | medium | ||
Version: | rawhide | CC: | barryn, bnocera, otaylor |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | i686 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2004-09-16 07:22:37 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 123268 |
Description
Barry K. Nathan
2004-07-16 10:33:00 UTC
Actually, AFAICT sometimes rhgb freezes with a gray screen when it should quit, and sometimes it's firstboot that freezes right after it starts up. (I could be misperceiving the whole situation, however!) Anyway, it turns out that downgrading rhgb from 0.12.2-1 to 0.11.2-7 makes the problem go away... Still happening with 2004-08-22 rawhide snapshot, although there's a traceback first (I'll attach that at some point in the next 48 hours). The traceback I'm getting is probably the same as bug 130567. *** This bug has been marked as a duplicate of 129532 *** My workaround of downgrading rhgb still works, but that doesn't affect the traceback. So, the traceback I mentioned in comment #3 appears to be unrelated to this bug. do we have a bracket of what versions of rhgb do and do not work? No idea, I have snapshot of rawhide from mid-july and so far I have never been able to reproduce that problem. I have seen an X server without any ouput on X86_64, but it wasn't specific to the one running rhgb and after a bunch of nightly update failures. Doing a diff of the source between the version 5 months ago and last week show only: - a change in gtk widget code to accomodate xinerama which I doubt can produce that effect - a patches for close on exec of a socket - correctly unmounting the ramfs needed if exec'ing the X server failed - and a 0 -> NULL pointer cleanup fix. I don't see how any of those can generate the stated result. And I can't reproduce it to chase where this may occur. Daniel err ... I have no snapshot of rawhide from mid-july ... Will try to reproduce this tomorrow and check Quoting from (my) comment #1: > Anyway, it turns out that downgrading rhgb from 0.12.2-1 to 0.11.2-7 > makes the problem go away... 0.12.2-1 breaks, 0.11.2-7 works. Does that help? BTW, 0.11.2-7 also happens to be the rhgb version from FC3 test1. Okay, problem reproduced, from here it is now possible to detect what change messed things up and get a proper fix, Daniel The grey screen appears for 2 reasons in rawhide: - firstboot can simply crash see #129532 - the patch supplied to fix xinerama handling #115209 (or more precisely the part of the patch affecting splash.h and splash.c) make firstboot hang In the later case it's not obvious to find why the stck trace is not very clear: [Switching to Thread -151071648 (LWP 3006)] 0x00d99782 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2 (gdb) where #0 0x00d99782 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2 #1 0x0068f23d in poll () from /lib/tls/libc.so.6 #2 0x0018ff13 in g_main_context_acquire () from /usr/lib/libglib-2.0.so.0 #3 0x0019022f in g_main_loop_run () from /usr/lib/libglib-2.0.so.0 #4 0x00eba1de in gtk_main () from /usr/lib/libgtk-x11-2.0.so.0 #5 0x00282c16 in init_gtk () from /usr/lib/python2.3/site-packages/gtk-2.0/gtk/_gtk.so #6 0x00d1fcb4 in _PyEval_SliceIndex () from /usr/lib/libpython2.3.so.1.0 #7 0x00d210ae in PyEval_EvalCodeEx () from /usr/lib/libpython2.3.so.1.0 #8 0x00cdce6e in PyFunction_SetClosure () from /usr/lib/libpython2.3.so.1.0 #9 0x00cc9617 in PyObject_Call () from /usr/lib/libpython2.3.so.1.0 #10 0x00cd0dac in PyMethod_New () from /usr/lib/libpython2.3.so.1.0 #11 0x00cc9617 in PyObject_Call () from /usr/lib/libpython2.3.so.1.0 #12 0x00d1b2a0 in PyEval_CallObjectWithKeywords () from /usr/lib/libpython2.3.so.1.0 #13 0x00cccaa9 in PyInstance_New () from /usr/lib/libpython2.3.so.1.0 #14 0x00cc9617 in PyObject_Call () from /usr/lib/libpython2.3.so.1.0 #15 0x00d1eccf in _PyEval_SliceIndex () from /usr/lib/libpython2.3.so.1.0 #16 0x00d210ae in PyEval_EvalCodeEx () from /usr/lib/libpython2.3.so.1.0 #17 0x00d21372 in PyEval_EvalCode () from /usr/lib/libpython2.3.so.1.0 #18 0x00d3a8b7 in PyErr_Display () from /usr/lib/libpython2.3.so.1.0 #19 0x00d3b9e2 in PyRun_SimpleFileExFlags () from /usr/lib/libpython2.3.so.1.0 #20 0x00d3ca34 in PyRun_AnyFileExFlags () from /usr/lib/libpython2.3.so.1.0 ---Type <return> to continue, or q <return> to quit--- #21 0x00d4172e in Py_Main () from /usr/lib/libpython2.3.so.1.0 #22 0x080485b2 in main () (gdb) One simple fix is to just reverse that patch. A better way would be to find exactly what in the patch makes the whole thing hang, probably the window manager code of the patch. rhgb runs without a window manager ... except when firstboot starts since firstboot itself starts metacity. Daniel Okay we now have a fix for this it's in http://people.redhat.com/veillard/testing/SRPMS/rhgb-0.12.5-1.src.rpm with that and a quick fix to #129532 from first boot (removing mouse configuration), then I have firstboot back on today's rawhide. I need to push this into rawhide, maybe over the week-end, Daniel This has been pushed to Rawhide, this should be fixed there, thanks, Daniel |