Bug 1367766 - systemctl daemon-reexec causes Xorg to panic
Summary: systemctl daemon-reexec causes Xorg to panic
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Fedora
Classification: Fedora
Component: systemd
Version: 24
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
Assignee: systemd-maint
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-08-17 12:21 UTC by Ray Holme
Modified: 2016-09-06 18:22 UTC (History)
8 users (show)

Fixed In Version: systemd-231-4.fc25
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2016-09-06 18:22:31 UTC
Type: Bug
Embargoed:


Attachments (Terms of Use)

Description Ray Holme 2016-08-17 12:21:08 UTC
Description of problem: After clean install of Fedora 24 (64 bit) on a computer with a graphics accelerator, dnf update looks like it is succeeding normally and then I am "thrown back" to a login prompt with words like "Xorg failure - tell development".


Version-Release number of selected component (if applicable):


How reproducible: Completely - did three times (last time I did the updates manually after getting a "dnf check-update" list into a file. All components updated fine except for "dnf update systemd" which throws me back to a login prompt. This did NOT happen on another computer which does NOT have a graphics processor.


Steps to Reproduce:
1. Install from DVD.
2. run "dnf update -y"  (better yet "dnf update systemd")
3.

Actual results: If you have a graphics processor (sorry I cannot figure out how to tell you which one - this refurb. just came with one), you will be back to the login screen. If you are doing a full update as I was the first two times, the new kernel will not be installed and probably a lot of other things don't happen. As it is now, I am not sure what has been done with "systemd". DNF tells me all things are up to date, but I highly suspect that systemd is not right. Please tell me how to fix this if possible (downgrade?) and how to send you the core dump (wherever it is) to help you find the problem.


Expected results: Normally no error and I reboot to get a new kernel if built (and it would have been in this case - was on 4.5.5, current 4.6.6 after my manual update commands).


Additional info: I built a non GPI machine and it worked fine. I did full "dnf update -y" twice to be sure, final time used "dnf check-update > /tmp/dnf_list"; then manually ran dnf commands till I broke it with systemd, after that I did the rest - dnf says NOTHING TO DO but systemd is probably hosed. I tried to find the core file, but was not successful.

Comment 1 Zbigniew Jędrzejewski-Szmek 2016-08-17 16:22:22 UTC
Yeah, there's some issue. The problem happens when 'systemctl daemon-reexec' is invoked. On my F24 machine there is no crash, but the screen is completely messed up. After switching to a different tty and back everything goes back to normal. I'm not sure what exactly is the reason for this, but I think it might be related to the recent changes to vconsole setup.

Comment 2 Zbigniew Jędrzejewski-Szmek 2016-08-17 16:55:27 UTC
As I suspected: the "crash" happens when make_console_stdio() is invoked.

Debugging this is a bit tricky, because the issue only happens during the first daemon-reexec. Subsequent ones happen without any issue.

Comment 3 Ray Holme 2016-08-17 22:15:53 UTC
OK, you know what it is. Great. I assume you need nothing more from me

 BUT

 a) if you do, let me know what

 b) do I need to do anything to "fix" the state that dnf left my computer in
    - The only thing that may not be complete is the update of systemd
    I have no idea what physical status it is left in.

Best of luck finding the problem. I understand tricky and am glad that I am retired from all this.

Comment 4 Zbigniew Jędrzejewski-Szmek 2016-08-18 02:43:52 UTC
This is tracked upstream in https://github.com/systemd/systemd/issues/3842.

Comment 5 Ray Holme 2016-08-20 11:45:19 UTC
Track it any way you want. It happened again with "dnf update -y"

I watched closely and on step 93 of 109 - cleanup of systemd ...
I watched the black screen of death then finally got a login screen.

I sure hope you get it - I have no idea what state my system is in anymore.
There is no new kernel installed (not sure if one should be).

will try dnf update -y again after this post but expect it to say nothing to do and be fine even thought it did a half-baked job.

do I have to manually install dnf updates in the future leaving systemd as the last thing to install? PLEASE let me know!!!!!!!!

Comment 6 Ray Holme 2016-08-20 13:35:25 UTC
And for the record, not sure this was caused by a failed update, but

the system was dog slow, I had to reboot

top showed: gdm using 82-97% and gnome-console using 80-90%
  there goes two cpus (my quota)

Comment 7 Zbigniew Jędrzejewski-Szmek 2016-08-20 14:02:53 UTC
The bug is fixed upstream, but not yet in Fedora. There will be an automatic comment in this bug when an update that fixes the bug is created. For now, you should exclude systemd from the upgrade (dnf upgrade --exclude=systemd).

Comment 8 Ray Holme 2016-08-21 16:05:08 UTC
I have noticed a new problem - http and apache are not happy anymore after reboot.

Online notes say this may be related to a bad systemd installation.

I can get them going, but I am wondering if the systemd problem I have has caused this. It did abort the last 20 cleanup steps when it flashed me out.

systemctl status httpd.service yields:

● httpd.service - The Apache HTTP Server
   Loaded: loaded (/usr/lib/systemd/system/httpd.service; enabled; vendor preset: disable
   Active: failed (Result: exit-code) since Sun 2016-08-21 11:33:40 EDT; 26min ago
  Process: 1061 ExecStart=/usr/sbin/httpd $OPTIONS -DFOREGROUND (code=exited, status=1/FA
 Main PID: 1061 (code=exited, status=1/FAILURE)

Aug 21 11:33:38 rainbow systemd[1]: Starting The Apache HTTP Server...
Aug 21 11:33:40 rainbow httpd[1061]: (99)Cannot assign requested address: AH00072: make_s
Aug 21 11:33:40 rainbow httpd[1061]: no listening sockets available, shutting down
Aug 21 11:33:40 rainbow httpd[1061]: AH00015: Unable to open logs
Aug 21 11:33:40 rainbow systemd[1]: httpd.service: Main process exited, code=exited, stat
Aug 21 11:33:40 rainbow systemd[1]: Failed to start The Apache HTTP Server.
Aug 21 11:33:40 rainbow systemd[1]: httpd.service: Unit entered failed state.
Aug 21 11:33:40 rainbow systemd[1]: httpd.service: Failed with result 'exit-code'.

To fix things, I run 3 commands

httpd
apachectl start
systemctl start httpd.service

I can do this every time I reboot until the fix hits the world. Just wanted to make sure the cause was this.

Comment 9 Zbigniew Jędrzejewski-Szmek 2016-08-25 19:52:59 UTC
It turns out that https://github.com/systemd/systemd/issues/3842 must have a different underlying reason.

Comment 10 Ray Holme 2016-08-26 12:23:27 UTC
OK, someone solved the other issue I posted and was worried was related.

Right now, my weekly updates look like.
   dnf update --exclude=systemd
   dnf update systemd

This seems to work for now. If I need to provide you any more information, please let me know - be explicit as I am no longer intimate with all the things I once was.

I do know my systems without a graphics processor has NO problems, but this one has suffered full window crashes with two systemd updates. The first three times till you told me how to separate the updates, the second update was a week later and I am assuming that the problem is not fatal as
  a) the system comes back after I login
  b) the problem happens in the dnf cleanup stage

Good luck.

Comment 11 Fedora Update System 2016-09-01 15:32:16 UTC
systemd-231-4.fc25 has been submitted as an update to Fedora 25. https://bodhi.fedoraproject.org/updates/FEDORA-2016-c1271b0fee

Comment 12 Ray Holme 2016-09-01 18:28:38 UTC
Sure hope you can fix fedora 24 too.

Comment 13 Fedora Update System 2016-09-03 03:54:48 UTC
systemd-231-4.fc25 has been pushed to the Fedora 25 testing repository. If problems still persist, please make note of it in this bug report.
See https://fedoraproject.org/wiki/QA:Updates_Testing for
instructions on how to install test updates.
You can provide feedback for this update here: https://bodhi.fedoraproject.org/updates/FEDORA-2016-c1271b0fee

Comment 14 Fedora Update System 2016-09-06 18:22:23 UTC
systemd-231-4.fc25 has been pushed to the Fedora 25 stable repository. If problems still persist, please make note of it in this bug report.


Note You need to log in before you can comment on or make changes to this bug.