This bug has been migrated to another issue tracking site. It has been closed here and may no longer be being monitored.

If you would like to get updates for this issue, or to participate in it, you may do so at Red Hat Issue Tracker .
RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 2119073 - Firefox freezes @ pa_get_runtime_dir
Summary: Firefox freezes @ pa_get_runtime_dir
Keywords:
Status: CLOSED MIGRATED
Alias: None
Product: Red Hat Enterprise Linux 9
Classification: Red Hat
Component: pulseaudio
Version: 9.0
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: rc
: ---
Assignee: Wim Taymans
QA Contact: Desktop QE
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-08-17 12:57 UTC by Roger Sewell
Modified: 2023-09-11 17:49 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2023-09-11 17:49:06 UTC
Type: Bug
Target Upstream Version:
Embargoed:
pm-rhel: mirror+


Attachments (Terms of Use)
Various requested items (64.61 KB, application/gzip)
2022-08-18 13:04 UTC, Roger Sewell
no flags Details
Files that cause the bug to be reproduced (1.21 KB, application/gzip)
2022-08-19 08:53 UTC, Roger Sewell
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker   RHEL-3065 0 None Migrated None 2023-09-11 17:49:00 UTC
Red Hat Issue Tracker RHELPLAN-131330 0 None None None 2022-08-17 13:08:20 UTC

Description Roger Sewell 2022-08-17 12:57:33 UTC
Description of problem: 

Since upgrading to 9.0, when logged in as user1 but firefox running under user2 who has display permission (in order to have access to a different set of files), firefox displays OK but right-clicks cause it to hang and trying to use the drop-down menus causes it to hang.


Version-Release number of selected component (if applicable):

firefox-91.12.0-2.el9_0.x86_64.rpm


How reproducible: Always


Steps to Reproduce:
1. Login as user1 using x11 and Gnome-classic.

2. Give user2 permission to display by

xhost +si:localuser:rfsnet

3. su - user2
(appropriate password)

4. firefox &

5. Right-click on any of the links on the default home page, OR hit alt-B to get a drop-down menu to appear (OR alt-anything else).

Actual results: 

Firefox becomes unresponsive, and a window asking whether you want to force-quit appears. Before you can get firefox to run again, you need to not only say force-quit, but also look for the process using ps and explicitly kill it. On occasion the whole system also hangs, but usually it doesn't.


Expected results: 

Appropriate menu appears.


Additional info:

a. The problem persists if one downgrades firefox e.g. to firefox-91.4.0-1.el8_5.x86_64.rpm without changing anything else; this is notable because this latter version worked fine under 8.5 .

b. Under 9.0 okular, gnome-terminal, and other apps run fine under user2, including drop-down menus and right-clicks.

c. Making selinux permissive doesn't improve matters.

It's very hard to be sure that I'm filing this bug under the correct package, as I suspect that this version of firefox would run fine under 8.5 or 8.6 if one could install it; the reason for choosing to file it under firefox is point (b) above.

Comment 1 Tomas Popela 2022-08-17 13:23:30 UTC
Would it be possible to try the Firefox 102 binary from Mozilla (you can get it from https://www.mozilla.org/en-US/firefox/all/#product-desktop-esr) and see whether it's fixed there or not (we are currently working on bringing the Firefox 102 to RHEL so would be good if 102 fixes your problem).

Comment 2 Roger Sewell 2022-08-17 14:03:39 UTC
Tomas,

Yes indeed. I downloaded the 102 .tar.bz2 file from the URL you gave, decompressed it to a directory called firefox in user2's space, made sure everything in it was owned by user2, and ran

./firefox/firefox &

as user2. Sadly exactly the same problem occurred; as before it remains the same with selinux permissive.

Thank you for the suggestion though.

Roger Sewell.

Comment 3 Roger Sewell 2022-08-17 14:37:37 UTC
Caveat to the above: The tarball from Mozilla's website containing firefox 102 contains no installation instructions and no rpm and no ReadMe file. So please understand that all I did is exactly what I said above - and I can't be sure that e.g. no libraries from the pre-existing previous installation were invoked.

Comment 4 Martin Stransky 2022-08-18 10:20:32 UTC
I tried that on VM and I can't reproduce. Please attach your about:support page.

In case you're use HW acceleration try to turn it off:
https://fedoraproject.org/wiki/How_to_debug_Firefox_problems#Check_WebRender_state_(Hardware_acceleration)

The 'window asking whether you want to force-quit' means there's a deadlock and Firefox does not proceed system events. We need to know where is Firefox frozen.

Please install Firefox debug info packages and attach gdb to it when it's frozen:
https://fedoraproject.org/wiki/Debugging_guidelines_for_Mozilla_products#Attach_debugger_to_running_application

and attach a backtrace from gdb.

Thanks.

Comment 5 Roger Sewell 2022-08-18 13:04:23 UTC
Created attachment 1906261 [details]
Various requested items

The README file in the attached tarball details what I did in response to your requests; the various outputs requested are also in the tarball. Please note that, as therein explained, not everything you asked for worked, probably precisely because of this bug.

Many thanks for your help.

Comment 6 Martin Stransky 2022-08-18 15:39:57 UTC
Thanks, the backtrace is here:

#1  0x00007fe3a0c7f087 in nanosleep () from /lib64/libc.so.6
No symbol table info available.
#2  0x00007fe37efe11c7 in pa_msleep () from /lib64/libpulse.so.0
No symbol table info available.
#3  0x00007fe37ef51cbc in pa_get_runtime_dir () from /usr/lib64/pulseaudio/libpulsecommon-15.0.so
No symbol table info available.
#4  0x00007fe37ef51f3a in get_path () from /usr/lib64/pulseaudio/libpulsecommon-15.0.so
No symbol table info available.
#5  0x00007fe37efc15af in pa_context_connect () from /lib64/libpulse.so.0
No symbol table info available.
#6  0x00007fe376813b6b in context_connect () from /usr/lib64/libcanberra-0.30/libcanberra-pulse.so
No symbol table info available.
#7  0x00007fe376814238 in pulse_driver_open () from /usr/lib64/libcanberra-0.30/libcanberra-pulse.so
No symbol table info available.
#8  0x00007fe38565a749 in context_open_unlocked () from /lib64/libcanberra.so.0
No symbol table info available.
#9  0x00007fe38565b122 in ca_context_play_full () from /lib64/libcanberra.so.0
No symbol table info available.
#10 0x00007fe38565b5d9 in ca_context_play () from /lib64/libcanberra.so.0
No symbol table info available.
#11 0x00007fe3963eb95b in nsSound::PlayEventSound (this=<optimized out>, aEventId=6) at /usr/src/debug/firefox-91.12.0-2.el9_0.x86_64/widget/gtk/nsSound.cpp:385
        settings = 0x7fe385531fe0
        ctx = <optimized out>
        settings = <optimized out>
        ctx = <optimized out>
        enable_sounds = <optimized out>
#12 nsSound::PlayEventSound (this=<optimized out>, aEventId=6) at /usr/src/debug/firefox-91.12.0-2.el9_0.x86_64/widget/gtk/nsSound.cpp:357
        settings = <optimized out>
        ctx = <optimized out>
        enable_sounds = <optimized out>
#13 0x00007fe3967df30d in nsMenuPopupFrame::ShowPopup (this=0x7fe377e7d870, aIsContextMenu=<optimized out>) at /usr/src/debug/firefox-91.12.0-2.el9_0.x86_64/layout/xul/nsMenuPopupFrame.cpp:989
        sound = {<nsCOMPtr_base> = {mRawPtr = 0x7fe369ca1e80}, <No data fields>}
        menuFrame = <optimized out>

which looks like https://bugzilla.redhat.com/show_bug.cgi?id=568178

I think it's related to your two user scenario here.

As a workaround you may disable system sounds.

Comment 7 Martin Stransky 2022-08-18 15:41:07 UTC
As it freezes in pulseaudio changing the component.

Comment 8 Martin Stransky 2022-08-18 15:43:20 UTC
Btw. can you install debuginfo for pulseaudio (for /usr/lib64/pulseaudio/libpulsecommon-15.0.so) and attach the bactrace again?
Thanks.

Comment 9 Roger Sewell 2022-08-18 18:01:05 UTC
Martin, 

Thank you very much - I have now solved it (or at least am well on the way to) as a result of your comments. Ironically, the most helpful one was just saying that you weren't able to reproduce it, which got me thinking more on the right lines. 

I copied user2's account then deleted all the .[^.]* files in it, and let the system repopulate them on user2's next login. That leaves user2 with work to do to get his account set up right, but it made the scenario I reported go back to working properly.

I will let you know when I've found the exact file that was responsible (probably tomorrow sometime). The root cause is lazy migration (by me) of user2's account from 8.6 to 9.0 by just copying all the files instead of setting them all up from scratch. So NOTABUG as far as I can see.

Your help was much appreciated. If you would still like the pulseaudio backtrace please say (obviously with the version of user2's account that was present when the "bug" was evident). Otherwise please consider this closed. Many thanks !

Yours, Roger Sewell.

Comment 10 Roger Sewell 2022-08-19 08:53:20 UTC
Created attachment 1906501 [details]
Files that cause the bug to be reproduced

First, I was wrong as to the root cause being careless migration - see below.

Second, when the bug is present, the listing of user2's ~/.pulse directory looks like this (user2 being rfsnet):

  drwx------.  2 rfsnet rfsnet  4096 Aug 14 11:44 .
  drwxrwxr-x. 31 rfsnet rfsnet  4096 Aug 18 21:52 ..
  lrwxrwxrwx.  1 rfsnet rfsnet    23 Aug  1 13:53 48d8a0c1b7554effb9303ff6ee9e0e84-runtime -> /tmp/pulse-Q0hk23NIb7HV
  lrwxrwxrwx.  1 rfsnet rfsnet    23 Aug 14 11:44 48d8a0c1b7554effb9303ff6ee9e0e84-runtime.tmp -> /tmp/pulse-T86EfUVPSyvz
  -rw-r--r--.  1 rfsnet rfsnet 24576 Jan 13  2021 820b143de7cc4bf4864cd49f8053c153-card-database.tdb
  -rw-r--r--.  1 rfsnet rfsnet     1 Jan 13  2021 820b143de7cc4bf4864cd49f8053c153-default-sink
  -rw-r--r--.  1 rfsnet rfsnet     1 Jan 13  2021 820b143de7cc4bf4864cd49f8053c153-default-source
  -rw-r--r--.  1 rfsnet rfsnet  8192 Apr  8  2020 820b143de7cc4bf4864cd49f8053c153-device-volumes.tdb
  lrwxrwxrwx.  1 rfsnet rfsnet    23 Jan 13  2021 820b143de7cc4bf4864cd49f8053c153-runtime -> /tmp/pulse-zgfReFfE8mnM
  -rw-r--r--.  1 rfsnet rfsnet 12288 Apr  8  2020 820b143de7cc4bf4864cd49f8053c153-stream-volumes.tdb

Removal of the file 

48d8a0c1b7554effb9303ff6ee9e0e84-runtime.tmp -> /tmp/pulse-T86EfUVPSyvz 

which points to an existing empty directory, solves the problem.

Moreover, deleting the contents of user2's ~/.pulse directory and replacing it by putting the attached tarball in user2's home directory and doing

cd ~rfsnet
tar -xzf bad.pulse.tar.gz

brings the problem back again - not just in the two user scenario, but also when logged in on the display as rfsnet.

Moreover the date on the file 48d8a0c1b7554effb9303ff6ee9e0e84-runtime.tmp and on the directory it points to are two weeks later than migration time, so the file has been written by the system, not just migrated from 8.6 .

However all my attempts to get the system to produce such a file have failed.

You suggested that this bug might have similarities to bug 568178 on Fedora from 2010; it might, but I'd guess it might have more to do with the change to pipewire in 9.0, and be related to bug 2114896 .

In any event thank you for your help which has been fantastic. I'm clearing the needinfo flag for now as I'm guessing that given the above you no longer need it, but if you or one of your colleagues would still like a further backtrace with the pulseaudio debug info present, I'll be more than happy to provide it.

Yours, Roger.

Comment 11 RHEL Program Management 2023-09-11 17:43:41 UTC
Issue migration from Bugzilla to Jira is in process at this time. This will be the last message in Jira copied from the Bugzilla bug.

Comment 12 RHEL Program Management 2023-09-11 17:49:06 UTC
This BZ has been automatically migrated to the issues.redhat.com Red Hat Issue Tracker. All future work related to this report will be managed there.

Due to differences in account names between systems, some fields were not replicated.  Be sure to add yourself to Jira issue's "Watchers" field to continue receiving updates and add others to the "Need Info From" field to continue requesting information.

To find the migrated issue, look in the "Links" section for a direct link to the new issue location. The issue key will have an icon of 2 footprints next to it, and begin with "RHEL-" followed by an integer.  You can also find this issue by visiting https://issues.redhat.com/issues/?jql= and searching the "Bugzilla Bug" field for this BZ's number, e.g. a search like:

"Bugzilla Bug" = 1234567

In the event you have trouble locating or viewing this issue, you can file an issue by sending mail to rh-issues. You can also visit https://access.redhat.com/articles/7032570 for general account information.


Note You need to log in before you can comment on or make changes to this bug.