Red Hat Bugzilla – Bug 222288
Sun's JVM 1.6.0 crashes after upgrading glibc to 2.5-10
Last modified: 2007-11-30 17:11:53 EST
Description of problem:
After upgrading glibc to 2.5-10 from current Fedora 6 updates, Sun's JVM 1.6.0
receives a segmentation fault in every program that loads libnet.so.
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. Upgrade to glibc 2.5-10
2. Compile the attached test case using Sun's JDK
3. Run the resulting class file using Sun's JVM 1.6.0
JVM receives SIGSEGV and outputs:
# An unexpected error has been detected by Java Runtime Environment:
# SIGSEGV (0xb) at pc=0x00002aaaaaab4202, pid=7276, tid=1076017472
# Java VM: Java HotSpot(TM) 64-Bit Server VM (1.6.0-b105 mixed mode)
# Problematic frame:
# C [ld-linux-x86-64.so.2+0x9202]
# An error report file with more information is saved as hs_err_pid7276.log
# If you would like to submit a bug report, please visit:
"OK" should be outputted and no segmentation fault should happen.
This bug was also reported to Sun with internal review ID 885210.
Created attachment 145346 [details]
Test case which causes JVM to crash with glibc 2.5-10
Created attachment 145347 [details]
Java error report
This problem did not happen before upgrading to latest Fedora update (with
glibc 2.5-10). However, downgrading back to original Fedora Core 6 glibc
(2.5-3) did not fix the problem.
This doesn't correspond to glibc-2.5-10.x86_64.rpm dynamic linker:
The dump says:
0x00002aaaaaab41f2: ff 02 00 00 48 8b 8d 20 ff ff ff 31 d2 45 31 d2
0x00002aaaaaab4202: 48 8b 01 48 85 c0 0f 84 47 02 00 00 48 8b 9d 20
2aaaaaaab000-2aaaaaac5000 r-xp 00000000 fd:00 31653891
So 0x9202 within ld-2.5.so. But glibc-2.5-10's ld.so has:
91ef: 4c 8b 69 10 mov 0x10(%rcx),%r13
91f3: 48 8b 85 38 ff ff ff mov 0xffffffffffffff38(%rbp),%rax
91fa: 48 8d 0d 96 ce 00 00 lea 52886(%rip),%rcx # 16097
9201: 48 89 b5 40 ff ff ff mov %rsi,0xffffffffffffff40(%rbp)
9208: 48 8d 35 9e ce 00 00 lea 52894(%rip),%rsi # 160ad
Created attachment 145365 [details]
Another java crash report
I downgraded back to 2.5-3 to test, the error report might have been generated
after the downgrade. This report should contain traces that match
Oops, sorry, I was looking at glibc-2.5-10 rather than glibc-2.5-10.fc6.
This sounds like the same problem as #210748/#215377. There is a race
condition in the dynamic linker (that has been around in glibc forever), if a
thread program in some threads calls dlopen on many libraries and in other
threads calls functions for the first time with lazy binding on. Unfortunately
it is quite difficult problem to solve and not to slow down dynamic linking
too much. rawhide has some initial attempts to solve that, but they still
contain ABBA deadlock possibilities.
As a workaround, you can run the application with LD_BIND_NOW=1.
*** This bug has been marked as a duplicate of 215377 ***