The following has be reported by IBM LTC: SPEC2000 gap benchmark hangs in __read_nocancel in libc. Hardware Environment:pSeries: problem has been reproduced on PWR4, PWR3, and RS64IV processors. Software Environment: RHEL 3 beta 2 and RC1. gap compiled with gcc3.2.3 or xlc hangs at all opt levels. Steps to Reproduce: 1. compile SPEC2000 gap benchmark with gcc-3.2.3 without optimizations. 2. run with ref data. Actual Results: Hangs in __read_nocancel during runtime. This is the traceback when using gcc-3.2.3. 0x0fedba94 in __read_nocancel () from /lib/tls/libc.so.6 (gdb) where #0 0x0fedba94 in __read_nocancel () from /lib/tls/libc.so.6 #1 0x1005c218 in syGetch () #2 0x1005a1ac in SyFgets () #3 0x100520e4 in GetLine () #4 0x100531f0 in GetSymbol () #5 0x100536fc in Match () #6 0x1004e970 in ReadIt () #7 0x10026414 in main () Expected Results: Normal completion. Additional Information: This seems to be an intermittent problem. It has been reproduced on three different types of machines. However, sometimes the program executes normally. This may prevent generating SPEC2000 gap benchmark results with gcc and xlc.I have a couple of questions: -- Why are you compiling without optimizations ? -- When you said "hang", is it just the benchmark or the whole system ? Thanks.1. The problem occurs at all opt levels in gcc and xlc. 2. It's just the benchmark that hangs. Glen/Greg - please submit this to Red Hat as this is a RHEL3 RC1 bug. Thanks. Sirish - does this problem happen on RHAS 2.1 or previous RHEL3 beta1 ?I have run the gap benchmark only on RHEL 3 beta 2 and RC1. I am not sure, if we have RHAS 2.1 machine around here. I will talk to John tomorrow and see if that machine is available. If it is available, I will run the benchmark and confirm with you the restults.
__read_nocancel is the read syscall, there is nothing in userland which could hang in this function.
------ Additional Comments From khoa.com 2003-22-10 18:35 ------- Sirish - based on Red Hat's comments, I suspect that there is a problem with the traceback you provided. Since RHEL3 GA is today, can you please try to see if the problem occurs on RHEL3 GA ? If so, please re-open this bug report. Thanks.
Can you please provide further information to help with the diagnosis. What platforms have you seen this on (pSeries, iSeries)? What device are you reading from? Please provide a traceback of the kernel portion of the stack when this happens (sysrq-T would be *very* helpful). And please reproduce with RHEL 3 RC4. Thanks.
Glen, can you please update this bugzilla? There isn't sufficient information for us to make progress.
Due to lack of information, we will have to assume that this is no longer a problem.
changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |mranweil.com Status|REJECTED |CLOSED ------- Additional Comments From mranweil.com 2005-06-14 19:10 EDT ------- Non-duplicate rejected bug that's been rejected for over 6 months. Marking closed, you can re-open if needed.