570278 – __pthread_mutex_lock: Assertion `mutex->__data.__owner == 0' failed

Bug 570278 - __pthread_mutex_lock: Assertion `mutex->__data.__owner == 0' failed

Summary: __pthread_mutex_lock: Assertion `mutex->__data.__owner == 0' failed

Keywords:
Status:	CLOSED NOTABUG
Alias:	None
Product:	Fedora
Classification:	Fedora
Component:	glibc
Sub Component:
Version:	11
Hardware:	i686
OS:	Linux
Priority:	low
Severity:	high
Target Milestone:	---
Assignee:	Andreas Schwab
QA Contact:	Fedora Extras Quality Assurance
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2010-03-03 18:42 UTC by Miguel Diaz
Modified:	2010-03-05 14:32 UTC (History)
CC List:	2 users (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:	2010-03-04 10:55:29 UTC
Type:	---
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
source file that exhibits the problem. (1.86 KB, text/x-c++src) 2010-03-03 18:42 UTC, Miguel Diaz	no flags	Details
View All

Description Miguel Diaz 2010-03-03 18:42:36 UTC

Created attachment 397632 [details]
source file that exhibits the problem.

Description of problem:
Multithreaded applications competing for a mutex often crash with:

pthread_mutex_lock.c:87: __pthread_mutex_lock: Assertion `mutex->__data.__owner == 0' failed.

I've attached source code that can reproduce exact problem

Version-Release number of selected component (if applicable):
Environment 1:
Fedora Core 10 2.6.27.24-170.2.68.fc10.i686.PAE #1 SMP Wed May 20 22:58:30 EDT 2009
gcc (GCC) 4.3.2 20081105 (Red Hat 4.3.2-7)
glibc 2.9 i386

Environment 2:
Fedora Core 11 2.6.29.4-167.fc11.i586 #1 SMP Wed May 27 17:14:37 EDT 2009
gcc (GCC) 4.4.1 20090725 (Red Hat 4.4.1-2)
glibc 2.10.2 i686

How reproducible:
Just put some pthreads competing for a lock on a default-initialized mutex. Crash occurs inside pthread_mutex_lock

Steps to Reproduce:
1. compile attachment like this:

$ g++ -g -Wall -Werror -pipe -O3 -Wno-deprecated break_pthreads.cpp -o break_pthreads.o -c
$ g++ -g -pthread break_pthreads.o -o break_pthreads
$ rm break_pthreads.o

2. run ./break_pthreads 
3. in my fc10 and fc11 systems it takes less than a second for it to happen
  
Actual results:
break_pthreads: pthread_mutex_lock.c:62: __pthread_mutex_lock: Assertion `mutex->__data.__owner == 0' failed.
Aborted (core dumped)

Expected results:
Program should run forever

Additional info:
I've already read:
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=479952
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29415

those links seem to be related.

Comment 1 Andreas Schwab 2010-03-04 10:55:29 UTC

The program has undefined behaviour.  Use PTHREAD_ERRORCHECK_MUTEX_INITIALIZER_NP.

Comment 2 Miguel Diaz 2010-03-05 03:41:57 UTC

Can you please explain why it has undefined behaviour?

is it ok to use PTHREAD_ERRORCHECK_MUTEX_INITIALIZER_NP in production code?

Comment 3 Jakub Jelinek 2010-03-05 07:14:22 UTC

The code locks the mutex once and then unlocks it twice in some code paths, you can't unlock an unlocked mutex.  It is ok to use error checking mutexes in production code, just it will be slower than the normal ones.  With error checking mutexes the second pthread_mutex_unlock will just fail with EPERM.

Much better if you just fix the bug.

Comment 4 Miguel Diaz 2010-03-05 14:32:46 UTC

Thanks a lot Jakub!

Production code is totally different but incurs in the same issue. I think I'm gonna put some wrapper calls around pthread_mutex_lock and pthread_mutex_unlock so I can trace better any path with their return values.

Regards!

Note You need to log in before you can comment on or make changes to this bug.