709393 – (CVE-2011-2491) CVE-2011-2491 kernel: rpc task leak after flock()ing NFS share

Bug 709393 (CVE-2011-2491) - CVE-2011-2491 kernel: rpc task leak after flock()ing NFS share

Summary: CVE-2011-2491 kernel: rpc task leak after flock()ing NFS share

Keywords:
Status:	CLOSED ERRATA
Alias:	CVE-2011-2491
Product:	Security Response
Classification:	Other
Component:	vulnerability
Sub Component:
Version:	unspecified
Hardware:	Unspecified
OS:	Linux
Priority:	high
Severity:	high
Target Milestone:	---
Assignee:	Red Hat Product Security
QA Contact:	Norm Murray
Docs Contact:
URL:
Whiteboard:
Depends On:	709546 709547 709548 709549 709550 752999
Blocks:
TreeView+	depends on / blocked

Reported:	2011-05-31 15:31 UTC by Vasily Averin
Modified:	2023-05-12 22:05 UTC (History)
CC List:	18 users (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:	2015-07-29 12:48:36 UTC
Embargoed:

Attachments	(Terms of Use)
dmesg output with NFS/NLM/RPC debug (208.82 KB, text/plain) 2011-05-31 15:31 UTC, Vasily Averin	no flags	Details
patch for 2.6.39 kernel (1.42 KB, patch) 2011-05-31 15:36 UTC, Vasily Averin	no flags	Details \| Diff
Proposed patch from Trond (2.98 KB, patch) 2011-06-01 01:00 UTC, Eugene Teo (Security Response)	no flags	Details \| Diff
View All

Links
System	ID	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHSA-2011:1189	normal	SHIPPED_LIVE	Important: kernel security, bug fix, and enhancement update	2011-08-23 14:41:43 UTC
Red Hat Product Errata	RHSA-2011:1212	normal	SHIPPED_LIVE	Important: kernel security and bug fix update	2011-09-06 21:41:35 UTC
Red Hat Product Errata	RHSA-2011:1253	normal	SHIPPED_LIVE	Important: kernel-rt security and bug fix update	2011-09-12 19:43:48 UTC
Red Hat Product Errata	RHSA-2011:1813	normal	SHIPPED_LIVE	Important: kernel security and bug fix update	2011-12-14 02:28:38 UTC

Description Vasily Averin 2011-05-31 15:31:26 UTC

Created attachment 502027 [details]
dmesg output with NFS/NLM/RPC debug

btw. I'm reporting this issue to akpm@ and to Trond Myklebust because other vendors and upstream linux kernel is affected too.

Description of problem:
unprivileged user can use flock syscall for NFS share to trigger unlimited resource leak and this could cause node DoS.

Details:
flock on NFS calls nlmclnt_lock() function, where nlmclnt_call() executes synchronous RPC task NLMPROC_LOCK. Via several subcalls (call_bind) it successfully connects to NFS server and decodes its reply, however because server did not set up r_port rpcb_getport_done() returns -EACCES, and following rpc subcall call_bind_status() in this case repeats call_bind() after 3 sec timeout.

It isn't a big problem, and this cycle could be broken by signal, however error path of nlmclnt_lock() executes identical asynchronous RPC call to unlock taken lock. Waiting of result of this call is also canceled by signal, however in this case RPC call is asynchronous, it cycles too and never finished.

I would like to ask Trond to recheck my analyze and review my patch with fix of this issue or propose some better and more correct solution.

My idea is to break cycle described above, return -EOPNOTSUPP to nlmclnt_lock(), check it on error path and do not execute NLMPROC_UNLOCK call because we already know that server does not support locking operations.

PS. one more separate issue: NLMPROC_UNLOCK can be cycled in another place: nlmclnt_unlock_callback() restarts RPC task in case of any errors. Therefore we cannot execute NLMPROC_UNLOCK on error path in case of server errors. I did not reproduce this issue, however I believe it's possible.

PPS. Also I've found similar cycles in other nlm callbacks, however I have no ideas is it possible to exploit it. However I would like to ask Trond to review this code too.

Version-Release number of selected component (if applicable):
2.6.32-131.0.15.el6.x86_64
I've reproduced this issue on RHEL5, RHEL6, ubuntu 11.04 and upstream linux kernel 2.6.39. 

How reproducible:
100%

Actual results:
flock hangs, can be canceled by signal, as result one more rpc task leaks
(FYI: sysctl -w sunrpc.rpc_debug=0 dumps status of active RPC taks) 

Expected results:
no hangs, no leaks

Additional info:
dmesg output with NFS/NLM/RPC debug and concept of patch are attached.
Because  vendor-sec is no longer in use I'm preparing private report to akpm@ and to Trond Myklebust, probably they can propose some better solution

Comment 1 Vasily Averin 2011-05-31 15:36:57 UTC

Created attachment 502030 [details]
patch for 2.6.39 kernel

Comment 3 Eugene Teo (Security Response) 2011-06-01 01:00:30 UTC

Created attachment 502142 [details]
Proposed patch from Trond

How about something like the appended fix? It should deal with errors
such as the rebind issue, but only after retrying a few times.

Cheers
 Trond

Comment 7 Eugene Teo (Security Response) 2011-06-01 01:12:53 UTC

Acknowledgements:

Red Hat would like to thank Vasily Averin for reporting this issue.

Comment 8 Eugene Teo (Security Response) 2011-06-23 07:09:42 UTC

Upstream commit:
http://git.kernel.org/linus/0b760113a3a155269a3fba93a409c640031dd68f

Comment 15 errata-xmlrpc 2011-08-23 14:41:53 UTC

This issue has been addressed in following products:

  Red Hat Enterprise Linux 6

Via RHSA-2011:1189 https://rhn.redhat.com/errata/RHSA-2011-1189.html

Comment 16 errata-xmlrpc 2011-09-12 19:44:31 UTC

This issue has been addressed in following products:

  MRG for RHEL-6 v.2

Via RHSA-2011:1253 https://rhn.redhat.com/errata/RHSA-2011-1253.html

Comment 17 errata-xmlrpc 2011-09-13 15:44:05 UTC

This issue has been addressed in following products:

  Red Hat Enterprise Linux 5

Via RHSA-2011:1212 https://rhn.redhat.com/errata/RHSA-2011-1212.html

Comment 19 errata-xmlrpc 2011-12-13 21:28:57 UTC

This issue has been addressed in following products:

  Red Hat Enterprise Linux 5.6.Z - Server Only

Via RHSA-2011:1813 https://rhn.redhat.com/errata/RHSA-2011-1813.html

Note You need to log in before you can comment on or make changes to this bug.