Bug 904870 - [abrt]: kernel BUG at net/sunrpc/svc.c:559!
Summary: [abrt]: kernel BUG at net/sunrpc/svc.c:559!
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Fedora
Classification: Fedora
Component: kernel
Version: 18
Hardware: x86_64
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
Assignee: nfs-maint
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard: abrt_hash:2917b60288624a584838aae524c...
: 910457 921062 (view as bug list)
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2013-01-28 00:27 UTC by Qiao
Modified: 2013-03-22 00:14 UTC (History)
12 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2013-03-22 00:14:35 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)

Description Qiao 2013-01-28 00:27:47 UTC
Description of problem:
i don't see any problem.

Additional info:
kernel BUG at net/sunrpc/svc.c:559!
invalid opcode: 0000 [#1] SMP 
Modules linked in: nfsv4 auth_rpcgss nfs dns_resolver fscache lockd sunrpc bnep bluetooth rfkill mpt2sas raid_class scsi_transport_sas coretemp kvm_intel iTCO_wdt iTCO_vendor_support r8169 kvm mii lpc_ich mfd_core microcode ppdev i2c_i801 parport_pc parport snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_page_alloc snd_timer snd soundcore uinput i915 video i2c_algo_bit drm_kms_helper drm i2c_core
CPU 0 
Pid: 756, comm: mount.nfs Not tainted 3.7.2-204.fc18.x86_64 #1 O.E.M O.E.M/G41MXE-V
RIP: 0010:[<ffffffffa0365362>]  [<ffffffffa0365362>] svc_destroy+0x132/0x140 [sunrpc]
RSP: 0018:ffff8800bef9b888  EFLAGS: 00010202
RAX: ffff8800366e1438 RBX: ffff8800366e1400 RCX: 0000000000000085
RDX: ffff8800c2bc5478 RSI: 0000000000000286 RDI: 0000000000000286
RBP: ffff8800bef9b898 R08: dead000000200200 R09: ffff8800c3c02e00
R10: 000000000000001c R11: 0000000000000000 R12: ffff8800bf2ac000
R13: ffff88003642a480 R14: ffff8800366e1400 R15: ffff8800bef9bae0
FS:  00007f8cf26ea840(0000) GS:ffff8800c7a00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 00007f3a98fa0000 CR3: 00000000beca5000 CR4: 00000000000407f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process mount.nfs (pid: 756, threadinfo ffff8800bef9a000, task ffff8800befd4560)
Stack:
 ffff8800366e1400 ffff88003642a484 ffff8800bef9b8c8 ffffffffa0365776
 0000000000000000 0000000000000000 ffffffff81ca3b00 ffff8800c0fce000
 ffff8800bef9b8f8 ffffffffa0404063 0000000000000000 ffff8800bebb0400
Call Trace:
 [<ffffffffa0365776>] svc_exit_thread+0xa6/0xb0 [sunrpc]
 [<ffffffffa0404063>] nfs_callback_down+0x73/0xe0 [nfsv4]
 [<ffffffffa0407619>] nfs4_free_client+0x59/0xc0 [nfsv4]
 [<ffffffffa03afd2b>] nfs_put_client+0xeb/0x110 [nfs]
 [<ffffffffa040781e>] nfs4_init_client+0x19e/0x2e0 [nfsv4]
 [<ffffffff812e850f>] ? idr_get_new_above_int+0x3f/0x50
 [<ffffffffa0362fe4>] ? rpc_init_wait_queue+0x64/0xa0 [sunrpc]
 [<ffffffffa04074f9>] ? nfs4_alloc_client+0xf9/0x1c0 [nfsv4]
 [<ffffffffa03b0229>] nfs_get_client+0x299/0x350 [nfs]
 [<ffffffffa0407378>] nfs4_set_client.isra.5+0x78/0x100 [nfsv4]
 [<ffffffffa040820c>] nfs4_create_server+0xec/0x390 [nfsv4]
 [<ffffffffa0400f59>] nfs4_remote_mount+0x39/0x70 [nfsv4]
 [<ffffffff81198aa3>] mount_fs+0x43/0x1b0
 [<ffffffff81152040>] ? __alloc_percpu+0x10/0x20
 [<ffffffff811b1d7f>] vfs_kern_mount+0x6f/0x100
 [<ffffffffa0400ed1>] nfs_do_root_mount+0x91/0xe0 [nfsv4]
 [<ffffffffa0401244>] nfs4_try_mount+0x44/0xc0 [nfsv4]
 [<ffffffffa03b1340>] ? get_nfs_version+0x30/0x90 [nfs]
 [<ffffffffa03bc1a5>] nfs_fs_mount+0x265/0xcf0 [nfs]
 [<ffffffffa03bcd80>] ? nfs_clone_super+0x150/0x150 [nfs]
 [<ffffffffa03b9940>] ? nfs_clone_sb_security+0x60/0x60 [nfs]
 [<ffffffff81198aa3>] mount_fs+0x43/0x1b0
 [<ffffffff81152040>] ? __alloc_percpu+0x10/0x20
 [<ffffffff811b1d7f>] vfs_kern_mount+0x6f/0x100
 [<ffffffff811b27f4>] do_kern_mount+0x54/0x110
 [<ffffffff811b4355>] do_mount+0x315/0x8e0
 [<ffffffff811b3eaa>] ? copy_mount_options+0x3a/0x180
 [<ffffffff811b49ae>] sys_mount+0x8e/0xe0
 [<ffffffff8163ded9>] system_call_fastpath+0x16/0x1b
Code: 37 a0 31 c0 e8 82 64 2c e1 e9 22 ff ff ff 48 8b 07 8b 57 14 48 c7 c7 c7 ab 37 a0 48 8b 70 20 31 c0 e8 65 64 2c e1 e9 ec fe ff ff <0f> 0b 0f 0b 66 2e 0f 1f 84 00 00 00 00 00 66 66 66 66 90 55 48 
RIP  [<ffffffffa0365362>] svc_destroy+0x132/0x140 [sunrpc]
 RSP <ffff8800bef9b888>

Comment 1 Qiao 2013-01-28 00:40:28 UTC
in my fstab file have this line:
192.168.1.202:/share/smb	/home/qiao/202	nfs	defaults	0 0

and server is CentOS 5.7, samba-3.0.33-3.29.el5_6.2

Comment 2 J. Bruce Fields 2013-01-28 23:14:15 UTC
Looking at upstream v3.7.2, not the Fedora source, but I suspect the line numbers are the same, in which case this is:

  BUG_ON(!list_empty(&serv->sv_tempsocks));

which looks similar to the problem reported in http://mid.gmane.org/<50F42F85.50907>.

Comment 3 Stanislav Kinsburskiy 2013-01-29 11:05:58 UTC
Bruce, please have a look at "SUNRPC: protect transport processing with rw sem" patch in NFS mailing list.
This one is marked with RFC, because the solution is rough. But it seems much more simple and safe, then playing with transports bits, etc.

Comment 4 mauricio.esguerra 2013-02-18 07:57:58 UTC
I get a similar problem, but get a mount.nfs tainted message.
This started to happen after I switched from NFS4 to NFS3 to avoid kernel BUG at net/sunrpc/svc_xprt.c:921!

I'm mounting a RHEL Release 6.2



kernel BUG at net/sunrpc/svc.c:559!
invalid opcode: 0000 [#2] SMP 
Modules linked in: fuse nfsv3 nfs_acl nfsv4 auth_rpcgss nfs dns_resolver fscache ip6table_filter ip6_tables ebtable_nat ebtables ipt_MASQUERADE iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack xt_CHECKSUM iptable_mangle bridge st
p llc lockd sunrpc rfcomm bnep be2iscsi iscsi_boot_sysfs bnx2i cnic uio cxgb4i cxgb4 cxgb3i cxgb3 mdio libcxgbi ib_iser rdma_cm ib_addr iw_cm ib_cm ib_sa ib_
mad ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi snd_hda_codec_analog snd_hda_intel snd_hda_codec btusb snd_hwdep snd_seq bluetooth snd_seq_d
evice snd_pcm rfkill snd_page_alloc snd_timer snd coretemp soundcore mei iTCO_wdt iTCO_vendor_support i2c_i801 lpc_ich dell_wmi dcdbas mfd_core serio_raw spa
rse_keymap e1000e microcode vhost_net tun macvtap macvlan kvm_intel kvm binfmt_misc uinput ata_generic pata_acpi wmi i915 video i2c_algo_bit drm_kms_helper d
rm i2c_core
CPU 0 
Pid: 26429, comm: mount.nfs Tainted: G      D      3.7.3-101.fc17.x86_64 #1 Dell Inc. OptiPlex 780           
      /0DFRFW
 RIP: 0010:[<ffffffffa04e5362>]  [<ffffffffa04e5362>] svc_destroy+0x132/0x140 [sunrpc]
 RSP: 0018:ffff88010f023888  EFLAGS: 00010202
 RAX: ffff8800b486ba38 RBX: ffff8800b486ba00 RCX: 0000000000000069
 RDX: ffff880111ee15c8 RSI: 0000000000000286 RDI: 0000000000000286
 RBP: ffff88010f023898 R08: dead000000200200 R09: ffff880117beafa0
 R10: 00000000000000e0 R11: 0000000000000000 R12: ffff8801033de000
 R13: ffff880069daff00 R14: ffff8800b486ba00 R15: ffff88010f023ae0
 FS:  00007f8f20514840(0000) GS:ffff880117c00000(0000) knlGS:0000000000000000
 CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
 CR2: 00007f8f1fbb16f5 CR3: 0000000110666000 CR4: 00000000000407f0
 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
 DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
 Process mount.nfs (pid: 26429, threadinfo ffff88010f022000, task ffff880013862e40)
 Stack:
  ffff8800b486ba00 ffff880069daff04 ffff88010f0238c8 ffffffffa04e5776
  0000000000000000 0000000000000000 ffffffff81ca3980 ffff88010cffb800
  ffff88010f0238f8 ffffffffa062e063 0000000000000000 ffff88001ab22c00
 Call Trace:
  [<ffffffffa04e5776>] svc_exit_thread+0xa6/0xb0 [sunrpc]
  [<ffffffffa062e063>] nfs_callback_down+0x73/0xe0 [nfsv4]
  [<ffffffffa0631619>] nfs4_free_client+0x59/0xc0 [nfsv4]
  [<ffffffffa05cad2b>] nfs_put_client+0xeb/0x110 [nfs]
  [<ffffffffa063181e>] nfs4_init_client+0x19e/0x2e0 [nfsv4]
  [<ffffffff812e64bf>] ? idr_get_new_above_int+0x3f/0x50
  [<ffffffffa04e2fe4>] ? rpc_init_wait_queue+0x64/0xa0 [sunrpc]
  [<ffffffffa06314f9>] ? nfs4_alloc_client+0xf9/0x1c0 [nfsv4]
  [<ffffffffa05cb229>] nfs_get_client+0x299/0x350 [nfs]
  [<ffffffffa0631378>] nfs4_set_client.isra.5+0x78/0x100 [nfsv4]
  [<ffffffffa063220c>] nfs4_create_server+0xec/0x390 [nfsv4]
  [<ffffffffa062af59>] nfs4_remote_mount+0x39/0x70 [nfsv4]
  [<ffffffff81198443>] mount_fs+0x43/0x1b0
  [<ffffffff811519d0>] ? __alloc_percpu+0x10/0x20
  [<ffffffff811b171f>] vfs_kern_mount+0x6f/0x100
  [<ffffffffa062aed1>] nfs_do_root_mount+0x91/0xe0 [nfsv4]
  [<ffffffffa062b244>] nfs4_try_mount+0x44/0xc0 [nfsv4]
  [<ffffffffa05cc340>] ? get_nfs_version+0x30/0x90 [nfs]
  [<ffffffffa05d71a5>] nfs_fs_mount+0x265/0xcf0 [nfs]
  [<ffffffffa05d7d80>] ? nfs_clone_super+0x150/0x150 [nfs]
  [<ffffffffa05d4940>] ? nfs_clone_sb_security+0x60/0x60 [nfs]
  [<ffffffff81198443>] mount_fs+0x43/0x1b0
  [<ffffffff811519d0>] ? __alloc_percpu+0x10/0x20
  [<ffffffff811b171f>] vfs_kern_mount+0x6f/0x100
  [<ffffffff811b2194>] do_kern_mount+0x54/0x110
  [<ffffffff811b3cf5>] do_mount+0x315/0x8e0
  [<ffffffff811b384a>] ? copy_mount_options+0x3a/0x180
  [<ffffffff811b434e>] sys_mount+0x8e/0xe0
  [<ffffffff81639799>] system_call_fastpath+0x16/0x1b
 Code: 4f a0 31 c0 e8 32 1d 14 e1 e9 22 ff ff ff 48 8b 07 8b 57 14 48 c7 c7 c7 ab 4f a0 48 8b 70 20 31 c0 e8 1
5 1d 14 e1 e9 ec fe ff ff <0f> 0b 0f 0b 66 2e 0f 1f 84 00 00 00 00 00 66 66 66 66 90 55 48 
 RIP  [<ffffffffa04e5362>] svc_destroy+0x132/0x140 [sunrpc]
RSP <ffff88010f023888>

Comment 5 Stanislav Kinsburskiy 2013-02-18 08:02:14 UTC
You can try these patches:
http://www.spinics.net/lists/linux-nfs/msg35363.html
http://www.spinics.net/lists/linux-nfs/msg35364.html

Comment 6 J. Bruce Fields 2013-02-18 15:10:41 UTC
*** Bug 910457 has been marked as a duplicate of this bug. ***

Comment 7 Jason Tibbitts 2013-02-28 18:48:59 UTC
Just wanted to note that I just now saw this (or, more correctly, the "kernel BUG at net/sunrpc/svc_xprt.c:921" backtrace I reported in one of the duplicated) on F17 with 3.7.9-104.fc17.x86_64.  I had not previously seen this issue on F17.

Comment 8 Stanislav Kinsburskiy 2013-03-01 05:43:37 UTC
(In reply to comment #7)
> Just wanted to note that I just now saw this (or, more correctly, the
> "kernel BUG at net/sunrpc/svc_xprt.c:921" backtrace I reported in one of the
> duplicated) on F17 with 3.7.9-104.fc17.x86_64.  I had not previously seen
> this issue on F17.

This is a race. I.e. it doesn't happen on every run.
So it's ok.
And wanna note, that the patches, fixing the problem, wasn't yet included into stable branch.

Comment 9 Jason Tibbitts 2013-03-01 06:34:37 UTC
Yeah, I know.  Unfortunately I don't think the patches even got much of a review, or at least if they did I was dropped from the CC list.  They're certainly not in Linus's tree.  I would petition for them to get into Fedora's kernel but without any indication of movement towards upstream acceptance I don't think they'd agree.  I'd really rather not have to roll out a custom kernel across all of my desktops, but I'm not sure what other option I have at this point; even a rare bug happens pretty commonly when you get up over a hundred desktops.

Comment 10 Stanislav Kinsburskiy 2013-03-01 06:44:00 UTC
The patches are in maintaners's tree and will become a part of 3.9 kernel.
Also, they are requested to be included into stable kernels.
Once they will be accepted by Greg, I (or Bruce) will inform you about stable kernel version you can use.

Comment 11 Josh Boyer 2013-03-01 13:42:58 UTC
They're in Linus' tree as of this morning.  Fedora can carry them as patches until they work into 3.8.y.

Comment 12 Josh Boyer 2013-03-01 14:01:09 UTC
Applied to both F17 and F18.

Comment 13 Fedora Update System 2013-03-08 22:14:49 UTC
kernel-3.8.2-105.fc17 has been submitted as an update for Fedora 17.
https://admin.fedoraproject.org/updates/kernel-3.8.2-105.fc17

Comment 14 Fedora Update System 2013-03-10 01:02:26 UTC
Package kernel-3.8.2-105.fc17:
* should fix your issue,
* was pushed to the Fedora 17 testing repository,
* should be available at your local mirror within two days.
Update it with:
# su -c 'yum update --enablerepo=updates-testing kernel-3.8.2-105.fc17'
as soon as you are able to, then reboot.
Please go to the following url:
https://admin.fedoraproject.org/updates/FEDORA-2013-3638/kernel-3.8.2-105.fc17
then log in and leave karma (feedback).

Comment 15 Josh Boyer 2013-03-13 12:25:27 UTC
*** Bug 921062 has been marked as a duplicate of this bug. ***

Comment 16 Fedora Update System 2013-03-14 15:17:57 UTC
kernel-3.8.2-105.fc17 has been submitted as an update for Fedora 17.
https://admin.fedoraproject.org/updates/FEDORA-2013-3638/kernel-3.8.2-105.fc17

Comment 17 Fedora Update System 2013-03-14 22:54:52 UTC
kernel-3.8.3-101.fc17 has been submitted as an update for Fedora 17.
https://admin.fedoraproject.org/updates/kernel-3.8.3-101.fc17

Comment 18 Fedora Update System 2013-03-16 01:27:35 UTC
Package kernel-3.8.3-101.fc17:
* should fix your issue,
* was pushed to the Fedora 17 testing repository,
* should be available at your local mirror within two days.
Update it with:
# su -c 'yum update --enablerepo=updates-testing kernel-3.8.3-101.fc17'
as soon as you are able to, then reboot.
Please go to the following url:
https://admin.fedoraproject.org/updates/FEDORA-2013-3909/kernel-3.8.3-101.fc17
then log in and leave karma (feedback).

Comment 19 Fedora Update System 2013-03-22 00:14:40 UTC
kernel-3.8.3-103.fc17 has been pushed to the Fedora 17 stable repository.  If problems still persist, please make note of it in this bug report.


Note You need to log in before you can comment on or make changes to this bug.