Bug 679407
| Summary: | [5.7] niu: Fix races between up/down and get_stats. | |||
|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 5 | Reporter: | Flavio Leitner <fleitner> | |
| Component: | kernel | Assignee: | Jiri Pirko <jpirko> | |
| Status: | CLOSED ERRATA | QA Contact: | Weibing Zhang <atzhang> | |
| Severity: | high | Docs Contact: | ||
| Priority: | high | |||
| Version: | 5.5 | CC: | agospoda, anton, cye, hjia, jpirko, jwest, kzhang, mjenner, qcai, rkhan, sassmann | |
| Target Milestone: | rc | |||
| Target Release: | --- | |||
| Hardware: | Unspecified | |||
| OS: | Linux | |||
| Whiteboard: | ||||
| Fixed In Version: | Doc Type: | Bug Fix | ||
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 683069 (view as bug list) | Environment: | ||
| Last Closed: | 2011-07-21 10:10:31 UTC | Type: | --- | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | ||||
| Bug Blocks: | 683069 | |||
|
Description
Flavio Leitner
2011-02-22 14:12:59 UTC
Processes racing:
PID: 8348 TASK: ffff8103eaffe7e0 CPU: 3 COMMAND: "ifenslave"
#0 [ffff81041fcbbf20] crash_nmi_callback at ffffffff8007bce5
#1 [ffff81041fcbbf40] do_nmi at ffffffff800658c5
#2 [ffff81041fcbbf50] nmi at ffffffff80064eaf
[exception RIP: niu_init_hw+2199]
RIP: ffffffff8825c700 RSP: ffff8103ea48bc88 RFLAGS: 00000086
RAX: ffffc20013600160 RBX: 0000000000000000 RCX: 000000000000000c
RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
RBP: ffff81041df74500 R8: 0000000000000001 R9: 0000000000000002
R10: 0000e00800000000 R11: 0000000000000006 R12: ffff81041df7565c
R13: ffff81041df755d4 R14: 0000000000000160 R15: 000000000000002c
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
--- <NMI exception stack> ---
#3 [ffff8103ea48bc88] niu_init_hw at ffffffff8825c700
#4 [ffff8103ea48bcc0] niu_open at ffffffff88263704
#5 [ffff8103ea48bcf0] dev_open at ffffffff80234b54
#6 [ffff8103ea48bd00] bond_enslave at ffffffff885df6a5
#7 [ffff8103ea48bd80] bond_do_ioctl at ffffffff885e165c
#8 [ffff8103ea48bde0] dev_ioctl at ffffffff802347a5
#9 [ffff8103ea48be90] sock_ioctl at ffffffff8022a58c
#10 [ffff8103ea48beb0] do_ioctl at ffffffff8004226a
#11 [ffff8103ea48bed0] vfs_ioctl at ffffffff8003026e
#12 [ffff8103ea48bf40] sys_ioctl at ffffffff8004c73b
#13 [ffff8103ea48bf80] tracesys at ffffffff8005d28d (via system_call)
RIP: 0000003c2e6cc647 RSP: 00007fffac465908 RFLAGS: 00000246
RAX: ffffffffffffffda RBX: ffffffff8005d28d RCX: ffffffffffffffff
RDX: 00007fffac465b00 RSI: 0000000000008990 RDI: 000000000000000b
RBP: 00007fffac4670c3 R8: 00007fffac465b15 R9: 0000000000000003
R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
R13: 0000000000000000 R14: 00007fffac4670c9 R15: 00007fffac465b00
ORIG_RAX: 0000000000000010 CS: 0033 SS: 002b
PID: 8444 TASK: ffff8103eafa9100 CPU: 4 COMMAND: "ifconfig"
#0 [ffff8103ea34dbf0] crash_kexec at ffffffff800af85a
#1 [ffff8103ea34dcb0] __die at ffffffff80065117
#2 [ffff8103ea34dcf0] do_page_fault at ffffffff8006748d
#3 [ffff8103ea34dde0] error_exit at ffffffff8005dde9
[exception RIP: niu_get_stats+121]
RIP: ffffffff88262156 RSP: ffff8103ea34de90 RFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff81041df74000 RCX: 0000000000000001
RDX: ffff81041df74500 RSI: 0000000000000000 RDI: 0000000000000000
RBP: ffff81041d702e40 R8: 0000000000000000 R9: 0000000000000006
R10: 0000000000000004 R11: 0000000000000000 R12: 00000000000002b4
R13: 0000000000001000 R14: 0000000000000000 R15: 00002b3ff19f4000
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
#4 [ffff8103ea34de90] dev_seq_show at ffffffff8023256d
#5 [ffff8103ea34deb0] seq_read at ffffffff8003f858
#6 [ffff8103ea34df10] vfs_read at ffffffff8000b787
#7 [ffff8103ea34df40] sys_read at ffffffff80011c5c
#8 [ffff8103ea34df80] tracesys at ffffffff8005d28d (via system_call)
RIP: 0000003c2e6c5ff0 RSP: 00007fff078ab918 RFLAGS: 00000246
RAX: ffffffffffffffda RBX: ffffffff8005d28d RCX: ffffffffffffffff
RDX: 0000000000001000 RSI: 00002b3ff19f4000 RDI: 0000000000000005
RBP: 000000001e6005a0 R8: 00000000ffffffff R9: 0000000000000000
R10: 0000000000000022 R11: 0000000000000246 R12: 0000000000000000
R13: 0000000000000000 R14: 000000001e6005a0 R15: 00000000078ab9a8
ORIG_RAX: 0000000000000000 CS: 0033 SS: 002b
This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release. Adding Stefan to the CC list as he is the niu maintainer and should have been assigned this bug. in kernel-2.6.18-246.el5 You can download this test kernel (or newer) from http://people.redhat.com/jwilson/el5 Detailed testing feedback is always welcomed. Reproduced on kernel-2.6.18-238.el5, execute "while true; do cat /proc/net/dev; done", kernel panic was hit after several ifup and ifdown on niu NIC. On kernel-2.6.18-246.el5, running "while true; do cat /proc/net/dev; done" and "while true; do ifdown eth5; sleep 3; ifup eth5; sleep 3 ; done" in two ssh sessions, no panic is triggered in test test for about an hour. Set Verified. An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHSA-2011-1065.html |