| Summary: | 6.2 kernel shows WARNING: at kernel/sched.c:5914 | ||||||
|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 6 | Reporter: | Rik Theys <rik.theys> | ||||
| Component: | kernel | Assignee: | Red Hat Kernel Manager <kernel-mgr> | ||||
| Status: | CLOSED DUPLICATE | QA Contact: | Red Hat Kernel QE team <kernel-qe> | ||||
| Severity: | medium | Docs Contact: | |||||
| Priority: | medium | ||||||
| Version: | 6.2 | CC: | ajb, akrherz, chref, Colin.Simpson, fahnoe, igeorgex, ilmis, jeder, jwest, mailroom, mkelly, osbugs, pasteur, pza, ralph, spurrier, ychavan | ||||
| Target Milestone: | rc | ||||||
| Target Release: | --- | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2012-01-04 20:23:55 UTC | Type: | --- | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Attachments: |
|
||||||
|
Description
Rik Theys
2011-12-24 13:59:13 UTC
I posted this on rhbz#767127, which this may or may not be a dupe of. As with Rik, I am seeing this oops fairly often, but I am running a fully up to date RHEL 6.2. I am seeing this on both nodes in my cluster (identical hardware). Dec 25 15:40:12 an-node01 kernel: ------------[ cut here ]------------ Dec 25 15:40:12 an-node01 kernel: WARNING: at kernel/sched.c:5914 thread_return+0x232/0x79d() (Tainted: G W ---------------- ) Dec 25 15:40:12 an-node01 kernel: Hardware name: empty Dec 25 15:40:12 an-node01 kernel: Modules linked in: gfs2 iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 drbd(U) dlm configfs ip6table_filter ip6_tables iptable_filter ip_tables ebtable_nat ebtables sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf bridge stp llc bonding ipv6 vhost_net macvtap macvlan tun kvm_intel kvm microcode shpchp i2c_i801 i2c_core sg iTCO_wdt iTCO_vendor_support e1000e ext4 mbcache jbd2 sd_mod crc_t10dif ahci dm_mirror dm_region_hash dm_log dm_mod [last unloaded: scsi_wait_scan] Dec 25 15:40:12 an-node01 kernel: Pid: 1343, comm: bond1 Tainted: G W ---------------- 2.6.32-220.2.1.el6.x86_64 #1 Dec 25 15:40:12 an-node01 kernel: Call Trace: Dec 25 15:40:12 an-node01 kernel: [<ffffffff81069997>] ? warn_slowpath_common+0x87/0xc0 Dec 25 15:40:12 an-node01 kernel: [<ffffffff810699ea>] ? warn_slowpath_null+0x1a/0x20 Dec 25 15:40:12 an-node01 kernel: [<ffffffff814eccc5>] ? thread_return+0x232/0x79d Dec 25 15:40:12 an-node01 kernel: [<ffffffff8107d068>] ? add_timer+0x18/0x30 Dec 25 15:40:12 an-node01 kernel: [<ffffffff8108be79>] ? queue_delayed_work_on+0xb9/0x120 Dec 25 15:40:12 an-node01 kernel: [<ffffffffa0269650>] ? bond_mii_monitor+0x0/0x610 [bonding] Dec 25 15:40:12 an-node01 kernel: [<ffffffff8108b15c>] ? worker_thread+0x1fc/0x2a0 Dec 25 15:40:12 an-node01 kernel: [<ffffffff81090a10>] ? autoremove_wake_function+0x0/0x40 Dec 25 15:40:12 an-node01 kernel: [<ffffffff8108af60>] ? worker_thread+0x0/0x2a0 Dec 25 15:40:12 an-node01 kernel: [<ffffffff810906a6>] ? kthread+0x96/0xa0 Dec 25 15:40:12 an-node01 kernel: [<ffffffff8100c14a>] ? child_rip+0xa/0x20 Dec 25 15:40:12 an-node01 kernel: [<ffffffff81090610>] ? kthread+0x0/0xa0 Dec 25 15:40:12 an-node01 kernel: [<ffffffff8100c140>] ? child_rip+0x0/0x20 Dec 25 15:40:12 an-node01 kernel: ---[ end trace 705d5c1db0fb1e00 ]--- The nodes are also Intel Xeons, but they are the much more modest E3-1220. The mainboard is a Tyan S5510 with 8GB of DDR3 ECC memory. Also getting this on most of my boxes after rebooting to kernel-2.6.32-220.2.1.el6.x86_64. Different stacks each time, triggered by different processes (bash, squid, cpuspeed, java, etc). All stacks have warn_slowpath_common at the top. There are also several reports on that on the CentOS bug tracker, see http://bugs.centos.org/view.php?id=5371 Red Hat has a KB article on this now: https://access.redhat.com/kb/docs/DOC-68014 which references this BZ: https://bugzilla.redhat.com/show_bug.cgi?id=766051 private BZ, grrrrrrrrrrrrrrr Thanks for the KB article. Can the above referenced bug be made open? It is starting to get old having all these abortd emails coming to me (since I forward all root emails from all my machines to me :) ) Created attachment 550159 [details]
my bug report
Comment on attachment 550159 [details]
my bug report
I hope more info will help. Thanks.
I uploaded my report. I hope this issue will be fixed :) Thanks! On 6.1 we get spammed with: /etc/cron.hourly/mcelog.cron: read: No such device , every time a system is reboooted (which I hope is fixed in 6.2). But now in 6.2 we get spammed with "[abrt] full crash report"s everytime a system is rebooted. It would nice for this to be fixed so we get "quiet" RHEL6 systems. *** This bug has been marked as a duplicate of bug 766051 *** that is a private bug! Jeremy, Can you clone the private bug (minus customer data) so that those of us also affected by this issue can track it? It would be much appreciated. *** Bug 770431 has been marked as a duplicate of this bug. *** Why is redhat closing other bugs in reference to this one, when this one is closed and references a private bug? I do see redhat removed the link from the KB article at least. |