Bug 1306341
Summary: | spinning rt tasks: hung of jbd2 kworkers | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 7 | Reporter: | Daniel Bristot de Oliveira <daolivei> |
Component: | kernel-rt | Assignee: | Clark Williams <williams> |
kernel-rt sub component: | Other | QA Contact: | Jiri Kastner <jkastner> |
Status: | CLOSED WONTFIX | Docs Contact: | |
Severity: | high | ||
Priority: | unspecified | CC: | bhu, lcapitulino |
Version: | 7.1 | ||
Target Milestone: | rc | ||
Target Release: | 7.3 | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2017-11-29 16:55:27 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 1442258 |
Description
Daniel Bristot de Oliveira
2016-02-10 15:40:53 UTC
We're debugging a KVM-RT issue that looks similar: Bug 1448770 - several tasks blocked for more than 600 seconds (see stack trace in bug 1448770 comment 25) However, we haven't been able to get a working vmcore yet. And I haven't been able to reproduce myself. Do you have a reproducer? Unfortunately, we do not have a reproducer. Should we talk to storage/fs people? If they can help getting a reproducer, yes. But I think it's possible that bug 1448770 is the same issue and we have a reproducer for that one. I also suspect that this issue is caused by workqueue numa scheduling, but I don't have enough data to confirm this yet (which would be very good news, since workqueue numa scheduling can be easily disabled). Never mind the workqueue numa scheduling hypothesis, at least for bug 1448770. The issue can be reproduced even when workqueue numa scheduling is disabled. This bug has not been seen in months and can be worked around with the RT_RUNTIME_GREED feature. An actual fix to avoid starving kworkers/softirqd threads will require upstream RT architecture changes. Closing WONTFIX |