Bug 1150585
| Summary: | numad dies after number of "Could not write 1 to /cgroup/cpuset/libvirt/qemu/vm_name/emulator/cpuset.mems -- errno: 13" errors | ||||||
|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 6 | Reporter: | Alexandros Gkesos <agkesos> | ||||
| Component: | numad | Assignee: | Bill Gray <bgray> | ||||
| Status: | CLOSED ERRATA | QA Contact: | qe-baseos-daemons | ||||
| Severity: | medium | Docs Contact: | |||||
| Priority: | medium | ||||||
| Version: | 6.5 | CC: | bgray, jprokes, jscotka, jsynacek, juergen_thomann, ppostler, psklenar, tlavigne | ||||
| Target Milestone: | rc | Keywords: | Regression | ||||
| Target Release: | --- | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Linux | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | |||||||
| : | 1235164 (view as bug list) | Environment: | |||||
| Last Closed: | 2015-07-22 07:46:24 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
Alexandros Gkesos
2014-10-08 13:08:15 UTC
Created attachment 945210 [details]
Hypervisor's sosreport
We have the same problem and it is caused by too many open file descriptors. I'm now testing the following patch:
--- a/numad.c
+++ b/numad.c
@@ -1111,6 +1111,7 @@ int write_to_cpuset_file(char *fname, char *s) {
numad_log(LOG_DEBUG, "Writing %s to: %s\n", s, fname);
if (write(fd, s, strlen(s)) <= 0) {
numad_log(LOG_CRIT, "Could not write %s to %s -- errno: %d\n", s, fname, errno);
+ close(fd);
return -1;
}
close(fd);
Looks like the patch in Comment 6 would have prevented numad from crashing by fixing the failure to close the file during error conditions when attempting to write to the file. Thanks. The new version of numad no longer uses cpusets, entirely eliminating the writes to the cpuset control files. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2015-1441.html |