Bug 689483
Summary: | crash in ganglia moddisk.so | ||
---|---|---|---|
Product: | [Fedora] Fedora | Reporter: | Terje Røsten <terje.rosten> |
Component: | ganglia | Assignee: | Kostas Georgiou <k.georgiou> |
Status: | CLOSED ERRATA | QA Contact: | Fedora Extras Quality Assurance <extras-qa> |
Severity: | medium | Docs Contact: | |
Priority: | unspecified | ||
Version: | 15 | CC: | bernard, cjg9411, k.georgiou, kjell.m.randa |
Target Milestone: | --- | Keywords: | Reopened |
Target Release: | --- | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | ganglia-3.1.7-4.fc15 | Doc Type: | Bug Fix |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2011-07-15 01:24:00 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Terje Røsten
2011-03-21 16:19:18 UTC
I can't reproduce localy unfortunately :( Would it be possible to install the debuginfo packages and try again from inside gdb? Hm, strange I can't reproduce any longer. From yum history I see that glibc has been updated to glibc-2.13.90-7.x86_64. Do you use glibc-2.13.90-7 too? I tested on a VM running Fedora 15 Alpha x86_64 and couldn't reproduce it either. Also using glibc-2.13.90-7.x86_64. I tested with glibc-2.13.90-6 actually. We can't rule out a stack overflow somewhere but until it shows up again I'll put it down as a "random" rawhide failure. Yeah, you can close it, I will reopen if I see the problem again. Thanks for the quick reply. Reopening since I just saw the problem. (gdb) where #0 0x0000003d9cc362c5 in raise () from /lib64/libc.so.6 #1 0x0000003d9cc37bdb in abort () from /lib64/libc.so.6 #2 0x0000003d9cc722c3 in __libc_message () from /lib64/libc.so.6 #3 0x0000003d9ccf7a87 in __fortify_fail () from /lib64/libc.so.6 #4 0x0000003d9ccf7a50 in __stack_chk_fail () from /lib64/libc.so.6 #5 0x00007ffff1d02ef0 in find_disk_space (total_size=0x7fffffffe2e8, total_free=0x7fffffffe2e0) at metrics.c:1264 #6 0x00007ffff1d02f40 in disk_total_func () at metrics.c:1289 #7 0x00007ffff1d005aa in disk_metric_handler (metric_index=<optimized out>) at mod_disk.c:36 #8 disk_metric_handler (metric_index=<optimized out>) at mod_disk.c:27 Unfortunately after I setup some breakpoints and tried to find where it dies it magically fixed itself! My bet for the cause is the sscanf in find_disk_space. I can't see anything else that could overwrite the stack in the function. sscanf(procline, "%s %s %s %s ", device, mount, type, mode); Got it, here is an entry from /proc/mounts (which was getting expired and removed by the automounter everytime I was trying to debug the problem, so I missed it the first few times) foobar:/a/b/c /vols/c nfs rw,nosuid,nodev,relatime,vers=3,rsize=32768,wsize=32768,namlen=255,hard,proto=tcp,timeo=600,retrans=2,sec=sys,mountaddr=10.10.10.10,mountvers=3,mountport=22194,mountproto=udp,local_lock=none,addr=10.10.10.10 char mount[128], device[128], type[32], mode[128]; rc=sscanf(procline, "%s %s %s %s ", device, mount, type, mode); Not f15 specific it seems, it's just better at detecting stack corruptions. Any fix available for this? It's beginning to annoy me a little :-) ganglia-3.1.7-4.fc15 has been submitted as an update for Fedora 15. https://admin.fedoraproject.org/updates/ganglia-3.1.7-4.fc15 Package ganglia-3.1.7-4.fc15: * should fix your issue, * was pushed to the Fedora 15 testing repository, * should be available at your local mirror within two days. Update it with: # su -c 'yum update --enablerepo=updates-testing ganglia-3.1.7-4.fc15' as soon as you are able to. Please go to the following url: https://admin.fedoraproject.org/updates/ganglia-3.1.7-4.fc15 then log in and leave karma (feedback). ganglia-3.1.7-4.fc15 has been pushed to the Fedora 15 stable repository. If problems still persist, please make note of it in this bug report. |