Hide Forgot
Description of problem: Some of our tests for oprofile regularly fail on certain HW, so I start reporting these so either our tests can be fixed (if they are wrong), or the genuine bug in oprofile is confirmed. Some of our tests work try to collect some data in oprofile and then run some commands working with data dumped. This usually works, but sometimes this fails, and it is usually bound to a certain HW or certain box. The test which fails follows a oprofile tutorial for Fedora 12, on http://people.redhat.com/wcohen/Fedora12OprofileTutorial.txt :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: [ LOG ] :: Oprofile configuration :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: [ PASS ] :: Running 'opcontrol --init' :: [07:21:48] :: x86-64/family10 :: [ PASS ] :: Running 'opcontrol --setup --no-vmlinux' :: [ PASS ] :: Running 'ophelp --get-default-event' :: [07:21:48] :: Using CPU_CLK_UNHALTED:100000:0:1:1 as default event :: [ PASS ] :: Running 'opcontrol --setup --no-vmlinux --event=CPU_CLK_UNHALTED:100000:0:1:1' :: [ PASS ] :: Running 'opcontrol --setup --vmlinux=/usr/lib/debug/lib/modules/2.6.32-211.el6.i686/vmlinux --event=CPU_CLK_UNHALTED:100000:0:1:1' Using 2.6+ OProfile kernel interface. Reading module info. Using log file /var/lib/oprofile/samples/oprofiled.log Daemon started. Profiler running. :: [ PASS ] :: Running 'opcontrol --start' :: [ PASS ] :: Running 'ps aux | grep oprofiled' :: [ PASS ] :: Running 'opcontrol --shutdown' :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: [ LOG ] :: Collecting and analysing data :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: [ PASS ] :: Running 'opcontrol --start' :: [07:21:56] :: Sleeping for a while :: [ PASS ] :: Running 'opcontrol --dump' :: [ PASS ] :: File /var/lib/oprofile/samples/current should exist :: [ PASS ] :: Running 'opcontrol --shutdown' :: [ PASS ] :: Running 'opcontrol --save=junk' :: [ PASS ] :: Running 'opcontrol --start' :: [07:22:08] :: Executing the benchmark <output snipped> :: [07:22:10] :: Benchmark finished :: [ PASS ] :: Running 'opcontrol --dump' :: [07:22:11] :: Sleeping for a while opreport error: No sample file found: try running opcontrol --dump or specify a session containing sample files :: [ FAIL ] :: Running 'opreport --long-filenames --threshold 1' (Expected 0, got 1) <same error> :: [ FAIL ] :: Running 'opreport --long-filenames --threshold 1 | grep linpack1000d' (Expected 0, got 1) <same error> :: [ FAIL ] :: Running 'opreport -l image:/mnt/testarea/tests/oprofile/Sanity/fedora12-tutorial/linpack1000d --threshold .1' (Expected 0, got 1) <same error> :: [ FAIL ] :: Running 'opreport -l image:/mnt/testarea/tests/oprofile/Sanity/fedora12-tutorial/linpack1000d --threshold .1 | grep daxpy' (Expected 0, got 1) <same error> :: [ FAIL ] :: Running 'opreport -l image:/mnt/testarea/tests/oprofile/Sanity/fedora12-tutorial/linpack1000d --include-symbols=daxpy_ --details' (Expected 0, got 1) <same error> :: [ FAIL ] :: Running 'opannotate --source /mnt/testarea/tests/oprofile/Sanity/fedora12-tutorial/linpack1000d -o session_annote' (Expected 0, got 1) :: [ FAIL ] :: File session_annote should exist :: [ PASS ] :: Running 'opcontrol --shutdown' :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: [ LOG ] :: Saving profiling data :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: [ PASS ] :: Running 'opcontrol --start' :: [07:22:28] :: Executing the benchmark <output snipped> :: [07:22:30] :: Benchmark finished :: [ PASS ] :: Running 'opcontrol --save=linpack' :: [ PASS ] :: Running 'opcontrol --dump' :: [07:22:33] :: Sleeping for a while error: no sample files found: profile specification too strict ? :: [ FAIL ] :: Running 'opreport session:linpack --long-filenames --threshold 1' (Expected 0, got 1) error: no sample files found: profile specification too strict ? :: [ FAIL ] :: Running 'opreport session:linpack --long-filenames --threshold 1 | grep linpack1000d' (Expected 0, got 1) error: no sample files found: profile specification too strict ? :: [ FAIL ] :: Running 'opreport session:linpack -l image:/mnt/testarea/tests/oprofile/Sanity/fedora12-tutorial/linpack1000d --threshold .1' (Expected 0, got 1) error: no sample files found: profile specification too strict ? :: [ FAIL ] :: Running 'opreport session:linpack -l image:/mnt/testarea/tests/oprofile/Sanity/fedora12-tutorial/linpack1000d --threshold .1 | grep daxpy' (Expected 0, got 1) error: no sample files found: profile specification too strict ? :: [ FAIL ] :: Running 'opreport session:linpack -l image:/mnt/testarea/tests/oprofile/Sanity/fedora12-tutorial/linpack1000d --include-symbols=daxpy_ --details' (Expected 0, got 1) error: no sample files found: profile specification too strict ? :: [ FAIL ] :: Running 'opannotate session:linpack --source /mnt/testarea/tests/oprofile/Sanity/fedora12-tutorial/linpack1000d -o linpack_annote' (Expected 0, got 1) :: [ FAIL ] :: File linpack_annote should exist Version-Release number of selected component (if applicable): oprofile-0.9.6-21.el6.i686 How reproducible: always on hp-bl495cg5-01.rhts.bos.redhat.com (4cores, family 16=10h, model 2, arch i686) Steps to Reproduce: 1. see above Actual results: opreport and opannotate do not show anything after opcontrol --dump Expected results: something useful Additional info: This is very environment specific. The identical test PASSed od different box with same CPU family/model running x86_64. I was not able to test if the real factor is the box itself, or x86-64ness. This is not a regression.
Hm, I think I just seen the similar issue on 8-core ppc :/ Removing the arch specific attributes. :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: [ LOG ] :: Environment and preparing samples :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: Unloading oprofile module :: [ PASS ] :: Compiling the testcase :: [ PASS ] :: Running 'opcontrol --init :: [20:41:17] :: Shutting oprofile daemon for the case it is running Daemon not running :: [ PASS ] :: Running 'opcontrol --shutdown' :: [ PASS ] :: Running 'opcontrol --no-vmlinux' Using default event: CYCLES:100000:0:1:1 Daemon started. Profiler running. :: [ PASS ] :: Starting oprofile daemon Signalling daemon... done :: [ PASS ] :: Resetting to abstract away previous tests :: [20:41:25] :: Running something for oprofile to annotate source_address: fffe78a4003 dest_address: fffe78a6007 1000000000 interations of memcpy(d+7.s+3,65) requires 42.745 seconds :: [ PASS ] :: Dumping samples from oprofile :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: [ LOG ] :: Actual testing :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: opannotate error: No sample file found: try running opcontrol --dump or specify a session containing sample files :: [ FAIL ] :: Running opannotate (Expected 0, got 1)
oprofile fails the same way on SMP X86_64. And I am running upstream 3.2-rc1 kernel. $rpm -q oprofile oprofile-0.9.6-21.fc15.x86_64 $uname -a Linux Fedora15 3.2.0-rc1 #21 SMP Wed Nov 9 02:24:59 EST 2011 x86_64 x86_64 x86_64 GNU/Linux $cat /proc/cpuinfo processor : 0 vendor_id : GenuineIntel cpu family : 6 model : 26 model name : Intel(R) Xeon(R) CPU X5550 @ 2.67GHz stepping : 5 microcode : 0x11 cpu MHz : 2660.301 cache size : 8192 KB fpu : yes fpu_exception : yes cpuid level : 11 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss syscall nx rdtscp lm constant_tsc arch_perfmon pebs bts nopl xtopology tsc_reliable aperfmperf pni ssse3 cx16 sse4_1 sse4_2 popcnt hypervisor lahf_lm ida dts bogomips : 5320.60 clflush size : 64 cache_alignment : 64 address sizes : 40 bits physical, 48 bits virtual power management: processor : 1 vendor_id : GenuineIntel cpu family : 6 model : 26 model name : Intel(R) Xeon(R) CPU X5550 @ 2.67GHz stepping : 5 microcode : 0x11 cpu MHz : 2660.301 cache size : 8192 KB fpu : yes fpu_exception : yes cpuid level : 11 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss syscall nx rdtscp lm constant_tsc arch_perfmon pebs bts nopl xtopology tsc_reliable aperfmperf pni ssse3 cx16 sse4_1 sse4_2 popcnt hypervisor lahf_lm ida dts bogomips : 5320.60 clflush size : 64 cache_alignment : 64 address sizes : 40 bits physical, 48 bits virtual power management:
upstream oprofile at git://oprofile.git.sourceforge.net/gitroot/oprofile/oprofile works well. Maybe you should just merge upstream code into rhel/fedora rpms...
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHBA-2012-0966.html