Bug 821213

Summary: Nepomuk cannot index most pdf files.
Product: [Fedora] Fedora Reporter: addammo
Component: strigiAssignee: Deji Akingunola <dakingun>
Status: CLOSED ERRATA QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: medium Docs Contact:
Priority: unspecified    
Version: 16CC: dakingun, jreznik, kevin, ltinkl, mbriza, rdieter, rnovacek, ry, smparrish, than
Target Milestone: ---Keywords: Triaged
Target Release: ---   
Hardware: i386   
OS: Linux   
Whiteboard:
Fixed In Version: strigi-0.7.7 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-07-12 17:00:27 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
pdf for testing none

Description addammo 2012-05-13 07:34:10 UTC
Created attachment 584109 [details]
pdf for testing

Description of problem: most pdf files are not indexed. Since version 4.7.3 nepomuk uses pdttotext to index pdf files.


Version-Release number of selected component (if applicable):
4.7.4

How reproducible:
Always

Steps to Reproduce:
1. Save the joined file somewhare into a directory indexed by nepomuk/strigi;
2. 
3.
  
Actual results:
File not indexed by nepomuk

Expected results:
File indexed by nepomuk

Additional info:
I read a discussion about this issue here:
https://bugs.kde.org/show_bug.cgi?id=231936

But, does the patch really works?

I join a sample pdf file.

Comment 1 addammo 2012-05-13 07:36:56 UTC
The program pdftotext works with the file i joined.

Comment 2 Martin Bříza 2012-07-12 14:36:28 UTC
Hello,
the upstream (in the external tracker) bug claims the issue to be fixed in KDE 4.7.3 with strigi 0.7.7. What versions did you test, please? Is the bug still present?
There are some reports on the upstream Bugzilla that the bug wasn't fixed entirely. If you still do experience the problems, please let us and upstream know.
Thank you.

Comment 3 addammo 2012-07-12 14:51:18 UTC
(In reply to comment #2)
> Hello,
> the upstream (in the external tracker) bug claims the issue to be fixed in
> KDE 4.7.3 with strigi 0.7.7. What versions did you test, please? Is the bug
> still present?
> There are some reports on the upstream Bugzilla that the bug wasn't fixed
> entirely. If you still do experience the problems, please let us and
> upstream know.
> Thank you.

Yes, I updated strigi to version 0.7.7 and now pdf files are indexed. I forgot to update my info here, sorry. You can close this bug.