Bug 1065810

Summary: PDF text cannot be found with search in acroreader, but is visible in PDF.
Product: [Community] Publican Reporter: Scott Mumford <smumford>
Component: publicanAssignee: Jeff Fearn 🐞 <jfearn>
Status: CLOSED CURRENTRELEASE QA Contact: Ruediger Landmann <rlandman>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 4.0CC: aigao, mmcallis, rlandman
Target Milestone: 4.1   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: 4.1.0 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2014-05-05 06:03:32 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Scott Mumford 2014-02-17 02:56:16 UTC
Description of problem:

From bug 965738:

"There is some bug in the PDF generated text that it doesn't find the word (Ctrl+F in adobe reader) "ExtendedFormAuthenticator" but it finds "ExtendedForm Authenticator". However looking into the text I cannot see a space between them.

The word is contained in page 127 in a source code section.
https://access.redhat.com/site/documentation/en-US/JBoss_Enterprise_Application_Platform/6.1/pdf/Security_Guide/JBoss_Enterprise_Application_Platform-6.1-Security_Guide-en-US.pdf"

Version-Release number of selected component (if applicable):
Issue confirmed using Acroread 9.5.5 on Fedora 18. 
Publican version installed locally is 4.0.0, however this was not used in the creation of the book (build performed remotely via Pressgang).

How reproducible:
100% (When using Acroread. Bug does not present in any other PDF viewer)

Steps to Reproduce:
1. Download latest version of document: http://documentation-devel.engineering.redhat.com/site/documentation/en-US/JBoss_Enterprise_Application_Platform/6.3/pdf/Security_Guide/JBoss_Enterprise_Application_Platform-6.3-Security_Guide-en-US.pdf
2. Open the above PDF in Acroread (v 9.x)
3. Search for "ExtendedFormAuthenticator"


Actual results:
No instances of "ExtendedFormAuthenticator" found.

Expected results:
Two instances of the term should be found.

Additional info:
The bug was originally raised against the Red Hat JBoss EAP Security Guide, however the ECS documentation team is unable to correct this.

Comment 1 Jeff Fearn 🐞 2014-03-05 04:29:13 UTC
Testing with acroreader 9.1.0 on Fedora 19 verifies the non-searchability of text.

PDFs generated with a newer release of wkhtmltopdf, 0.12.0, appear to be searchable in acroreader 9.1.0.

Changes subject to indicate this issue is specific to Adobe products.

Comment 2 Jeff Fearn 🐞 2014-03-11 00:05:03 UTC
Upgrading to wkhtmltopdf 0.12 resolves this issue.

To ssh://git.fedorahosted.org/git/publican.git
   50fff7b..64a5160  devel -> devel

Comment 3 Jeff Fearn 🐞 2014-05-05 06:03:32 UTC
A fix for this shipped in Publican 4.1.0.