Bug 994385

Summary: Invisible text not preserved by pdfwrite
Product: Red Hat Enterprise Linux 5 Reporter: Simon Matter <simon.matter>
Component: ghostscriptAssignee: Tim Waugh <twaugh>
Status: CLOSED WONTFIX QA Contact: QE Internationalization Bugs <qe-i18n-bugs>
Severity: unspecified Docs Contact:
Priority: low    
Version: 5.9CC: lijli, simon.matter
Target Milestone: rcKeywords: i18n, Patch
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
: 994452 (view as bug list) Environment:
Last Closed: 2013-11-01 12:10:19 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 994452    
Attachments:
Description Flags
PDF with invisible text none

Description Simon Matter 2013-08-07 07:38:53 UTC
Description of problem:
Running gs with pdfwrite on a scanned and OCR'ed PDF file the invisible text, which is used for copy/paste, is not preserved in the output file.

Version-Release number of selected component (if applicable):
ghostscript-8.70-14.el5_8.1

How reproducible:
always

Steps to Reproduce:
1. gs -dBATCH -dNOPAUSE -q -sDEVICE=pdfwrite -sOutputFile=all.pdf 201308070*.pdf
2.
3.

Actual results:
Some or all invisible text is missing in all.pdf (depending on the kind of font used in the input files)

Expected results:
All invisible text should still be in all.pdf

Additional info:
See the ghostscript bug for more info. The upstream commit is here http://ghostscript.com/pipermail/gs-cvs/2010-October/011807.html

I've tested the patch and it fixes the issue mentioned above.

Regards,
Simon

Comment 1 Tim Waugh 2013-08-07 10:28:40 UTC
Thank you for the report.

Are you able to attach a small example document that reproduces the problem?

Comment 2 Simon Matter 2013-08-07 12:04:34 UTC
Created attachment 783858 [details]
PDF with invisible text

If the file is run through ghostscript like this

gs -dBATCH -dNOPAUSE -q -sDEVICE=pdfwrite -sOutputFile=test-gs.pdf test.pdf

the output file doesn't include the invisible text anymore.

Comment 3 RHEL Program Management 2013-08-07 14:39:59 UTC
This request was evaluated by Red Hat Product Management for
inclusion in the current release of Red Hat Enterprise Linux.
Because the affected component is not scheduled to be updated
in the current release, Red Hat is unable to address this
request at this time.

Red Hat invites you to ask your support representative to
propose this request, if appropriate, in the next release of
Red Hat Enterprise Linux.

Comment 4 Tim Waugh 2013-11-01 12:10:19 UTC
This bug has been reviewed by Red Hat and is not planned on being addressed in Red Hat Enterprise Linux 5, and therefore will be closed. If this bug is critical to production systems, please contact your Red Hat support representative.

This issue is being tracked for Red Hat Enterprise Linux 6 as bug #994452.