Bug 225188 - Charset handling is less than perfect
Charset handling is less than perfect
Product: Fedora
Classification: Fedora
Component: unrtf (Show other bugs)
All Linux
medium Severity medium
: ---
: ---
Assigned To: Aurelien Bompard
Fedora Extras Quality Assurance
Depends On:
  Show dependency treegraph
Reported: 2007-01-29 12:53 EST by Robert Vogelgesang
Modified: 2007-11-30 17:11 EST (History)
0 users

See Also:
Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
Last Closed: 2007-03-03 05:00:41 EST
Type: ---
Regression: ---
Mount Type: ---
Documentation: ---
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---

Attachments (Terms of Use)
Workaround/fix for codepage problems (824 bytes, patch)
2007-01-29 12:53 EST, Robert Vogelgesang
no flags Details | Diff

  None (edit)
Description Robert Vogelgesang 2007-01-29 12:53:41 EST
Description of problem:
Recent versions of unrtf have "improved" codepage support, but when
converting RTFs e. g. using codepage 1252, the "umlauts" are not properly
handled.  Earlier versions of unrtf did manage to convert these characters.

Version-Release number of selected component (if applicable):
0.20.2 and earlier

How reproducible:
Always when converting RTFs using codepage 1252 with umlauts's.

Steps to Reproduce:
1. unrtf --html RTF.rtf
Actual results:
The generated HTML contains HTML comments that specify the hex code of the
umlaut characters.

Expected results:
No added HTML comments, but instead properly converted umlauts.

Additional info:
See the attached patch for a fix.  This patch was also submitted to the
upstream maintainers, but without response from them since October 2006.
Comment 1 Robert Vogelgesang 2007-01-29 12:53:44 EST
Created attachment 146844 [details]
Workaround/fix for codepage problems
Comment 2 Aurelien Bompard 2007-03-03 05:00:41 EST
Patch applied in release 2, thanks

Note You need to log in before you can comment on or make changes to this bug.