Red Hat Bugzilla – Bug 225188
Charset handling is less than perfect
Last modified: 2007-11-30 17:11:54 EST
Description of problem:
Recent versions of unrtf have "improved" codepage support, but when
converting RTFs e. g. using codepage 1252, the "umlauts" are not properly
handled. Earlier versions of unrtf did manage to convert these characters.
Version-Release number of selected component (if applicable):
0.20.2 and earlier
Always when converting RTFs using codepage 1252 with umlauts's.
Steps to Reproduce:
1. unrtf --html RTF.rtf
The generated HTML contains HTML comments that specify the hex code of the
No added HTML comments, but instead properly converted umlauts.
See the attached patch for a fix. This patch was also submitted to the
upstream maintainers, but without response from them since October 2006.
Created attachment 146844 [details]
Workaround/fix for codepage problems
Patch applied in release 2, thanks