Bug 225188

Summary:

Charset handling is less than perfect

Product:

[Fedora] Fedora

Reporter:

Robert Vogelgesang <vogel>

Component:

unrtf

Assignee:

Aurelien Bompard <gauret>

Status:

CLOSED NEXTRELEASE

QA Contact:

Fedora Extras Quality Assurance <extras-qa>

Severity:

medium

Docs Contact:

Priority:

medium

Version:

Target Milestone:

---

Target Release:

---

Hardware:

All

OS:

Linux

Whiteboard:

Fixed In Version:

Doc Type:

Bug Fix

Doc Text:

Story Points:

---

Clone Of:

Environment:

Last Closed:

2007-03-03 10:00:41 UTC

Type:

---

Regression:

---

Mount Type:

---

Documentation:

---

CRM:

Verified Versions:

Category:

---

oVirt Team:

---

RHEL 7.3 requirements from Atomic Host:

Cloudforms Team:

---

Target Upstream Version:

Embargoed:

Attachments:

Description	Flags
Workaround/fix for codepage problems	none

Description Robert Vogelgesang 2007-01-29 17:53:41 UTC

Description of problem:
Recent versions of unrtf have "improved" codepage support, but when
converting RTFs e. g. using codepage 1252, the "umlauts" are not properly
handled.  Earlier versions of unrtf did manage to convert these characters.

Version-Release number of selected component (if applicable):
0.20.2 and earlier

How reproducible:
Always when converting RTFs using codepage 1252 with umlauts's.

Steps to Reproduce:
1. unrtf --html RTF.rtf
2.
3.
  
Actual results:
The generated HTML contains HTML comments that specify the hex code of the
umlaut characters.

Expected results:
No added HTML comments, but instead properly converted umlauts.

Additional info:
See the attached patch for a fix.  This patch was also submitted to the
upstream maintainers, but without response from them since October 2006.

Comment 1 Robert Vogelgesang 2007-01-29 17:53:44 UTC

Created attachment 146844 [details]
Workaround/fix for codepage problems

Comment 2 Aurelien Bompard 2007-03-03 10:00:41 UTC

Patch applied in release 2, thanks