Bug 227260

Summary: ISO-8859 manual pages mis-coded
Product: [Fedora] Fedora Reporter: Nick Lamb <redhat>
Component: man-pagesAssignee: Ivana Varekova <varekova>
Status: CLOSED RAWHIDE QA Contact: Ben Levenson <benl>
Severity: medium Docs Contact:
Priority: medium    
Version: 6   
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2007-02-13 14:46:46 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Nick Lamb 2007-02-04 03:37:01 UTC
Description of problem:
During the transition to UTF-8 some manual pages have been damaged.

Version-Release number of selected component (if applicable): man-pages-2.39-5

How reproducible:

Steps to Reproduce:
1. Switch to an English locale (I used en_GB.UTF-8)
2. from a shell type e.g. exactly 'man iso_8859-15' (NB underscore and dash)
3. Examine the EURO SIGN
  
Actual results:

The obscure universal currency symbol ¤ is shown

Expected results:

The Euro currency symbol € should be shown

Additional info:

These manual pages were casualties of an otherwise mostly successful battle to
get rid of encoding problems in manual pages.

/usr/share/man/en/man7/iso_8859-2.7.gz
/usr/share/man/en/man7/iso_8859-7.7.gz
/usr/share/man/en/man7/iso_8859-9.7.gz
/usr/share/man/en/man7/iso_8859-15.7.gz
/usr/share/man/en/man7/iso_8859-16.7.gz

Each of these pages was converted to UTF-8 but with the assumption that their
original character set was ISO-8859-1 (or maybe Windows 1252). In fact they were
each in their respective native encoding. A tool like 'iconv' can fix this,
firstly undo the conversion to UTF-8, then redo it but choosing the correct
source encoding e.g.

iconv -f UTF-8 -t ISO-8859-1 < iso_8859-9.7 > tmpfile
iconv -t UTF-8 -f ISO-8859-9 < tmpfile > iso_8859-9.7

Meanwhile the ASCII versions of these pages of course don't do a very good job
of illustrating characters which explicitly aren't in ASCII. Since we can't
illustrate them properly we should just remove this column, there is a text
description of each character included so it's still useful as pure ASCII.

Comment 1 Ivana Varekova 2007-02-13 14:46:46 UTC
Fixed in man-peges-2.43-6.fc7 and man-pages-2_39-7_fc6.