|
Doc Text:
|
Cause:
The IBM256, IBM277, IBM278, IBM280, IBM284, IBM297, IBM424 character sets encoded the EBCDIC codepoint 0xBC as the Unicode character U+203E OVERLINE.
Consequence:
When using the `iconv' program provided by glibc, converting text in those cahracter sets which contain the 0xBC codepoint fails for non-Unicode character sets such as ISO-8859-1 because they cannot encode the U+203E OVERLINE character.
Fix:
The mapping for the 0xBC codepoint for those IBM character sets is changed to U+00AF MACRON.
Result:
Input in the IBM277, IBM278, IBM280, IBM284, IBM297 character sets can now be converted to ISO-8859-1 in all cases. For the IBM256 and IBM424 character sets, conversion no longer fails if the input text contains the 0xBC codepoint, and U+00AF MACRON is used in the output.
|