Bug 192929
Summary: | iconv() converts into UCS-4 as little-endian | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 4 | Reporter: | Gregory Brodsky <gbrodsky> | ||||
Component: | gcc3 | Assignee: | Jakub Jelinek <jakub> | ||||
Status: | CLOSED NOTABUG | QA Contact: | |||||
Severity: | medium | Docs Contact: | |||||
Priority: | medium | ||||||
Version: | 4.0 | CC: | drepper | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | i586 | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2006-05-24 14:34:47 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Attachments: |
|
Description
Gregory Brodsky
2006-05-24 10:16:16 UTC
Created attachment 129908 [details]
C++ source to reproduce the problem
Can you cite why you think say UCS-2 is an alias for UTF-16? Certainly http://www.unicode.org/reports/tr17/index.html doesn't suggest anything like that, it has always been a different encoding. That's nonsense. UCS-2 and UCS-4 are standalone encodings. (In reply to comment #2) > Can you cite why you think say UCS-2 is an alias for UTF-16? > Certainly http://www.unicode.org/reports/tr17/index.html > doesn't suggest anything like that, it has always been a different encoding. > You are correct, my opinion is based on different source. Unicode v4.0 Standard book, page 1350: "As a conseguence, UCS-4 can now be taken effectively as an alias for the Unicode encoding form UTF-32...". In page 1352, list of encodings: "UTF-8, UTF-16 or UCS-4 (=UTF-32)" There is a similar statement about UTF-16 vs UCS-2, but I did not find an exact citate. (In reply to comment #3) > That's nonsense. UCS-2 and UCS-4 are standalone encodings. Well, but according to Unicode v4.0 Standard book, page 32, an endian order for both of them is platform dependent. Please notice, since (unlike UTF-16 and UTF-32) UCS-2 and UCS-4 converted data is generated witout BOM, customer does not have another way to expect their endian order but by platform. That's why I don't understand why endian order of UCS-2 and UCS-4 might be different in the same system. |