Red Hat Bugzilla – Bug 481179
pirut package lists do not sort packages using locale specific infromation
Last modified: 2013-04-12 16:02:16 EDT
Created attachment 329720 [details]
screenshot of the error
Description of problem:
during system installation, when selecting individual packages, these are sorted in alphabetical order, but accented letters are put after the letter Z
this may seem as some packages are missing ...
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. start system installation
2. select non English locale which uses accented letters (e.g. Czech)
3. choose to modify packages list immediately
4. choose Languages and scroll the list below letter Z
see the attached screenshot
A+acute is between A and B, C+caron is between C and D ...
changing summary to match the correct component ... sorry, I did not realize that it is a package manager run by the installer and not the installer itself, as I prefer using commandline yum
so the error is reproducible anytime running pirut, not just during installation
Created attachment 329843 [details]
Fix for pirut (and pirut in anaconda) sorting by "packages strings"
This is the fix we've tested internally. It might go out for 5.4
Ok, this bug is different ...
The issue with "accented letters are put after the letter Z" is
due to the ascii [ISO 8859-1 and ISO 8859-2 of the international
standards] value of the characters.
ISO 8859-1 is West European languages (Latin-1)
ISO 8859-2 is Central and East European languages (Latin-2)
The ascii value of 'A-Z' is decimal value of '64 to 90';
and the "acute" accented 'A' has the value of '193':
Oct Dec Hex Char Description:
300 192 C0 À LATIN CAPITAL LETTER A WITH GRAVE
301 193 C1 Á LATIN CAPITAL LETTER A WITH ACUTE
Therefore, the sorting/ordering of the letters [component names]
are corrected based on the international standards.
Hence, this issue should be closed as NOTABUG.
(In reply to comment #9)
> The issue with "accented letters are put after the letter Z" is
> due to the ascii [ISO 8859-1 and ISO 8859-2 of the international
> standards] value of the characters.
ahem, aren't we using Unicode (UTF-8) rather then those of ISO-8859 family?
> Therefore, the sorting/ordering of the letters [component names]
> are corrected based on the international standards.
as far as my knowledge of those standards go, these are all about _encoding_ characters, not about _sorting_
these are two distinct things
to quote http://www.unicode.org/faq/collation.html
"Linguistically meaningful sorting is done not by comparing code point values (an approach which would fail even for English) ..."
> Hence, this issue should be closed as NOTABUG.
considering which version this concerns and how long this was untouched, I'd agree on closing - but rather WONTFIX
This should be fixed in the RHEL-6 package GUI.
This request was evaluated by Red Hat Engineering for inclusion in a Red
Hat Enterprise Linux maintenance release.
Red Hat does not currently plan to provide this change in a Red Hat
Enterprise Linux update release for currently deployed products.
With the goal of minimizing risk of change for deployed systems, and in
response to customer and partner requirements, Red Hat takes a
conservative approach when evaluating enhancements for inclusion in
maintenance updates for currently deployed products. The primary
objectives of update releases are to enable new hardware platform
support and to resolve critical defects.