Bug 196711 - LC_COLLATE for cs (Czech) locale is buggy wrt. uppercase letters
Summary: LC_COLLATE for cs (Czech) locale is buggy wrt. uppercase letters
Keywords:
Status: CLOSED RAWHIDE
Alias: None
Product: Fedora
Classification: Fedora
Component: glibc
Version: 5
Hardware: All
OS: Linux
medium
low
Target Milestone: ---
Assignee: Jakub Jelinek
QA Contact: Brian Brock
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2006-06-26 17:04 UTC by Jan "Yenya" Kasprzak
Modified: 2007-11-30 22:11 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2007-10-04 08:03:25 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)

Description Jan "Yenya" Kasprzak 2006-06-26 17:04:20 UTC
Description of problem:
LC_COLLATE for cs (Czech) locale is buggy wrt. uppercase letters, it incorrectly
sorts uppercase letters before lowercase ones.

Version-Release number of selected component (if applicable):
glibc-2.4-8

Steps to Reproduce:
1. LC_ALL=cs_CZ.UTF-8 sort <<EOF
jakub
Jakub
EOF
  
Actual results:

Jakub
jakub


Expected results:

jakub
Jakub

Additional info:
The Czech sorting algorithm should put lowercase letters first (when the sorted
words are otherwise the same except possible non-alphanumeric characters).

For more info about the Czech sorting, see Petr Olsak's article in the CSTUG
bulletin (in Czech).

http://math.feld.cvut.cz/ftp/olsak/bulletin/csr.ps

Or see the examples from Jan Pazdziora's Czech sorting page:

http://www.fi.muni.cz/~adelton/l10n/cssort/sort.tab

Except for the above error, I think the Czech locale provides correct sorting.

This behaviour is actually documented as known bug in
/usr/share/i18n/locales/cs_CZ (see the comment labeled "3.7").

Comment 1 Ulrich Drepper 2006-07-30 22:25:14 UTC
It's trivial to change but you have the get the last maintainer (Vladimir Michl)
to agree.  Only if he does or doesn't reply will I change the file.

Comment 2 Jan "Yenya" Kasprzak 2006-11-28 12:53:15 UTC
Sorry for the delay - I have now tried to cotnact Vladimir Michl, and asked him
to add a note to this bug or contact you by email.

Comment 3 Vladimir Michl 2006-11-29 10:14:22 UTC
Hello,

as I see Yenya is right. And I think we should change it as he requests.
Just after change, so lowercase letters are before uppercase, can we delete
point 3.7 at the start as well.

Thanks

Vladimir Michl

Comment 4 Ulrich Drepper 2007-09-30 18:45:26 UTC
Should be fixed in upstream cvs.

Comment 5 Jakub Jelinek 2007-10-04 08:03:25 UTC
In glibc-2.6.90-17 in rawhide, please test.

Comment 6 Jan "Yenya" Kasprzak 2007-10-05 12:44:04 UTC
Works for me, thanks!


Note You need to log in before you can comment on or make changes to this bug.