Bug 196711

Summary: LC_COLLATE for cs (Czech) locale is buggy wrt. uppercase letters
Product: [Fedora] Fedora Reporter: Jan "Yenya" Kasprzak <kas>
Component: glibcAssignee: Jakub Jelinek <jakub>
Status: CLOSED RAWHIDE QA Contact: Brian Brock <bbrock>
Severity: low Docs Contact:
Priority: medium    
Version: 5CC: drepper
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2007-10-04 08:03:25 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Jan "Yenya" Kasprzak 2006-06-26 17:04:20 UTC
Description of problem:
LC_COLLATE for cs (Czech) locale is buggy wrt. uppercase letters, it incorrectly
sorts uppercase letters before lowercase ones.

Version-Release number of selected component (if applicable):
glibc-2.4-8

Steps to Reproduce:
1. LC_ALL=cs_CZ.UTF-8 sort <<EOF
jakub
Jakub
EOF
  
Actual results:

Jakub
jakub


Expected results:

jakub
Jakub

Additional info:
The Czech sorting algorithm should put lowercase letters first (when the sorted
words are otherwise the same except possible non-alphanumeric characters).

For more info about the Czech sorting, see Petr Olsak's article in the CSTUG
bulletin (in Czech).

http://math.feld.cvut.cz/ftp/olsak/bulletin/csr.ps

Or see the examples from Jan Pazdziora's Czech sorting page:

http://www.fi.muni.cz/~adelton/l10n/cssort/sort.tab

Except for the above error, I think the Czech locale provides correct sorting.

This behaviour is actually documented as known bug in
/usr/share/i18n/locales/cs_CZ (see the comment labeled "3.7").

Comment 1 Ulrich Drepper 2006-07-30 22:25:14 UTC
It's trivial to change but you have the get the last maintainer (Vladimir Michl)
to agree.  Only if he does or doesn't reply will I change the file.

Comment 2 Jan "Yenya" Kasprzak 2006-11-28 12:53:15 UTC
Sorry for the delay - I have now tried to cotnact Vladimir Michl, and asked him
to add a note to this bug or contact you by email.

Comment 3 Vladimir Michl 2006-11-29 10:14:22 UTC
Hello,

as I see Yenya is right. And I think we should change it as he requests.
Just after change, so lowercase letters are before uppercase, can we delete
point 3.7 at the start as well.

Thanks

Vladimir Michl

Comment 4 Ulrich Drepper 2007-09-30 18:45:26 UTC
Should be fixed in upstream cvs.

Comment 5 Jakub Jelinek 2007-10-04 08:03:25 UTC
In glibc-2.6.90-17 in rawhide, please test.

Comment 6 Jan "Yenya" Kasprzak 2007-10-05 12:44:04 UTC
Works for me, thanks!