196711 – LC_COLLATE for cs (Czech) locale is buggy wrt. uppercase letters

Bug 196711 - LC_COLLATE for cs (Czech) locale is buggy wrt. uppercase letters

Summary: LC_COLLATE for cs (Czech) locale is buggy wrt. uppercase letters

Keywords:
Status:	CLOSED RAWHIDE
Alias:	None
Product:	Fedora
Classification:	Fedora
Component:	glibc
Sub Component:
Version:	5
Hardware:	All
OS:	Linux
Priority:	medium
Severity:	low
Target Milestone:	---
Assignee:	Jakub Jelinek
QA Contact:	Brian Brock
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2006-06-26 17:04 UTC by Jan "Yenya" Kasprzak
Modified:	2007-11-30 22:11 UTC (History)
CC List:	1 user (show)
Fixed In Version:
Doc Type:	Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed:	2007-10-04 08:03:25 UTC
Type:	---
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Description Jan "Yenya" Kasprzak 2006-06-26 17:04:20 UTC

Description of problem:
LC_COLLATE for cs (Czech) locale is buggy wrt. uppercase letters, it incorrectly
sorts uppercase letters before lowercase ones.

Version-Release number of selected component (if applicable):
glibc-2.4-8

Steps to Reproduce:
1. LC_ALL=cs_CZ.UTF-8 sort <<EOF
jakub
Jakub
EOF
  
Actual results:

Jakub
jakub


Expected results:

jakub
Jakub

Additional info:
The Czech sorting algorithm should put lowercase letters first (when the sorted
words are otherwise the same except possible non-alphanumeric characters).

For more info about the Czech sorting, see Petr Olsak's article in the CSTUG
bulletin (in Czech).

http://math.feld.cvut.cz/ftp/olsak/bulletin/csr.ps

Or see the examples from Jan Pazdziora's Czech sorting page:

http://www.fi.muni.cz/~adelton/l10n/cssort/sort.tab

Except for the above error, I think the Czech locale provides correct sorting.

This behaviour is actually documented as known bug in
/usr/share/i18n/locales/cs_CZ (see the comment labeled "3.7").

Comment 1 Ulrich Drepper 2006-07-30 22:25:14 UTC

It's trivial to change but you have the get the last maintainer (Vladimir Michl)
to agree.  Only if he does or doesn't reply will I change the file.

Comment 2 Jan "Yenya" Kasprzak 2006-11-28 12:53:15 UTC

Sorry for the delay - I have now tried to cotnact Vladimir Michl, and asked him
to add a note to this bug or contact you by email.

Comment 3 Vladimir Michl 2006-11-29 10:14:22 UTC

Hello,

as I see Yenya is right. And I think we should change it as he requests.
Just after change, so lowercase letters are before uppercase, can we delete
point 3.7 at the start as well.

Thanks

Vladimir Michl

Comment 4 Ulrich Drepper 2007-09-30 18:45:26 UTC

Should be fixed in upstream cvs.

Comment 5 Jakub Jelinek 2007-10-04 08:03:25 UTC

In glibc-2.6.90-17 in rawhide, please test.

Comment 6 Jan "Yenya" Kasprzak 2007-10-05 12:44:04 UTC

Works for me, thanks!

Note You need to log in before you can comment on or make changes to this bug.