Bug 1997589

Summary: glibc: Switch to upstream version of C.UTF-8
Product: Red Hat Enterprise Linux 9 Reporter: Carlos O'Donell <codonell>
Component: glibcAssignee: glibc team <glibc-bugzilla>
Status: CLOSED CURRENTRELEASE QA Contact: qe-baseos-tools-bugs
Severity: medium Docs Contact:
Priority: unspecified    
Version: 9.0CC: ashankar, codonell, dj, fweimer, mnewsome, pfrankli, sipoyare
Target Milestone: rcKeywords: Bugfix, Patch, Triaged
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: glibc-2.34-6.el9 Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-10-01 14:04:33 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1994653    
Bug Blocks:    

Description Carlos O'Donell 2021-08-25 13:35:51 UTC
The current implementation of C.UTF-8 in RHEL9 has several defects in the higher code-point ranges (incorrect sorting).

The C.UTF-8 upstream update (v8) fixes all of these defects and further reduces the size of the locale from ~2MiB down to ~400KiB (saves ~1.6MiB):

https://sourceware.org/pipermail/libc-alpha/2021-August/130501.html

The new version of C.UTF-8 will most likely be included in glibc 2.35, with a backport to glibc 2.34.

We should include glibc 2.34 in RHEL9 regardless of the upstream decision to backport. We want to make this change before RHEL9 GA, after RHEL9 GA we would not want to change the collation of code points because of the impact it has on sorting data that customers have (we only change this at X-stream boundaries).

Therefore I think this should be fixed immediately.

Comment 3 Carlos O'Donell 2021-09-08 19:01:43 UTC
Fixed upstream with these two commits.

commit 466f2be6c08070e9113ae2fdc7acd5d8828cba50
Author: Carlos O'Donell <carlos>
Date:   Wed Sep 1 15:19:19 2021 -0400

    Add generic C.UTF-8 locale (Bug 17318)
...    

commit f5117c6504888fab5423282a4607c552b90fd3f9
Author: Carlos O'Donell <carlos>
Date:   Thu Jul 29 22:45:39 2021 -0400

    Add 'codepoint_collation' support for LC_COLLATE.
...