1986428 – A character range too hungry in C.UTF-8 with glibc-2.33.9000-53.fc35

Bug 1986428 - A character range too hungry in C.UTF-8 with glibc-2.33.9000-53.fc35

Summary: A character range too hungry in C.UTF-8 with glibc-2.33.9000-53.fc35

Keywords:
Status:	CLOSED DUPLICATE of bug 1986421
Alias:	None
Product:	Fedora
Classification:	Fedora
Component:	sed
Sub Component:
Version:	rawhide
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	unspecified
Target Milestone:	---
Assignee:	Jakub Martisko
QA Contact:	Fedora Extras Quality Assurance
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2021-07-27 14:01 UTC by Petr Pisar
Modified:	2021-07-27 14:12 UTC (History)
CC List:	10 users (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:	2021-07-27 14:04:19 UTC
Type:	Bug
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Description Petr Pisar 2021-07-27 14:01:37 UTC

There is a regression:

Fedora 34:

$ printf '4A\n' | LC_ALL=C.UTF-8 sed 's/[4-4]\+/X/'
XA

Fedora 35:

$ printf '4A\n' | LC_ALL=C.UTF-8 sed 's/[4-4]\+/X/'
X

It depends on locale, Fedora 35:

$ printf '4A\n' | LC_ALL=en_US.UTF-8 sed 's/[4-4]\+/X/'
XA

And the regular expression, e.g. character set without a range works:

$ printf '4A\n' | LC_ALL=C.UTF-8 sed 's/[4]\+/X/'
XA

This is very probably triggered by a recent glibc changes. Tested with:

glibc-common-2.33.9000-53.fc35.x86_64
glibc-gconv-extra-2.33.9000-53.fc35.x86_64
glibc-langpack-cs-2.33.9000-53.fc35.x86_64
glibc-langpack-en-2.33.9000-53.fc35.x86_64
glibc-langpack-tr-2.33.9000-53.fc35.x86_64
glibc-2.33.9000-53.fc35.x86_64
glibc-headers-x86-2.33.9000-53.fc35.noarch
glibc-devel-2.33.9000-53.fc35.x86_64
glibc-static-2.33.9000-53.fc35.x86_64
sed-4.8-7.fc34.x86_64

Comment 1 Petr Pisar 2021-07-27 14:04:19 UTC


*** This bug has been marked as a duplicate of bug 1986421 ***

Note You need to log in before you can comment on or make changes to this bug.