Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 799863

Summary:

inconsistent \w and [[:alnum:]] behaviour

Product:

Red Hat Enterprise Linux 6

Reporter:

Lukáš Zachar <lzachar>

Component:

grep

Assignee:

Jaroslav Škarvada <jskarvad>

Status:

CLOSED ERRATA

QA Contact:

Jan Kepler <jkejda>

Severity:

medium

Docs Contact:

Priority:

medium

Version:

6.2

CC:

dkutalek, jkejda

Target Milestone:

Keywords:

Patch

Target Release:

---

Hardware:

Unspecified

OS:

Unspecified

Whiteboard:

Fixed In Version:

grep-2.20-1.el6

Doc Type:

Bug Fix

Doc Text:

Cause: Previously the behavior of \w and \W symbols in regular expressions were inconsistent with the behavior of [:alnum:] character class. Consequence: Some expressions could cause incorrect match/non-match. Fix: Upstream fix was backported that makes the behavior consistent. Result: Now \w is synonym for [_[:alnum:]] and \W for [^_[:alnum:]].

Story Points:

---

Clone Of:

Clones:

1159012 1187243 (view as bug list)

Environment:

Last Closed:

2015-07-22 06:17:47 UTC

Type:

---

Regression:

---

Mount Type:

---

Documentation:

---

CRM:

Verified Versions:

Category:

---

oVirt Team:

---

RHEL 7.3 requirements from Atomic Host:

Cloudforms Team:

---

Target Upstream Version:

Embargoed:

Bug Depends On:

Bug Blocks:

1159012, 1187243

Attachments:

Description	Flags
Backported fix	none
grep-2.20 fix	none

Description Lukáš Zachar 2012-03-05 10:02:23 UTC

Description of problem:

\w and [[:alnum:]] seems to match different set of characters:

$ echo 'á' | grep '\w'
$ echo 'á' | grep '[[:alnum:]]'
$ á

Their negations are inconsistent as well:

$ echo 'á' | grep '[^[:alnum:]]'
$ echo 'á' | grep '\W'
$ á

This doesn't seem to be problem of a locale (I tried it with the en_US.UTF-8 and cs_CZ.UTF-8, both made the same results).

Affected are accented characters Á Č Ď É Ě Í Ň Ó Ř Š Ť Ú Ů Ý Ž (I have tested just these)


Version-Release number of selected component (if applicable):
grep-2.6.3-2

How reproducible:
always

Steps to Reproduce:
1. echo 'á' | grep '\w'
2.
3.
  
Actual results:
-empty-

Expected results:
á

Additional info:

Comment 2 Jaroslav Škarvada 2012-03-20 15:37:46 UTC

Upstream ticket:
http://savannah.gnu.org/bugs/?19637

Also reproducible with latest grep-2.11.

Comment 3 RHEL Program Management 2012-09-07 05:04:46 UTC

This request was evaluated by Red Hat Product Management for
inclusion in the current release of Red Hat Enterprise Linux.
Because the affected component is not scheduled to be updated
in the current release, Red Hat is unable to address this
request at this time.

Red Hat invites you to ask your support representative to
propose this request, if appropriate, in the next release of
Red Hat Enterprise Linux.

Comment 4 Jaroslav Škarvada 2014-10-30 16:57:42 UTC

Created attachment 952258 [details]
Backported fix

Comment 6 Jaroslav Škarvada 2014-10-30 17:00:29 UTC

This can be also resolved by rebase to grep > 2.20.

Comment 7 Jaroslav Škarvada 2015-01-29 14:36:03 UTC

Created attachment 985631 [details]
grep-2.20 fix

(In reply to Jaroslav Škarvada from comment #6)
> This can be also resolved by rebase to grep > 2.20.

This is now preferred way, patch for grep-2.20 is attached.

Comment 8 Jaroslav Škarvada 2015-01-29 15:37:21 UTC

RHEL-7 is also affected by this, thus cloning to RHEL-7, not to have regression there.

Comment 12 errata-xmlrpc 2015-07-22 06:17:47 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://rhn.redhat.com/errata/RHSA-2015-1447.html