Bug 443270

Summary: segfault with certain utf8 string as regexp
Product: [Fedora] Fedora Reporter: Ariel T. Glenn <ariel>
Component: perlAssignee: Marcela Mašláňová <mmaslano>
Status: CLOSED WORKSFORME QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: medium Docs Contact:
Priority: low    
Version: 9CC: kasal, rc040203, robin.norwood, splewako, tcallawa
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2008-06-26 14:03:58 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Ariel T. Glenn 2008-04-19 23:50:52 UTC
Problem: The supplied code segfaults.  

Version-Release number of selected component (if applicable):
perl-5.8.8-28.fc7

How reproducible: run this program:

#!/usr/bin/perl                                                                
                                                                        
binmode(STDOUT, ":utf8");
binmode(STDIN, ":utf8");

use encoding(UTF8);

$title="test";
if ($title =~ /'élé'/) {
    print "it matches with title $title\n";
}
else {
    print "it does not match with title $title\n";
}
exit 0;

My locale, if it makes a difference: el_GR.UTF-8

Note that taking out the single quotes, or one of the accented e's, or putting 
other text in front of the string (but after the quote) makes it work fine.

Comment 1 Bug Zapper 2008-05-14 09:45:20 UTC
Changing version to '9' as part of upcoming Fedora 9 GA.
More information and reason for this action is here:
http://fedoraproject.org/wiki/BugZappers/HouseKeeping

Comment 2 Marcela Mašláňová 2008-06-09 14:14:33 UTC
Have you perl-5.8.8 and F-9?

It's working for me with perl-5.8.8 and perl-5.10.0. I tried different locales,
but no segfault.