From Bugzilla Helper: User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.5a) Gecko/20030718 Description of problem: Some of RH's patches makes regexp engine broken (compared to vanilla perl compiled from sources). The following script should match 'oo' in foo via f([^\s]+). It does, if "foo" is taken as raw scalar, but fails if taken as UTF-8 flagged scalar. Similar problems appear with negated character-sets [^...] and \S,\w,\W. #!/usr/bin/perl use Encode; my $exp="foo"; # raw variant (ok) print "(raw): "; print $exp=~/f([^\s]+)/ ? "Matches '$1'\n" : "no match\n"; # utf8 variant (BROKEN!!!) print "(utf8): "; my $exp=decode('iso-8859-1',$exp); print $exp=~/f([^\s]+)/ ? "Matches '$1'\n" : "no match\n"; Version-Release number of selected component (if applicable): perl-5.8.0-88 How reproducible: Always Steps to Reproduce: 1.run the script from Description Actual Results: (raw): Matches 'oo' (utf8): no-match Expected Results: (raw): Matches 'oo' (utf8): Matches 'oo' Additional info: Bug is present in perl shipped with both RedHat Linux 8 and RedHat Linux 9.
I get correct behavior using the perl in rawhide. can you test that to confirm it behaves properly for you?
This smells like 104540. This bug causes the "mirror" program (http://sunsite.org.uk/packages/mirror/) to fail if LANG includes utf8 (default in RHL9).