Bug 1092015 - mojomojo-1.10-2.fc21 FTBFS
Summary: mojomojo-1.10-2.fc21 FTBFS
Alias: None
Product: Fedora
Classification: Fedora
Component: mojomojo
Version: 20
Hardware: Unspecified
OS: Unspecified
Target Milestone: ---
Assignee: Petr Pisar
QA Contact: Fedora Extras Quality Assurance
URL: https://github.com/mojomojo/mojomojo/...
Depends On:
Blocks: 1000256
TreeView+ depends on / blocked
Reported: 2014-04-28 13:59 UTC by Petr Pisar
Modified: 2014-07-02 13:17 UTC (History)
2 users (show)

Fixed In Version: mojomojo-1.10-3.fc21
Doc Type: Bug Fix
Doc Text:
Clone Of:
Last Closed: 2014-07-02 13:17:22 UTC
Type: Bug

Attachments (Terms of Use)
Prpoposed fix (1.36 KB, patch)
2014-04-29 10:50 UTC, Petr Pisar
no flags Details | Diff

System ID Private Priority Status Summary Last Updated
CPAN 87267 0 None None None Never

Description Petr Pisar 2014-04-28 13:59:50 UTC
mojomojo-1.10-2.fc21 fails to build because 'Unicode wikilinks' t/unicode.t test fails with current Encode:

$ CATALYST_CONFIG=t/var/mojomojo.yml prove -l -v t/unicode.t
ok 6 - POST /.jsrpc/render
ok 7 - basic Unicode: page content
[error] Caught exception in MojoMojo::Controller::Jsrpc->render "Cannot decode string with wide characters at /usr/lib64/perl5/vendor_perl/Encode.pm line 215."
not ok 8 - Unicode wikilinks

#   Failed test 'Unicode wikilinks'
#   at t/unicode.t line 52.
#          got: "<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Trans"...
#       length: 3850
#     expected: "<p><span class="newWikiWord"><a title="Not found. "...
#       length: 133
#     strings begin to differ at char 2 (line 1 column 2)
ok 9 - restore original formatter
# Looks like you failed 1 test of 9.

$ rpm -q perl-Encode

Comment 1 Petr Pisar 2014-04-28 14:19:41 UTC
The test is:

$test = 'Unicode wikilinks';
my $unicode_string = 'განეკუთვნება';
$content = "[[$unicode_string]]";
$mech->post('/.jsrpc/render', { content => $content });
$mech->content_is(<<"HTML", $test);
<p><span class="newWikiWord"><a title="Not found. Click to create this page." href="/$unicode_string.edit">$unicode_string?</a></span></p>

The Encode::decode_utf8() complains on:


$ perl -MEncode -e 'decode_utf8(qq{\x{10d2}\x{10d0}\x{10dc}\x{10d4}\x{10d9}\x{10e3}\x{10d7}\x{10d5}\x{10dc}\x{10d4}\x{10d1}\x{10d0}}, 1)'
Cannot decode string with wide characters at /usr/lib64/perl5/vendor_perl/Encode.pm line 215.

The \x{} notation corresponds to the 'განეკუთვნება' string.

It looks like the unicode string is double-decoded, second decoding is performed on string with UTF-8 flag up instead of on bit-stream.

Comment 2 Petr Pisar 2014-04-29 07:17:49 UTC
This bug is triggered by upgrading Encode from 2.52 to 2.53, more precisely by commit:

commit ff65c71aa64c0efd285e6905ac68ba4e2cb25541
Author: Tatsuhiko Miyagawa <miyagawa@bulknews.net>
Date:   Sun Aug 25 19:02:16 2013 -0700

    Do not short-circuit decode_utf8 with utf8 flags

diff --git a/Encode.pm b/Encode.pm
index aea404a..5cee760 100644
--- a/Encode.pm
+++ b/Encode.pm
@@ -209,7 +209,6 @@ my $utf8enc;
 sub decode_utf8($;$) {
     my ( $octets, $check ) = @_;
-    return $octets if is_utf8($octets);
     return undef unless defined $octets;
     $octets .= '' if ref $octets;
     $check   ||= 0;

The former behavior was to return success on UTF-8-flagged string immediately. Now, it checks the argument is correct UTF-8 byte-string.

Comment 3 Petr Pisar 2014-04-29 08:18:17 UTC
Reported to mojomojo upstream as <https://github.com/mojomojo/mojomojo/issues/121>.

Comment 4 Petr Pisar 2014-04-29 08:21:40 UTC
Fedora 20 is affected too.

Comment 5 Petr Pisar 2014-04-29 10:50:48 UTC
Created attachment 890741 [details]
Prpoposed fix

Comment 6 Fedora Update System 2014-04-29 11:24:58 UTC
mojomojo-1.10-3.fc20 has been submitted as an update for Fedora 20.

Comment 7 Fedora Update System 2014-05-08 10:00:46 UTC
mojomojo-1.10-3.fc20 has been pushed to the Fedora 20 stable repository.  If problems still persist, please make note of it in this bug report.

Note You need to log in before you can comment on or make changes to this bug.