Bug 702205

Summary: Multibyte characters in a po cannot be processed by Zanata properly
Product: [Retired] Zanata Reporter: Kenichi Takemura <ktakemur>
Component: Component-LogicAssignee: zanata-dev-internal <zanata-dev-internal>
Status: CLOSED NOTABUG QA Contact: Ding-Yi Chen <dchen>
Severity: medium Docs Contact:
Priority: medium    
Version: 1.3-alpha-1CC: ebaak, sflaniga, zanata-bugs
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2011-05-05 07:23:09 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Kenichi Takemura 2011-05-05 03:59:25 UTC
Description of problem:

Accidentally I was trying to import a pot file that contains a single multibyte character and mvn zanata:publican-push produces Embedded error: operation returned 500 (Internal Server Error).

As a client user this looks like a zanata system error but in fact from the server.log zanata cannot deal with multibyte character.

This should be rectified in either way

- More appropriate error message in order for user to fix the book.
- Zanata supports multibyte characters

Caused by: java.sql.SQLException: Incorrect string value: '\xE2\x88\x97, B...' for column 'content' at row 1
        at com.mysql.jdbc.SQLError.createSQLException(SQLError.java:1055)

The msgstr of the po looks

msgid "Xen and the Art of Virtualization, Paul Barham∗, Boris Dragovic. University of Cambridge SOSP'03 paper"
msgstr ""

$ xxd RHEL_5-7/ja-JP/Chapters/Test_design/Block_device_model.po | grep e288
0002e80: 2050 6175 6c20 4261 7268 616d e288 972c   Paul Barham...,

'∗' is a multibyte character.

Version-Release number of selected component (if applicable):
Zanata version 1.3-alpha-1 (20110330-1649)

How reproducible:
always

Steps to Reproduce:
1.
2.
3.
  
Actual results:
import ends up with internal error

Expected results:
- More appropriate error message in order for user to fix the book.
- Zanata supports multibyte characters

Additional info:

Comment 1 Sean Flanigan 2011-05-05 05:36:31 UTC
Was this the same system which was using a latin1 database?

Comment 2 Kenichi Takemura 2011-05-05 07:23:09 UTC
Tested again. My mistake. Sorry for disturbing.