Bug 75192 - tcsh incorrectly localized (can not handle UTF-8)
Summary: tcsh incorrectly localized (can not handle UTF-8)
Keywords:
Status: CLOSED RAWHIDE
Alias: None
Product: Fedora
Classification: Fedora
Component: tcsh
Version: rawhide
Hardware: All
OS: Linux
medium
high
Target Milestone: ---
Assignee: Miloslav Trmač
QA Contact:
URL:
Whiteboard:
: 73627 (view as bug list)
Depends On: 73627
Blocks: FC2Blocker FC3Target
TreeView+ depends on / blocked
 
Reported: 2002-10-05 09:19 UTC by Tomasz Kepczynski
Modified: 2007-11-30 22:10 UTC (History)
2 users (show)

Fixed In Version: 6.13-1
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2004-08-18 10:10:51 UTC


Attachments (Terms of Use)
Automatically convert message encoding (13.47 KB, patch)
2004-07-17 01:18 UTC, Miloslav Trmač
no flags Details | Diff

Description Tomasz Kepczynski 2002-10-05 09:19:34 UTC
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.0.1) Gecko/20020830

Description of problem:
Polish output from tcsh in pl_PL.utf8 locale lacks Polish letters. Additionaly
tcsh source rpm doesn't compile under .utf8 locales (which is most likely the
root cause of the problem).

Version-Release number of selected component (if applicable):


How reproducible:
Always

Steps to Reproduce:
1. Set locale to pl_PL.utf8, start unicode on console
2. run: tcsh -h
3. check the output
4. set locale to pl_PL, stop unicode on console
5. run: tcsh -h
6. compare results
7. This will not work under X windows! Why? Well, someone already suggested in
some other bug, that locales in X are handled differently

Actual Results:  Nieznana opcja: `-h'
Uycie: tcsh [ -bcdefilmnqstvVxX ] [ argument ... ].


Expected Results:  Nieznana opcja: `-h'
U?ycie: tcsh [ -bcdefilmnqstvVxX ] [ argument ... ].


Additional info:

in actual results &zabovedot; is not visible with unicode.
Anyway - utf8 support seems to be rushed through the door. I can understand that
some 3rd party utilities may not work with it as you mention in release notes
but the fact that some packages in distribution only work correctly with unicode
and some without it is unacceptable. I'm rolling back to 7.3 which is a pity as
8.0 has some nice features.

Comment 1 Aleksey Nogin 2003-11-14 20:49:52 UTC
This is still the case with tcsh-6.12-5 in Fedora Core. The way tcsh
is localized is that it always assumes a specific charset for every
localization. As a result, when utf-8 is used, it just does not work.
For example, under ru_RU.UTF-8 all tcsh messages are just empty!

P.S. The only work-around I know for this is to erase
/usr/share/locale/ru/LC_MESSAGES/tcsh (or at least rename it into
/usr/share/locale/ru_RU.KOI8-R/LC_MESSAGES/tcsh). This way I get all
the messages in English, which is still way better than not getting
them at all.

Comment 2 Aleksey Nogin 2004-02-18 06:19:54 UTC
Still there (tcsh-6.12-7 :-( ).

Comment 3 Mikko Paananen 2004-02-22 22:17:08 UTC
You really should complain at www.tcsh.org. Last release seems to from
2002, so it's now really a surprise UTF-8 doesn't work.

May be translations should be left out since it seems they cause more
problems than have advantages.


Comment 4 Aleksey Nogin 2004-02-24 02:59:42 UTC
Re: comment #3.

There are two sides to this bug. First, there are no localizations
that would work under UTF-8 - this is obviously a tcsh bug. Second,
the localization files that do work (just not under UTF-8) are
packaged in a way that they get picked up under UTF-8 locales. The
second one is a packaging bug, and not tcsh's fault (at least not
directly).

Comment 5 Aleksey Nogin 2004-02-25 06:35:24 UTC
triage->easyfix (packaging the locale files under correct directories
in /usr/share/locale - see comment #1 - would make things a lot easier).

Comment 6 Simos Xenitellis 2004-04-13 05:53:34 UTC
tcsh does not use gettext to internationalise the messages. 
It uses some older technique which apparently does not support utf-8 
properly (??). The same issue exists with greek (LANG=el_GR.UTF-8). 
tcsh should be updated to use gettext. Talk to www.tcsh.org to help 
get this done. 
 
If you cannot write under a utf-8 locale with tcsh, please look at 
http://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=89549 
and get someone apply the patch. 

Comment 7 Miloslav Trmač 2004-07-17 01:18:31 UTC
Created attachment 101991 [details]
Automatically convert message encoding

The attached patch against tcsh-6.13.00:
* reinstates use of the "$ codeset" extension so that gencat doesn't barf
  on a non-8-bit locale (like *.UTF-8 or POSIX)
* embeds the catalog encoding information in the generated catalogs
* automatically converts messages from catalog encoding to the
  LC_CTYPE-specified encoding.

[Largest part of the patch is just the result of rerunning autoconf]

With this patch the messages are output correctly in UTF-8 locales for me.

Comment 8 Miloslav Trmač 2004-08-17 14:37:14 UTC
*** Bug 73627 has been marked as a duplicate of this bug. ***

Comment 9 Miloslav Trmač 2004-08-18 10:10:51 UTC
Patch added to tcsh-6.13-1, which should show up in rawhide soon.


Note You need to log in before you can comment on or make changes to this bug.