Red Hat Bugzilla – Bug 88313
man does'nt display accented characters
Last modified: 2007-04-18 12:52:52 EDT
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.3) Gecko/20030314
Description of problem:
man does'nt display accented characters of french manpages. Is there still
something like MANCHARSET or LESSCHARSET to set? If ye, what are the admissibles
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1.upgrade from rh7.3 to rh9
2.run man -a <manpage>
Actual Results: ï¿½ ï¿½ ï¿½ etc... show as ?.
Expected Results: correct display as before
Reading my mail, I feel bugzilla has the same problem; LoL -:)
Is there anybody out there (Pink Floyd, the Wall)?
what locale are you using? (type "locale" at the command line)
Still hoping for an answer. Thanks
LANG=fr_FR.UTF-8@euro man ls
I tried LANG=fr_FR.UTF-8@euro: it qgaves things like rÃÂ©pertoires instead of
rÃ©pertoire. But LANG=fr_FR works, I am sure it didn't with RH7.3), and I still
have the euro sign â¬. I don't understand, but thanks anyway.
Is your terminal set to display UTF-8? If it isn't you're receive gargage.
The reason fr_FR works and fr_FR@euro doesn't is because they use different
character sets: fr_FR uses iso-8859-1 (aka Latin-1) and fr_FR@euro uses
iso-8859-15 (aka Latin-9 or Latin-0)
The default character encoding for Red Hat Linux 8 and beyond (for European
language-- other languages (esp CJK) will be migrated to UTF-8 in later
versions) is UTF-8, which is different from 7.3.
I have been trying (very hard) to tell you that UTF8 does'nt work at all with
any west european locale (and possibly others). Please try them. For your
information, my last comment said ini works, but I have to use kde konsole (in
gnome !!!).Gnome terminal won't accept fr_FR locale.
> Is your terminal set to display UTF-8?
> If it isn't you're receive gargage.
reporter, you didn't answer this question. However, I confirmed
that 'man' doesn't work the way it should. There are two problems.
One of them is not the fault of man but that of less (see bug 88868).
To see that, you can do the following:
1. $ env LC_ALL=fr_FR.UTF-8 xterm &
2. In a new xterm window,
$ env LANG=fr_FR.UTF-8 man man -P more
You'll see that letters with diacritic marks are
3. In the same xterm,
$ env LANG=fr_FR.UTF-8 man man
would not work.
Because the default pager for 'man' is 'less -isr',
this is a clear indication that 'less' is to blame.
Now, what's the problem of 'man' I wrote about above?
That is, it's not compliant to the POSIX-specified locale
resolution-order. When LC_ALL is specified, it should have
the highest priority, but it still selects the man page based
on LANG. I'll file it as a separate bug.
sorry. the first part of my comment #10 is wrong. I didn't relaize that
JLESSCHARSET was set to 'ko'. With that unset,
LANG=fr_FR.UTF-8 /usr/bin/man man
works fine in a UTF-8 terminal.
So, this bug as reported should be closed.
On the other hand, I have a trouble with fr_FR (fr_FR@euro) in a ISO-8859-1 or
ISO-8859-15 terminal. Letters with diacritic marks come up as question
marks.Using '-P more' didn't change this.
When I use fr_FR.ISO-8859-1 or fr_FR.ISO-8859-15, they come up correctly.
Because '-P more' change didn't make any difference, I suspect that either man
or groff is to blame for not using nl_langinfo(CODESET) but relying on parsing
the value of LANG to determine the codeset.
as per comment #11, closing