Red Hat Bugzilla – Bug 75280
xchat doesn't accept danish characters fxeFXE
Last modified: 2008-05-01 11:38:04 EDT
Description of Problem:
X-Chat doesn't accept input of danish characters fxeFXE when
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. export LANG=en_US.UTF-8 (this is set in /etc/sysconfig/i18n)
2. Login with gdm
3. Run xchat
4. Try entering danish characters fxeFXE in the text entry
Letters fxeFXE shows up as other characters (C|C,C%C etc)
(note: when copied and inserted into other application they show up right)
You should be able to use danish charaters even though the locale isn't set
This doesn't happen when LANG=da_DK.ISO_8859_1
seems opera messed up the danish characters...
any clue to how I can get them right on the bug report?
This is not an xchat bug. When multiple people are communicating, either
via IRC, email, or otherwise, or when any to applications are communicating
streams of text, both ends of communication MUST be using an agreed upon
character encoding. If both ends are not using the same encoding, then
garbled characters will result. This is expected behaviour.
The solution to this "problem", is to either:
1) disable UTF-8, and set your locale to a non unicode locale, or start
xchat with a non unicode locale, ie: "LANG=de_DE xchat"
2) enable "fontset" support by default in xchat. Many people are happy
3) force everyone else you communicate with to also use UTF-8
The IRC protocol itself specifies an 8bit encoding, but doesn't specify
a particular encoding. This is most unfortunate, since it leaves
encoding undefined. Any encoding is therefore theoretically valid.
The unfortunate consequence of that, is that any people who are
communicating with each other, must use the same encoding, otherwise
any non-ASCII characters will appear as random junk. This includes
using UTF-8 as well.
There is no way to solve this unfortunate problem. It's just a flaw
in the IRC protocol itself. Hopefully at some point the IRC protocol
will be officially enhanced and will specify UTF-8 as the encoding.
That would enable worldwide communication via IRC with a single
official standard 8 bit character encoding. Of course, it would
likely take years for all IRC servers and clients to become properly
UTF-8 aware at that point.
*** Bug 73801 has been marked as a duplicate of this bug. ***
*** Bug 75681 has been marked as a duplicate of this bug. ***
*** Bug 68141 has been marked as a duplicate of this bug. ***
*** Bug 76775 has been marked as a duplicate of this bug. ***