Red Hat Bugzilla – Bug 125362
python-doc html files have japanese character encoding
Last modified: 2007-11-30 17:10:44 EST
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.6)
Description of problem:
Quite a few of the python documentation HTML pages specify the charset
"EUC-JP", causing Mozilla to display them in an odd font. For example,
has the following line in the header porition:
<meta http-equiv="Content-Type" content="text/html; charset=EUC-JP">
The top level documentation files are not affected. For example,
uses the following encoding:
<meta http-equiv="Content-Type" content="text/html;
and displays normally.
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. Make sure the python-docs rpm is installed
2. Start mozilla
3. Open file:///usr/share/doc/python-docs-2.3.3/html/lib/module-os.html
Actual Results: The page loaded with an odd looking, non standard
font corresponding to the japanese character encoding. If you change
the character encoding to ISO-8859-1 (western) via the View->Character
Encoding menu, the page is rendered in the standard font.
Expected Results: The pages should be rendered in the ISO-8859-1
character set and appear the same as other pages.
I tried to figure out why, but no progress. I'll keep chasing it.
Note that the standard docs on the python web page don't have this
problem... the following command can be used to quickly resolve this
(though obviously the rpms should be fixed)
grep -irl 'charset=EUC-JP' . | xargs sed -i
Yeah, I have to identify why it happens though...
Here's one place to check - the latex2html conf file. Since the
documents are generated using latex2html, I took a look at the
configuration file shipped as default with FC2
(/usr/share/latex2html/l2hconf.pm). If you grep for the line with
"$default_language", you'll see that english is
commented out and japanese is set. Looking at the same file on one of
my FC1 machices shows that it used to default to english.
This parameter causes the corresponding file in styles to be used.
Looking at styles/japanese.perl, the charset to be definted as "EUC-JP".
Seems to be fixed in 2.3.4-7, can you please confirm?
This is fixed. Closing.