Red Hat Bugzilla – Bug 160349
FC4 Release Notes appear to have bad encoding
Last modified: 2007-04-18 13:27:59 EDT
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.8) Gecko/20050513 Fedora/1.0.4-1.3.1 Firefox/1.0.4
Description of problem:
Using default FC3+Firefox in ISO-8859-1 or UTF-8 encoding, the new release notes contain a number of odd characters that make the page readable, but ugly. I suspect that this is some sort of export problem, as I often see the same sort of garbage when exporting .doc files to HTML.
These characters appear in the HTML source, not just the rendered HTML.
The copyright symbol should also probably properly be represented as "©.
As a nit while we're on the subject, the w3c validator suggests that this page should be labelled as HTML 4.01 Transitional rather than Strict, because it's not legal 4.01 Strict...
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. FC3, Western Encoding, Firefox
2. Browse to http://fedora.redhat.com/docs/release-notes/fc4/
Expected Results: Release notes displayed without garbage
While it's not a crasher bug, if more people see the release notes like this it's not going to be good PR.
Ooo, trying to submit this is generating internal server errors. I'll try removing my examples and sending them as an attachment instead
It looks like "smart quotes" are once again the most common culprit.
Created attachment 115416 [details]
I used the chart at
to do a quick search and replace, using ™ — etc instead of the
*** Bug 160346 has been marked as a duplicate of this bug. ***
*** Bug 160347 has been marked as a duplicate of this bug. ***
*** Bug 160348 has been marked as a duplicate of this bug. ***
Adding common blocking tracker bug, as specified.
There are a number of layers happening here.
I think the primary situation is using UTF-8 envar on the build system, in this
case, my laptop. This is not bad, but it has problems when you combine it with
the situation we had on fedora.redhat.com up until this morning.
The PHP includes did not have a charset attribute in the <META> tag, and
presumably httpd was using the default ISO-8859-1 from /etc/httpd/conf/httpd.conf.
We added charset=utf-8 to the include so all pages on f.r.c are served using utf-8.
Now this breaks a few other docs that I will rebuild using UTF-8 and hope they
Interested if the release notes look better now.
Long term, we need to coordinate the documentation better from writing to
Changing the stated character set has indeed resolved this from the perspective
of my browser.
Making the global changes to the fedora.redhat.com pages was sufficient. This
is now consistent with a default install of FC.