Bug 71921

Summary: ignore locale cs_CZ
Product: [Retired] Red Hat Linux Reporter: Vaclav Dusek <vaclav.dusek>
Component: htdigAssignee: Phil Knirsch <pknirsch>
Status: CLOSED RAWHIDE QA Contact: David Lawrence <dkl>
Severity: medium Docs Contact:
Priority: medium    
Version: 7.3CC: grdetil, rvokal
Target Milestone: ---   
Target Release: ---   
Hardware: i386   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2004-01-13 13:32:59 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Vaclav Dusek 2002-08-20 08:28:14 UTC
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.0.0) Gecko/20020607

Description of problem:
in htdig.conf is ignored tag
locale: cs_CZ
version htdig 1.x is OK

Version-Release number of selected component (if applicable):


How reproducible:
Always

Steps to Reproduce:
1.czech WWW site with Apache
2.in htdig.conf is direktive locale: cs_CZ
3.this directive is ignored with htsearch
	

Additional info:

Comment 1 Phil Knirsch 2002-08-20 11:41:12 UTC
Could you give a more specific and detailed description on how to reproduce this
problem?

With that little information i don't know how i should be able to reproduce it,
and reproduction of bugs locally is essential ;-).

So if you could give me step by step description of what i should try and what
the expected result and the actual results are i'd greatelly appreciate it.

Thanks,

Read ya, Phil

Comment 2 Vaclav Dusek 2002-08-20 18:19:41 UTC
htdig.conf

locale: cs_CZ
database_dir:           /var/lib/htdig/dusek
start_url:              http://dusek.upce.cz/

when i using search via WWW, i don't have czech characters in result - in result
is incorect special charcters &...; - not czech characters in ISO8859-2

Comment 3 Phil Knirsch 2003-09-02 12:51:49 UTC
Could you give the latest releases a try on a current Red Hat? Since 8.0 we have
utf-8 as default and this should be enough to have a consistent charset over all
apps.

Thanks,

Read ya, Phil

Comment 4 Gilles Detillieux 2003-10-30 15:53:22 UTC
This bug should be fixed in snapshots of ht://Dig 3.2.0b4 on or after 20030615
(June 15, 2003), with the addition of the translate_latin1 attribute.  You'd
need to add

translate_latin1: false

to your htdig.conf file, to disable the translation of the ISO Latin 1 SGML
entities from your documents into upper end 8-bit characters in the database,
and then back to the Latin 1 SGML entities in htsearch results. Snapshots before
June 15 didn't have this attribute, and always did the translation. This might
require building your own RPM of htdig, as the most recent snapshot available as
a Red Hat update for 7.3 is the 011302 snapshot (Jan. 13, 2002).  Even rawhide
is still at 20030601.  Watch for ht://Dig 3.2.0rc1 coming hopefully in early
November.

Comment 5 Phil Knirsch 2004-01-13 13:32:59 UTC
Thanks for the info, i've included the change in the latest Fedora
Core package for htdig now.

Read ya, Phil

Comment 6 Gilles Detillieux 2004-01-14 20:52:23 UTC
Actually, what was released in November was 3.2.0b5.  Release Candidate 1 is still 
apparently a few months off.  However, 3.2.0b5 is far more solid than any of the 
3.2.0b4 development snapshots that have been RPM'ed.