Bug 513998 - Review Request: html2txt - Produces a text-only file from an HTML file
Summary: Review Request: html2txt - Produces a text-only file from an HTML file
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Fedora
Classification: Fedora
Component: Package Review
Version: rawhide
Hardware: All
OS: Linux
medium
medium
Target Milestone: ---
Assignee: Nobody's working on this, feel free to take it
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks: KyaPanel
TreeView+ depends on / blocked
 
Reported: 2009-07-27 15:35 UTC by Hugo Cisneiros
Modified: 2009-07-28 19:48 UTC (History)
5 users (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed: 2009-07-28 19:48:12 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)

Description Hugo Cisneiros 2009-07-27 15:35:23 UTC
Spec URL: http://www.devin.com.br/fedora-devel/SPECS/html2txt.spec
SRPM URL: http://www.devin.com.br/fedora-devel/html2txt-0.99.1-1.fc11.src.rpm
Description:
html2txt doesn't do any conversion of your file, but only stripes
ANYTHING between the '<' and '>' signs in the original html-file
and writes the result to the specified text file. The html file
remains untouched.

--

I don't know if I'll need a new sponsor because I didn't package and work with Fedora team since 2006. FedoraProject Brazil asked for help on packaging the KyaPanel, so I'm trying to "come back" to help.

Although KyaPanel is a noarch package, it uses this html2txt for some reporting generation. I'm studying with the author the possibility to substitute this single utility for a sed script, for compliance with the noarch nature of the whole package. But since this is not the case at this time, I think that packaging this program won't hurt :)

The current KyaPanel review request is:
https://bugzilla.redhat.com/show_bug.cgi?id=485401

And it's missing the spec, etc. I'll work through the week on the spec and its dependencies.

Comment 1 Hugo Cisneiros 2009-07-27 16:09:06 UTC
Koji scratch build on dist-f11:
http://koji.fedoraproject.org/koji/taskinfo?taskID=1541252

Comment 2 Bruno Wolff III 2009-07-27 16:49:18 UTC
html2txt doesn't seem to be a very good tool for converting html to plain text.
Could you use some other tool that is already in Fedora to do that? lynx will do it, but there are probably other tools that could do it as well. You can test "lynx -force_html -dump" and see if it does what you need.

Comment 3 Jason Tibbitts 2009-07-27 17:25:46 UTC
There's also html2text, already in Fedora.

Comment 4 Hugo Cisneiros 2009-07-27 22:58:49 UTC
I could also use sed to do it, but since the KyaPanel (goal) upstream does not use it in the current version, I can't create a package for it. For this, one question: what do you think of creating a patch for not using this tool on the main package, even when upstream didn't use the patch?

I ask this because I'll sure push a patch like that to him in the future :)

If using a patch is a good thing, I'll just dump this package.

Comment 5 Bruno Wolff III 2009-07-28 03:48:20 UTC
I think the typical solution for this kind of thing is to have a way to configure it at run time. Either read a file (global and/or local) and/or look at an environment variable that says what command to use to change html to plain text. There should also be a fallback. The fallback is the only thing you would need to change for Fedora if you can get the general scheme upstreamed.

Comment 6 Hugo Cisneiros 2009-07-28 19:48:12 UTC
Ok, since nobody wants this package included, I'll dump it and use another approach. If anyone changes opinion, re-open ticket. Thanks.


Note You need to log in before you can comment on or make changes to this bug.