html stripper

  • Thread starter Thread starter Dan Epstein
  • Start date Start date
Dan said:
I'm looking for a simple, lightweight html stripper. What's your
favorite?

What a coincidence. I was looking for one myself today. Here's what I found:

http://www.flintmich.com/techtips/ecleaner.htm

Click on the link titled "ECleaner v. 2.02" and save the file to your
hard drive. I scanned it and it's clean of viruses. Besides, this looks
like a reputable site. I installed the download just a little while ago
and my drive came up clean after an AV scan. The program works like a charm.
 
I believe Notetab Lite has that ability. In fact, if you learn a little bit
about its scripting language, you can fine-tune it. I use the Pro version
and I developed a script to strip scripts, iframes, tables, etc while
leaving the good stuff intact. I think the Lite version can do that too.

www.notetab.com

M
 
John said:
What a coincidence. I was looking for one myself today. Here's what I
found:

http://www.flintmich.com/techtips/ecleaner.htm

Click on the link titled "ECleaner v. 2.02" and save the file to your
hard drive. I scanned it and it's clean of viruses. Besides, this looks
like a reputable site. I installed the download just a little while ago
and my drive came up clean after an AV scan. The program works like a
charm.
I use first Web2text, then run the files through ECleaner, works well.

Mike Sa
------------------
Web2Text
<http://www.all4you.dk/FreewareWorld/links.php?id=12760>
http://www.jetman.dircon.co.uk/software/web2text.html
Web2Text version 1.6/32-bit: runs under MS Windows 95/98/NT. Has some
features the 16-bit version doesn't including conditional conversion.
Web2Text version 1.2/16-bit: runs under MS Windows 3.1 and above.
***No longer being developed or supported due to lack of time and demand
for it.
Download: w2t16v12.zip (132,917 bytes, also to be found on Simtel sites).
 
I'm looking for a simple, lightweight html stripper. What's your
favorite?

The bestest text editor (NoteTab Light) does that, amongst a million
other things.

HTH.
 
John said:
Dan Epstein wrote:

eCleaner is Pricelessware.
http://www.pricelesswarehome.org/2005/PL2005TEXT.php#0901-PW
eCleaner's home page: http://ecleaner.tripod.com/

Some HTML strippers are listed here:

http://www.pricelesswarehome.org/acf/P_TEXT.php#2.03Convert:HTMLToFormattedText
HTML2TXT
HTMLDOC (v 1.8.23)

http://www.pricelesswarehome.org/acf/P_TEXT.php#2.03Convert:HTMLToText
html2ps
HTMLAsText
KILL<HTML>
Web2Text

http://www.pricelesswarehome.org/acf/P_TEXT.php#2.03Convert:HTMLToTextAndTables
HTML2Table
HTMStrip

http://www.pricelesswarehome.org/acf/P_TEXT.php#2.03HTMLViewer;Converter
ViewHTML

I'll second Mike Sa's Web2Text suggestion. There are some configuration choices - if you don't need
to change those you can drag and drop files on to the app and it will create a text file with the
same name as the HTML file in the directory the HTML file is in. Can't get much simpler than that. :)

Susan
--
Posted to alt.comp.freeware
Search alt.comp.freeware (or read it online):
http://www.google.com.gr/groups?q=+group:alt.comp.freeware&hl=en
Pricelessware & ACF: http://www.pricelesswarehome.org
Pricelessware: http://www.pricelessware.org (not maintained)
 
Susan said:
eCleaner is Pricelessware.
http://www.pricelesswarehome.org/2005/PL2005TEXT.php#0901-PW
eCleaner's home page: http://ecleaner.tripod.com/

Some HTML strippers are listed here:

http://www.pricelesswarehome.org/acf/P_TEXT.php#2.03Convert:HTMLToFormattedText

HTML2TXT
HTMLDOC (v 1.8.23)

http://www.pricelesswarehome.org/acf/P_TEXT.php#2.03Convert:HTMLToText
html2ps
HTMLAsText
KILL<HTML>
Web2Text

http://www.pricelesswarehome.org/acf/P_TEXT.php#2.03Convert:HTMLToTextAndTables

HTML2Table
HTMStrip

http://www.pricelesswarehome.org/acf/P_TEXT.php#2.03HTMLViewer;Converter
ViewHTML

I'll second Mike Sa's Web2Text suggestion. There are some configuration
choices - if you don't need to change those you can drag and drop files
on to the app and it will create a text file with the same name as the
HTML file in the directory the HTML file is in. Can't get much simpler
than that. :)

Maybe so, but I often use eCleaner's ability to strip attribution from
third or more generation forwarded jokes.
 
Ok but i will second myself because of the nice shaped
layout of links, wich is far the closed to the original, look :

(end of the http://www.lemonde.fr)

WEB2TEXT :


Vos recommandés


Chronique d'une exaspération ordinaire, par Jean-Louis Andreani
OpinionsSur
le déclin, exactement, par Maurice Lévy OpinionsM. de Villepin
résiste à la
concurrence de M. Sarkozy FranceAl-Qaida affirme avoir tué les
diplomates
algériens enlevés en Irak InternationalEn Ile-de-France, la
délinquance
progresse dans les territoires les plus défavorisés Société
"Lizzy" ou
l'exploit, par Marie-Béatrice Baudet HorizonsLes femmes, victimes d'un
"préjugé négatif des employeurs", selon le Céreq SociétéAttentats
de Londres :
neuf nouvelles arrestations Les attentats de LondresLes services
secrets
égyptiens auraient été informés de la préparation d'attentats
AfriqueEn
Algérie, le débat sur l'amnistie des islamistes est relancé
Proche-Orient


LINKS :

Vos recommandes

Vivre avec le Sur le declin,
terrorisme, par exactement, par
Jean-Marie Colombani Maurice Levy
Opinions Opinions
Chronique d'une L'"Havhingsten" ou
exasperation la conquete, par
ordinaire, par Jean-Franc,ois
Jean-Louis Andreani Augereau Horizons
Opinions Mais ou va
Esclavage domestique l'aristocratie
: la France britannique ?
condamnee par la Europe
Cour europeenne des Sur les plages,
droits de l'homme l'UMP invente
Organisations l'adhesion par SMS
internationales France
"Nous sommes Al-Qaida affirme
confrontes `a la avoir tue les
pire des haines" diplomates
Proche-Orient algeriens enleves
M. de Villepin en Irak
resiste `a la International
concurrence de M.
Sarkozy France



arghh, course links have no accents :) Character set problem.

Lh
 
I'm looking for a simple, lightweight html stripper. What's your
favorite?

The one I use is NoteTab Lite. It's a text editor, but it also has
a HTML tag stripper that works very well. It will strip the tags
out of a book-length HTML file in less than a second.
 
Stop Growing Carets!
http://groups.msn.com/HappyHousekee...ssage=123282&LastModified=4675516170686573748
Windows 98/ME/2000/XP Outlook Express 5 and 6
Open Outlook Express, click "Tools" "Options" "Send". Next to "Mail
sending format" click "HTML Settings" and uncheck the box that says
"Indent
messages on reply" then click OK. Next click "Plain text settings" and
uncheck "Indent the original text with > when replying and forwarding"
and
click OK. Now click "Apply" "OK" and close Outlook Express. The next
time
you open it you won't be adding your own carets to the carets that are
already there.

(Note: This is not a tip for removing existing carets. It is a tip for
preventing you from adding more carets every time you reply to an
email.
There are several programs that will remove the carets already there
(not
automatically) and we've covered some of them in our Information Avenue
newsletters. The best way to stop the spread of carets is to stop
spreading
them yourself and to tell you friends how to stop sending carets. If
everyone did this, eventually we'd have no carets!
 
Back
Top