Re: html to text
Bruce Harper (bharper@VT.EDU)
Tue, 23 Sep 1997 09:53:04 -0400
>Does anyone know of any freeware or shareware that will convert HTML to
>text or rtf or some other more usable word processor format?
The easiest way to convert from HTML to text is with a web browser. Open
the HTML file, then use the "Save as ..." feature to save the displayed
file to a new file, specifying "Text" as the desired format (instead of
"HTML" or "Source"). As others have mentioned, the HTML file is just
a plain text file, so any text editor can be used to open the file and
manually (with heavy use of "search and replace") strip out the HTML
tags. If you are using a Macintosh, BBEdit is about the best text
editor going for work with HTML. I use BBEdit 4.0 (the commercial version),
which has built into it an extension that will strip tags from an HTML
file; I'm not sure if this is in BBEdit Lite or not.
If you want to go one step further and convert the file to a word
processor file, there is a DOS/Windows program that will create rtf (Rich
Text Format) output. HTML2RTF.COM is fast and free and can be downloaded
from http://www.cena.dgac.fr/~sagnier/html2rtf.htm The last update was
on July 24, 1997, so it is current and available. I have used it and it
works well. Once in rtf format, the file can be imported into almost any
word processor with most of the orginal HTML format (headings, bold text,
etc.) pretty much intact.
You can find this and other conversion programs through the
Web Consortium web site ( http://www.w3.org/ ) under the HTML Converters
( http://www.w3.org/Tools/Filters.html ).
Hope this helps.
Bruce in Blacksburg
Assistant Scoutmaster, Troop 56, Blacksburg, Virginia
New River District -- Blue Ridge Mountains Council
Bruce B. Harper (540)231-4360 firstname.lastname@example.org
Distributed Information Systems Virginia Tech Computing Center
1700 Pratt Drive Blacksburg, Virginia 26060
Virginia Tech home page http://www.vt.edu/
Terry Howerton Sakima Group, Inc. SCOUTER Magazine Kansas City