Converting from HTML
Up to full list of filters
Browsers
Most www browsers will convert html to plain text, for example:
www -na "some-URL" > my-text
(See the Linemode
browser and
command line options).
lynx -dump "some-URL" > my-text
(See the Lynx documentation)
Mosaic
will let you "save as" plain text, and in some versions also in other formats
including formatted text and PostScript.
SGML tools
Some SGML tools will allow you to convert HTML to other formats. For instance:
-
gf. A general-purpose SGML compiler.
-
HyperHelp Bridge from
Bristol Technology will convert to RTF.
-
SGML2TeX will convert SGML to TeX on the PC.
-
Fred will convert SGML to
HTML, TeX (PostScript), ASCII etc.
-
sgml2 will convert SGML to other formats.
-
instant from OSF can be used with sgmls to produce
various output formats from standard SGML inputs.
- An HTML to ICADD Transformation Service
translates HTML into the ICADD DTD, suitable for further translation to Braille,
large print or voice synthesis.
Further information is available on
SGML resources and tools.
Other tools
- PostScript
- html2ps is a Perl script
to convert HTML to PostScript. It handles most tags but not in-line images.
Contact: jan@tdb.uu.se (Jan Karrman)
- Here is information about another script to convert
HTML to PostScript.
-
And here is a service to do the job for you. Try it!
- Frame
-
www_and_frame
will convert HTML to MML for FrameMaker.
See also an HTML to MIFconverter and toolkit
by the same author.
See
Support Info for current situation and sources. Contact: connolly@w3.org (Dan Connolly)
- html2mif is an HTML to Framemaker MIF
converter written in Tcl. Contact: faustus@remarque.berkeley.edu (Wayne A. Christopher)
- LaTeX etc
-
html2latex is a program by Nathan Torkington based on the
NCSA html parser.
- Another html2latex
by Frans J. Faase is able to combine
several HTML files into a single LaTeX file, where links between the
files are converted to references. External URL's can be converted
into footnotes or into a bibliography (sorted on URL).
- htmlatex.pl
is a perl script to do the conversion (may be moving soon).
Contact: n9146070@cc.wwu.edu (Jake Kesinger)
- There is also a
sed script
to convert HTML into LaTeX.
- Plain text and setext
- Other formats
Check out word processor filters, some of
which work both ways, and also HTML editors.
__________________________________________________________________
MS,
CERN
10 August 1995