TeX4ht
   HOME

TheInfoList



OR:

TeX4ht is a configurable converter capable of translating
TeX Tex may refer to: People and fictional characters * Tex (nickname), a list of people and fictional characters with the nickname * Joe Tex (1933–1982), stage name of American soul singer Joseph Arrington Jr. Entertainment * ''Tex'', the Italian ...
and
LaTeX Latex is an emulsion (stable dispersion) of polymer microparticles in water. Latexes are found in nature, but synthetic latexes are common as well. In nature, latex is found as a milky fluid found in 10% of all flowering plants (angiosperms ...
documents to
HTML The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScri ...
and certain
XML Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. T ...
formats. Most notably, TeX4ht serves for converting (La)TeX documents to formats used by
word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features. Word processor (electronic device), Early word processors were stand-alone devices ded ...
s. It was developed by Eitan M. Gurari. The program is published under the LaTeX Project Public License (LPPL).


History

TeX4ht was developed in the 1990s to convert (La)TeX to HTML, helping to publish scientific documents that were written in (La)TeX on the
World Wide Web The World Wide Web (WWW), commonly known as the Web, is an information system enabling documents and other web resources to be accessed over the Internet. Documents and downloadable media are made available to the network through web se ...
for display in a
web browser A web browser is application software for accessing websites. When a user requests a web page from a particular website, the browser retrieves its files from a web server and then displays the page on the user's screen. Browsers are used on ...
. Particularly,
hypertext Hypertext is E-text, text displayed on a computer display or other electronic devices with references (hyperlinks) to other text that the reader can immediately access. Hypertext documents are interconnected by hyperlinks, which are typi ...
features were supported, so it became possible to include hyperlinks in the web version of documents. More XML-based formats were supported gradually. As of 2019, HTML5,
XHTML Extensible HyperText Markup Language (XHTML) is part of the family of XML markup languages. It mirrors or extends versions of the widely used HyperText Markup Language (HTML), the language in which Web pages are formulated. While HTML, prior ...
,
MathML Mathematical Markup Language (MathML) is a mathematical markup language, an application of XML for describing mathematical notations and capturing both its structure and content. It aims at integrating mathematical formulae into World Wide Web ...
,
OpenDocument The Open Document Format for Office Applications (ODF), also known as OpenDocument, is an open file format for word processing documents, spreadsheets, presentations and graphics and using ZIP-compressed XML files. It was developed wi ...
, DocBook, and TEI are supported.
JavaHelp JavaHelp is both an application and a format for online help files that can be displayed by the JavaHelp browser. It is written in Java, and is mainly used in Java applications. It can be used for any application and it does require the overhead o ...
can also be generated. TeX4ht is now included preconfigured with all TeX distributions. Since Eitan M. Gurari's death the program has been maintained by Radhakrishnan CV (no longer active), Karl Berry, and Michal Hoftich, with contributions from many others.


Function

TeX4ht does not directly transform TeX or LaTeX markup into the output
markup language Markup language refers to a text-encoding system consisting of a set of symbols inserted in a text document to control its structure, formatting, or the relationship between its parts. Markup is often used to control the display of the document ...
(HTML etc.) Instead, an ordinary (La)TeX run compiles a
DVI Digital Visual Interface (DVI) is a video display interface developed by the Digital Display Working Group (DDWG). The digital interface is used to connect a video source, such as a video display controller, to a display device, such as a comp ...
file from the source first. TeX4ht subsequently processes the DVI file. Cf. ''The LaTeX Web Companion'', pp. 169f. Other converters, most notably
LaTeX2HTML Latex is an emulsion (stable dispersion) of polymer microparticles in water. Latexes are found in nature, but synthetic latexes are common as well. In nature, latex is found as a milky fluid found in 10% of all flowering plants (angiosperm ...
or TtH operate in a single pass. TeX4ht essentially can deal with any successfully compiling (La)TeX document source. TeX4ht can also incorporate support publicly available macro packages or user-made (perhaps document-specific) commands to process features that transcend standard TeX formats, such as for managing bibliography with BibTeX, because these extensions do not need corresponding implementations in the converter. Mathematical formulae and other characters or symbols that cannot be displayed as text are converted into graphics. Mathematics can also be converted into
MathML Mathematical Markup Language (MathML) is a mathematical markup language, an application of XML for describing mathematical notations and capturing both its structure and content. It aims at integrating mathematical formulae into World Wide Web ...
or form suitable for processing with
MathJax MathJax is a cross-browser JavaScript library that displays mathematical notation in web browsers, using MathML, LaTeX and ASCIIMathML markup. MathJax is released as open-source software under the Apache License. The MathJax project started in ...
. TeX4ht can convert LaTeX documents into Microsoft Word's ''doc'' format via the
OpenDocument The Open Document Format for Office Applications (ODF), also known as OpenDocument, is an open file format for word processing documents, spreadsheets, presentations and graphics and using ZIP-compressed XML files. It was developed wi ...
format, ODT.


References


External links


Project Homepage

Online Manual

SimpleTeX4ht
- Graphic frontend for TeX4ht for Mac OS X




Literature

* Translating LaTeX to HTML using TeX4ht, in: Michel Goossens,
Sebastian Rahtz Sebastian Patrick Quintus Rahtz (13 February 1955 – 15 March 2016) (SPQR) was a British digital humanities information professional. Life Born in 1955 to Somerset-focused archaeologist Philip Rahtz, Sebastian trained in archaeology, befor ...
, Eitan M. Gurari, Ross Moore, Robert S. Sutor. The LaTeX Web Companion. Integrating TeX, HTML, and XML. 1999. 8th printing January 2006. pp. 155–194. * Eitan Gurari
HTML Production
TUGBoat 25 (2004), No. 1, pp. 39–47.


See also

*
LaTeX2HTML Latex is an emulsion (stable dispersion) of polymer microparticles in water. Latexes are found in nature, but synthetic latexes are common as well. In nature, latex is found as a milky fluid found in 10% of all flowering plants (angiosperm ...
TeX HTML XML software Word processors