HTML Tidy is a
console application
A console application or command-line program is a computer program (applications or utilities) designed to be used via a text-only user interface.
A console application can be used with a computer terminal, a system console, or a terminal emu ...
for correcting invalid
HyperText Markup Language
Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheet ...
(HTML), detecting potential
web accessibility errors, and for improving the layout and
indent style
In computer programming, indentation style is a convention or style, governing the indentation of lines of source code. An indentation style generally specifies a consistent number of whitespace characters before each line of a block, so that ...
of the resulting markup. It is also a cross-platform
library
A library is a collection of Book, books, and possibly other Document, materials and Media (communication), media, that is accessible for use by its members and members of allied institutions. Libraries provide physical (hard copies) or electron ...
for
computer applications
Application software is any computer program that is intended for end-user use not operating, administering or programming the computer. An application (app, application program, software application) is any program that can be categorized as ...
that provides HTML Tidy's features.
History
HTML Tidy was developed by
Dave Raggett
Dave Raggett is an English computer specialist who has played a major role in implementing the World Wide Web since 1992.
He has been a W3C Fellow at the World Wide Web Consortium since 1995 and worked on many of the key web protocols, including ...
of the
World Wide Web Consortium
The World Wide Web Consortium (W3C) is the main international standards organization for the World Wide Web. Founded in 1994 by Tim Berners-Lee, the consortium is made up of member organizations that maintain full-time staff working together in ...
(W3C). Later it was released as a
SourceForge
SourceForge is a web service founded by Geoffrey B. Jeffery, Tim Perdue, and Drew Streib in November 1999. SourceForge provides a centralized software discovery platform, including an online platform for managing and hosting open-source soft ...
project in 2003 and managed by various maintainers.
In 2012, the project was moved to
GitHub
GitHub () is a Proprietary software, proprietary developer platform that allows developers to create, store, manage, and share their code. It uses Git to provide distributed version control and GitHub itself provides access control, bug trackin ...
, and maintained by Michael Smith, also of
W3C
The World Wide Web Consortium (W3C) is the main international standards organization for the World Wide Web. Founded in 1994 by Tim Berners-Lee, the consortium is made up of member organizations that maintain full-time staff working together in ...
, where
HTML5
HTML5 (Hypertext Markup Language 5) is a markup language used for structuring and presenting hypertext documents on the World Wide Web. It was the fifth and final major HTML version that is now a retired World Wide Web Consortium (W3C) recommend ...
support was added.
In 2015, the
HTML Tidy Advocacy Community Group (HTACG) was formed for management and development of HTML Tidy as a W3C Community Group.
HTML Tidy source code is written in
ANSI C
ANSI C, ISO C, and Standard C are successive standards for the C programming language published by the American National Standards Institute (ANSI) and ISO/IEC JTC 1/SC 22/WG 14 of the International Organization for Standardization (ISO) and the ...
for portability. Compiled binary files are available for a variety of platforms. It is available under the
W3C Software Notice and License, a permissive
BSD-style license. Up-to-date versions are available as source code cloned from its GitHub
Git version control repository, or in binary packages for multiple operating systems from its GitHub Releases repository.
Features
Examples of corrections to invalid or poorly constructed HTML:
* Reorder mixed-up tags
* Complete missing or mismatched end tags
* Add missing syntax elements (some tags, quotes, etc.)
* Report proprietary HTML extensions
* Change layout of markup to a predefined style
* Transform characters from some encodings into HTML entities
See also
*
Comparison of HTML parsers
HTML parsers are software for automated Hypertext Markup Language (HTML) parsing. They have two main purposes:
* HTML traversal: offer an interface for programmers to easily access and modify the "HTML string code". Canonical example: DOM par ...
References
External links
HTML Tidy Project PageCurrent Source CodeOfficial Binary ReleasesHTML Tidy Project Page(legacy)
HTML Tidy service Was an online version of HTML Tidy at the W3C. The W3C online tidy service was decommissioned on 29 November, 2017.
{{HTML editors
Free software programmed in C
HTML editors
Software using the W3C Software Notice and License