HOME

TheInfoList



OR:

In
computing Computing is any goal-oriented activity requiring, benefiting from, or creating computer, computing machinery. It includes the study and experimentation of algorithmic processes, and the development of both computer hardware, hardware and softw ...
, formatted text, styled text, or rich text, as opposed to
plain text In computing, plain text is a loose term for data (e.g. file contents) that represent only characters of readable material but not its graphical representation nor other objects ( floating-point numbers, images, etc.). It may also include a lim ...
, is digital text which has styling information beyond the minimum of semantic elements: colours, styles ( boldface, italic),
sizes Size in general is the Magnitude (mathematics), magnitude or dimensions of a thing. More specifically, ''geometrical size'' (or ''spatial size'') can refer to three geometrical measures: length, area, or volume. Length can be generalized ...
, and special features in
HTML Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheets ( ...
(such as
hyperlink In computing, a hyperlink, or simply a link, is a digital reference providing direct access to Data (computing), data by a user (computing), user's point and click, clicking or touchscreen, tapping. A hyperlink points to a whole document or to ...
s).


Beginnings of formatted text

Formatted text has its genesis in the pre-computer use of underscoring to embolden passages in typewritten manuscripts. In the first interactive systems of early computer technology, underlining was not possible, and users made up for this lack (and the lack of formatting in ASCII) by using certain symbols as substitutes. Emphasis, for example, could be achieved in ASCII in a number of ways: * Capitalization: * Surrounding with underscores: * Surrounding with asterisks: * Spacing: Surrounding by underscores was also used for book titles:


Markup languages

Formatting can be marked by tags distinguished from the body text by special characters, such as angle brackets in
HTML Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheets ( ...
. For example, this text: :The dog is classified as ''Canis familiaris'' in taxonomy. is marked up in
HTML Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheets ( ...
thus:

The dog is classified as Canis familiaris in taxonomy.

The italicised text is enclosed by an opening and a closing italics tag. In
LaTeX Latex is an emulsion (stable dispersion) of polymer microparticles in water. Latices are found in nature, but synthetic latices are common as well. In nature, latex is found as a wikt:milky, milky fluid, which is present in 10% of all floweri ...
, the text would be marked up like this: The dog is classified as \textit in taxonomy. Most markup languages can be edited with any text editor, needing no special
software Software consists of computer programs that instruct the Execution (computing), execution of a computer. Software also includes design documents and specifications. The history of software is closely tied to the development of digital comput ...
. Many markup languages can also be edited with specialized software designed to automate some functions or present the output as WYSIWYG.


Formatted document files

Since the invention of MacWrite, the first WYSIWYG word processor, in which the typist codes the formatting visually rather than by inserting textual markup, word processors have tended to save to binary files. Opening such files with a text editor reveals them embedded with various binary characters, either around the formatted text (e.g. in WordPerfect) or separate from it, at the beginning or end of the file (e.g. in
Microsoft Word Microsoft Word is a word processor program, word processing program developed by Microsoft. It was first released on October 25, 1983, under the name Multi-Tool Word for Xenix systems. Subsequent versions were later written for several other platf ...
). Formatted text documents in binary files have, however, the disadvantages of formatting scope and secrecy. Whereas the extent of formatting is accurately marked in markup languages, WYSIWYG formatting is based on memory, that is, keeping for example your pressing of the boldface button until cancelled. This can lead to formatting mistakes and maintenance troubles. As for secrecy, formatted text document file formats tend to be proprietary and undocumented, leading to difficulty in coding compatibility by third parties, and also to unnecessary upgrades because of version changes. WordStar was a popular word processor that did not use binary files with hidden characters. OpenOffice.org Writer saves files in an
XML Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing data. It defines a set of rules for encoding electronic document, documents in a format that is both human-readable and Machine-r ...
format. However, the resultant file is a binary since it is compressed (a tarball equivalent). PDF is another formatted text file format that is usually binary (using compression for the text, and storing graphics and fonts in binary). It is generally an end-user format, written from an application such as
Microsoft Word Microsoft Word is a word processor program, word processing program developed by Microsoft. It was first released on October 25, 1983, under the name Multi-Tool Word for Xenix systems. Subsequent versions were later written for several other platf ...
or OpenOffice.org Writer, and not editable by the user once done.


See also

*
Character encoding Character encoding is the process of assigning numbers to graphical character (computing), characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical v ...
* Online rich-text editor *
Prepress Prepress is the term used in the printing and publishing industries for the processes and procedures that occur between the creation of a print layout and the final printing. The prepress process includes the preparation of artwork for press, media ...
*
Word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features. Early word processors were stand-alone devices dedicated to the function, but current word ...


References

{{reflist , refs= {{cite journal , title=RFC1855: Netiquette Guidelines , first=Sally , last=Hambridge , date=October 1995 , website=IETF Datatracker, Internet Engineering Task Force , url=https://datatracker.ietf.org/doc/html/rfc1855 , access-date=2024-02-04 {{cite web , title=Structured Text , date=2015-07-26 , first=Ed , last=Mullen , website=edmullen.net , url=https://edmullen.net/mozilla/moz_stext.php , access-date=2024-02-04 Computer file formats Publishing