Revisable-Form Text
   HOME

TheInfoList



OR:

Document Content Architecture, or DCA for short, is a standard developed by IBM for text documents in the early 1980s. DCA was used on
mainframe A mainframe computer, informally called a mainframe or big iron, is a computer used primarily by large organizations for critical applications like bulk data processing for tasks such as censuses, industry and consumer statistics, enterprise ...
and
IBM i IBM i (the ''i'' standing for ''integrated'') is an operating system developed by IBM for IBM Power Systems. It was originally released in 1988 as OS/400, as the sole operating system of the IBM AS/400 line of systems. It was renamed to i5/OS in ...
systems, and formed the basis of
DisplayWrite DisplayWrite (sometimes written as Displaywrite) was a word processor software application that IBM developed and marketed for the IBM PC and PCjr. It was among the company's first internally developed, commercially sold PC software titles. Disp ...
's file format. DCA was later extended as MO:DCA (Mixed Object Document Content Architecture), which added embedded
data files A data file is a computer file which stores data to be used by a computer application or system, including input and output data. A data file usually does not contain instructions or code to be executed (that is, a computer program). Most of th ...
. The original purpose of DCA was to provide a common document format that could be used across multiple IBM word processing platforms, such as the
IBM PC The IBM Personal Computer (model 5150, commonly known as the IBM PC) is the first microcomputer released in the IBM PC model line and the basis for the IBM PC compatible de facto standard. Released on August 12, 1981, it was created by a team ...
,
IBM mainframes IBM mainframes are large computer systems produced by IBM since 1952. During the 1960s and 1970s, IBM dominated the large computer market. Current mainframe computers in IBM's line of business computers are developments of the basic design of th ...
, the Displaywriter System, and the IBM 5520 Administrative System. DCA defines two types of documents: * ''Revisable-Form Text'' (DCA/RFT) which is editable. * ''Final-Form Text'' (DCA/FFT) which is "formatted for a particular output device and cannot be changed."


Description

DCA defines a data stream representing a
document A document is a written, drawn, presented, or memorialized representation of thought, often the manifestation of non-fictional, as well as fictional, content. The word originates from the Latin ''Documentum'', which denotes a "teaching" or ...
. MO:DCA is the wrapper or container for various ''objects'' that can make up the document. Each object is defined by its own subordinate architecture. The architectures are: * Presentation Text Object Content Architecture (PTOCA) describes formatted text, including text attributes such as font or color. * Image Object Content Architecture (IOCA) describes resolution-independent images. * Graphics Object Content Architecture (GOCA) describes
vector graphic Vector graphics is a form of computer graphics in which visual images are created directly from geometric shapes defined on a Cartesian plane, such as points, lines, curves and polygons. The associated mechanisms may include vector display a ...
images. A variation of GOCA, AFP GOCA, is used in
Advanced Function Presentation Advanced Function Presentation (AFP) is a presentation architecture and family of associated printer software and hardware that provides for document and information presentation independent of specific applications and devices. Using AFP, users c ...
environments. * Bar Code Object Content Architecture (BCOCA) describes
bar code A barcode or bar code is a method of representing data in a visual, Machine-readable data, machine-readable form. Initially, barcodes represented data by varying the widths, spacings and sizes of parallel lines. These barcodes, now commonly refe ...
s in a number of different formats. * Font Object Content Architecture (FOCA) describes fonts to be used in the document * Color Management Object Content Architecture (CMOCA) describes required color management information. Each architecture uses a series of binary structured fields to describe its corresponding object.


Revisable-Form Text

Revisable-Form Text (abbreviated RFT or RFT-DCA) is part of DCA. It is sometimes referred to as ''Revisable Format Text''. It was used by
IBM DisplayWrite DisplayWrite (sometimes written as Displaywrite) was a word processor software application that IBM developed and marketed for the IBM PC and PCjr. It was among the company's first internally developed, commercially sold PC software titles. Dis ...
4 and 5
word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features. Word processor (electronic device), Early word processors were stand-alone devices ded ...
s on
System/360 The IBM System/360 (S/360) is a family of mainframe computer systems that was announced by IBM on April 7, 1964, and delivered between 1965 and 1978. It was the first family of computers designed to cover both commercial and scientific applica ...
and 370
mainframe computers A mainframe computer, informally called a mainframe or big iron, is a computer used primarily by large organizations for critical applications like bulk data processing for tasks such as censuses, industry and consumer statistics, enterpris ...
, and OfficeVision/400 to allow transfer of formatted documents to other systems. RFT has a counterpart Final-Form Text (abbreviated FFT or FFT-DCA), which was not intended to be editable and was output-only.


History

The drive to initiate international standards for the DCAs was initiated in 1980 at the IBM Rochester facility. A team, consisting of two MODCA architects, an RTOCA architect, and a PTOCA architect, was assembled. These architects were responsible for forming IBM consensus for the design of the data streams and to take the work into the international standards arena. There was a concerted effort to bring the international community into the development. This decision was based in part on the experience gained over the acceptance of GML into an international SGML standard. To avoid the long delay of creating the architecture, they wanted to get everyone involved early.
SGML The Standard Generalized Markup Language (SGML; ISO 8879:1986) is a standard for defining generalized markup languages for documents. ISO 8879 Annex A.1 states that generalized markup is "based on two postulates": * Declarative: Markup should des ...
standardization had taken many years to develop. IBM's work with document content had been pushed by the needs of main frame computers where GML and DCA were in use, but that experience was pointing to a need for standardized component architectures for revisable and non-revisable text in particular. In 1981, shortly after its inception, the group was moved along with the IBM 5280 Distributed Data System to IBM Austin near Round Rock, TX, where the work continued with mixed success. As the architectures were becoming more firmly positioned on the international stage, the team was moved again in 1987 to The IBM Dallas Programming Center, where in 1998 it was disbanded and the work on the DCA architectures discontinued due mainly to the PC community which had gone in a different direction of necessity. The DCA architectures were fully completed, but not totally agreed upon after 18 years. There were no active implementations in sight. The PC world had decided on
HTML The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScri ...
(believed to be an application of the
SGML The Standard Generalized Markup Language (SGML; ISO 8879:1986) is a standard for defining generalized markup languages for documents. ISO 8879 Annex A.1 states that generalized markup is "based on two postulates": * Declarative: Markup should des ...
international standard) and used portions of it for their purposes. Microsoft Word eventually used the similar datastream for the internal working datastream for storage of editable content. Even though the SGML standard was available, it was impractical to use the full SGML parser implementation, so a potential subset of it became the de facto standard for revisable text used today in the PC arena. At about the same time, Adobe Systems designed and produced the printable document encoding
PDF Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. ...
, which has become the standard for PC-produced printable documents. The international standard was set in 2008, with input from the users, who decided to use the products offered in great numbers. The decision was driven by the need for the product, and the solution found was far more acceptable than the standards committees could design. Over 10 years of work had not produced an acceptable method, and the PC computing community created what they needed in less time. Attempting to achieve a consensus document data stream was quickly out-flanked by the available and usable content provided by the companies who did not attempt to share with others, but created a workable solution and successfully sold it to users. The output of the word processing software is 'printed' into the PDF format provided by the most used presentation product. For example, Microsoft Word provides a printer selection 'Microsoft Print to PDF' in order to produce the requisite output for a PDF document. A similar method could have been used to produce the international standard had one eventually arrived. When IBM disbanded its Dallas Programming Center in 1998, the entire staff of architects retired and left the company, except the manager, who was moved, ending the DCA architecture project for the foreseeable future at IBM.


See also

*
Advanced Function Presentation Advanced Function Presentation (AFP) is a presentation architecture and family of associated printer software and hardware that provides for document and information presentation independent of specific applications and devices. Using AFP, users c ...
*
Rich Text Format ) As an example, the following RTF code would be rendered as follows: This is some bold text. Character encoding A standard RTF file can only consist of 7-bit ASCII characters, but can use escape sequences to encode other characters. Th ...
(RTF) – another formatting code system that is sometimes confused with Revisable-Form Text. *
List of document markup languages The following is a list of document markup languages. You may also find the List of markup languages of interest. Well-known document markup languages * HyperText Markup Language (HTML) – the original markup language that was defined as a part o ...


References


External links


AFP Consortium homepage
* * United States patent number 5,513,323 {{Document markup languages IBM software Advanced Function Presentation