Data processing is the
collection and manipulation of digital data to produce meaningful information.
Data processing is a form of ''
information processing'', which is the modification (processing) of information in any manner detectable by an observer.
[Data processing is distinct from '']word processing
A word is a basic element of language that carries an objective or practical meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consen ...
'', which is manipulation of text specifically rather than data generally.
The term "Data Processing", or "DP" has also been used to refer to a department within an organization responsible for the operation of data processing programs.
Data processing functions
Data processing may involve various processes, including:
*
Validation – Ensuring that supplied data is correct and relevant.
*
Sorting – "arranging items in some sequence and/or in different sets."
*
Summarization(statistical) or
(automatic) – reducing detailed data to its main points.
*
Aggregation
Aggregation may refer to:
Business and economics
* Aggregation problem (economics)
* Purchasing aggregation, the joining of multiple purchasers in a group purchasing organization to increase their buying power
* Community Choice Aggregation, the ...
– combining multiple pieces of data.
*
Analysis – the "collection,
organization
An organization or organisation (Commonwealth English; see spelling differences), is an entity—such as a company, an institution, or an association—comprising one or more people and having a particular purpose.
The word is derived fro ...
, analysis, interpretation and presentation of data."
*
Reporting – list detail or summary data or computed information.
*
Classification – separation of data into various categories.
History
The
United States Census Bureau
The United States Census Bureau (USCB), officially the Bureau of the Census, is a principal agency of the U.S. Federal Statistical System, responsible for producing data about the American people and economy
An economy is an area of th ...
history illustrates the evolution of data processing from manual through electronic procedures.
Manual data processing
Although widespread use of the term ''data processing'' dates only from the 1950's,
data processing functions have been performed manually for millennia. For example,
bookkeeping
Bookkeeping is the recording of financial transactions, and is part of the process of accounting in business and other organizations. It involves preparing source documents for all transactions, operations, and other events of a business. ...
involves functions such as posting transactions and producing reports like the
balance sheet and the
cash flow statement. Completely manual methods were augmented by the application of
mechanical or electronic
calculators. A person whose job was to perform calculations manually or using a calculator was called a "
computer."
The
1890 United States Census schedule was the first to gather data by individual rather than
household. A number of questions could be answered by making a check in the appropriate box on the form. From 1850 to 1880 the Census Bureau employed "a system of tallying, which, by reason of the increasing number of combinations of classifications required, became increasingly complex. Only a limited number of combinations could be recorded in one tally, so it was necessary to handle the schedules 5 or 6 times, for as many independent tallies."
"It took over 7 years to publish the results of the 1880 census"
using manual processing methods.
Automatic data processing
The term ''
automatic data processing'' was applied to operations performed by means of
unit record equipment, such as
Herman Hollerith's application of
punched card
A punched card (also punch card or punched-card) is a piece of stiff paper that holds digital data represented by the presence or absence of holes in predefined positions. Punched cards were once common in data processing applications or to di ...
equipment for the
1890 United States Census. "Using Hollerith's punchcard equipment, the Census Office was able to complete tabulating most of the 1890 census data in 2 to 3 years, compared with 7 to 8 years for the 1880 census. It is estimated that using Hollerith's system saved some $5 million in processing costs"
in 1890 dollars even though there were twice as many questions as in 1880.
Electronic data processing
Computerized data processing, or
Electronic data processing represents a later development, with a computer used instead of several independent pieces of equipment. The Census Bureau first made limited use of
electronic computers for the
1950 United States Census, using a
UNIVAC I system,
delivered in 1952.
Other developments
The term ''data processing'' has mostly been subsumed by the more general term ''
information technology
Information technology (IT) is the use of computers to create, process, store, retrieve, and exchange all kinds of data . and information. IT forms part of information and communications technology (ICT). An information technology system ...
'' (IT).
The older term "data processing" is suggestive of older technologies. For example, in 1996 the ''
Data Processing Management Association'' (DPMA) changed its name to the ''Association of Information Technology Professionals." Nevertheless, the terms are approximately synonymous.
Applications
Commercial data processing
Commercial data processing involves a large volume of input data, relatively few computational operations, and a large volume of output. For example, an insurance company needs to keep records on tens or hundreds of thousands of policies, print and mail bills, and receive and post payments.
Data analysis
In science and engineering, the terms ''data processing'' and ''
information systems'' are considered too broad, and the term ''data processing'' is typically used for the initial stage followed by a
data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making. Data analysis has multiple facets and approaches, en ...
in the second stage of the overall data handling.
Data analysis uses specialized
algorithm
In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ...
s and
statistical calculations that are less often observed in a typical general business environment. For data analysis, software suites like
SPSS or
SAS
SAS or Sas may refer to:
Arts, entertainment, and media
* ''SAS'' (novel series), a French book series by Gérard de Villiers
* ''Shimmer and Shine'', an American animated children's television series
* Southern All Stars, a Japanese rock ba ...
, or their free counterparts such as
DAP,
gretl or
PSPP are often used.
See also
*
Data processing system
*
Big data
*
Computation
*
Decision-making software
*
Information and communications technology
Information and communications technology (ICT) is an extensional term for information technology (IT) that stresses the role of unified communications and the integration of telecommunications ( telephone lines and wireless signals) and comput ...
*
Information technology
Information technology (IT) is the use of computers to create, process, store, retrieve, and exchange all kinds of data . and information. IT forms part of information and communications technology (ICT). An information technology system ...
*
Computer science
Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includin ...
Notes
External links
References
Further reading
*Bourque, Linda B.; Clark, Virginia A. (1992) ''Processing Data: The Survey Example''. (Quantitative Applications in the Social Sciences, no. 07-085). Sage Publications.
*Levy, Joseph (1967)
Punched Card Data Processing'. McGraw-Hill Book Company.
{{Authority control
Computer data