Data collection system (DCS) is a
computer application
Application software is any computer program that is intended for end-user use not operating, administering or programming the computer. An application (app, application program, software application) is any program that can be categorized as ...
that facilitates the process of
data collection
Data collection or data gathering is the process of gathering and measuring information on targeted variables in an established system, which then enables one to answer relevant questions and evaluate outcomes. Data collection is a research com ...
, allowing specific, structured information to be gathered in a systematic fashion, subsequently enabling
data analysis
Data analysis is the process of inspecting, Data cleansing, cleansing, Data transformation, transforming, and Data modeling, modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making. Da ...
to be performed on the information. Typically a DCS displays a form that accepts data input from a user and then validates that input prior to committing the data to persistent storage such as a database.
Many computer systems implement data entry forms, but data collection systems tend to be more complex, with possibly many related forms containing detailed user input fields, data validations, and navigation links among the forms.
DCSs can be considered a specialized form of
content management system
A content management system (CMS) is computer software used to manage the creation and modification of digital content ( content management).''Managing Enterprise Content: A Unified Content Strategy''. Ann Rockley, Pamela Kostur, Steve Manning. New ...
(CMS), particularly when they allow the information being gathered to be published, edited, modified, deleted, and maintained. Some general-purpose CMSs include features of DCSs.
Importance
Accurate data collection is essential to many
business process
A business process, business method, or business function is a collection of related, structured activities or tasks performed by people or equipment in which a specific sequence produces a service or product (that serves a particular business g ...
es, to the enforcement of many government
regulation
Regulation is the management of complex systems according to a set of rules and trends. In systems theory, these types of rules exist in various fields of biology and society, but the term has slightly different meanings according to context. Fo ...
s, and to maintaining the integrity of scientific research.
Data collection systems are an end-product of
software development
Software development is the process of designing and Implementation, implementing a software solution to Computer user satisfaction, satisfy a User (computing), user. The process is more encompassing than Computer programming, programming, wri ...
. Identifying and categorizing software or a software sub-system as having aspects of, or as actually being a "Data collection system" is very important. This categorization allows encyclopedic knowledge to be gathered and applied in the design and implementation of future systems. In
software design
Software design is the process of conceptualizing how a software system will work before it is implemented or modified.
Software design also refers to the direct result of the design process the concepts of how the software will work which co ...
, it is very important to identify generalizations and
patterns
A pattern is a regularity in the world, in human-made design, or in abstract ideas. As such, the elements of a pattern repeat in a predictable manner. A geometric pattern is a kind of pattern formed of geometric shapes and typically repeated li ...
and to
re-use existing knowledge whenever possible.
Types
Generally the computer software used for
data collection
Data collection or data gathering is the process of gathering and measuring information on targeted variables in an established system, which then enables one to answer relevant questions and evaluate outcomes. Data collection is a research com ...
falls into one of the following categories of practical application.
*
Surveys or questionnaires
* Data registries
*
Case management systems
*
Performance measurement systems
* Exams and
quiz
A quiz is a form of mind sport in which people attempt to answer questions correctly on one or several topics. Quizzes can be used as a brief Educational assessment, assessment in education and similar fields to measure growth in knowledge, abil ...
zes
* Online forms and form filing and reporting systems
Vocabulary
There is a
taxonomic scheme associated with data collection systems, with readily-identifiable synonyms used by different industries and organizations. Cataloging the most commonly used and widely accepted vocabulary improves efficiencies, helps reduce variations, and improves data quality.
The vocabulary of data collection systems stems from the fact that these systems are often a software representation of what would otherwise be a paper data collection
form
Form is the shape, visual appearance, or configuration of an object. In a wider sense, the form is the way something happens.
Form may also refer to:
*Form (document), a document (printed or electronic) with spaces in which to write or enter dat ...
with a complex internal structure of sections and sub-sections. Modeling these structures and relationships in software yields technical terms describing the
hierarchy
A hierarchy (from Ancient Greek, Greek: , from , 'president of sacred rites') is an arrangement of items (objects, names, values, categories, etc.) that are represented as being "above", "below", or "at the same level as" one another. Hierarchy ...
of
data containers, along with a set of industry-specific synonyms.
Collection synonyms
A
collection (used as a noun) is the topmost container for grouping related documents,
data model
A data model is an abstract model that organizes elements of data and standardizes how they relate to one another and to the properties of real-world entities. For instance, a data model may specify that the data element representing a car be ...
s, and
dataset
A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
s. Typical vocabulary at this level includes the terms:
Data model synonyms
Each document or
dataset
A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
within a
collection is modeled in software. Constructing these models is part of designing or "authoring" the expected data to be collected. The terminology for these
data model
A data model is an abstract model that organizes elements of data and standardizes how they relate to one another and to the properties of real-world entities. For instance, a data model may specify that the data element representing a car be ...
s includes:
Sub-collection or master-detail synonyms
Data model
A data model is an abstract model that organizes elements of data and standardizes how they relate to one another and to the properties of real-world entities. For instance, a data model may specify that the data element representing a car be ...
s are often
hierarchical
A hierarchy (from Greek: , from , 'president of sacred rites') is an arrangement of items (objects, names, values, categories, etc.) that are represented as being "above", "below", or "at the same level as" one another. Hierarchy is an importan ...
, containing sub-collections or
master–detail structures described with terms such as:
Data element synonyms
At the lowest level of the
data model
A data model is an abstract model that organizes elements of data and standardizes how they relate to one another and to the properties of real-world entities. For instance, a data model may specify that the data element representing a car be ...
are the
data element
In metadata, the term data element is an atomic unit of data that has precise meaning or precise semantics. A data element has:
# An identification such as a data element name
# A clear data element definition
# One or more representation term ...
s that describe individual pieces of data. Synonyms include:
Data point synonyms
Moving from the abstract,
domain model
In software engineering, a domain model is a conceptual model of the domain (software engineering), domain that incorporates both behavior and data.Fowler, Martin. "P of EAA - Domain Model"/ref> In ontology engineering, a domain model is a Knowl ...
ling facet to that of the concrete, actual data: the lowest level here is the
data point
In statistics, a unit of observation is the unit described by the data that one analyzes. A study may treat groups as a unit of observation with a country as the unit of analysis, drawing conclusions on group characteristics from data collected a ...
within a
dataset
A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
. Synonyms for ''data point'' include:
Dataset synonyms
Finally, the synonyms for
dataset
A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
include:
See also
*
Data management
Data management comprises all disciplines related to handling data as a valuable resource, it is the practice of managing an organization's data so it can be analyzed for decision making.
Concept
The concept of data management emerged alongsi ...
*
Survey data collection
*
Case report form
A case report form (or CRF) is a paper or electronic questionnaire specifically used in clinical trial research. The case report form is the tool used by the sponsor of the clinical trial to collect data from each participating patient. All data on ...
*
Safety data sheet
A safety data sheet (SDS), material safety data sheet (MSDS), or product safety data sheet (PSDS) is a document that lists information relating to occupational safety and health for the use of various substances and products. SDSs are a wide ...
*
Data mining
Data mining is the process of extracting and finding patterns in massive data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and ...
*
Web mining
*
Crowdsourcing
Crowdsourcing involves a large group of dispersed participants contributing or producing goods or services—including ideas, votes, micro-tasks, and finances—for payment or as volunteers. Contemporary crowdsourcing often involves digit ...
*
Collaborative software
Collaborative software or groupware is application software designed to help people working on a common task to attain their goals. One of the earliest definitions of groupware is "intentional group processes plus software to support them."
Regar ...
References
External links
{{Statistics, collection
Data collection
Data collection or data gathering is the process of gathering and measuring information on targeted variables in an established system, which then enables one to answer relevant questions and evaluate outcomes. Data collection is a research com ...
Survey methodology