Enterprise information integration (EII) is the ability to support a unified view of data and information for an entire organization. In a
data virtualization application of EII, a process of
information integration, using
data abstraction to provide a unified interface (known as
uniform data access) for viewing all the data within an organization, and a single set of structures and naming conventions (known as
uniform information representation) to represent this data; the goal of EII is to get a large set of
heterogeneous data sources to appear to a user or system as a single, homogeneous data source.
Overview
Data
Data ( , ) are a collection of discrete or continuous values that convey information, describing the quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpreted for ...
within an
enterprise can be stored in heterogeneous formats, including
relational database
A relational database (RDB) is a database based on the relational model of data, as proposed by E. F. Codd in 1970.
A Relational Database Management System (RDBMS) is a type of database management system that stores data in a structured for ...
s (which themselves come in a large number of varieties), text files,
XML
Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing data. It defines a set of rules for encoding electronic document, documents in a format that is both human-readable and Machine-r ...
files,
spreadsheet
A spreadsheet is a computer application for computation, organization, analysis and storage of data in tabular form. Spreadsheets were developed as computerized analogs of paper accounting worksheets. The program operates on data entered in c ...
s and a variety of proprietary
storage methods, each with their own
index
Index (: indexes or indices) may refer to:
Arts, entertainment, and media Fictional entities
* Index (''A Certain Magical Index''), a character in the light novel series ''A Certain Magical Index''
* The Index, an item on the Halo Array in the ...
ing and
data access
Data access is a generic term referring to a process which has both an IT-specific meaning and other connotations involving access rights in a broader legal and/or political sense. In the former it typically refers to software and activities relat ...
methods.
Standardized data access
API
An application programming interface (API) is a connection between computers or between computer programs. It is a type of software interface, offering a service to other pieces of software. A document or standard that describes how to build ...
s have emerged that offer a specific set of commands to retrieve and modify data from a generic data source. Many applications exist that implement these APIs' commands across various data sources, most notably relational databases. Such APIs include
ODBC
In computing, Open Database Connectivity (ODBC) is a standard application programming interface (API) for accessing database management systems (DBMS). The designers of ODBC aimed to make it independent of database systems and operating systems. An ...
,
JDBC
Java Database Connectivity (JDBC) is an application programming interface (API) for the Java (programming language), Java programming language which defines how a client may access a database. It is a Java-based data access technology used for Java ...
,
XQJ,
OLE DB, and more recently
ADO.NET.
There are also standard formats for representing data within a file that are very important to information integration. The best-known of these is XML, which has emerged as a standard universal representation format. There are also more specific XML "grammars" defined for specific types of data such as
Geography Markup Language for expressing geographical features and
Directory Service Markup Language for holding directory-style information. In addition, non-XML standard formats exist such as
iCalendar for representing calendar information and
vCard
vCard, also known as VCF ("Virtual Contact File"), is a file format standard for electronic business cards. vCards can be attached to e-mail messages, sent via Multimedia Messaging Service (MMS), on the World Wide Web, instant messaging, N ...
for
business card
Business cards are card stock, cards bearing business information about a company or individual. They are shared during formal introductions as a convenience and a memory aid. A business card typically includes the giver's name, types of co ...
information.
Enterprise Information Integration (EII) applies
data integration commercially. Despite the theoretical problems described above, the private sector shows more concern with the problems of data integration as a viable product.
EII emphasizes neither on correctness nor tractability, but speed and simplicity.
; Combining disparate data sets : Each data source is disparate and as such is not designed to support EII. Therefore, data virtualization as well as
data federation depends upon accidental data commonality to support combining data and information from disparate data sets. Because of this lack of data value commonality across data sources, the return set may be inaccurate, incomplete, and impossible to validate.
: One solution is to recast disparate databases to integrate these databases without the need for
ETL. The recast databases support commonality constraints where referential integrity may be enforced between databases. The recast databases provide designed data access paths with data value commonality across databases.
; Simplicity of deployment : Even if recognized as a solution to a problem, EII currently takes time to apply and offers complexities in deployment. Proposed schema-less solutions include "Lean Middleware".
; Handling higher-order information : Analysts experience difficulty—even with a functioning information integration system—in determining whether the sources in the database will satisfy a given application. Answering these kinds of questions about a set of repositories requires semantic information like
metadata
Metadata (or metainformation) is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including:
* Descriptive ...
and/or ontologies.
Applications
EII products enable
loose coupling between
homogeneous-data consuming client applications and services and heterogeneous-data stores. Such client applications and services include Desktop Productivity Tools (spreadsheets,
word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features.
Early word processors were stand-alone devices dedicated to the function, but current word ...
s, presentation software, etc.),
development environments and
frameworks (
Java EE,
.NET
The .NET platform (pronounced as "''dot net"'') is a free and open-source, managed code, managed computer software framework for Microsoft Windows, Windows, Linux, and macOS operating systems. The project is mainly developed by Microsoft emplo ...
,
Mono,
SOAP
Soap is a salt (chemistry), salt of a fatty acid (sometimes other carboxylic acids) used for cleaning and lubricating products as well as other applications. In a domestic setting, soaps, specifically "toilet soaps", are surfactants usually u ...
or
RESTful
Web service
A web service (WS) is either:
* a service offered by an electronic device to another electronic device, communicating with each other via the Internet, or
* a server running on a computer device, listening for requests at a particular port over a n ...
s, etc.),
business intelligence
Business intelligence (BI) consists of strategies, methodologies, and technologies used by enterprises for data analysis and management of business information. Common functions of BI technologies include Financial reporting, reporting, online an ...
(BI),
business activity monitoring (BAM) software,
enterprise resource planning (ERP),
Customer relationship management
Customer relationship management (CRM) is a strategic process that organizations use to manage, analyze, and improve their interactions with customers. By leveraging data-driven insights, CRM helps businesses optimize communication, enhance cus ...
(CRM),
business process management (BPM and/or BPEL) Software, and
web content management (CMS).
Data access technologies
*
Service Data Objects (SDO) for Java, C++ and .Net clients and any type of data source
*
XQuery and
XQuery API for Java
See also
*
* (BI 2.0)
*
*
*
*
*
*
*
*
*
*
*
References
{{Reflist
Data management