Document AI, also known as Document Intelligence, refers to a field of
technology
Technology is the application of Conceptual model, conceptual knowledge to achieve practical goals, especially in a reproducible way. The word ''technology'' can also mean the products resulting from such efforts, including both tangible too ...
that employs
machine learning
Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
(ML) techniques, such as
natural language processing
Natural language processing (NLP) is a subfield of computer science and especially artificial intelligence. It is primarily concerned with providing computers with the ability to process data encoded in natural language and is thus closely related ...
(NLP). These techniques are used to develop
computer models capable of analyzing documents in a manner akin to human review.
Through NLP, computer systems are able to understand relationships and contextual nuances in document contents, which facilitates the extraction of information and insights. Additionally, this technology enables the categorization and organization of the documents themselves.
The applications of Document AI extend to processing and
parsing a variety of
semi-structured documents, such as forms, tables, receipts, invoices, tax forms, contracts, loan agreements, and
financial reports.
Key Features
Machine learning
Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
is utilized in Document AI to extract information from both
digital and printed documents. This
technology
Technology is the application of Conceptual model, conceptual knowledge to achieve practical goals, especially in a reproducible way. The word ''technology'' can also mean the products resulting from such efforts, including both tangible too ...
recognizes text, characters, and
image
An image or picture is a visual representation. An image can be Two-dimensional space, two-dimensional, such as a drawing, painting, or photograph, or Three-dimensional space, three-dimensional, such as a carving or sculpture. Images may be di ...
s in various languages, aiding in the extraction of insights from
unstructured documents. The use of this technology can improve the speed and quality of
decision-making
In psychology, decision-making (also spelled decision making and decisionmaking) is regarded as the Cognition, cognitive process resulting in the selection of a belief or a course of action among several possible alternative options. It could be ...
in document analysis. Additionally, the
automation of
data extraction and validation can contribute to increased efficiency in document analysis processes.
Example

A
business letter contains information in for the form of text, as well as other types of information, such as the position of the text. For instance, a typical letter contains two addresses before the body of the text. The address at the very top (sometimes aligned to the right) is the ''sender'' address. This is normally followed by the date of the letter, with the place of writing. After this, the ''receiver'' address is listed.
The distinction between the sender address and the receiver address is conveyed solely by the position of the address on the page, i.e. there is no textual indication like
Sender:
in front of the addresses.
Data dimensions & ML architecture
Data is typically distinguished in spatial data and time-series data, the former can be things like images, maps, graphs, etc. the latter can be e.g. stock-price or a voice recording. Document AI combines text data, which has a time dimension, with other types of data, such as the position of an address in a business letter, which is spatial.
Historically in machine learning spatial data was analyzed using a
convolutional neural network, and temporal data using a
recurrent neural network. With the advent of dimension-type agnostic
transformer architecture, these two different types of dimension can be more easily combined, Document AI is an example of this.
Common Uses
* Enhancing the reliability of business information by reducing manual
data entry errors
* Utilizing AI to identify anomalies in new invoices from established customers
* Accelerating the
mortgage
A mortgage loan or simply mortgage (), in civil law (legal system), civil law jurisdictions known also as a hypothec loan, is a loan used either by purchasers of real property to raise funds to buy real estate, or by existing property owners t ...
workflow process
* Automating the monitoring of loan portfolios for
credit risk management
* Enabling employee focus on higher-value tasks
* Detecting
counterfeit currency and
fraudulent checks
* Extracting and analyzing data previously inaccessible in document silos for informed business decisions
* Streamlining the processing of
receipt
A receipt (also known as a packing list, packing slip, packaging slip, (delivery) docket, shipping list, delivery list, bill of the parcel, Manifest (transportation), manifest, or customer receipt) is a document acknowledging that something h ...
s on a global scale
* Assisting firms in automating the assessment of regulatory change impacts on
contracts
* In the
real estate sector, aiding in developing standardized document classification and automated information extraction
[{{Cite journal , last=Bodenbender , first=Mario , last2=Kurzrock , first2=Björn-Martin , last3=Müller , first3=Philipp Maximilian , date=April 2019 , title=Broad application of artificial intelligence for document classification, information extraction and predictive analytics in real estate , url=http://journals.sagepub.com/doi/10.1177/0306307018823113 , journal=Journal of General Management , language=en , volume=44 , issue=3 , pages=170–179 , doi=10.1177/0306307018823113 , issn=0306-3070, url-access=subscription ]
References
Enterprise software
Applications of artificial intelligence