Action model learning (sometimes abbreviated action learning) is an area of
machine learning
Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence.
Machine ...
concerned with creation and modification of
software agent
In computer science, a software agent or software AI is a computer program that acts for a user or other program in a relationship of agency, which derives from the Latin ''agere'' (to do): an agreement to act on one's behalf. Such "action on behal ...
's knowledge about ''effects'' and ''preconditions'' of the ''actions'' that can be executed within its ''environment''. This knowledge is usually represented in logic-based
action description language
In artificial intelligence, action description language (ADL) is an automated planning and scheduling system in particular for robots. It is considered an advancement of STRIPS. Edwin Pednault (a specialist in the field of data abstraction and mod ...
and used as the input for
automated planners.
Learning action models is important when goals change. When an agent acted for a while, it can use its accumulated knowledge about actions in the domain to make better decisions. Thus, learning action models differs from
reinforcement learning
Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents ought to take actions in an environment in order to maximize the notion of cumulative reward. Reinforcement learning is one of three basic machine ...
. It enables reasoning about actions instead of expensive trials in the world.
[
] Action model learning is a form of
inductive reasoning, where new knowledge is generated based on agent's ''observations''. It differs from standard
supervised learning
Supervised learning (SL) is a machine learning paradigm for problems where the available data consists of labelled examples, meaning that each data point contains features (covariates) and an associated label. The goal of supervised learning alg ...
in that correct input/output pairs are never presented, nor imprecise action models explicitly corrected.
Usual motivation for action model learning is the fact that manual specification of action models for planners is often a difficult, time consuming, and error-prone task (especially in complex environments).
Action models
Given a
training set
In machine learning, a common task is the study and construction of algorithms that can learn from and make predictions on data. Such algorithms function by making data-driven predictions or decisions, through building a mathematical model from ...
consisting of examples
, where
are observations of a world state from two consecutive time steps
and
is an ''action instance'' observed in time step
, the goal of action model learning in general is to construct an ''action model''
, where
is a description of domain dynamics in action description formalism like
STRIPS,
ADL
Adl ( ar, عدل, ) is an Arabic word meaning ' justice', and is also one of the names of God in Islam. It is equal to the concept of ''Insaf'' انصاف (lit. sense of justice) in the Baháʼí Faith.
Adil ( ar, عادل, ), and Adeel ( ar, � ...
or
PDDL and
is a probability function defined over the elements of
.
[
]
However, many state of the art ''action learning methods'' assume determinism and do not induce
. In addition to determinism, individual methods differ in how they deal with other attributes of domain (e.g. partial observability or sensoric noise).
Action learning methods
State of the art
Recent action learning methods take various approaches and employ a wide variety of tools from different areas of
artificial intelligence
Artificial intelligence (AI) is intelligence—perceiving, synthesizing, and inferring information—demonstrated by machines, as opposed to intelligence displayed by animals and humans. Example tasks in which this is done include speech r ...
and
computational logic
Computational logic is the use of logic to perform or reason about computation. It bears a similar relationship to computer science and engineering as mathematical logic bears to mathematics and as philosophical logic bears to philosophy. It is ...
. As an example of a method based on propositional logic, we can mention SLAF (Simultaneous Learning and Filtering) algorithm,
which uses agent's observations to construct a long propositional formula over time and subsequently interprets it using a
satisfiability (SAT) solver. Another technique, in which learning is converted into a satisfiability problem (weighted
MAX-SAT In computational complexity theory, the maximum satisfiability problem (MAX-SAT) is the problem of determining the maximum number of clauses, of a given Boolean formula in conjunctive normal form, that can be made true by an assignment of truth val ...
in this case) and SAT solvers are used, is implemented in ARMS (Action-Relation Modeling System).
[
]
Two mutually similar, fully declarative approaches to action learning were based on logic programming paradigm
Answer Set Programming
Answer set programming (ASP) is a form of declarative programming oriented towards difficult (primarily NP-hard) search problems. It is based on the stable model (answer set) semantics of logic programming. In ASP, search problems are reduced ...
(ASP) and its extension, Reactive ASP. In another example, bottom-up
inductive logic programming approach was employed. Several different solutions are not directly logic-based. For example, the action model learning using a
perceptron algorithm or the multi level
greedy search
A greedy algorithm is any algorithm that follows the problem-solving heuristic of making the locally optimal choice at each stage. In many problems, a greedy strategy does not produce an optimal solution, but a greedy heuristic can yield locally ...
over the space of
possible action models. In the older paper from 1992,
the action model learning was studied as an extension of
reinforcement learning
Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents ought to take actions in an environment in order to maximize the notion of cumulative reward. Reinforcement learning is one of three basic machine ...
.
Literature
Most action learning research papers are published in journals and conferences focused on
artificial intelligence
Artificial intelligence (AI) is intelligence—perceiving, synthesizing, and inferring information—demonstrated by machines, as opposed to intelligence displayed by animals and humans. Example tasks in which this is done include speech r ...
in general (e.g. Journal of Artificial Intelligence Research (JAIR), Artificial Intelligence, Applied Artificial Intelligence (AAI) or AAAI conferences). Despite mutual relevance of the topics, action model learning is usually not addressed on
planning
Planning is the process of thinking regarding the activities required to achieve a desired goal. Planning is based on foresight, the fundamental capacity for mental time travel. The evolution of forethought, the capacity to think ahead, is c ...
conferences like ICAPS.
See also
*
Machine learning
Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence.
Machine ...
*
Automated planning and scheduling
Automation describes a wide range of technologies that reduce human intervention in processes, namely by predetermining decision criteria, subprocess relationships, and related actions, as well as embodying those predeterminations in machines ...
*
Action language
In computer science, an action language is a language for specifying state transition systems, and is commonly used to create formal models of the effects of actions on the world. Action languages are commonly used in the artificial intelligence ...
*
PDDL
*
Architecture description language
Architecture description languages (ADLs) are used in several disciplines: system engineering, software engineering, and enterprise modelling and engineering.
The system engineering community uses an architecture description language as a languag ...
*
Inductive reasoning
*
Computational logic
Computational logic is the use of logic to perform or reason about computation. It bears a similar relationship to computer science and engineering as mathematical logic bears to mathematics and as philosophical logic bears to philosophy. It is ...
*
Knowledge representation
Knowledge representation and reasoning (KRR, KR&R, KR²) is the field of artificial intelligence (AI) dedicated to representing information about the world in a form that a computer system can use to solve complex tasks such as diagnosing a medic ...
References
{{reflist
Inductive reasoning
Machine learning
Data mining